Tagged in

Design

Data Science & Design

All about Data Science, Machine Learning, and Design. Also, lot of things about Statistics, Data Visualization, Benchmarking, and funny stuff.

More information

Followers

1.1K

Elsewhere

More, on Medium

Design

Laurae in Data Science & Design

Apr 23, 2017

Visiting: Categorical Features and Encoding in Decision Trees

12 responses

Laurae in Data Science & Design

Dec 7, 2016

Understanding a bit xgboost’s Generalized Linear Model (gblinear)

Laurae: This post is about xgboost’s gblinear and its parameters. Elastic Net? Generalized Linear Model? Gradient Descent? Coordinate Descent?… The post was originally at Kaggle.

1 response

Laurae in Data Science & Design

Nov 13, 2016

Using your brain for smart transformations of Data

Laurae: This post is about transforming data appropriately when you are provided a multivariate distribution (preferable a low amount of variables) to find an explanation of internal interactions of variables. It takes an example on Allstate…

1 response

Laurae in Data Science & Design

Nov 13, 2016

Why under-predicting or over-predicting might not be an issue?

Laurae: This post is about the rationale between over-predicting/under-predicting and the performance metric you are optimizing. It takes the example of the Matthew’s Correlation Coefficient (MCC). The post was originally…

Laurae in Data Science & Design

Nov 13, 2016

Bosch competition & a Data Science project mode: 8th place solution (team LAJMBURO)

Laurae: This post is about my team scoring 8th for the Bosch competition, post Leaderboard results (Private Leaderboard open). I also earned two master tiers from the Bosch competition…

Laurae in Data Science & Design

Oct 13, 2016

Data Science vs Competitions’ Public LB beginner: Bias & Variance

Laurae: This post is about the downfall of thinking the generalization of “using Public LB = same for Private LB”. This is a particularly good way to show why you should not only follow your Public LB advancements if…

Laurae in Data Science & Design

Sep 11, 2016

Looking mathematically the difference between three pictures (+/- objects)

Laurae: This post is about exploring series of images in order to be able to reconstruct how they are supposed to be ordered. Color theory knowledge is a little bit necessary (how colors are additive)…

Laurae in Data Science & Design

Sep 11, 2016

Hierarchical Supervised Models: is it better for predicting than without any hierarchy?

Laurae: This post talks about hierarchical classification, but applies also to hierarchical regression. When a label has a known hierarchy, it is tempting to split the label. But…

Laurae in Data Science & Design

Sep 11, 2016

Row IDs leaking?! Detect it using Nearest Neighbors!

Laurae: This post is about a row ID leak in a competition that stroke (openly) a competition 3 days before the end. The post was originally at Kaggle. The context of the leak can be found there, by fakeplastictrees. Without the available…

Laurae in Data Science & Design

Aug 22, 2016

A “commercial” Data Scientist life for the business by a Data Scientist… is valuation!

Laurae: This post is about how someone excelling at machine learning should transition to a business environment. It includes an extra post by inversion which summarizes an essential…