Super handy classification with CatBoost

Andrew Yip
Feb 15, 2018 · 1 min read
CatBoost = gradient boosting on decision trees library with categorical features support out of the box

Yandex CatBoost is a Godsend.

Working on an intro to data science workshop for web developers, I am worried that the web devs will be scared away due to the immense and hair-tearing efforts needed in data-munging

And then CatBoost happened.

What is CatBoost?

CatBoost is an open-source gradient boosting on decision trees library with categorical features support out of the box for Python and R. (from GH repo)

Simply put, it’s a plug-and-play classifier in scikit-learn’s convention that would deal with categorical features automatically for you. Say bye to the days of getting dummies and scratching your head over what to do with text features.

All you need to change from your scikit-learn routine is this…

Checkout this Kaggle kernel to see it in action.

Have fun!

DevCJeddah

Bite-size technical hands-on from the Developer Circles…

Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more

Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore

If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store