Image for post
Image for post
Photo by Alex Holyoake on Unsplash

The task of grouping data points into groups (clusters) such that points in a group are more ‘similar’ to each other than to points outside the group is called clustering. But how does one know if a data point is similar to another point or not? This act of defining similarity is what distinguishes various clustering methods from each other — K-Means defines similarity by the closeness of a data point to the centroid of the clusters while DBSCAN defines similarity by grouping together data points that are within the same density region.

In this article, we’ll take a look at these two clustering methods that are often used in unsupervised machine learning and implement them in Python. …

About

James Issac

A whole new world. That’s where we’ll be.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store