Publishing your first dataset on Kaggle

Getting started as a kaggler

Dorian Lazar
Analytics Vidhya

--

Image by Gerd Altmann on Pixabay

While we want to work on a data science and machine learning problem, it is nice when we find out that a dataset that is suitable for solving our desired problem is already available and ready to use on a platform like Kaggle. It makes our life much easier. Collecting data can be sometimes a difficult and slow process. Data is the new gold. By making our datasets public and by promoting an open source thinking among data science and machine learning practitioners we can accelerate the progress that is done in this field. A good place to do so is Kaggle. It is for data scientists what Github is for software developers. If we happen to have collected an interesting dataset dataset, it is good practice to publish it on Kaggle, so that others can use it too. And by doing so, we can increase our reputation on Kaggle, and this may help us in getting a job in the field; this is another benefit of publishing datasets on Kaggle.

Let us get started.

Now, assuming you already have a dataset that you can publish, the first thing you need to do is to create the dataset entry. From your Kaggle homepage, go to the “Data” tab from the left panel:

--

--

Dorian Lazar
Analytics Vidhya

Passionate about Data Science, AI, Programming & Math | Owner of ∇² https://www.nablasquared.com/