Clothing Dataset: Call for Action

Help me collect a public-domain dataset with images of clothes

Alexey Grigorev
Aug 12, 2020 · 4 min read
Clothing Dataset, a public-domain dataset with images (CC0)

Getting a dataset with images is not easy if you want to use it for a course or a book. Yes, there are many datasets with images, but few of them are suitable for commercial or educational use.

To solve this issue, I decided to collect a dataset with clothing. All the pictures will be shared under the CC0 license. This means that anyone can use this data for any purpose.

For example:

  • Creating a tutorial or a course (free or paid)
  • Writing a book
  • Kaggle competitions (as an external dataset)
  • Training an internal model at any company

I already collected more than 1,000 pictures, but it’s not easy to do alone. I need your help.

How can I help?

There are many ways you can help.

Spread the word about it. Share it on social media, send it to your colleagues and friends.

Use the share buttons at the top
Or simply retweet this tweet

Upload your pictures. If don’t want to go through your entire wardrobe and take a picture of every item — it’s okay. Even one image is helpful. Perhaps there’s a t-shirt nearby, jeans, or shoes? Take a picture and upload it using this form. See the next section for details on how to take pictures.

Uploading images with the Airtable form

The form works on mobile too!

Upload many pictures at once. If you have more than a couple of images, using the previous form is not convenient. There are other options:

  • Google Photos. The app can automatically synchronize all your images. Just move the pictures of clothes to a separate album and share the link.
  • Dropbox, Google Drive, Yandex Disk, or any similar cloud storage. Upload a folder or a zip archive and share the link.

Once you have a link, use another form to submit it:

Bulk-upload images with another Airtable form

Images

There are the following categories of clothes:

  • T-shirts
  • Long sleeves, sweaters, hoodies
  • Shirts
  • Jeans, pants, shorts
  • Dresses, skirts
  • Shoes
  • Jackets, coats
  • Hats
  • Clothes for kids

To make a picture, put the item on a floor or a bed:

Pictures of hanging clothes are fine, but make sure the item is visible:

The item shouldn’t be crumpled or packed:

The background should be contrasting enough to see the item:

An image should contain only one item:

And there should be no people:

If you’re not sure about something, just share it, and I’ll figure it out.

How can I know when the data is ready?

When I collect enough pictures, I’ll annotate them and upload the result to Kaggle. If you provide your email when sharing images, I’ll inform you when it happens.

I will also post in other places:

I’d like to collect 10,000 images and I need your help!

Data Science Insider

Making you a better data scientist. One insight at a time.

Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more

Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore

If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store