Manage, store and label data for ML — Why data prep is hard Many data scientists and machine learning teams report that they spend about 80% of their time preparing, managing, or curating their datasets. There are three things that have enabled the ML revival over the last 5–10 years: breakthroughs in algorithms, fast and scalable hardware, and large curated datasets. …