Sitemap
TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Follow publication

Member-only story

Avoid These Easily Missed Mistakes in Machine Learning Workflows — Part 2

6 min readJan 24, 2025

--

Image by the Author.

Welcome back to another edition in this series on easily missed mistakes in machine learning workflows! For those who haven’t read the first one, this is part of a series that focuses predominantly on procedural errors that may not always be very obvious but have a very high potential of deteriorating model performance if they do end up slipping into our development pipeline.

In the first article, we explored common pitfalls like misusing numerical identifiers, mishandling data splits, and overfitting the model to rare feature values.

In this edition, we’ll continue to explore some errors related to data handling, specifically focusing on the following two topics:

  1. Training with data not available at prediction time
  2. Mixing magic numbers with real numbers

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Thomas A Dorfer
Thomas A Dorfer

Written by Thomas A Dorfer

Senior Data Scientist @ BCG. I mainly write about data science and technology.