Data leakage represents, together with over/underfitting, the main cause of failure of machine learning projects that go into production — Data leakage is undoubtedly a threat that preys on data scientists, regardless of the level of seniority. It is that phenomenon that can affect everyone — even professionals with years of experience in the sector. Together with over/underfitting, it represents the main cause of failure of machine learning projects that…