Simon GrahinTowards Data ScienceAutomated Detection of Data Quality IssuesA method for autonomously identifying data errors and calculating the Data Dirtiness Score with minimal human intervention.17 min read·Mar 22, 2024--5--5
Simon GrahinTowards Data ScienceData Dirtiness ScoreNew method to measure tabular dataset quality11 min read·Mar 2, 2024--2--2
Simon GrahinLevel Up CodingSOLID Principles Applied to Data ScienceDiscover how SOLID principles can transform your Data Science projects from quick experiments into reliable, maintainable solutions.11 min read·Feb 20, 2024--1--1
Simon GrahinTowards Data Science6 recommendations for optimizing a Spark jobA guideline of six recommendations that are quickly actionable for optimizing a Spark job.14 min read·Nov 24, 2021--8--8