Simon GrahinTowards Data ScienceAutomated Detection of Data Quality IssuesA method for autonomously identifying data errors and calculating the Data Dirtiness Score with minimal human intervention.Mar 225Mar 225
Simon GrahinTowards Data ScienceData Dirtiness ScoreNew method to measure tabular dataset qualityMar 22Mar 22
Simon GrahinLevel Up CodingSOLID Principles Applied to Data ScienceDiscover how SOLID principles can transform your Data Science projects from quick experiments into reliable, maintainable solutions.Feb 201Feb 201
Simon GrahinTowards Data Science6 recommendations for optimizing a Spark jobA guideline of six recommendations that are quickly actionable for optimizing a Spark job.Nov 24, 20219Nov 24, 20219