PinnedMatt ChapmaninTowards Data ScienceSimplify Your Data Preparation With These 4 Lesser-Known Scikit-Learn ClassesForget train_test_split: Pipeline, ColumnTransformer, FeatureUnion and FunctionTransformer are indispensable even if you use XGBoost or…·10 min read·Jun 1, 2023--1--1
Matt ChapmaninTowards Data ScienceData Scientists Work in the Cloud. Here’s How to Practice This as a Student (Part 1: SQL)Forget local Jupyter Notebooks and bubble-wrapped coding courses – here’s where to practice with real-world cloud platforms. Part 1: SQL·6 min read·1 day ago--1--1
Matt ChapmaninTowards Data ScienceFrom Social Science to Data Science8 years ago I started my bachelor’s degree in Geography. Now I’m a Data Scientist; this is the story of how (and why) I’ve got here·9 min read·May 2, 2024--4--4
Matt ChapmaninLevel Up CodingLearn Mathematical Optimization in Python with Fantasy FootballCompanies like Amazon and Meta want Data Scientists with this skill; here’s a beginner-friendly tutorial with everything you need to know·10 min read·Feb 20, 2024----
Matt ChapmaninTowards Data ScienceRebuilding the Portfolio that Got Me a Data Scientist JobIn 2022, my portfolio helped me get my first DS job. Now I’m tearing it down and starting again from scratch·9 min read·Feb 9, 2024--9--9
Matt ChapmaninTowards Data ScienceWhy (and How) I Learned Web Development as a Data ScientistWeb dev lets you build full-stack ML apps and maximise MLEng/entrepreneurial skills. Oh, and you can do it in Python·6 min read·Jan 27, 2024--5--5
Matt ChapmaninTowards Data ScienceStop Overusing Scikit-Learn and Try OR-Tools InsteadMany Data Scientists overuse ML and neglect Mathematical Optimisation, even though it’s great for your career and easy to learn·9 min read·Jan 26, 2024--6--6
Matt ChapmanWhat should I call my newsletter?“Matt’s Newsletter” is off the table (but only just)3 min read·Dec 24, 2023--1--1
Matt ChapmaninTowards Data ScienceBe Careful When Using “NOT IN” in SQL+ 3 simple solutions to make sure you’re not caught out·5 min read·Dec 15, 2023--2--2
Matt ChapmaninTowards Data ScienceAdd One Line of SQL to Optimise Your BigQuery TablesClustering: A simple way to group similar rows and prevent unnecessary data processing·5 min read·Dec 8, 2023--2--2