Miriam SantosinTowards Data ScienceA Beginner’s Guide to Building High-Quality Datasets for Machine LearningTools and techniques for data cleaning, visualization, augmentation, and synthetic data generationNov 11, 20234Nov 11, 20234
Miriam SantosinTowards Data ScienceMissing Data Demystified: The Absolute Primer for Data ScientistsMissing data, missing mechanisms, and missing data profilingAug 29, 20231Aug 29, 20231
Miriam SantosinTowards Data ScienceUnderstand your Data in Real-Timewith bytewax and ydata-profilingJul 20, 20232Jul 20, 20232
Miriam SantosinTowards Data SciencePandas 2.0: A Game-Changer for Data Scientists?The Top 5 Features for Efficient Data ManipulationJun 27, 202331Jun 27, 202331
Miriam SantosinTowards Data ScienceA Data Scientist’s Essential Guide to Exploratory Data AnalysisBest Practices, Techniques, and Tools to Fully Understand Your DataMay 30, 202317May 30, 202317
Miriam SantosinTowards Data ScienceHow to Generate Real-World Synthetic Data with CTGANExploring the Streamlit App introduced in ydata-syntheticApr 13, 20232Apr 13, 20232
Miriam SantosinTowards Data ScienceAwesome Data Science Tools to Master in 2023: Data Profiling Edition5 Open Source Python Packages for EDA and VisualizationFeb 22, 20235Feb 22, 20235
Miriam SantosinTowards Data ScienceData Quality Issues that Kill Your Machine Learning ModelsNavigating the complexity of imperfect dataJan 19, 20232Jan 19, 20232