Homepage
Open in app
Sign in
Get started
Data Science Collective
Advice, insights, and ideas from the Medium data science community
About
Submission guidelines
Follow
Latest stories
I’m the Founding Editor of Data Science Collective — Here’s What’s Coming
I’m the Founding Editor of Data Science Collective — Here’s What’s Coming
Big News: I’m the Founding Editor of Data Science Collective — the New Data Science Publication on Medium.
Paolo Perrone
Feb 11
Data Science Collective: Submission Guidelines
Data Science Collective: Submission Guidelines
Welcome to Data Science Collective, a community-driven publication dedicated to exploring data science through writing. We’re to highlight…
Data Science Collective Editors
Feb 9
LOESS
LOESS
Smoothing data using local regression
João Paulo Figueira
May 24, 2019
A new community home for data science writing on Medium
A new community home for data science writing on Medium
Join Data Science Collective
Data Science Collective Editors
Feb 10
Latest
5 Data Quality Giants You’re Ignoring (But Will Save Your Career)
5 Data Quality Giants You’re Ignoring (But Will Save Your Career)
Hidden tools that fix dirty data, prevent disasters, and make you look like a genius
Rauf Azam
May 9
Train LLMs to Talk Like You on Social Media, Using Consumer Hardware
Train LLMs to Talk Like You on Social Media, Using Consumer Hardware
Use your own comments on social media to fine-tune an LLM, and run all fine-tuning on (relatively) inexpensive hardware.
Florin Andrei
May 9
How I Built an AI That Thinks Like a Data Analyst
How I Built an AI That Thinks Like a Data Analyst
Whether you’re a new hire, a freelancer, or prepping for an interview, this tool generates the exact questions you should be asking from…
Mukundan Sankar
May 9
What to do when you encounter Sample Ratio Mismatch in A/B Testing
What to do when you encounter Sample Ratio Mismatch in A/B Testing
Learn why checking for SRM is crucial, explore common reasons it occurs, and learn how to detect, diagnose, and address it.
Allon Korem | CEO, Bell Statistics
May 9
When one bad apple spoils the barrel: Tackling outliers in A/B Testing
When one bad apple spoils the barrel: Tackling outliers in A/B Testing
Learn why outliers can be problematic, explore the challenges of identifying and handling them, and get practical guidelines.
Allon Korem | CEO, Bell Statistics
May 9
Finding The Best Path To Successful Data Storytelling With Python
Finding The Best Path To Successful Data Storytelling With Python
Exploring global crime data using Nathan Yau’s visual framework
John Loewen, PhD
May 9
Data-Centric Architectural Thinking in AI-assisted Solutions
Data-Centric Architectural Thinking in AI-assisted Solutions
The integrative modeling of enterprise data beyond data science and AI capabilities
Sean Gu
May 9
Current approaches to LLM safety alignment remain largely superficial.
Current approaches to LLM safety alignment remain largely superficial.
“The more I know, the more I realize how much I don’t know.” — Albert Einstein
Nehdiii
May 9
From Social Networks to Biotech: How CDNMF Solves Real-World Community Detection
From Social Networks to Biotech: How CDNMF Solves Real-World Community Detection
Contrastive Deep Nonnegative Matrix Factorization for Community Detection
Yuecheng Li
May 9
Anti-fallacy protocol: The checklist that reveals whether your project identifies causal…
Anti-fallacy protocol: The checklist that reveals whether your project identifies causal…
Making data-driven decisions has become a mantra. However, there's an important difference between correlation and causation that many…
Robson Tigre
May 9
How to Be Taken Seriously
How to Be Taken Seriously
4 common mistakes of junior data scientists and how to fix them
Tessa Xie
May 9
Building a Fake News Detection System with TensorFlow and LSTM
Building a Fake News Detection System with TensorFlow and LSTM
Fake news is a type of misinformation that can mislead readers, influence public opinion, and even damage reputations. Detecting fake news…
Devang Vashistha
May 9
Exploratory Data Analysis: Radiation Monitoring with Python and Geiger Counter
Exploratory Data Analysis: Radiation Monitoring with Python and Geiger Counter
Collecting and Processing of the Geiger–Müller Tube Data
Dmitrii Eliuseev
May 9
Bridging Probability and Physics: How Sampling and Optimization Connect
Bridging Probability and Physics: How Sampling and Optimization Connect
Have you ever realized that Simulated Annealing is just a special case of the Metropolis algorithm? This article will explore a…
Herman Mostein
May 9
About Data Science Collective
Latest Stories
Archive
About Medium
Terms
Privacy
Teams