Homepage
Open in app
Sign in
Get started
Data Science Collective
Advice, insights, and ideas from the Medium data science community
About
Submission guidelines
Follow
Latest stories
I’m the Founding Editor of Data Science Collective — Here’s What’s Coming
I’m the Founding Editor of Data Science Collective — Here’s What’s Coming
Big News: I’m the Founding Editor of Data Science Collective — the New Data Science Publication on Medium.
Paolo Perrone
Feb 11
Data Science Collective: Submission Guidelines
Data Science Collective: Submission Guidelines
Welcome to Data Science Collective, a community-driven publication dedicated to exploring data science through writing. We’re to highlight…
Data Science Collective Editors
Feb 9
LOESS
LOESS
Smoothing data using local regression
João Paulo Figueira
May 24, 2019
A new community home for data science writing on Medium
A new community home for data science writing on Medium
Join Data Science Collective
Data Science Collective Editors
Feb 10
Latest
Enabling Synthetic data — A what, why and how guide in Python: Part 1
Enabling Synthetic data — A what, why and how guide in Python: Part 1
This is a two part article series. Part one of the article series will start by covering a primer on synthetic data. It is essential to…
Prateek Bhatnagar
Mar 25
Ingest Data with Dataflows Gen2 in Microsoft Fabric
Ingest Data with Dataflows Gen2 in Microsoft Fabric
Master data ingestion with Dataflows Gen2 in Microsoft Fabric and streamline ETL processes with a low-code, scalable approach.
Murat Girgin
Mar 25
How to earn $1 million with AWS in one year
How to earn $1 million with AWS in one year
Slash your AWS cloud costs by 90%! Learn 4 steps to optimize spending: challenge assumptions, tune resources, use Graviton instances, and…
Gianpi Colonna
Mar 25
One Formula to Rule Them All: The Unified Approach to Aggregation in Machine Learning
One Formula to Rule Them All: The Unified Approach to Aggregation in Machine Learning
How to combine the arithmetic mean, median, minimum, maximum, and many more in one single function
Dr. Robert Kübler
Mar 25
NN#17 — Video Understanding with CNNs
NN#17 — Video Understanding with CNNs
Neural Networks Decoded: From Frames to Action Recognition
Rakib.ai
Mar 25
ArcticDB vs. Pandas: Scaling to Production-Size Datasets Without Overloading Your RAM
ArcticDB vs. Pandas: Scaling to Production-Size Datasets Without Overloading Your RAM
Python has grown to dominate data science, and its package Pandas has become the go-to tool for data analysis. It is great for tabular data…
Ari Joury, PhD
Mar 25
Discover Hidden Relationships in Your Data with Latent Variable ModelingCapture dependencies…
Discover Hidden Relationships in Your Data with Latent Variable ModelingCapture dependencies…
Have you ever noticed that some datasets just don’t behave as expected? You tweak a model, adjust variables, and still, you’re just not…
Ari Joury, PhD
Mar 25
Benchmarking Our Path to AGI: Measuring AI Progress in 2025
Benchmarking Our Path to AGI: Measuring AI Progress in 2025
What is the state of play with AI in early 2025? Are we in an S-curve of diminishing returns, or actually at the early stages of an…
Aki Ranin
Mar 24
Structural Distillation for Cross-Dataset Uplift Modeling with Reinforcement Learning
Structural Distillation for Cross-Dataset Uplift Modeling with Reinforcement Learning
A Novel Approach to Transfer Partial Teacher Model Knowledge from Control to Treatment Data for Rapid AB Testing and Campaign Optimization
Shenggang Li
Mar 24
What I Learned from 3 Years Working with Chinese Tech Teams
What I Learned from 3 Years Working with Chinese Tech Teams
Busting 3 Myths and Confirming 1
Jose Parreño
Mar 24
How Qubits Are Rewriting the Rules of Computation
How Qubits Are Rewriting the Rules of Computation
From Classical Certainty to Quantum Possibility: Exploring the Science, Math, and Magic Behind the Future of Computing
Cristian Leo
Mar 24
CLIP: The Multimodal Powerhouse Transforming Computer Vision
CLIP: The Multimodal Powerhouse Transforming Computer Vision
When I first met with CLIP and its performance, it impressed me so much that whenever I start a new project, I take CLIP’s performance as a…
Yağmur Çiğdem Aktaş
Mar 24
SmolDocling: A New Era in Document Processing — OCR
SmolDocling: A New Era in Document Processing — OCR
A model that outperforms its competitors 27 times its size with the DocTags format
Buse Şenol
Mar 24
Space Travel for Language Models: How SuperBPE Revolutionizes Tokenization
Space Travel for Language Models: How SuperBPE Revolutionizes Tokenization
A new tokenization approach challenges everything we thought we knew about language processing
MKWriteshere
Mar 24
About Data Science Collective
Latest Stories
Archive
About Medium
Terms
Privacy
Teams