Homepage
Open in app
Sign in
Get started
Data
AI & ML
Mobile
Web
Infrastructure
Open Source
People
Careers
Data Science & Data Platform Engineering
Mussel — Airbnb’s Key-Value Store for Derived Data
Mussel — Airbnb’s Key-Value Store for Derived Data
How Airbnb built a persistent, high availability and low latency key-value storage engine for accessing derived data from offline and…
Shouyan guo
Oct 10, 2022
Upgrading Data Warehouse Infrastructure at Airbnb
Upgrading Data Warehouse Infrastructure at Airbnb
This blog aims to introduce Airbnb’s experience upgrading Data Warehouse infrastructure to Spark and Iceberg
Ronnie Zhu
Sep 26, 2022
Unified Payments Data Read at Airbnb
Unified Payments Data Read at Airbnb
How we redesigned payments data read flow to optimize client integrations, while achieving up to 150x performance gains.
Alican GÖKSEL
Jun 9, 2022
My Journey to Airbnb — Kamini Dandapani
My Journey to Airbnb — Kamini Dandapani
Airbnb’s VP of Engineering on why you don’t have to change your natural self to be a leader
AirbnbEng
May 11, 2022
Measuring Latency Overhead with Own Time
Measuring Latency Overhead with Own Time
by: Jimmy O’Neill
Jimmy O’Neill
Mar 21, 2022
Automating Data Protection at Scale, Part 3
Automating Data Protection at Scale, Part 3
Part three of a series on how we provide powerful, automated, and scalable data privacy and security engineering capabilities at Airbnb
elizabeth nammour
Dec 16, 2021
Automating Data Protection at Scale, Part 2
Automating Data Protection at Scale, Part 2
Part two of a series on how we provide powerful, automated, and scalable data privacy and security engineering capabilities at Airbnb
elizabeth nammour
Oct 19, 2021
Migrating Kafka transparently between Zookeeper clusters
Migrating Kafka transparently between Zookeeper clusters
Learn more about how to migrate your Kafka cluster from one Zookeeper cluster to another without any user impact.
Edmund Mok
Oct 12, 2021
The Airflow Smart Sensor Service
The Airflow Smart Sensor Service
Consolidating long-running, lightweight tasks for improved resource utilization
Yingbo Wang
Sep 28, 2021
How Airbnb Enables Consistent Data Consumption at Scale
How Airbnb Enables Consistent Data Consumption at Scale
Part-III: Building a coherent consumption experience
Shao Xie
Sep 21, 2021
Automating Data Protection at Scale, Part 1
Automating Data Protection at Scale, Part 1
Part one of a series on how we provide powerful, automated, and scalable data privacy and security engineering capabilities at Airbnb.
elizabeth nammour
Sep 14, 2021
How Airbnb Built “Wall” to prevent data bugs
How Airbnb Built “Wall” to prevent data bugs
Gaining trust in data with extensive data quality, accuracy and anomaly checks
Subrata Biswas
Aug 4, 2021
How Airbnb Measures Future Value to Standardize Tradeoffs
How Airbnb Measures Future Value to Standardize Tradeoffs
The propensity score matching model powering how we optimize for long-term decision-making
Jenny Chen
Jul 13, 2021
How Airbnb Standardized Metric Computation at Scale
How Airbnb Standardized Metric Computation at Scale
Part II: The six design principles of Minerva compute infrastructure
Amit Pahwa
Jun 1, 2021
How does Airbnb track and measure growth marketing?
How does Airbnb track and measure growth marketing?
How Airbnb built a unified tracking measurement system to support marketing campaigns
Jing Guo
May 4, 2021
How Airbnb Achieved Metric Consistency at Scale
How Airbnb Achieved Metric Consistency at Scale
Part-I: Introducing Minerva — Airbnb’s Metric Platform
Robert Chang
Apr 30, 2021
Achieving Insights and Savings with Cost Data
Achieving Insights and Savings with Cost Data
The path to cloud efficiency begins with a cost data foundation
Anna Matlin
Apr 13, 2021
Visualizing Data Timeliness at Airbnb
Visualizing Data Timeliness at Airbnb
by Chris Williams, Ken Chen, Krist Wongsuphasawat, and Sylvia Tomiyama
Chris C Williams
Feb 23, 2021
Supercharging Apache Superset
Supercharging Apache Superset
How Airbnb customized Superset for business intelligence at scale
Erik Ritter
Feb 9, 2021
Designing Experimentation Guardrails
Designing Experimentation Guardrails
Introducing the Airbnb Experiment Guardrails framework, which helps us prevent negative impact on key metrics while experimenting at scale.
Tatiana Xifara
Jan 27, 2021
Data Quality at Airbnb
Data Quality at Airbnb
Part 2 — A New Gold Standard
Vaughn Quoss
Nov 24, 2020
Data Quality at Airbnb
Data Quality at Airbnb
Part 1 — Rebuilding at Scale
Jonathan Parks
Nov 3, 2020
Project Lighthouse — Part 1: P-sensitive k-anonymity
Project Lighthouse — Part 1: P-sensitive k-anonymity
Part one of a series on how we will measure discrepancies in Airbnb guest acceptance rates using anonymized perceived demographic data.
Skyler Wharton
Sep 1, 2020
On Spark, Hive, and Small Files: An In-Depth Look at Spark Partitioning Strategies
On Spark, Hive, and Small Files: An In-Depth Look at Spark Partitioning Strategies
One of the most common ways to store results from a Spark job is by writing the results to a Hive table stored on HDFS. While in theory…
Zachary Ennenga
Mar 3, 2020
Scaling a Mature Data Pipeline — Managing Overhead
Scaling a Mature Data Pipeline — Managing Overhead
There is often a hidden performance cost tied to the complexity of data pipelines — Overhead. In this post we will examine the concept of…
Zachary Ennenga
Sep 24, 2019
About The Airbnb Tech Blog
Latest Stories
Archive
About Medium
Terms
Privacy