Homepage
Open in app
Sign in
Get started
Towards Data Engineering
Navigating the Path to Data Engineering Excellence
About
Follow
Trending
A Python Library every Data Engineer should know
A Python Library every Data Engineer should know
As a data engineer in a large company, ensuring data quality is a key responsibility. Even if you perform your tasks diligently and rarely…
Robin von Malottki
Sep 25
Parquet is Good for OLAP but Not for OLTP Use Cases. But Why?
Parquet is Good for OLAP but Not for OLTP Use Cases. But Why?
Many engineers and data scientists praise Parquet for its efficient compression and fast query performance. While it’s a highly valued…
Ritam Mukherjee
Sep 28
Inside a Netflix Data Engineering Interview.
Inside a Netflix Data Engineering Interview.
A Real time Media — OTT Domain Use Case Question and How to Solve It Together.
Brahma, The Data Engineer.
Sep 18
Latest
Essential Spark Time Functions for Real-Time Data: What Every Data Engineer Should Know
Essential Spark Time Functions for Real-Time Data: What Every Data Engineer Should Know
Explore key Spark time functions that transform your real-time data workflows and enhance your data engineering skills
Pritam Deb
Oct 16
100% chances you will be asked this question in your SQL-interviews
100% chances you will be asked this question in your SQL-interviews
What is the 𝐒𝐐𝐋 𝐎𝐫𝐝𝐞𝐫 𝐨𝐟 𝐎𝐩𝐞𝐫𝐚𝐭𝐢𝐨𝐧𝐬 ?
B V Sarath Chandra
Oct 16
Getting Started with Apache Spark: A Beginner’s Guide to Big Data Processing
Getting Started with Apache Spark: A Beginner’s Guide to Big Data Processing
Learn How Spark Transforms Data Engineering with Lightning-Fast Speed and Scalability
Satyam Sahu
Oct 14
IPL Data Analysis: ETL with Spark
IPL Data Analysis: ETL with Spark
Samhitha Poreddy
Oct 14
Data Engineering Interview Question: Implementing Schema Enforcement for Data Quality in…
Data Engineering Interview Question: Implementing Schema Enforcement for Data Quality in…
Schema Enforcement: It is all about Ensuring Clean and Consistent Data.
Brahma, The Data Engineer.
Oct 13
Many Orgs are moving from Cassandra to ScyllaDB. But why ?
Many Orgs are moving from Cassandra to ScyllaDB. But why ?
In recent years, companies have been making a quiet but significant move: switching from Apache Cassandra to ScyllaDB. If you’re wondering…
Ritam Mukherjee
Oct 11
How to prevent AI from taking your data jobs?
How to prevent AI from taking your data jobs?
Leverage your degree and uniqueness to showcase your value!
Harris Wan
Sep 16
Mastering Apache Spark: Key Concepts and Practical Tips
Mastering Apache Spark: Key Concepts and Practical Tips
Apache Spark is an open-source, distributed computing system designed for fast and general-purpose big data processing. Developed at UC…
Arpita Mishra
Jun 15
Advanced Data Engineering Interview Questions-Part 5
Advanced Data Engineering Interview Questions-Part 5
Welcome to Part 5 of our Advanced Data Engineering Interview Questions series.
Arpita Mishra
Oct 10
Why Engaging with Upstream Stakeholders Matters in Data Engineering
Why Engaging with Upstream Stakeholders Matters in Data Engineering
How often have unexpected job failures or data quality issues caught you off guard because of unplanned changes from upstream sources? Such…
Samhitha Poreddy
Oct 6
About Towards Data Engineering
Latest Stories
Archive
About Medium
Terms
Privacy
Teams