Avin KohaleSpark-Beyond Basics: Cross-joins in sparkCross joins and its mysteries in Apache Spark4 min read·15 hours ago--
Vu TrinhinData Engineer ThingsDo We Need the Lakehouse Architecture?When data lakes and data warehouses are not enough.10 min read·6 days ago--8
Analytics at MetaData engineering at Meta: High-Level Overview of the internal tech stackThis article provides an overview of the internal tech stack that we use on a daily basis as data engineers at Meta. The idea is to shed…12 min read·Oct 10, 2023--26--26
Numbers around usinNumbers around usSuper Saiyan Data Skills: Mastering Big Data with RHarnessing the power of big data is akin to mastering an incredible energy source. In the realm of data science, R serves as both a…8 min read·1 day ago----
Joseph George LewisinTowards AIExplainable AI: Thinking like a machineAn explainer for XAI, AI UX and other trends and methods in building interpretable and trustworthy AI in projects and enterprise.12 min read·Mar 18, 2024--1--1
Avin KohaleSpark-Beyond Basics: Cross-joins in sparkCross joins and its mysteries in Apache Spark4 min read·15 hours ago--
Vu TrinhinData Engineer ThingsDo We Need the Lakehouse Architecture?When data lakes and data warehouses are not enough.10 min read·6 days ago--8
Analytics at MetaData engineering at Meta: High-Level Overview of the internal tech stackThis article provides an overview of the internal tech stack that we use on a daily basis as data engineers at Meta. The idea is to shed…12 min read·Oct 10, 2023--26
Numbers around usinNumbers around usSuper Saiyan Data Skills: Mastering Big Data with RHarnessing the power of big data is akin to mastering an incredible energy source. In the realm of data science, R serves as both a…8 min read·1 day ago--
Joseph George LewisinTowards AIExplainable AI: Thinking like a machineAn explainer for XAI, AI UX and other trends and methods in building interpretable and trustworthy AI in projects and enterprise.12 min read·Mar 18, 2024--1
Jagadesh JamjalaTricky Scenario Question for Senior Data EngineerScenario: You are the Senior data engineer responsible for the company’s sales data pipeline running on AWS. The pipeline extracts data…·4 min read·Mar 7, 2024--1
Russell JurneyinGraphlet AI BlogA brief history of Agile Data ScienceI noticed today that Agile Data Science 2.0 — the second, 2017 edition of my book from 2013 — is still 4.1 stars on Amazon and a hard copy…6 min read·1 day ago--
Vyacheslav EfimovinTowards Data ScienceSystem Design: Consistent HashingUnlocking the power of efficient data partitioning in distributed databases like Cassandra and Dynamo DB.7 min read·Mar 13, 2024--