Sarah LeainTowards Data ScienceSQL and Data Modelling in Action: A Deep Dive into Data LakehousesAnyone working with business intelligence, data science, data analysis, or cloud computing will have come across SQL at some point. We can…Oct 21Oct 21
caleb leeinTowards Data ScienceDataflow Architecture — Derived Data Views and Eventual Consistencya (not-so) brief history of a health & fitness data pipeline: part iiOct 151Oct 151
Robin von MalottkiinTowards Data ScienceEfficient Testing of ETL Pipelines with PythonHow to Instantly Detect Data Quality Issues and Identify their CausesOct 31Oct 31
Betsy VargheseinCognizant ServianA macro-ful way to test incremental models and snapshots in dbtand how to go about itJul 15, 20211Jul 15, 20211
TurkelBuilding Efficient Incremental Models in dbt with Reusable MacrosManaging historical data while keeping your data warehouse efficient and responsive can be challenging. This is where dbt (Data Build Tool)…May 8May 8
Pavan EmaniFrom Zero to Data Engineer: What I’d Do Differently TodayStrategies for Fast-Tracking Your Data Engineering CareerSep 4Sep 4
Mary ArainTowards Data ScienceAutomating ETL to SFTP Server Using Python and SQLLearn how to automate a daily data transfer process on Windows, from PostgreSQL database to a remote serverAug 246Aug 246
KudosWallinThe Resume Whisperer7 Data Engineer Resume Headlines That Speak VolumesFirst impressions matter. And for data engineers, that first impression often starts with your resume headline. It’s that prime piece of…Jul 91Jul 91
Maxime BeaucheminFunctional Data Engineering — a modern paradigm for batch data processingBatch data processing — historically known as ETL — is extremely challenging. It’s time-consuming, brittle, and often unrewarding. Not only…Jan 8, 201826Jan 8, 201826
Chad IsenberginTowards Data ScienceThe Semantics of Differing SCD2 TechniquesHow small differences can have a big impactNov 16, 2023Nov 16, 2023
Alexandre Magno Lima MartinsinApache AirflowHow we orchestrate 2000+ DBT models in Apache AirflowIn recent years, DBT (Data Build Tool) has established itself as the go-to data transformation workflow, connecting to a variety of…May 2614May 2614
Stanley UdegbunaminWriting Solopreneur11 Stupid Simple Passive Income Ideas for ProgrammersActionable Steps for Entrepreneurial DevelopersMar 1316Mar 1316
SamueldavidwinterOptimising Query Performance — In Azure Synapse AnalyticsSynapse Analytics is A Massively Parallel Processing (MPP) engine built for loading and querying large datasetsMar 25, 2022Mar 25, 2022
Denzel S. WilliamsinThe Data Driven DiariesUnderstanding the Semantic LayerLeveraging the Syntax of dbt Labs’s MetricFlowSep 6, 2023Sep 6, 2023
Oliver MolanderinBetter ProgrammingDuckDB — What’s the Hype About?This was a blog post that I already planned to write during the spring when I saw that the hype around DuckDB started taking new heights…Dec 29, 202213Dec 29, 202213
Saikat DuttainTowards Data ScienceWhen Do You Self Join? A Handy TrickIntermediate SQL for ETL dev to Data Engineer TransitionMar 173Mar 173
Dr. Derek Austin 🥳inBetter ProgrammingWhy I Prefer Regular Merge Commits Over Squash CommitsI used to think squash commits were so cool, and then I had to use them all day, every day. Here’s why you should avoid squashSep 30, 202275Sep 30, 202275