How to Build a Medallion Architecture Pipeline on Snowflake with dbt — Step-by-Step GuideELT best practices, staging with dbt models, and loading API data into Snowflake using Python.Oct 22Oct 22
Real time Data Pipeline: a DIY projectI spent 3 days building a real-time data pipeline. Here’s everything that went wrong (and how I fixed it).Oct 16Oct 16
DataTalk: API’s, HTTP, LAMBDA and FoliumIn the past months, if’ve done a project (actually, I’ve did it before, but it’s another story) using FogoCruzado data.Oct 7Oct 7
Evaluating ML modelsToday, I completed the first of 25 Data Science projects from this blog. The original goal was to build a Machine Learning model to predict…Sep 17A response icon2Sep 17A response icon2
I told myself that I should run something on Spark this Sunday, and I did.I wanted to create a project to reinforce my learning, so I decided to try out Apache Spark. That’s when I discovered I could simply run it…Sep 16Sep 16
How to Not Do an ML ProjectMistakes are underrated teachers. When I tried building a credit card fraud detection model with a Kaggle dataset, I thought I was doing…Sep 12Sep 12
AirFlow: what do you need to know to run a project?This isn’t my first project with Airflow, but I think it’s the most detailed one. Let’s break it down.Sep 4Sep 4
How to: data pipelines (ETL-O)Well, this is awkward — a cloud undersizing issue forced me to learn about it. So, let me tell you how I approached the problem, and maybe…Aug 26Aug 26
GCP: First StepsWell, this is my first article here. I decided to document some things I learned today. I’m new to some tools in data engineering, so I…Aug 22Aug 22