Henrique SilveiraWhat is Data Observability?Observability is no longer just for software engineering. With the rise of data downtime and the increasing complexity of the data stack…Jul 2, 2022Jul 2, 2022
Henrique SilveiraData lake with Pyspark through Dataproc GCP using AirflowIn this post, I will try my best to tell the steps on how to build a data lake with Pyspark through dataproc GCP using airflow.Jun 27, 2022Jun 27, 2022
Henrique SilveiraInstalling Apache Nifi on Ubuntu!Would you like to learn how to do an Apache Nifi installation on Ubuntu Linux? In this tutorial, we are going to show you how to download…Jun 21, 2022Jun 21, 2022
Henrique SilveiraHow to deploy a Zookeeper and Kafka cluster in Google Cloud PlatformOne of the great advantages of Google Cloud Platform is how easy and fast it is to run experiments. For example, you can easily spin up a…Jun 21, 2022Jun 21, 2022