Thiago CordoninData ArenaDatabricks Certified Associate Developer for Apache Spark — tips to get prepared for the examGet prepared for the exam with these tips and conquer the certification.8 min read·May 31, 2021----
Thiago CordoninData ArenaEscreva para o Data ArenaCompartilhe ideias, contribua para a comunidade de dados e expanda seu nework.7 min read·Mar 13, 2021----
Thiago CordoninData ArenaWrite for Data ArenaShare ideas, contribute to the data community and expand your network.7 min read·Mar 13, 2021----
Thiago CordoninData ArenaEvolving Schemas with Schema RegistryThis article explores Schema Registry compatibility modes and how to evolve schemas according to them.12 min read·Mar 6, 2021--5--5
Thiago CordoninData ArenaMerging different schemas in Apache SparkThis article explores an approach to merge different schemas using Apache Spark.6 min read·Dec 21, 2020--7--7
Thiago CordoninData ArenaEnabling streaming data with Spark Structured Streaming and KafkaA comprehensive example on how to integrate Spark Structured Streaming with Kafka to create a streaming data visualization.6 min read·Oct 11, 2020--4--4
Thiago CordoninData ArenaBuilding a Spark and Airflow development environment with DockerA brief guide on how to set up a development environment with Spark, Airflow and Jupyter Notebook.6 min read·May 1, 2020--8--8
Thiago CordoninData ArenaUsing Machine Learning to classify hard bounce e-mails — Part 2The objective of this article series is to identify hard bounce e-mails using machine learning techniques. The part 1 article was about…5 min read·Dec 23, 2019----
Thiago CordoninData ArenaUsing Machine Learning to classify hard bounce e-mails — Part 1In this first article you will see Feature Engineering and Exploratory Analysis for a hard bounce e-mails classification problem.4 min read·Dec 1, 2019----