Vitor TeixeirainTowards Data ScienceDelta Lake — Type wideningWhat is type widening and why does it matter?·5 min read·Apr 29, 2024----
Vitor TeixeirainTowards Data ScienceDelta Lake — Partitioning, Z-Order and Liquid ClusteringHow are different partitioning/clustering methods implemented in Delta? How do they work in practice?·10 min read·Nov 8, 2023--4--4
Vitor TeixeirainTowards Data ScienceDelta Lake — Deletion VectorsHow are deletion vectors related to DML commands and how can they improve write performance?·9 min read·May 25, 2023--1--1
Vitor TeixeirainTowards Data ScienceDelta Lake — Automatic Schema EvolutionWhat happens and what you can/can’t do when merging evolutive DataFrames·5 min read·Mar 10, 2023----
Vitor TeixeirainTowards Data ScienceDelta Lake— Keeping it fast and cleanEver wondered how to improve your Delta tables’ performance? Hands-on on how to keep Delta tables fast and clean.·11 min read·Feb 15, 2023--5--5
Vitor TeixeirainTowards Data ScienceCustom Kafka Streaming metrics using Apache Spark Prometheus SinkA detailed tutorial on how to create and expose custom Kafka Consumer metrics in Apache Spark’s PrometheusServlet·6 min read·Feb 2, 2023--1--1