PinnedAlexander VolokinPlumbers Of Data Science5 Approaches to ad hoc Dataframes in PySparkVarious ways to create small Spark dataframes in place without external dependenciesJul 30, 20231Jul 30, 20231
PinnedAlexander VolokinPlumbers Of Data ScienceScripting PySpark DataframesReproducing and transporting dataframes by generating plain Python scriptsJun 12, 2023Jun 12, 2023
PinnedAlexander VolokinBetter ProgrammingDelta-RS and DuckDB — Read and Write Delta Without SparkAn alternative toolset to Spark for low-latency operations with small to medium-size datasetsApr 26, 20232Apr 26, 20232
Alexander VolokinPlumbers Of Data ScienceDelta Properties and Check Constraints at ScaleManaging Delta Table Properties and Check Constraints using PythonMay 26, 20231May 26, 20231
Alexander VolokinPlumbers Of Data ScienceTowards Databricks Certified Data Engineer ProfessionalA compilation of observations and tips to achieve this credentialMay 12, 20231May 12, 20231
Alexander VolokDatabricks Observability: Processing Collected Ganglia MetricsDatabricks Observability SeriesApr 16, 20232Apr 16, 20232
Alexander VolokDatabricks Observability: Collecting Ganglia MetricsDatabricks Observability SeriesApr 4, 20232Apr 4, 20232