Published inConveyor DataAre your AKS logging costs too high? Here’s how to reduce themExplore how to reduce the cost of logging on Azure by analyzing your applications and investigating Basics log analytics tables in Azure.Mar 10Mar 10
What a month of SQL challenges taught meWhile participating in Advent of SQL, I learned that practice really makes perfect, issues with time, ChatGPT outperforms me and more…Dec 30, 2024Dec 30, 2024
Published inDatamindedRunning thousands of Spark applications without losing your coolI explain how to troubleshoot and detect problematic Spark applications at scale as well as show how this can be used to reduce your costs.Dec 12, 2024Dec 12, 2024
Published inDatamindedThe building blocks of successful Data TeamsBased on my experience I will elaborate on key criteria for building successful data teamsMay 3, 20244May 3, 20244
Published inDatamindedYou can use a supercomputer to send an email but should you?Discover the next evolution in data processing with DuckDB and PolarsMar 12, 20241Mar 12, 20241
Published inDatamindedMy key takeaways for building a data engineering platformHaving been a member of a product team for two years, I aim to share three valuable insights that I have gained.Feb 15, 20242Feb 15, 20242
Published inDatamindedQuacking Queries in the Azure Cloud with DuckDBThis post describes 2 Duckdb extensions that enable you to read data from Azure blob storage. It also shows code for both Python and dbt.Jan 10, 20241Jan 10, 20241
Published inDatamindedHow we reduced our docker build times by 40%This post describes two ways to speed up building your Docker images: caching build info remotely, using the link option when copying filesOct 4, 202317Oct 4, 202317
Published inDatamindedHead-to-head comparison of dbt SQL enginesCompare usage and performance of dbt against 3 popular open-source SQL engines, namely: Spark, Trino and DuckdbSep 8, 20235Sep 8, 20235
Published inDatamindedUse dbt and Duckdb instead of Spark in data pipelinesDbt has become very popular for transformation on top of your data warehouse. We see potential to use dbt with Duckdb on top of a data…Apr 12, 202316Apr 12, 202316