Open in app

Sign In

Write

Sign In

Daniel Palma
Daniel Palma

190 Followers

Home

About

Pinned

High-performance open-source Data Lakehouse at home

Ever wanted to deploy your own Data Lake and on top of it a so-called Lakehouse architecture? The good news is, that now it’s easier than ever with tools like Minio, Trino (with its multitude of connectors), and others. In this article we’ll cover how these components actually fit together…

Data Lake

5 min read

High-performance open source Data Lakehouse at home
High-performance open source Data Lakehouse at home
Data Lake

5 min read


Pinned

Analyzing large files locally in seconds with DuckDB and DataGrip

If you have ever received a huge csv file that you had to analyze or just quickly wanted to peek into to check it’s structure your go-to tool is usually pandas and a small Python script. But if you are like me and always have DataGrip (or any other JDBC-compatible…

Duckdb

3 min read

Analyzing large files locally in seconds with DuckDB and DataGrip
Analyzing large files locally in seconds with DuckDB and DataGrip
Duckdb

3 min read


Pinned

Implementing Data Contracts

Data contracts help you document and enforce the shape and metadata of your records through data pipelines and processing systems. Their main goal is reducing surprises and getting rid of undocumented changes. For example, if data producers and data consumers agree that the data interchanged between them has a specific…

Data

6 min read

Implementing Data Contracts
Implementing Data Contracts
Data

6 min read


Pinned

Breathing life into Streamlit with Materialize & Redpanda

At work I had the opportunity to start getting familiar with Streamlit for a project so I got motivated to create a side project with it during the weekend. After a bit of research I got to the conclusion that there are not enough examples of building live dashboards with…

Streamlit

6 min read

Breathing life into Streamlit with Materialize & Redpanda
Breathing life into Streamlit with Materialize & Redpanda
Streamlit

6 min read


Pinned

Visualizing Twitter trends live: Redpanda, Materialize and dbt

Experimenting with new data tools is always fun; Today we’ll take a look at how we can leverage Materialize to build a live map of trending Twitter topics. We’ll be using Redpanda as a fully Kafka-compatible storage layer for our data, from where we’ll build the live and breathing materialized…

Kafka

6 min read

Visualizing Twitter trends live: Redpanda, Materialize and dbt
Visualizing Twitter trends live: Redpanda, Materialize and dbt
Kafka

6 min read


Nov 3, 2022

The Developer Experience formula

It took a long time to shift the mindset of the software industry to be user-first. The rise of a product-focused mentality and product management best practices helped a great deal in pushing this forward. But when the users are developers, all these decades of established processes are suddenly thrown…

Developer Experience

5 min read

The Developer Experience formula
The Developer Experience formula
Developer Experience

5 min read


Oct 24, 2022

Thinking in events

This is one of those articles that I wrote to solidify my understanding of something, it’s not meant to be educational (except for myself, hehe), but I find it helps a lot to write things down, and it’s a big plus if someone else finds it useful. Like most people…

4 min read

4 min read


Sep 30, 2022

YubiKey OTP generation from the command line

One less reason to leave your favorite place, the terminal! Install macOS prerequisite: brew install swig Install ykman with pipx, the official YubiKey manager CLI application pipx install yubikey-manager Usage To list all available accounts ykman oath accounts list Generating an access code for an account that’s called danthelion:daniel@arecadata.com ykman oath accounts…

Command Line

1 min read

Command Line

1 min read


Sep 27, 2022

An introduction to Redpanda: Creating a chat application in Python in less than 100 lines of code.

One of the biggest challenges software engineers — especially beginners — face is learning how to use a new piece of technology. This is to a large extent true for streaming data platforms, as they are considered advanced by nature. In this tutorial, we will create a simple terminal-based chat…

Kafka

11 min read

An introduction to Redpanda: Creating a chat application in Python in less than 100 lines of code.
An introduction to Redpanda: Creating a chat application in Python in less than 100 lines of code.
Kafka

11 min read


Jun 27, 2022

Temporal analysis of Wikipedia changes with Redpanda & Materialize & dbt

The source I have recently discovered that Wikimedia (The organization behind Wikipedia) has a publicly available streaming API for all Mediawiki sites (wikidata, wikipedia, etc.), called the EventStreams, which uses the Server-Sent Events protocol and is backed by Kafka internally. EventStreams provides access to several different data streams, most notably the recentchange…

Streaming

5 min read

Temporal analysis of Wikipedia changes with Redpanda & Materialize & dbt
Temporal analysis of Wikipedia changes with Redpanda & Materialize & dbt
Streaming

5 min read

Daniel Palma

Daniel Palma

190 Followers

Wait, it’s all data? 🔫

Following
  • Netflix Technology Blog

    Netflix Technology Blog

  • Lauren Balik

    Lauren Balik

  • Mark Palfalvi

    Mark Palfalvi

  • Raphaela Abramson

    Raphaela Abramson

  • Barack Obama

    Barack Obama

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech