Druid parquet extension on Array/List type

As we rolled out and stabilized our Realtime Flink Parquet Data Warehouse, we are considering ingest parquet data into druid directly. We follow the guideline here, everything seems working well in the beginning. When our QA team runs integration test on…


Speed up Kafka queries on Presto

After I added protobuf and avro decoders into Presto, right now I can query my Kafka cluster through Presto. It saved me lots of time debugging data issues in my data pipelines. Basically If I didn’t see data in Kafka, I do not need to debug my downstream data pipelines.

Hadoop Noob
Hadoop Noob
Elephant trainers
More information
Followers
161