George ZefkilisinData Engineer ThingsBuilding a Local Data Lake from scratch with MinIO, Iceberg, Spark, StarRocks, Mage, and DockerHello again, fellow technology enthusiasts! I am a software/data engineer who transitioned from data science. The learning curve in this…Jul 136
Vishal BarvaliyaDelta Lake 4.0: A Simple GuideDelta Lake is a popular tool for managing and processing large amounts of data, making sure it is organized and reliable. The new release…Jul 24Jul 24
KeerthipriyaninWalmart Global Tech BlogAchieve million-dollar savings with unified code and configuration-driven data pipelinesCoauthored by Guru Prakash and Chirag Goel1d ago1d ago
Vu TrinhinData Engineer ThingsDo We Need the Lakehouse Architecture?When data lakes and data warehouses are not enough.Apr 2013Apr 2013
George ZefkilisinData Engineer ThingsBuilding a Local Data Lake from scratch with MinIO, Iceberg, Spark, StarRocks, Mage, and DockerHello again, fellow technology enthusiasts! I am a software/data engineer who transitioned from data science. The learning curve in this…Jul 136
Vishal BarvaliyaDelta Lake 4.0: A Simple GuideDelta Lake is a popular tool for managing and processing large amounts of data, making sure it is organized and reliable. The new release…Jul 24
KeerthipriyaninWalmart Global Tech BlogAchieve million-dollar savings with unified code and configuration-driven data pipelinesCoauthored by Guru Prakash and Chirag Goel1d ago
Vu TrinhinData Engineer ThingsDo We Need the Lakehouse Architecture?When data lakes and data warehouses are not enough.Apr 2013
balaji balinSTREAM-ZEROComparing Trino, ClickHouse, and Apache Doris: Architectures, Use Cases, and PerformanceDatalakes and Data Platforms are going through another cycle of change and evolution. In recent years I have been implementing solutions…Apr 29
Madhav PandeHarnessing the Power of Machine Learning and AI: Why These Technologies Are Vital in Today’s…Introduction:1d ago
Ciro GrecoinTowards Data ScienceWrite-Audit-Publish for Data Lakes in Pure Python (no JVM)An open source implementation of WAP using Apache Iceberg, Lambdas, and Project Nessie all running entirely PythonApr 122