Siddharth TeotiaAggregation Operations in Distributed SQL Query EnginesIn this post, we will discuss how distributed query engines process SQL GROUP BY aggregation queries. Distributed aggregation is a core…21h ago
💡Mike ShakhomirovinTowards Data ScienceThe Top 10 Data Lifecycle Problems that Data Engineering SolvesClear strategies for addressing key pain pointsAug 22
Vu TrinhinData Engineer ThingsI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 2412Aug 2412
Vishal BarvaliyaSQL topics & subtopics for Data Analyst roleComprehensive Guide to SQL for Data Analysts: From Fundamentals to Advanced Techniques1d ago1d ago
Vu TrinhinData Engineer ThingsApache Kafka — OverviewThe terminology and the architecture.Jul 66Jul 66
Siddharth TeotiaAggregation Operations in Distributed SQL Query EnginesIn this post, we will discuss how distributed query engines process SQL GROUP BY aggregation queries. Distributed aggregation is a core…21h ago
💡Mike ShakhomirovinTowards Data ScienceThe Top 10 Data Lifecycle Problems that Data Engineering SolvesClear strategies for addressing key pain pointsAug 22
Vu TrinhinData Engineer ThingsI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 2412
Vishal BarvaliyaSQL topics & subtopics for Data Analyst roleComprehensive Guide to SQL for Data Analysts: From Fundamentals to Advanced Techniques1d ago
SciforceinSciforceStep-by-Step Guide to Creating Your Own Large Language ModelLarge Language Models (LLMs) are transforming AI by enabling computers to generate and understand human-like text, making them essential…Sep 53
KoushikApache Spark On Apple SiliconInstall Spark(Scala, Python) on Macs having M1, M1 Pro, M1 Max, and M2 with this guide22h ago