Jordi Esteve SorribasWhy You Should Avoid UDFs in Apache Spark: A Practical Guide with Regression AnalysisIntroduction15h ago
InTowards Data Scienceby💡Mike ShakhomirovThe Top 10 Data Lifecycle Problems that Data Engineering SolvesClear strategies for addressing key pain pointsAug 23
Abhinav VinciApache Spark — Common mistakes…Spark is a framework for processing big data. In Part 1 we focused on the Basics of spark and Why its so fastNov 32Nov 32
IndatamindedbebyNiels ClaeysRunning thousands of Spark applications without losing your coolI explain how to troubleshoot and detect problematic Spark applications at scale as well as show how this can be used to reduce your costs.2d ago2d ago
InData Engineer ThingsbyVu TrinhApache Kafka — OverviewThe terminology and the architecture.Jul 68Jul 68
Jordi Esteve SorribasWhy You Should Avoid UDFs in Apache Spark: A Practical Guide with Regression AnalysisIntroduction15h ago
InTowards Data Scienceby💡Mike ShakhomirovThe Top 10 Data Lifecycle Problems that Data Engineering SolvesClear strategies for addressing key pain pointsAug 23
Abhinav VinciApache Spark — Common mistakes…Spark is a framework for processing big data. In Part 1 we focused on the Basics of spark and Why its so fastNov 32
IndatamindedbebyNiels ClaeysRunning thousands of Spark applications without losing your coolI explain how to troubleshoot and detect problematic Spark applications at scale as well as show how this can be used to reduce your costs.2d ago
InData Engineer ThingsbyVu TrinhI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 2419
InWren AIbyHoward ChiTrino + Wren AI — Getting Big Data from Anywhere Fast and Easy with AIHow Trino and Wren AI Combine to Supercharge Big Data Analytics, Streamline Multi-Source Integration, and Deliver Real-Time Insights2d ago