PinnedPuneet SahainAllThingDataApache Spark Optimization — Avoid groupByKey()Apache Spark, a powerful open-source framework for big data processing, excels in both batch and real-time tasks. In this discussion, we…Dec 18, 2023Dec 18, 2023
Puneet SahainTowards Data EngineeringHow to reduce cost of Data Engineering pipelines while improving performanceShort answer is : Co-locating compute and data for these pipelines + using Amazon S3 Express One Zone storage.Dec 9, 2023Dec 9, 2023
Puneet SahaStrategies and Roadmap for Cloud MigrationIntroduction: Migration to the cloud can be driven by multiple factors — reduced capital expenditure, minimizing operational expenses, and…Dec 2, 2023Dec 2, 2023
Puneet SahainAllThingDataCloud — Cost Saving StrategiesNow, we are in the process of migrating our applications and systems to the cloud. Some are becoming cloud-enabled, while others are being…Sep 28, 2023Sep 28, 2023
Puneet SahainAllThingDataFeature Store — How to pick up the right one for ML infrastructureGenerating features from raw input data is a time-consuming and complex process. Therefore, it makes sense to persist them for reuse across…Sep 18, 2023Sep 18, 2023
Puneet SahainAllThingDataMLOps — How To Monitor Data Drift in Machine Learning ModelsDeploying machine learning models comes with its unique set of challenges. In this article, we will delve into one of the challenges from a…Sep 12, 2023Sep 12, 2023
Puneet SahaUnlock the Secret to Boost Your Mood!There are times when life throws a curveball, and we are not prepared for it. We become sad and mildly depressed. There are various reasons…Sep 7, 2023Sep 7, 2023
Puneet SahaWhat should we do if we don’t achieve what we wanted in life?In today’s world, we are bombarded by reels and influencers from every corner, each offering a quick fix to happiness, often in under 30…Sep 2, 20231Sep 2, 20231