Prateek DubeyApache Spark on Kubernetes — On-Premise (Ceph) and AWS (S3)In this article we will go through how Kubernetes works as a Cluster Resource Manager for deploying Apache Spark applications. Spark as a…Sep 24, 2021Sep 24, 2021
Prateek DubeyJupyterHub on Kubernetes — AWS & On-PremiseThis article is a continuation to my previous article on how I built a Data Platform — On-Premise on Kubernetes. To further explain how…Sep 21, 20212Sep 21, 20212
Prateek DubeyData Platform — On-Premise on KubernetesThis article is about some of the work I have been doing in my prior and current organization around Data Infrastructures. For the past…Sep 20, 2021Sep 20, 2021
Prateek DubeyDatabricks on AzureDatabricks is a Unified Data Analytics Platform created by Apache Spark Founders. It provides a PAAS on Azure (Partnered with Microsoft)…Jul 9, 2020Jul 9, 2020
Prateek DubeyDatabricks on AWSDatabricks is a Unified Data Analytics Platform created by Apache Spark Founders. It provides a PAAS on AWS Cloud to solve complex Data…Jul 9, 20201Jul 9, 20201
Prateek DubeyMy Business Intelligence to AI Infrastructure JourneyI graduated from University of Pune, India with a Bachelor of Engineering degree majoring in Computer Engineering in 2013. I wanted to…Jul 9, 2020Jul 9, 2020
Prateek DubeyTerraform Cloud Infrastructure — AWS, Azure and GCPAutomation of Cloud Infrastructure is a norm that every company follows. Whenever we think of Cloud platforms, the first thing that comes…Jul 9, 2020Jul 9, 2020
Prateek DubeyFree Community Edition Databricks AccountTo setup a Databricks Account, firstly we need to Sign Up using below linkJun 30, 2020Jun 30, 2020
Prateek DubeyDatabricks x Airflow IntegrationDatabricks comes with a seamless Apache Airflow integration to schedule complex Data Pipelines.Jun 30, 2020Jun 30, 2020
Prateek DubeySecure AWS NetworkingTo design a secure Infrastructure, you must be wondering if we need help from a SRE (Site Reliability Engineer) or a DevOps Engineer who…Jun 30, 2020Jun 30, 2020