Apache Spark on Kubernetes — On-Premise (Ceph) and AWS (S3)In this article we will go through how Kubernetes works as a Cluster Resource Manager for deploying Apache Spark applications. Spark as a…Sep 24, 2021Sep 24, 2021
JupyterHub on Kubernetes — AWS & On-PremiseThis article is a continuation to my previous article on how I built a Data Platform — On-Premise on Kubernetes. To further explain how…Sep 21, 2021A response icon2Sep 21, 2021A response icon2
Data Platform — On-Premise on KubernetesThis article is about some of the work I have been doing in my prior and current organization around Data Infrastructures. For the past…Sep 20, 2021Sep 20, 2021
Databricks on AzureDatabricks is a Unified Data Analytics Platform created by Apache Spark Founders. It provides a PAAS on Azure (Partnered with Microsoft)…Jul 9, 2020Jul 9, 2020
Databricks on AWSDatabricks is a Unified Data Analytics Platform created by Apache Spark Founders. It provides a PAAS on AWS Cloud to solve complex Data…Jul 9, 2020A response icon1Jul 9, 2020A response icon1
My Business Intelligence to AI Infrastructure JourneyI graduated from University of Pune, India with a Bachelor of Engineering degree majoring in Computer Engineering in 2013. I wanted to…Jul 9, 2020Jul 9, 2020
Terraform Cloud Infrastructure — AWS, Azure and GCPAutomation of Cloud Infrastructure is a norm that every company follows. Whenever we think of Cloud platforms, the first thing that comes…Jul 9, 2020Jul 9, 2020
Free Community Edition Databricks AccountTo setup a Databricks Account, firstly we need to Sign Up using below linkJun 30, 2020Jun 30, 2020
Databricks x Airflow IntegrationDatabricks comes with a seamless Apache Airflow integration to schedule complex Data Pipelines.Jun 30, 2020Jun 30, 2020
Secure AWS NetworkingTo design a secure Infrastructure, you must be wondering if we need help from a SRE (Site Reliability Engineer) or a DevOps Engineer who…Jun 30, 2020Jun 30, 2020