Fahad KhanUnlocking the Power of Terraform: Advanced Tricks and Tips for Managing Infrastructure as CodeTerraform is a popular open-source tool that allows you to create, change, and improve infrastructure as code. Here are a few tricks for…Jan 27, 2023Jan 27, 2023
Fahad KhanLarge Scale Machine Learning using Apache Spark (Part II) For NewbiesIn my previous blog, I told you how to setup Apache Hadoop and Apache Hive cluster consist of two nodes. In this blog I’m going to tell…Feb 16, 2019Feb 16, 2019
Fahad KhanLarge Scale Machine Learning using Apache Spark (Part I) For NewbiesHi, I’m back with another blog and in this blog I’m going to tell you how to do large scale machine learning using spark. This is the…Feb 12, 2019Feb 12, 2019
Fahad KhanSpark has three data representations i-e RDD, Dataset & Dataframe.Once we’ve installed and configured Spark. We can do programming in scala by opening spark-shell. Now let’s talk about RDD briefly, RDD…Jan 24, 20192Jan 24, 20192