saurabh goyalAWS Route 53There are multiple NameServer(NS) and each NS is the author of a zone.You create record set inside a zone .Sep 30, 2020Sep 30, 2020
saurabh goyalAWS Networking : VPC and SubnetsCertain AWS resources such as EC2,LB and RDS are required to be in a VPC and subnet where you need to define your own networking and some…Feb 22, 20201Feb 22, 20201
saurabh goyalDocker ImagesContainer images are templates from which containers are created. These images are composed of many layers. The first layer in the image…Jun 4, 2019Jun 4, 2019
saurabh goyalProject Tungsten and Catalyst SQL optimizerProject Tungsten/off-heap SearlizierNov 30, 20182Nov 30, 20182
saurabh goyalRunning Spark Jobs on YARNWhen running Spark on YARN, each Spark executor runs as a YARN container. Where MapReduce schedules a container and fires up a JVM for…Oct 24, 2018Oct 24, 2018
saurabh goyalSpark Architecture and DeploymentA spark application consists of a driver which run either on the client or on application master node and many executors which run across…Oct 24, 2018Oct 24, 2018
saurabh goyalPartitioning in Apache SparkData in the same partition will always be in the same machine. Data in a partition will not span multiple machines.Oct 3, 2018Oct 3, 2018
saurabh goyalPair RDDs: Transformations and ActionsDistributed key-value pairs are represented as Pair RDDs in Spark.Sep 26, 2018Sep 26, 2018