Naveen NelamaliSpark Executor Memory Overhead: Understanding & Best PracticesSpark executor memory overhead refers to additional memory allocated beyond the user-defined executor memory in Apache Spark·5 min read·Jan 14, 2024----
Naveen NelamaliWhen to use Hive Partitioning and Bucketing?Hive Partition is a way to split a large table into smaller tables based on the values of a column(one partition for each distinct value)4 min read·Jan 12, 2024----
Naveen NelamaliDo you know you can use Variables in Hive?Hive variables are key-value pairs that can be set using the set command and they can be used in scripts and Hive SQL.4 min read·Jan 12, 2024----
Naveen NelamaliHow to Union Pandas DataFrames?In pandas, you can use the concat() function to concatenate or union the Pandas DataFrames along with a particular axis.3 min read·Jan 12, 2024----
Naveen NelamaliWhy Avoid UDFs in Spark & PySpark?User-Defined Functions (UDFs) in Spark can incur performance issues due to serialization overhead, necessitating the conversion of data…4 min read·Jan 12, 2024----
Naveen NelamaliWhat are the Specific Roles of Spark Driver and ExecutorHave you ever wondered what the different roles of Apache Spark Driver and Executor play when running your application in a distributed…3 min read·Jan 12, 2024----
Naveen NelamaliIntroduction to Spark-Submit: A Comprehensive Guide to Submitting Spark ApplicationsEvery thing you need to know about spark-submit5 min read·Jan 11, 2024----
Naveen NelamaliRemember These when Accessing Matrix Elements in RWhen accessing elements in an R matrix, there are several important things to remember to ensure accurate and effective data manipulation:2 min read·Nov 11, 2023----
Naveen NelamaliRead Snowflake table into Spark DataFrame — Spark by {Examples}In this Snowflake data warehouse article, I will explain how to read a Snowflake table into Spark DataFrame and learn different connection…3 min read·Feb 28, 2020----
Naveen NelamaliWhich Spark HBase Connector to use in 2019? — Spark by {Examples}This tutorial explains different Spark connectors and libraries to interact with HBase Database and provides a Hortonworks connector…5 min read·Aug 30, 2019----