Deepa Vasanthkumar – Medium

Deepa Vasanthkumar

Pinned

Deepa Vasanthkumar

Unlocking the World of Data Engineering — Guide to Acing Interviews

In today’s data-driven world, the demand for skilled data engineers is soaring. Companies are on the lookout for professionals who can…

Jun 8

Unlocking the World of Data Engineering — Guide to Acing Interviews

Jun 8

Deepa Vasanthkumar

Understanding Coalesce function in SQL and Spark

The COALESCE function is a powerful and commonly used feature in both SQL and Apache Spark. It is instrumental in handling NULL values and…

2d ago

Understanding Coalesce function in SQL and Spark

2d ago

Deepa Vasanthkumar

Spark Concepts and Questions

1. How many types of join strategies are there in Spark?

Jul 12

Spark Concepts and Questions

Jul 12

Deepa Vasanthkumar

Exploring Architectural Patterns in Data Engineering Projects

Data engineering is a critical component of any data-driven organization, enabling the collection, transformation, and management of data…

Jul 1

Exploring Architectural Patterns in Data Engineering Projects

Jul 1

Deepa Vasanthkumar

Code Optimization in PySpark Leveraging Best Practices

Apache Spark is a powerful framework for distributed data processing, but to fully leverage its capabilities, it’s essential to write…

Jun 26

Code Optimization in PySpark Leveraging Best Practices

Jun 26

Deepa Vasanthkumar

Combining Dask with Delta and plotting

What is Dask?

Jun 24

Combining Dask with Delta and plotting

Jun 24

Deepa Vasanthkumar

Spark Accumulators and Broadcast variables

In Apache Spark, both accumulators and broadcast variables are used to share data among nodes in a distributed processing environment, but…

Jun 12

Spark Accumulators and Broadcast variables

Jun 12

Deepa Vasanthkumar

Spark dataframes select vs withcolumn comparison

In Apache Spark, both `select` and `withColumn` are methods used to manipulate DataFrames, but they serve different purposes and have…

Jun 10

Spark dataframes select vs withcolumn comparison

Jun 10

Deepa Vasanthkumar

Apache Spark Join Strategies

Broadcast Hash Join

Jun 7

Apache Spark Join Strategies

Jun 7

Deepa Vasanthkumar

pyspark dataframe transform m

The `transform()` method in PySpark DataFrame API applies a user-defined function (UDF) to each row of the DataFrame. It takes a function…

Apr 16

pyspark dataframe transform m

Apr 16

Deepa Vasanthkumar

Deepa Vasanthkumar

Data Engineering & Cloud | Follow/Connect 👋 https://www.linkedin.com/in/deepa-vasanthkumar/

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams