Manikandan MuthiahFrom Raw Text to DataFrames: A Journey with RDD’s zipWithIndex, filter, and mapIn the world of data analysis and processing, structured data in the form of DataFrames plays a crucial role. However, not all data comes…Jul 30, 2023Jul 30, 2023
Manikandan MuthiahHow to calculate distance between two co-ordinates using PySparkIn the world of data science, location data plays an important role in a lot of analysis. Calculating distances between two locations…Apr 1, 2023Apr 1, 2023
Manikandan MuthiahCreating and Using Wheel Files in Databricks: A Step-by-Step GuideTo create a Wheel file in Databricks and use it in your code, follow these steps:Mar 31, 2023Mar 31, 2023
Manikandan MuthiahHow to access Azure storage using Service principalFirst register an App in Azure Active directoryFeb 15, 2023Feb 15, 2023
Manikandan MuthiahHow to Configure Databricks CLIOpen cmd prompt in windows and run the following cmdFeb 14, 2023Feb 14, 2023
Manikandan MuthiahAmazon spending trends with SynapsePlay with your own Amazon order data. Here’s how to get the data:Feb 12, 2023Feb 12, 2023
Manikandan MuthiahSQL Problems and SolutionsThis post will have 4 SQL questions and its solutions are attached in the end.Nov 26, 2022Nov 26, 2022
Manikandan MuthiahDatabricks Certified Associate Developer for Apache Spark 3.0 — PythonAll you want to know about examMay 29, 2022May 29, 2022