Arpita Mishra – Medium

Arpita Mishra

Arpita Mishra

Key Interview Questions and Expert Insights on Optimization and Data Handling — Part 2

1) What are some ways to optimize Spark jobs?

6d ago

Key Interview Questions and Expert Insights on Optimization and Data Handling — Part 2

6d ago

Arpita Mishra

Beginner’s Guide to PySpark Interview Questions: RDDs, DataFrames, and Transformations — Part 1

1) What is PySpark, and how does it differ from Apache Spark?

Jul 9

Beginner’s Guide to PySpark Interview Questions: RDDs, DataFrames, and Transformations — Part 1

Jul 9

Arpita Mishra

All-In-One SQL Guide: From Fundamentals to Performance Tuning

SQL (Structured Query Language) is a standardized programming language used for managing and manipulating relational databases. It allows…

Jul 4

All-In-One SQL Guide: From Fundamentals to Performance Tuning

Jul 4

Arpita Mishra

Beginner’s Guide for E-commerce Analytics using PySpark : Advanced Syntax and Use Cases for Top…

Let’s consider a practical scenario where we have a large dataset of e-commerce transactions, and we want to analyse customer purchasing…

Jun 27

Beginner’s Guide for E-commerce Analytics using PySpark : Advanced Syntax and Use Cases for Top…

Jun 27

Arpita Mishra

From Basics to Advanced: Navigating Apache Hive for Big Data Professionals

Apache Hive is a data warehousing and SQL-like query language for Hadoop. Developed by Facebook, it is now a part of the Apache Software…

Jun 23

From Basics to Advanced: Navigating Apache Hive for Big Data Professionals

Jun 23

Arpita Mishra

Mastering Apache Spark: Key Concepts and Practical Tips

Apache Spark is an open-source, distributed computing system designed for fast and general-purpose big data processing. Developed at UC…

Jun 15

Mastering Apache Spark: Key Concepts and Practical Tips

Jun 15

Arpita Mishra

Understanding HDFS: The Backbone of Hadoop’s Data Storage

What is HDFS?

Jun 10

Understanding HDFS: The Backbone of Hadoop’s Data Storage

Jun 10

Arpita Mishra

Introduction to Bigdata and Hadoop Ecosystem

The big data ecosystem is a comprehensive suite of technologies and tools designed to handle the complexities of managing, processing, and…

Jun 7

Introduction to Bigdata and Hadoop Ecosystem

Jun 7

Arpita Mishra

Revolutionize Your IT Budget: Cost Analysis of Cloud Data Warehousing Solutions

In today’s fast-paced digital landscape, optimizing IT budgets while maintaining robust data management is crucial. Cloud data warehousing…

Jun 5

Revolutionize Your IT Budget: Cost Analysis of Cloud Data Warehousing Solutions

Jun 5

Arpita Mishra

Redefining Data Management: Modern Data Warehousing and Its Future

A modern data warehouse is a cloud-based, scalable, and highly flexible data storage solution designed to handle large volumes of diverse…

Jun 3

Redefining Data Management: Modern Data Warehousing and Its Future

Jun 3

Arpita Mishra

Arpita Mishra

MSBI + Azure Cloud ETL Professional .Working @SocGen

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams