Sandeep Suthrame
Apr 9, 2024

πŸš€ Excited to share a PySpark challenge for data engineers! πŸš€

Are you ready to put your data engineering skills to the test? πŸ’‘ In this challenge, you'll dive into a retail dataset to identify the top 5 customers who made the highest total purchase amount in the last month using PySpark.

πŸ›’ Input Dataset Sample:
(transaction_id, customer_id, product_id, quantity, unit_price, timestamp)

πŸ“Š Output:
Top 5 customers with their total purchase amounts.

πŸ” This challenge is designed to assess your ability to analyze transactional data and derive meaningful insights using PySpark. Are you up for the challenge? Share your solution and join the conversation! #PySpark #DataEngineering #DataAnalysis #TopCustomers #DataProcessing.

Let's dive in and showcase your PySpark prowess! πŸ’ͺπŸ’Ό