PinnedAbhishek DasSpark Cluster Runtime Architecture Explained: Ace Spark Job InterviewsLearn how Apache Spark’s drivers, executors, and cluster management technologies power efficient distributed data processingSep 15Sep 15
Abhishek DasUnderstanding Data Integrity in Databases: A Comprehensive GuideData integrity is a critical aspect of database management, ensuring the accuracy, completeness, and consistency of data stored or…Aug 9Aug 9
Abhishek DasAmazon Data Analysis Project in PysparkPySpark’s ability to handle large datasets makes it a valuable tool for data processing and analysis.Jul 16Jul 16
Abhishek DasPython requirements.txt : How to manage dependencies in Python ProjectThe requirements.txt file is a text document that enumerates the dependencies required by a Python project. It typically includes the…Mar 11Mar 11
Abhishek DasBumpversion : Manage your Python Project versionEver find difficulty in managing the versions of your python project in your production environment and the capture the versions in the…Mar 8Mar 8
Abhishek DasData bricks : Organize user-defined header, Footer and attach to Data in a file using spark SQLIn Data bricks, sometimes we get a requirement where we have to organize header (columns information) , data and footer (metadata of data…Feb 281Feb 281