Hareesha DandamudigRPC — Revolutionize Communication in Distributed SystemsA Modern Communication SolutionAug 16, 2023Aug 16, 2023
Hareesha DandamudiPartitioning and Consistent HashingSuccess of any distributed storage system is majorly dependent on the data partitioning and replication scheme that it uses. For a…Aug 18, 2022Aug 18, 2022
Hareesha DandamudiML — K Nearest Neighbors ClassifierWith and without Scikit-LearnJun 21, 2022Jun 21, 2022
Hareesha DandamudiApache Spark — Job monitoringUse Spark Listeners to collect low level job metrics.Jun 9, 20221Jun 9, 20221
Hareesha DandamudiApache Spark — Large query plansSpark achieves its fault tolerance with ability to go back and replay everything from DAG. But if lineage of some of those dataframes…Apr 22, 20221Apr 22, 20221
Hareesha DandamudiVectors, Matrices and VectorizationTerms that are most commonly heard in Data science and ML world.Apr 18, 2022Apr 18, 2022
Hareesha DandamudiPackaging PySpark application using pex and whl.Recently I started working on PySpark application and it did involve a lot of learning curve for me, not in terms of PySpark but packaging…Apr 12, 20221Apr 12, 20221