Sarthak JoshiUnderstanding Data Drift: A Comprehensive Guide with ExamplesData drift is a critical concept in machine learning (ML) and data science that often goes unnoticed until it causes significant…Sep 3Sep 3
Sarthak JoshiUnderstanding Locality Sensitive Hashing(LSH): A Powerful Technique for Similarity Search.With recent rise of Large language models (LLMs) there has been a rise of use cases for vector databases which provide fast approximate…Jul 30, 20231Jul 30, 20231
Sarthak JoshiinAnalytics VidhyaUDFs in PysparkPyspark sql library itself provides a wide variety of functions to apply on a data frame, but we can define our own functions in case of…May 3, 2020May 3, 2020
Sarthak JoshiinAnalytics VidhyaIntroduction to window function in pyspark with examplesWorking as a data scientist/data engineer transformation of big data is a very important aspect . Spark framework is most commonly used…Apr 25, 2020Apr 25, 2020
Sarthak JoshiMathematics of Machine Learning , Part-1 Linear RegressionMathematics is an integral part of machine learning, though in real world ML mostly involves programming and you don’t need to dive deep…Sep 14, 2018Sep 14, 2018