Amazon Dynamo tech paper quick highlightsAmazon Dynamo is a highly available key-value storage system that some of Amazon’s core services use to provide an “always-on” experience…Jan 27, 2024A response icon1Jan 27, 2024A response icon1
Lakehouse tech paper quick highlightsLakehouse is a new generation platform which unifies Data Warehousing and Data Lake capabilities.Jan 12, 2024A response icon1Jan 12, 2024A response icon1
Consistent Hashing with Java implementationIntent of this article is to introduce consistent hashing along with a basic Java code implementation. Github link also pasted at the end…Jun 22, 2023A response icon1Jun 22, 2023A response icon1
Data Engineering : Metadata storage and discovery toolsWith growing amount of data in a large organization, there is a dire need of a central metadata storage for storing useful information…Jun 19, 2023Jun 19, 2023
AWS series : How to to create aws resources via terraformIntent of this article is to demonstrate how to use HashiCorp AWS provider terraform modules to create AWS resources in a matter of…May 21, 2023May 21, 2023
Published inDevOps.devDelta Lake : A saviour to your Data Lake problemsIntent of this article is to give a brief overview of what Delta Lake is, what purpose it serves, along with a basic hands on experience…May 19, 2023May 19, 2023
Python Pandas : Word Count ProgramIntent of this article is show a very basic word count program using Pandas, for begiMay 17, 2023May 17, 2023
ML series : Pandas fundamentalsPandas is a very useful Python library for data manipulation and analysis. It provides high-performant, easy-to-use data structures and…May 15, 2023May 15, 2023
ML series : NumPy fundamentalsNumPy (Numerical Python) is a fundamental Python package for scientific computing. It provides a powerful array data structure for storing…May 14, 2023May 14, 2023
ML series : Machine Learning system types and categoriesMachine learning systems can be broadly classified into below categories.May 12, 2023May 12, 2023