ManojtPySpark | How to use Spark API’s to write word count program?If you have experience with interviews or are preparing for one, you are likely aware that word count is a commonly asked question to…Jun 23, 2023Jun 23, 2023
ManojtBeginner’s Guide to Resource Deployment on Cloud platforms Using TerraformLearning Cloud computing platforms can be a valuable investment for software engineers, as it provides numerous benefits such as…Apr 23, 2023Apr 23, 2023
ManojtA step-by-step approach to convert ER models to effective dimensional modelsIn my previous blog, I have discussed in detail about Data Models and how to build a ER(entity-relationship) model from scratch for…Apr 7, 2023Apr 7, 2023
ManojtHow to build a Data Model from scratch?In this blog, I’ll explain about how to build a Data Model using a business requirement with Retail domain. This blog would be helpful for…Mar 26, 20231Mar 26, 20231
ManojtinTowards DevIs that true I can do gold mining with Lakehouse architecture?Is Lakehouse a Data Warehouse? No….Mar 15, 2023Mar 15, 2023
ManojtIllustrating the purpose of PySpark User Defined Functions (UDF) with example datasetNot all the data you get to work with are straight forward to proceed ahead with the analysis. There are cases were you have to transform…Mar 2, 2023Mar 2, 2023
ManojtGet familiar with SQL by playing with Retail Domain dataset -Part 1Being a Data engineer I always loved working with data. Every data has it’s own story to narrate and sometimes analytical tools like SQL…Feb 24, 2023Feb 24, 2023
ManojtThings to know in Spark while dealing with JSON file formatsWhile building data pipeline you might had a scenario to deal with JSON file formats as the data source. In this blog, I’ll cover the…Feb 23, 2023Feb 23, 2023
ManojtSchema Validation for Streaming data using Kafka + Schema RegistryBefore we jump into the topic of Schema Registry let’s understand how data are streamed between producer and consumer.Dec 9, 20221Dec 9, 20221
ManojtEssential terminologies you need to know about Apache Kafka!Before we get started with the discussion. Let’s talk about the basics!Nov 2, 2022Nov 2, 2022