Stories by Tilak Mudgal on Medium

Vector Databases: The Memory Engine of RAG

Tilak Mudgal — Wed, 15 Oct 2025 23:38:29 GMT

Every intelligent system needs a way to remember what it has learned.
In Retrieval-Augmented Generation (RAG), that memory is powered by embeddings numerical representations of text that capture meaning instead of just words. These embeddings allow AI to understand that “refund” and “return” are related even if they’re written differently.

If you’re already familiar with how embeddings work, feel free to continue.
But if you’d like a quick refresher on how AI turns text into numbers and meaning, check out the below post.

RAG Embeddings: How AI Turns Text into Understanding

Now, once we’ve transformed our data into embeddings, the real question becomes:
Where do we store them, and how do we find the right ones when a user asks something new?
That’s where Vector Databases step in acting as the memory engine that helps RAG systems recall, search, and connect the right pieces of information in milliseconds.

They act as the memory of your RAG system whenever a user asks a question, the database helps find the most relevant pieces of information by comparing vectors.

For example:

You ask, “What are the benefits of using solar panels?”
The system doesn’t look for the word “benefits” it looks for meaning.

It fetches chunks whose embeddings are closest in meaning to your question, like “Solar panels reduce electricity costs and carbon emissions.”

Why Traditional Databases Don’t Work

Traditional databases are great for structured information things like customer names, invoice IDs, or sales amounts.
You can easily run queries like:

SELECT * FROM cars WHERE brand = 'Porsche';

But what if your question isn’t that simple?
Say you ask:

“Show me cars that feel sporty but practical.”

A relational database will stare blankly at you it doesn’t “understand” what sporty but practical means.
A vector database, on the other hand, can find cars like the Porsche 911, BMW M2, or Bugatti Chiron, because it stores meanings as numerical vectors, not just text strings.

That’s the magic vector databases let machines search by meaning, not by exact words.

How Vector Databases Work

When we turn text into embeddings, every sentence becomes a list of numbers like giving it a unique spot in a giant 3D map.
But instead of just three directions (x, y, z), this map has hundreds or even thousands of directions one for every tiny piece of meaning.

A vector database stores all these “locations” along with the original text, so it knows both where something is in this space and what it means.
It also creates special shortcuts (called indexes) so it can find things quickly, even if you have millions of sentences stored.

Now, when you ask a question, your query is also turned into a vector a point on the same map.
The database simply looks for other points that are closest to it.
Those nearby points usually represent text with similar meaning, so the database returns them as the most relevant results.

Vector Similarity: How It Finds the Closest Meaning

Imagine every car you know plotted in a huge invisible 3D space:

Cars with similar personality like “sporty,” “luxury,” or “eco-friendly” end up close to each other.

A Porsche 911 and BMW M4 would be neighbors because they share “performance” and “sporty” vibes.

Meanwhile, a Toyota Prius lives far away in the “eco & efficiency” zone.

So when you ask the system, “Recommend cars that feel thrilling to drive,” it doesn’t just find the word “thrilling” it finds the vectors closest in meaning to that feeling.

That’s what a vector database does: it measures closeness in meaning using something called vector similarity.

Vector search is about finding which vectors are closest to your query vector meaning they represent similar ideas.
Instead of text comparison, we measure how similar two points (vectors) are in multi-dimensional space.

The most common similarity measures are:

Cosine Similarity — The Shortcut to Meaning

Most vector databases rely on cosine similarity, which measures the angle between two vectors.
The smaller the angle, the more similar they are.
If two vectors point in almost the same direction, it means their meanings align closely.

So, when your query “thrilling to drive” points in nearly the same direction as “high performance car,” the cosine similarity is high and your database confidently returns results like Porsche 911 GT3 or BMW M4.

It’s not looking for keywords.
It’s comparing ideas mathematically.

Inside the Vector Database

Every piece of text you store whether it’s a paragraph from a document, a product description, or a customer review gets an embedding vector and a unique ID.
The database keeps track of both:

{
  "id": "123456",
  "text": "The Porsche 911 GT3 offers incredible handling and track-ready precision.",
  "embedding": [0.11, -0.82, 0.57, ...]
}

When a user asks, “Which car is fun on corners?”
the database finds vectors with similar patterns.
Those become your top results, which are then sent to the LLM for final answer generation.

How the Database Stores and Searches

Storage — Each document chunk and its embedding vector are stored together.
Search — When a query comes in, its embedding is computed.
Then the database compares it with all stored vectors (using cosine similarity or another metric).
Result — The closest ones are retrieved (top-k results) and passed to the LLM for context.

Popular Vector Databases

The Role of Vector Databases in RAG

In a Retrieval-Augmented Generation system, the vector database acts as the retriever’s engine it quickly finds the right context before passing it to the LLM.

The flow looks like this:

Your documents are broken into chunks and embedded.
Those embeddings are stored in the vector database.
A user asks a question → it’s embedded → the database finds similar vectors.
The retrieved chunks are sent to the LLM → the final answer is generated.

Without this database, your system would have to re-scan every document each time someone asked a question slow, inefficient, and impractical.

RAG Embeddings: How AI Turns Text into Understanding

Tilak Mudgal — Fri, 10 Oct 2025 04:27:42 GMT

If you’ve ever wondered how an AI “understands” that

“dog” and “puppy” are similar,
but “dog” and “table” are not
you’re really wondering about embeddings.

Embeddings are the heart of Retrieval-Augmented Generation (RAG).
They’re what allow an AI system to find meaning, not just matching words.

Before we create embeddings, we need to prepare our text so it’s manageable for the model to process.
This step is called chunking breaking a long document into smaller, meaningful sections.

Chunking — Preparing Text for Embedding

Every embedding model has a token limit (for example, 512, 1024, or 8192 tokens).
If a document exceeds that limit, you can’t embed it in one go you have to split it into smaller chunks.
Each chunk is then treated as a separate piece of text that gets its own embedding.

Why Chunking Matters

Chunking ensures:

Each piece of text fits within the model’s token limit.
The content remains contextually meaningful (you don’t cut sentences in half).
Search and retrieval later happen on relevant sections, not entire documents.

Think of it as dividing a book into chapters before summarizing it you get better organization and more precise meaning.

How Chunking Works

Here’s what happens in a typical workflow:

Document → Start with your full text (a report, webpage, or transcript).
Chunking → Split it into smaller pieces, often around 200–500 words or 512–1024 tokens per chunk.
Embedding → Send each chunk to the embedding model to get a numerical vector.
Vector Database → Store all embeddings for later retrieval and semantic search.

Good chunking doesn’t just cut by length it respects structure.
You can chunk:

By paragraphs or headings (for articles)
By conversation turns (for chat logs)
By logical sections (for PDFs or reports)

You can also add overlap between chunks (e.g., 20–30%) to preserve context between them, ensuring smoother retrieval later in RAG systems.

What Are Embeddings

At their core, embeddings are just numbers but very special ones.

They’re vectors (lists of numbers) that represent the meaning of text.
Each word, sentence, or paragraph can be converted into a vector.

The trick?
Texts with similar meanings have vectors that are close together in this mathematical space.

Example:

Let’s say you have these two sentences:

“The cat is sleeping on the couch.”
“A kitten is napping on the sofa.”

Even though the words are different, they mean almost the same thing.

When converted to embeddings, their numeric representations might look like:

Sentence 1: [0.12, -0.43, 0.88, ...]
Sentence 2: [0.11, -0.45, 0.91, ...]

These two vectors are very close in distance meaning the model recognizes they’re similar in meaning.

That’s how AI finds related information even when you don’t use the same words.

Why Embeddings Matter in RAG

When someone asks a question, your RAG system doesn’t do a keyword search like “Cmd+F”.
Instead, it:

Converts the question into an embedding vector.
Searches for similar embeddings in a large collection (your knowledge base).
Retrieves the most relevant passages based on meaning, not keywords.

So when someone asks:

“How can I return my order?”

It can still find:

“Refund policy for purchased products”
even though the words “return” and “refund” are different.

That’s the real power of embeddings semantic search instead of keyword search.

What Makes This “Semantic” Search Different

In traditional keyword search, systems only look for exact word matches.
So if you type “return order”, it’ll only find documents that literally contain those words.
If your database says “refund policy” instead, the system won’t connect the two even though both mean the same thing.

That’s where semantic search changes everything.

Semantic search understands meaning, not just words.
It works by comparing the embeddings those numeric representations of text meanings.
If two pieces of text have similar meanings, their embeddings will be close together in the vector space, even if they don’t share any words.

That’s why semantic search feels more like how humans think.
We don’t look for exact words we look for what’s related in meaning.

How Embeddings Are Created

To create embeddings, we use special models called embedding models.
These models convert text into numerical representations vectors that capture meaning, not just words.

Let’s see how this works using a sentence :

“Porsche 911 GT3 is the best car ever built.”

Static vs Contextual Meaning

Older models like Word2Vec gave “drive” one fixed vector.
Modern embedding models like BGE-M3 or OpenAI embeddings create context-aware vectors, so the same word means different things depending on its usage.

Choosing an Embedding Model

You’ll often hear about embedding models like:

BGE-M3 (multi-lingual, open-source, strong accuracy)
E5-Large (great for English semantic search)
OpenAI text-embedding-3-large (high-quality, API-based)
MiniLM / Instructor-xl (lighter, good for smaller setups)

Tokenization type — how the model breaks text into smaller pieces (like words or sub-words) before turning them into numbers.
Dimension — the length of each embedding vector, showing how much detail the model can capture about meaning.
Pooling method — how the model combines all token vectors into one final vector that represents the entire sentence.

Types of Tokenization

Sentence: “The Porsche 911 GT3 is insanely fast.”

1. Word-level tokenization
This method splits text into full words separated by spaces or punctuation.

Example: ["The", "Porsche", "911", "GT3", "is", "insanely", "fast"]
It’s simple but limited if the model hasn’t seen a word like “GT3” before, it treats it as unknown.

2. Subword-level tokenization
This approach breaks uncommon or complex words into smaller, known parts.

Example: ["The", "Porsche", "911", "GT", "3", "is", "insane", "ly", "fast"]
This helps the model handle rare words or new combinations without losing meaning.

3. Character-level tokenization
Here, every single character (including spaces and punctuation) becomes its own token.

Example: ["T", "h", "e", " ", "P", "o", "r", "s", "c", "h", "e", " ", "9", "1", "1", " ", "G", "T", "3", " ", "i", "s", " ", "i", "n", "s", "a", "n", "e", "l", "y", " ", "f", "a", "s", "t", "."]
It captures all details but creates extremely long token lists, which is inefficient for large texts.

4. Byte Pair Encoding (BPE)
BPE starts at the character level and repeatedly merges the most frequent pairs of characters into subwords.

Example: ["The", "▁Porsche", "▁911", "▁GT", "3", "▁is", "▁insane", "ly", "▁fast", "."]
This creates a balance between word and character-level tokenization compact yet flexible.

5. WordPiece tokenization
Used in models like BERT, it works similarly to BPE but uses probabilities to decide which subwords to merge.

Example: ["The", "Porsche", "911", "GT", "##3", "is", "insane", "##ly", "fast", "."]
The “##” prefix shows that a token continues from a previous one (e.g., “GT” + “##3” → “GT3”).

6. SentencePiece tokenization
SentencePiece treats the entire text as a raw byte stream, including spaces, and can handle multiple languages naturally.

Example: ["▁The", "▁Porsche", "▁911", "▁GT3", "▁is", "▁insanely", "▁fast", "."]
The underscore (▁) represents a space, making it language agnostic and ideal for multilingual embedding models like BGE-M3.

Final Takeaway

Embeddings aren’t just numbers they’re how machines understand meaning.
And chunking isn’t just splitting text it’s how we preserve structure and relevance in that meaning.

When combined, they transform unstructured data into searchable, intelligent knowledge the backbone of every modern RAG system.

Why RAG Exists and How It Works — Part 1

Tilak Mudgal — Thu, 09 Oct 2025 23:29:28 GMT

Why RAG Exists and How It Works

If you’ve ever asked ChatGPT or any other AI model a question like “What’s the revenue of Tesla in 2024?” and it confidently gave you an outdated or completely wrong answer, you’ve experienced the limits of large language models (LLMs).

They’re brilliant at language, but not always great at facts.
That’s where Retrieval-Augmented Generation (RAG) comes in it’s like giving your AI a search engine brain.

Source: seekr

Why RAG Exists

LLMs, like GPT or LLaMA, are trained on massive amounts of text. But:

Their training data is static it ends at a certain point.
They can’t “look up” new or private information.
They sometimes “hallucinate” i.e., make up facts that sound correct.

Let’s take an example:

Imagine you built a chatbot for your company’s customer support.
Someone asks:

“What’s the return policy for our premium customers?”

A normal LLM might guess based on patterns in text it’s seen before, saying something like:

“You can return products within 30 days.”

But your real policy might be:

“Premium customers can return products within 60 days.”

“Non Premium customers can return products within 30 days.”

That’s a big difference.
RAG fixes this problem by letting your chatbot retrieve the real policy from your database or documents before it generates an answer.

Where RAG Fits vs. Fine-Tuning

There are two main ways to give LLMs new knowledge:

Let’s simplify it:

Fine-tuning is like teaching your assistant new habits (“always greet customers politely”).
RAG is like giving your assistant a live knowledge base they can search whenever they need an answer.

In real-world systems, companies often combine both:

Fine-tune for tone and behavior.
Use RAG for real-time information.

High-Level Architecture of RAG

Let’s walk through what’s happening step by step:

Source: GeeksforGeeks

Knowledge Base → Data Chunks
Your documents, FAQs, or PDFs or text live in the knowledge base. Before we can use them, we split them into smaller pieces (called chunks) so that the model can search and understand them efficiently.
Embedding Model
Each chunk of text is converted into a list of numbers called an embedding a numerical representation that captures the meaning of the text. Ex: “Return policy for premium customers” → [0.23, 0.91, -0.45, ...]
Vector Database (Vector DB)
All these embeddings are stored in a vector database such as Milvus, FAISS, or pgvector.
When a user asks a question, the system converts the query into another embedding and searches for the most similar ones inside this database just like finding the closest points in space.
Retrieved Documents
The most relevant chunks are retrieved from the vector database. These chunks contain the factual information the model needs to answer correctly.
Generation Step (LLM)
The retrieved text is passed to the LLM (like GPT or LLaMA) along with the user’s question.
The model reads both and composes a fluent, context-aware response that’s grounded in those retrieved facts.

Example:

You ask:

“What is the warranty period for solar panels?”

Without RAG:

“Most solar panels come with a 10-year warranty.” (generic answer)

With RAG:

“According to our installation manual, all panels installed after 2022 have a 25-year product warranty and 10-year performance warranty.”

See the difference?
RAG doesn’t just talk smart it answers correctly by pulling from verified data.

Why It Matters

Trust: Answers come from your real data, not random web text.
Relevance: The model can access your internal knowledge (policies, product data, contracts).
Scalability: You can update your documents anytime without retraining the model.

That’s why RAG has become the standard architecture for any AI assistant, chatbot, or knowledge engine that needs to stay current and factual.

RAG Embeddings: How AI Turns Text into Understanding

How to Write a Dockerfile (Step-by-Step for Beginners)

Tilak Mudgal — Sat, 26 Apr 2025 22:05:42 GMT

If you’ve been following along, you already know what Docker is and how it helps you package your app into neat little containers.

Now the next question is:
👉 “How do I actually create a Docker container for my app?”

The answer: You write a Dockerfile.

Let’s walk through it in the simplest way possible.

What is a Dockerfile?

A Dockerfile is just a text file with instructions for Docker.
It tells Docker how to build your app into an image.

Think of it like a recipe:

Start with a base ingredient (Python)
Add your app files
Install any extra packages
Tell Docker how to start the app

When you “build” the Dockerfile, Docker follows these steps to create a ready-to-run image.

Let’s Build One Together!

Suppose you have a simple Python app with these two files:

/myapp
  ├── app.py
  └── requirements.txt

app.py has your Python code.
requirements.txt lists your Python packages (like Flask, Pandas, etc.).

Now, let’s write a Dockerfile to containerize this app!

Here’s a simple Dockerfile:

# 1. Use an existing Python image
FROM python:3.11

# 2. Set the working directory inside the container
WORKDIR /app

# 3. Copy the requirements file first
COPY requirements.txt .

# 4. Install the dependencies
RUN pip install --no-cache-dir -r requirements.txt

# 5. Copy the rest of the app files
COPY . .

# 6. Set environment variables
ENV PYTHONDONTWRITEBYTECODE=1
ENV PYTHONUNBUFFERED=1

# 7. Define the command to run the app
CMD ["python", "app.py"]

Line-by-Line Simple Explanation

FROM python:3.11 Start from an official Python environment

WORKDIR /app Create and move into a working folder

COPY requirements.txt .Copy the requirements.txt into the container

RUN pip installInstall Python libraries from requirements.txt

COPY . .Copy all your app code into the containerENV

PYTHONDONTWRITEBYTECODE=1Prevent Python from creating unnecessary .pyc files

ENV PYTHONUNBUFFERED=1Make Python print output immediately

CMD ["python", "app.py"]Tell Docker to start your app by running

python app.py

How to Build and Run Your Dockerfile

Step 1: Open your terminal and move to the project folder:

cd /path/to/your/myapp

Make sure your Dockerfile, app.py, and requirements.txt are inside.

Step 2: Build the Docker Image

docker build -t my-python-app .

This command builds an image called my-python-app based on your Dockerfile.

Step 3: Run the Docker Container

docker run my-python-app

You should see your app running! 🎉

For example, if your app.py simply prints something like:

print("Hello from inside Docker!")

You’ll see:

Hello from inside Docker!

Folder Structure You Should Have

/myapp
  ├── app.py
  ├── requirements.txt
  └── Dockerfile

Very simple and clean!

Real-World Tip 💡

Always copy requirements.txt first and install packages before copying your full code.
Because if you only change your code (not dependencies), Docker will cache earlier steps and build faster!

Setting environment variables like PYTHONUNBUFFERED=1 helps get real-time logs, especially useful when debugging.

Using --no-cache-dir in pip install makes your final image smaller and lighter.

Final Thoughts

Writing a Dockerfile might seem intimidating at first, but once you see it as a simple set of “steps” or a “recipe,” it becomes easy.

Every Dockerfile is basically:

Pick a base
Copy your stuff
Install what’s needed
Start your app

That’s it!

📋 Quick Summary

Dockerfile = Instructions for building a Docker image
Build image: docker build -t myapp .
Run container: docker run myapp
Think of it like cooking: recipe ➔ dish ➔ serve!

Getting Started with Docker and Docker Compose file

Tilak Mudgal — Sat, 26 Apr 2025 21:31:35 GMT

When you’re new to the world of containers, two names pop up almost immediately: Docker and Docker Compose. At first, they can sound confusing are they the same thing? Do you need both?

Let’s break it down in simple words.

What is Docker?

Imagine you’re building a project maybe a website or an app. Normally, you need to set up everything: the programming language, the database, the server, libraries, and a hundred other things.
And if someone else wants to run your project, they need to set up everything exactly the same way.
That’s painful and messy.

Docker fixes that.

Docker is a tool that packages your application and everything it needs into a container.
Think of a container like a little box that has your app, your libraries, your system settings everything.
You can send this box anywhere (your friend’s laptop, a cloud server, etc.), and it will run exactly the same every time.

Wait, what’s an Image and what’s a Container?

A Docker Image is like a blueprint — it’s the read-only template of your app (think of it like a recipe).

A Docker Container is the running instance of that image (like a dish you cooked using the recipe).

You create an image once, and you can run many containers from it whenever you want.

Quick Example:

You build a web app using Python and MySQL.
Instead of installing Python and MySQL everywhere, you “wrap” your app inside a Docker container.
Now you can ship your container and run it with a simple command — no messy installations needed!

What is Docker Compose?

Okay, now imagine your project isn’t just one container.
Maybe you need:

One container for your app (Python)
Another container for your database (MySQL)
Another container for a caching system (Redis)

Managing all of them separately would be annoying: you’d have to start each one manually, connect them together, set environment variables, etc.

Docker Compose solves that.

Docker Compose is a tool that lets you define and run multi-container applications easily.
You simply write a docker-compose.yml file where you list out all the containers your app needs, and their settings.

What if containers need to save data? Sometimes containers need to store information — like database files or user uploads.

That’s where Volumes come in.
Volumes are like shared storage areas that survive even if a container stops or gets deleted.

In Docker Compose, you can easily define volumes too!

What’s the Difference Between Docker and Docker Compose?

Docker

It is for running individual containers.
Good for small/simple apps
Container settings are given in the command line

Docker Compose

It is for orchestrating many containers at once.
Great for bigger apps needing multiple services (like app + database + cache)
Settings are written in a YAML file

When Should You Use Each?

Use Docker if you just have a single app — like a small script, a single website, or a simple backend server.
Use Docker Compose when your app needs multiple things — like a server, a database, a message queue, etc.

Real-Life Examples

You built a simple API in Python Flask?
→ A single Docker container is enough.
You built a full web application that needs a web server, a database, and a Redis cache?
→ You should set up a Docker Compose file.

Advantages

✅ Portability:
Both Docker and Docker Compose make it super easy to move your project between computers without worrying about “It works on my machine” problems.

✅ Consistency:
Every environment (development, testing, production) can behave exactly the same if you use containers.

✅ Isolation:
Each container runs its own little world — no conflicts between different apps or versions.

✅ Scaling:
Docker Compose can also help you scale — you can easily run multiple copies of a service if needed.

Final Thoughts

Docker and Docker Compose make modern app development cleaner, faster, and a lot less frustrating.
At first, they might seem a bit technical, but once you understand the basic ideas “pack everything into a box” and “manage multiple boxes together” it becomes really fun and powerful.

Quick Summary:

Images = Blueprint
Containers = Running app
Volumes = Persistent storage

In the next parts of the blog, we’ll dive deeper into how to create a basic Dockerfile and a Docker Compose file, with step-by-step examples!

Mastering Power BI DAX: From Basics to Advanced Calculations

Tilak Mudgal — Mon, 23 Dec 2024 08:21:32 GMT

Power BI’s Data Analysis Expressions (DAX) is a powerful formula language that enables users to create custom calculations and data analysis within Power BI. In this guide, we will take you from the basics to advanced concepts, so by the end, you’ll be equipped to write your own DAX calculations with confidence.

What is DAX?

DAX stands for Data Analysis Expressions. It is a collection of functions, operators, and constants that you can use to create formulas and expressions in Power BI, Power Pivot, and Analysis Services. DAX is used to perform calculations on data in tabular models.

Key highlights:

DAX operates on columns and tables rather than individual cells.
It’s designed to work with relational data.
DAX formulas are used in calculated columns, calculated tables, and measures.

Getting Started with DAX

1. Syntax Basics

A DAX formula begins with an equals sign = and can include functions, operators, and references to columns or tables.

Example:

=SUM(Sales[Amount])

This formula sums up the values in the Amount column of the Sales table.

2. Commonly Used Functions

Aggregation Functions:

SUM: Adds all numbers in a column.
AVERAGE: Calculates the mean.
COUNT: Counts the number of rows.
MAX and MIN: Find the maximum and minimum values.

Logical Functions:

IF: Performs conditional logic.

=IF(Sales[Amount] > 1000, "High", "Low")

AND, OR, NOT: Combine or negate conditions.

Text Functions:

CONCATENATE: Joins two strings.

=CONCATENATE(Customer[FirstName], Customer[LastName])

LEFT, RIGHT, MID: Extract parts of strings.

Intermediate DAX Concepts

1. Calculated Columns vs Measures

Calculated Columns: Add new data to the table; values are computed row by row. Example:

Profit = Sales[Revenue] - Sales[Cost]

Measures: Perform calculations on aggregated data; results change based on the context of the visualization. Example:

Total Sales = SUM(Sales[Revenue])

2. Contexts in DAX

Understanding context is crucial for writing effective DAX formulas.

Row Context: Applies to calculated columns and iterates through rows of a table.
Filter Context: Comes into play with measures and is determined by filters applied in visuals, slicers, or other DAX calculations.

Example of Filter Context:

Sales for Region = CALCULATE(SUM(Sales[Amount]), Region[Name] = "West")

This calculates total sales for the “West” region.

3. Iterators

DAX includes functions that iterate over rows to perform calculations.

SUMX: Iterates over a table and evaluates an expression for each row.

=SUMX(Sales, Sales[Revenue] - Sales[Cost])

AVERAGEX, MAXX, MINX: Similar to SUMX but perform other aggregations.

Advanced DAX Techniques

1. Time Intelligence Functions

DAX offers built-in functions to work with time-based data.

TOTALYTD, TOTALMTD, TOTALQTD:

=TOTALYTD(SUM(Sales[Revenue]), Calendar[Date])

DATESYTD, DATESBETWEEN, PREVIOUSYEAR:

=CALCULATE(SUM(Sales[Amount]), PREVIOUSYEAR(Calendar[Date]))

2. Advanced Filtering with CALCULATE

The CALCULATE function modifies the filter context.

Example:

Sales in 2023 = CALCULATE(SUM(Sales[Amount]), Year[Year] = 2023)

3. Dynamic Measures

Use dynamic calculations that respond to user selections.

Example:

Dynamic Sales = IF(SELECTEDVALUE(Region[Name]) = "East", SUM(Sales[Amount]), 0)

4. Variables in DAX

Variables simplify complex calculations by allowing you to store intermediate results.

Example:

Profit Margin =
VAR TotalCost = SUM(Sales[Cost])
VAR TotalRevenue = SUM(Sales[Revenue])
RETURN (TotalRevenue - TotalCost) / TotalRevenue

5. Advanced Table Functions

ADDCOLUMNS: Adds a calculated column to a table.
SUMMARIZE: Groups data and adds aggregations.
CROSSJOIN: Returns all combinations of two tables.

Example:

Grouped Sales = SUMMARIZE(Sales, Sales[Region], "TotalSales", SUM(Sales[Amount]))

Best Practices for Writing DAX

Start Simple: Begin with basic formulas and gradually layer complexity.
Use Meaningful Names: Name measures and calculated columns clearly.
Leverage Variables: Simplify calculations and improve readability.
Optimize Performance: Use aggregations and filters efficiently to avoid unnecessary computation.
Test Extensively: Validate your formulas in different contexts to ensure accuracy.

Conclusion

Mastering DAX takes practice, but it is a rewarding skill for unlocking the full potential of Power BI. By understanding the basics, practicing with intermediate concepts, and exploring advanced techniques, you’ll be well-equipped to perform sophisticated data analysis and create impactful visualizations.

SQL Questions and Answers

Tilak Mudgal — Mon, 23 Dec 2024 08:10:36 GMT

1. What is the difference between HAVING and WHERE?

Example:

SELECT department_id, COUNT(*) AS employee_count
FROM employees
WHERE salary > 50000
GROUP BY department_id
HAVING COUNT(*) > 5;

WHERE salary > 50000: Filters rows before grouping.
HAVING COUNT(*) > 5: Filters groups after aggregation.

2. What is the difference between UNION and UNION ALL?

UNIONUNION ALLCombines results and removes duplicates.Combines results without removing duplicates.Slower due to duplicate elimination.Faster as no duplicate check is performed.

Example:

-- UNION: Removes duplicates
SELECT name FROM customers
UNION
SELECT name FROM suppliers;

-- UNION ALL: Includes duplicates
SELECT name FROM customers
UNION ALL
SELECT name FROM suppliers;

Use UNION when you need unique values.
Use UNION ALL for better performance when duplicates are acceptable.

3. What is the difference between GROUP BY and ORDER BY?

GROUP BYORDER BYGroups rows based on one or more columns.Sorts rows based on one or more columns.Used to perform aggregations.Used to organize the result set.The order of rows is not guaranteed.Ensures a specific order in the result.

Example:

-- GROUP BY: Summarize data
SELECT department_id, COUNT(*) AS employee_count
FROM employees
GROUP BY department_id;

-- ORDER BY: Sort results
SELECT name, salary
FROM employees
ORDER BY salary DESC;

GROUP BY is used for aggregation (e.g., COUNT, SUM).
ORDER BY sorts the final output.

4. What is the difference between DELETE and TRUNCATE?

Example:

-- DELETE specific rows
DELETE FROM employees WHERE department_id = 1;

-- TRUNCATE entire table
TRUNCATE TABLE employees;

Use DELETE when removing specific rows.
Use TRUNCATE for clearing all data in a table quickly.

5. What is the difference between a Primary Key and a Unique Key?

Primary KeyUnique KeyUniquely identifies each row in a table.Ensures unique values in a column or set of columns.Only one per table.Can have multiple unique keys per table.Implicitly NOT NULL.Can contain NULL (but only one NULL per unique key).

Example:

-- Primary Key
CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50)
);

-- Unique Key
CREATE TABLE users (
    user_id INT,
    email VARCHAR(100) UNIQUE
);

Primary Key is the main identifier for a table.
Unique Key ensures data uniqueness without being the primary identifier.

6. What is the difference between INNER JOIN and OUTER JOIN?

Example:

-- INNER JOIN
SELECT e.name, d.department_name
FROM employees e
INNER JOIN departments d ON e.department_id = d.department_id;

-- OUTER JOIN
SELECT e.name, d.department_name
FROM employees e
LEFT JOIN departments d ON e.department_id = d.department_id;

INNER JOIN: Only rows with matches in both tables are included.
LEFT OUTER JOIN: Includes all rows from the left table, even if there’s no match in the right table.

Normalization and Denormalization in SQL: Simplified Explanation with Examples

Tilak Mudgal — Mon, 23 Dec 2024 07:56:19 GMT

When designing a database, it’s essential to strike a balance between efficient storage and performance. This is where Normalization and Denormalization come into play. Normalization focuses on organizing data to reduce redundancy, while Denormalization aims to optimize query performance by reintroducing some redundancy. Let’s break these concepts down with simple explanations and examples.

What is Normalization?

Normalization is the process of organizing a database to reduce redundancy and improve data integrity. This is done by dividing data into smaller, related tables and establishing relationships between them. Normalization follows a series of steps called Normal Forms (1NF, 2NF, 3NF, etc.), each with specific rules to achieve a better-structured database.

The Normal Forms

1. First Normal Form (1NF)

Rule: A table is in 1NF if:

Each column contains atomic (indivisible) values.
Each row is unique.

Example: A table with multiple phone numbers in one column is not in 1NF:

2. Second Normal Form (2NF)

Rule: A table is in 2NF if:

It is already in 1NF.
All non-key columns depend on the entire primary key (no partial dependency).

Example: Consider a table where Order_ID and Product_ID form the composite primary key:

3. Third Normal Form (3NF)

Rule: A table is in 3NF if:

It is already in 2NF.
All non-key columns depend only on the primary key (no transitive dependency).

Example: Consider a table:

What is Denormalization?

Denormalization is the process of combining tables to optimize query performance, especially in analytical scenarios. It reduces the need for complex joins by intentionally introducing redundancy.

Why Use Denormalization?

Faster read performance.
Simplified querying for reports or dashboards.

Example: Instead of normalized tables:

Benefits of Denormalization:

Queries are simpler and faster.
Useful for read-heavy systems like reporting databases.

Drawbacks of Denormalization:

Increased storage usage.
Potential for data inconsistencies during updates

Normalization vs. Denormalization

When to Use Normalization or Denormalization?

Use Normalization:

When data integrity and consistency are critical.
For systems with frequent data updates (e.g., transactional systems).

Use Denormalization:

When performance is a priority (e.g., analytical systems, reporting).
For systems with frequent reads and few updates.

Conclusion

Normalization and Denormalization are complementary techniques for structuring a database. Normalization ensures data consistency and eliminates redundancy, while Denormalization improves performance in read-heavy scenarios. Choosing the right approach depends on the specific requirements of your system. By mastering these concepts, you can design efficient and reliable databases.

Understanding Database Constraints: Ensuring Data Integrity in SQL

Tilak Mudgal — Mon, 23 Dec 2024 07:48:03 GMT

Database constraints are essential tools for maintaining data integrity and enforcing rules in SQL. They define rules for the data in a table, ensuring it remains accurate, consistent, and reliable. In this blog, we’ll cover key constraints like Primary Key, Foreign Key, Unique, Not Null, Check, and Default, with examples to illustrate their use.

1. Primary Key

A Primary Key uniquely identifies each row in a table. It cannot contain NULL values, and the values in the column(s) must be unique.

Use Case: Ensuring that each row in a table has a unique identifier.

Example:

CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50),
    salary DECIMAL(10, 2)
);

Explanation:

The employee_id column is the primary key and must contain unique, non-null values.

2. Foreign Key

A Foreign Key enforces a relationship between two tables. It ensures that the value in a column (or columns) matches a value in another table’s primary key.

Use Case: Maintaining referential integrity between related tables.

Example:

CREATE TABLE departments (
    department_id INT PRIMARY KEY,
    department_name VARCHAR(50)
);

CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50),
    department_id INT,
    FOREIGN KEY (department_id) REFERENCES departments(department_id)
);

Explanation:

The department_id column in the employees table references the department_id column in the departmentstable.
Ensures that each employee belongs to a valid department.

3. Unique

The Unique constraint ensures that all values in a column are unique. Unlike the primary key, a table can have multiple unique constraints.

Use Case: Enforcing uniqueness for specific columns, such as email addresses.

Example:

CREATE TABLE users (
    user_id INT PRIMARY KEY,
    email VARCHAR(100) UNIQUE,
    username VARCHAR(50) UNIQUE
);

Explanation:

Both email and username must have unique values across all rows in the users table.

4. Not Null

The Not Null constraint ensures that a column cannot have NULL values.

Use Case: Enforcing mandatory fields, such as names or dates.

Example:

CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50) NOT NULL,
    hire_date DATE NOT NULL
);

Explanation:

Both name and hire_date must have non-null values.

5. Check

The Check constraint ensures that a column’s value meets a specified condition.

Use Case: Validating data, such as ensuring salaries are within a valid range.

Example:

CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50),
    salary DECIMAL(10, 2),
    CHECK (salary >= 30000 AND salary <= 200000)
);

Explanation:

The salary column must have a value between 30,000 and 200,000.

6. Default

The Default constraint sets a default value for a column when no value is provided.

Use Case: Providing default values for optional fields.

Example:

CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50),
    hire_date DATE DEFAULT CURRENT_DATE
);

Explanation:

If no value is specified for hire_date, it defaults to the current date.

Combining Constraints

You can combine multiple constraints in a single column to enforce complex rules.

Example:

CREATE TABLE employees (
    employee_id INT PRIMARY KEY,
    name VARCHAR(50) NOT NULL,
    email VARCHAR(100) UNIQUE,
    salary DECIMAL(10, 2) CHECK (salary >= 30000 AND salary <= 200000),
    hire_date DATE DEFAULT CURRENT_DATE
);

Explanation:

employee_id is a primary key.
name cannot be null.
email must be unique.
salary is constrained to a valid range.
hire_date defaults to the current date if not provided.

Why Use Constraints?

Ensure Data Integrity: Prevent invalid or inconsistent data from entering the database.
Reduce Errors: Enforce business rules directly in the database.
Simplify Application Logic: Move validation rules from application code to the database.

Best Practices

Plan Constraints Carefully: Analyze your data requirements before defining constraints.
Use Descriptive Names: When naming constraints (e.g., in ALTER TABLE), use meaningful names like chk_salary_range.
Test Thoroughly: Ensure that constraints don’t conflict with valid data during insert or update operations.
Document Constraints: Clearly document the purpose of each constraint for future reference.

Conclusion

Database constraints are vital for maintaining the integrity, accuracy, and reliability of your data. By using constraints like Primary Key, Foreign Key, Unique, Not Null, Check, and Default, you can enforce rules directly at the database level, ensuring that your data adheres to your business requirements. Start incorporating these constraints into your database designs to create robust and error-resistant systems.

Window Functions in SQL: A Complete Guide

Tilak Mudgal — Mon, 23 Dec 2024 07:44:17 GMT

Window functions are powerful tools in SQL that allow you to perform calculations across a subset of rows related to the current row, without collapsing the data into a single result set. Unlike aggregate functions like SUM() or COUNT(), which return a single value for a group of rows, window functions retain the individual rows while adding additional analytical insights.

In this guide, we’ll explain what window functions are, their common use cases, and walk through examples of popular functions such as ROW_NUMBER(), RANK(), DENSE_RANK(), NTILE(), LEAD(), and LAG().

What Are Window Functions?

A window function performs a calculation across a set of rows (called a window) defined by a PARTITION BY clause. This "window" can be the entire dataset or a subset of it. The results of window functions are returned for each row in the result set.

Syntax:

function_name(expression) OVER (
    [PARTITION BY column_name(s)]
    [ORDER BY column_name(s)]
)

PARTITION BY: Divides the result set into subsets (like groups). Optional.
ORDER BY: Specifies the order of rows within each partition.
Function Name: Determines the type of calculation (e.g., ranking, summing).

Key Window Functions

1. ROW_NUMBER()

Assigns a unique number to each row within a partition, starting at 1 for the first row.

Use Case: Generate unique row numbers for each row in a dataset or partition.

Example: Rank employees by hire date within each department.

SELECT department_id, employee_name, hire_date,
       ROW_NUMBER() OVER (PARTITION BY department_id ORDER BY hire_date) AS row_number
FROM employees;

Explanation:

PARTITION BY department_id: Groups rows by department.
ORDER BY hire_date: Assigns row numbers based on hire date within each department.

2. RANK()

Assigns a rank to each row within a partition. If there are ties, the same rank is given, and the next rank is skipped.

Use Case: Determine rankings with ties in a competition or scores.

Example: Rank employees by salary within each department.

SELECT department_id, employee_name, salary,
       RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS rank
FROM employees;

Explanation:

PARTITION BY department_id: Groups rows by department.
ORDER BY salary DESC: Ranks employees by descending salary within each department.

3. DENSE_RANK()

Similar to RANK(), but without skipping ranks for ties.

Use Case: Assign continuous ranks even when there are ties.

Example:

SELECT department_id, employee_name, salary,
       DENSE_RANK() OVER (PARTITION BY department_id ORDER BY salary DESC) AS dense_rank
FROM employees;

Difference from RANK(): If two employees in department 1 have the same salary, both will be ranked 1, and the next rank will be 2 (not 3).

4. NTILE(n)

Divides rows into n equal-sized groups and assigns a bucket number to each row.

Use Case: Distribute rows into quantiles (e.g., quartiles, deciles).

Example: Divide employees into four salary quartiles.

SELECT employee_name, salary,
       NTILE(4) OVER (ORDER BY salary DESC) AS quartile
FROM employees;

Explanation: Rows are divided into four groups, with each group assigned a quartile (1 to 4).

5. LEAD() and LAG()

LEAD(): Accesses data from the next row in the result set.
LAG(): Accesses data from the previous row in the result set.

Use Case: Compare a value in one row to the previous or next row.

Example: Compare each employee’s salary to the next highest salary.

SELECT employee_name, salary,
       LEAD(salary) OVER (ORDER BY salary DESC) AS next_salary
FROM employees;

Explanation:

LEAD(salary): Returns the salary from the next row in descending order.

6. SUM() (or Other Aggregate Functions with a Window)

Calculates cumulative or running totals across a window.

Use Case: Calculate cumulative sales totals by month.

Example:

SELECT employee_name, salary,
       SUM(salary) OVER (PARTITION BY department_id ORDER BY salary DESC) AS cumulative_salary
FROM employees;

Explanation:

Calculates a running total of salaries within each department.

Best Practices for Window Functions

Understand the Dataset: Ensure you partition and order rows appropriately for meaningful results.
Avoid Overusing: Window functions can be computationally expensive; optimize queries with indexes.
Combine Functions: Use multiple window functions together for advanced analytics.

Conclusion

Window functions are invaluable for data analysis, providing deep insights while preserving individual rows in the result set. Whether ranking employees, calculating running totals, or comparing rows, window functions unlock new possibilities for SQL queries.