Inside LlamaIndex v1.0 Workflows: Orchestrating RAG with Human-in-the-Loop Workflows as MCP.This blog is a comprehensive exploration of building sophisticated Retrieval-Augmented Generation (RAG) workflows enhanced by…3d ago3d ago
Published inStackademicLayered Memory Architecture for Intelligent AI Agents: From Fast Context to Deep KnowledgeWhile designing intelligent AI agents, I repeatedly ran into the same challenge: how can an agent remember just enough to act swiftly…Jun 22Jun 22
Working with Native Qdrant nodes in n8n workflowsWith the launch of native Qdrant nodes in n8n, I built a fully automated RAG pipeline that shows just how powerful and seamless vector DB…Jun 15A response icon1Jun 15A response icon1
C2A Orchestration: Controlled & Customizable Agent2Agent PlatformIn this fast evolving landscape of autonomous AI agents, interoperability and controlled orchestration have become critical requirements —…Jun 8A response icon1Jun 8A response icon1
Ranking Matters: An Experimental Dive into Advanced RAG with Re-Rankers using LlamaIndex & Qdrant.In this blog article, we investigate the pivotal role of re-ranking in enhancing Retrieval-Augmented Generation (RAG) pipelines. Using…May 25May 25
Seamless Data Streaming with Kafka and Qdrant: Installation, Setup, and Application GuideIn this guide, I will walk you through the detailed steps of installing and setting up the Qdrant Sink Connector, building the necessary…May 24May 24
Creating and Deploying Memory-Efficient Medical Agents Using Agno, Qdrant, MongoDB & LiteLLM.This project implements two domain‐specific agents — medical and legal — that split short-term conversational state (stored in MongoDB)…May 12May 12
Building Trustworthy AI Agents: The Role of Accuracy, Performance, and ReliabilityIn the current phase of autonomous AI agents, where systems are designed to reason, decide, and act independently, one critical aspect…Apr 25A response icon1Apr 25A response icon1
Building Advanced Reasoning Agent Teams with Centralized Prompt Management using Agno and mlflowThis article will show you how we can create a team of reasoning agents that can solve the tough problems just like a human does. Not only…Apr 15Apr 15
Navigating Llama 4 Deployment: H100 SXM vs NVL vs PCIe, vLLM Compile Cache, and Real-World GPU…Large language models are pushing the limits of today’s hardware. Meta’s new Llama 4 Scout (a 17B parameter model with 16 experts…Apr 11A response icon1Apr 11A response icon1