Demystifying AI Agents in 2025Preface: In my work advising organizations on AI solutions, I often see misconceptions about what AI agents can really do. Some dismiss…Mar 2Mar 2
Knowledge Base Engineering Guide — Ingestion and StorageThe initial wave of GenAI investments focused heavily on rapid prototyping and experimentation. Many organizations found their early…Jan 28Jan 28
How Complicated Can RAG Queries Get? A Pragmatic Look at Its LimitsIn the ever-evolving landscape of data management and analysis, we’re witnessing a significant shift in how organizations extract insights…Oct 14, 2024Oct 14, 2024
How to Determine If You Will Benefit from Fine-Tuning an LLMIn the fast-paced world of Generative AI, pre-trained large language models (LLMs) have become go-to tools for a wide range of…Jun 23, 2024Jun 23, 2024
To Retrieve or Extend? Key Considerations and Research Insights on Using RAG and Long-Context LLMsA significant development in large language models (LLMs) is the expansion of context windows — the span of text a model can consider at…Apr 21, 2024Apr 21, 2024
How to prepare an instruction dataset to fine-tune LLM?Fine-tuning large language models (LLMs) on custom datasets is a popular technique to adapt these powerful models for specific downstream…Mar 4, 2024Mar 4, 2024
Create meaningful representations of data for RAGIn the evolving landscape of generative AI, retrieval augmented generation (RAG) has demonstrated immense promise for open-domain question…Feb 4, 2024Feb 4, 2024
Custom metrics for instruction fine-tuning of LLMsWhen fine-tuning large language models (LLMs) on downstream tasks, we often rely too much on generic metrics like loss/perplexity. While…Jan 1, 2024Jan 1, 2024
Your RAG Needs Some ScaffoldingUpdated in 2025 March: RAG is still not a fully solved problem, and therefore this article is still relevant. I updated the blog to reflect…Oct 15, 20231Oct 15, 20231
LLM as Knowledge Base v.s. LLM with Knowledge RetrievalUpdates on Feb 2025: LLM with Knowledge Retrieval (aka RAG) has been proven as a more sustainable approach for majority of the use cases…Sep 17, 2023Sep 17, 2023