PinnedPublished inAI AdvancesBenchmarking PDF to Markdown Document ConvertersTesting 5 different Python Markdown converters on a benchmark PDF document for better RAG results with LLMsFeb 917Feb 917
PinnedPublished inTDS ArchiveHow to Use Hybrid Search for Better LLM RAG RetrievalBuilding an advanced local LLM RAG pipeline by combining dense embeddings with BM25Aug 11, 20245Aug 11, 20245
PinnedPublished inTDS ArchiveHow to Use Re-Ranking for Better LLM RAG RetrievalBuilding an advanced local LLM RAG pipeline with two-step retrieval using open-source bi-encoders and cross-encodersMay 2, 20246May 2, 20246
Published inTowards AII Used ChatGPT to Count My CaloriesComparing my calorie count to the AI-generated estimates from ChatGPT-4o with different prompts in a self-experimentFeb 4Feb 4
Published inTDS ArchiveHow to Use Generative AI as a Software DeveloperBest practices for using AI tools to efficiently write higher quality, production-ready codeJan 31Jan 31
Published inTowards AITop 11 Publications on Medium for Data Science, Machine Learning, and AI In 2025A ranking of AI and Data Science publications based on combined Medium and social media followersJan 33Jan 33
Published inTDS ArchiveHow to Evaluate Multilingual LLMs in Any LanguageEvaluation of language-specific LLM accuracy on the global Massive Multitask Language Understanding (Global-MMLU) benchmark in PythonDec 9, 2024Dec 9, 2024
Published inTDS ArchiveImproved RAG Document Processing With MarkdownHow to read and convert PDFs to Markdown for better RAG results with LLMsNov 19, 202412Nov 19, 202412
Published inTDS ArchiveHow to Create a RAG Evaluation Dataset From DocumentsAutomatically create domain-specific datasets in any language using LLMsNov 3, 20247Nov 3, 20247
Published inTDS ArchiveRevisiting Karpathy’s “State of Computer Vision and AI”Looking back at AI progress since the 2012 blog post “The state of Computer Vision and AI: we are really, really far away”Oct 18, 20248Oct 18, 20248