PinnedPublished inTowards AIAdvanced RAG 02: Unveiling PDF ParsingIncluding key points, diagrams, and codeFeb 2, 2024A response icon23Feb 2, 2024A response icon23
PinnedAdvanced RAG 06: Exploring Query RewritingA key technique for aligning the semantics of queries and documentsMar 4, 2024A response icon6Mar 4, 2024A response icon6
PinnedPublished inAI AdvancesDemystifying PDF Parsing 02: Pipeline-Based MethodOverview, Implementation Strategies and InsightsMay 21, 2024A response icon1May 21, 2024A response icon1
PinnedPublished inAI AdvancesDemystifying PDF Parsing 03: OCR-Free Small Model-Based MethodOverview, Principles and InsightsJun 1, 2024A response icon2Jun 1, 2024A response icon2
Published inAI Exploration JourneyTime and Cost to Train a 1B Model on 1T Tokens? — AI Interview Questions 01Ever wondered how long it actually takes to train a 1B-parameter language model on a trillion tokens? It’s a simple question on the surface1h ago1h ago
Published inAI Exploration JourneyThe Logic Behind OCRFlux — AI Innovations and Insights 57Traditional OCR tools often fall apart when dealing with content that spans across pages. The core feature of OCRFlux is its ability to…3d agoA response icon13d agoA response icon1
Published inAI Exploration JourneyRAG + Reasoning is the Bridge to Human-Like Intelligence — AI Innovations and Insights 56By integrating reasoning capabilities, RAG evolves beyond a simple retrieval patch — it becomes an intelligent architecture with a built-inJul 12Jul 12
Published inAI Exploration JourneyFrom Retrieval to Reasoning: The Next-Gen AI Search Paradigm — AI Innovations and Insights 55As the volume of data and knowledge we interact with continues to explode, traditional search engines are increasingly falling short —Jul 8A response icon1Jul 8A response icon1
Published inAI Exploration JourneyMonkeyOCR: 3B Model Outperforms Industry Giants in Document Parsing — AI Innovations and Insights…Document parsing is a core technology that converts unstructured, multimodal content — such as text, tables, images, and mathematical formuJun 30Jun 30
Published inAI Exploration JourneySimpleDoc: Summary-Driven, Memory-Augmented Multimodal QA — AI Innovations and Insights 52Document Visual Question Answering (DocVQA) is about answering questions based on multi-modal documents that mix text, tables, and images —Jun 25Jun 25