AnythingLLM:Bring Together All LLM Runner and All large Language Models-Part 01 Connect Koboldcpp with AnythingLLM.

CA Amit Singh
Free or Open Source software’s
4 min readJun 18, 2024

Learn to Connect Koboldcpp/Ollama/llamacpp/oobabooga LLM runnr/Databases/TTS/Search Engine & Run various large Language Models.

KoboldCpp is an easy-to-use AI text-generation software for GGML and GGUF models. It’s a single self contained distributable from Concedo, that builds off llama.cpp, and adds a versatile Kobold API endpoint, additional format support, Stable Diffusion image generation, backward compatibility, as well as a fancy UI with persistent stories, editing tools, save formats, memory, world info, author’s note, characters, scenarios and everything Kobold and Kobold Lite have to offer.

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

A full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions.

Product Overview

AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open source LLMs and vectorDB solutions to build a private ChatGPT with no compromises that you can run locally as well as host remotely and be able to chat intelligently with any documents you provide it.

AnythingLLM divides your documents into objects called workspaces. A Workspace functions a lot like a thread, but with the addition of containerization of your documents. Workspaces can share documents, but they do not talk to each other so you can keep your context for each workspace clean.

Some cool features of AnythingLLM

  • Multi-user instance support and permissioning
  • Agents inside your workspace (browse the web, run code, etc)
  • Custom Embeddable Chat widget for your website
  • Multiple document type support (PDF, TXT, DOCX, etc)
  • Manage documents in your vector database from a simple UI
  • Two chat modes conversation and query. Conversation retains previous questions and amendments. Query is simple QA against your documents
  • In-chat citations
  • 100% Cloud deployment ready.
  • “Bring your own LLM” model.
  • Extremely efficient cost-saving measures for managing very large documents. You’ll never pay to embed a massive document or transcript more than once. 90% more cost effective than other document chatbot solutions.
  • Full Developer API for custom integrations!

Supported LLMs, Embedder Models, Speech models, and Vector Databases

Language Learning Models:

  • Any open-source llama.cpp compatible model
  • OpenAI
  • OpenAI (Generic)
  • Azure OpenAI
  • Anthropic
  • Google Gemini Pro
  • Hugging Face (chat models)
  • Ollama (chat models)
  • LM Studio (all models)
  • LocalAi (all models)
  • Together AI (chat models)
  • Perplexity (chat models)
  • OpenRouter (chat models)
  • Mistral
  • Groq
  • Cohere
  • KoboldCPP
  • LiteLLM
  • Text Generation Web UI

Embedder models:

  • AnythingLLM Native Embedder (default)
  • OpenAI
  • Azure OpenAI
  • LocalAi (all)
  • Ollama (all)
  • LM Studio (all)
  • Cohere

Audio Transcription models:

  • AnythingLLM Built-in (default)
  • OpenAI

TTS (text-to-speech) support:

  • Native Browser Built-in (default)
  • OpenAI TTS
  • ElevenLabs

STT (speech-to-text) support:

  • Native Browser Built-in (default)

Vector Databases:

  • LanceDB (default)
  • Astra DB
  • Pinecone
  • Chroma
  • Weaviate
  • Qdrant
  • Milvus
  • Zilliz

**Installation of AnythingLLM and Connection with Ollama in coming in Next article**

Now How to Connect Koboldcpp with AnythingLLM.

Step 01. Click on workspace

Step 02: Input workspace name as per your choice.

Step 03: Click on chat settings and then workspace LLM Provider.

Step 04: Now choose Koboldcpp as LLM Provider

Step 05: Now enter base URL as shown below

htto://localhost:5001/vI/

Step 06: Now click on default below workspace and start asking question and expect fast replies.

Step 07: Here is expected results.

Here is Quick Youtube video for Visual reference

Stay tuned for further updates on anythingLLM.

--

--

CA Amit Singh
Free or Open Source software’s

Qualified Chartered Accountant & Multi Technology Trainer with 24 yrs of Multi Technology/ Multi Industry Experience. www.linkedin.com/in/ca-amit-singh-07babb