Sherlock XuinBentoMLA Guide to Model CompositionGain an overview of model composition to build compound AI systems.Jul 30Jul 30
Sherlock XuServing A LlamaIndex RAG App as REST APIsBuild and serve a LlamaIndex RAG app as REST APIs with BentoML.May 28May 28
Sherlock XuinBentoMLBuilding RAG with Open-Source and Custom AI ModelsRetrieval-Augmented Generation (RAG) is a widely used application pattern for Large Language Models (LLMs). It uses information retrieval…May 6May 6
Sherlock XuinBentoMLA Guide to Open-Source Image Generation ModelsUnderstand open-source image generation models and find answers to frequently asked questions about them.Mar 281Mar 281
Sherlock XuinBentoMLDeploying A Large Language Model with BentoML and vLLMBuild an LLM application with vLLM for enhanced efficiency and deploy it on BentoCloud for scalable, efficient AI solutions in the cloud.Mar 22Mar 22
Sherlock XuinBentoMLNavigating the World of Large Language ModelsExplore the most popular open-source large language models and find answers to common questions in using them.Mar 22Mar 22
Sherlock XuinBentoMLDeploying Stable Diffusion XL with Latent Consistency Model LoRAs on BentoCloudAccelerate image generation with LCM LoRAs on BentoCloudFeb 29Feb 29
Sherlock XuinBentoMLUnderstanding Retrieval-Augmented Generation: Part 2Understand the practical applications of RAG, design ideas for a RAG system, and the prospect of this technology.Feb 1Feb 1
Sherlock XuinBentoMLUnderstanding Retrieval-Augmented Generation: Part 1Learn how Retrieval-Augmented Generation (RAG) transforms AI, enhancing language models with dynamic, external data access.Jan 25Jan 25