Sherlock XuServing A LlamaIndex RAG App as REST APIsBuild and serve a LlamaIndex RAG app as REST APIs with BentoML.May 28May 28
Sherlock XuinBentoMLBuilding RAG with Open-Source and Custom AI ModelsRetrieval-Augmented Generation (RAG) is a widely used application pattern for Large Language Models (LLMs). It uses information retrieval…May 6May 6
Sherlock XuinBentoMLA Guide to Open-Source Image Generation ModelsUnderstand open-source image generation models and find answers to frequently asked questions about them.Mar 28Mar 28
Sherlock XuinBentoMLDeploying A Large Language Model with BentoML and vLLMBuild an LLM application with vLLM for enhanced efficiency and deploy it on BentoCloud for scalable, efficient AI solutions in the cloud.Mar 22Mar 22
Sherlock XuinBentoMLNavigating the World of Large Language ModelsExplore the most popular open-source large language models and find answers to common questions in using them.Mar 22Mar 22
Sherlock XuinBentoMLDeploying Stable Diffusion XL with Latent Consistency Model LoRAs on BentoCloudAccelerate image generation with LCM LoRAs on BentoCloudFeb 29Feb 29
Sherlock XuinBentoMLUnderstanding Retrieval-Augmented Generation: Part 2Understand the practical applications of RAG, design ideas for a RAG system, and the prospect of this technology.Feb 1Feb 1
Sherlock XuinBentoMLUnderstanding Retrieval-Augmented Generation: Part 1Learn how Retrieval-Augmented Generation (RAG) transforms AI, enhancing language models with dynamic, external data access.Jan 25Jan 25
Sherlock XuinBentoMLThe New AI Landscape: 10 Predictions in 202410 AI predictions for 2024, covering multimodal models, open-source AI, GPU democratization, and more in the rapidly evolving AIJan 18Jan 18