The LLM App Stack — May 2024

The tools you need to know, what they do, and how they’re different

Yujian Tang
Plain Simple Software

--

LLM App Stack — Updated for May

It’s not accurate or fair to say that LLMs will change the world. They already have. — Yujian Tang

Want to keep up with Gen AI? Subscribe to our event calendar. Looking for a job? Fill out this form.

2023 showed us the rise of LLMs and their applications. Retrieval Augmented Generation (RAG) came out as the most popular use case. RAG consists of using data retrieved from your vector database and injecting it as context for the LLM to generate a human readable response.

2024 is going to bring some more of the same, as well as advances in the way we implement these technologies. We will see more LLM apps implemented, and we’ll start to see more of these take on production vibes. These include, but are not limited to — observability, data versioning, and enterprise features on the basic pieces.

As of the March 2024 update, this article contains 67 companies in 8 categories:

  • LLMs
  • LLM Providers
  • Vector Databases
  • Embedding Models
  • Orchestration
  • Quality Tuning
  • Infrastructure

--

--