Dr. Leon Eversberg – Medium

Dr. Leon Eversberg

Pinned

Dr. Leon Eversberg
in
Towards Data Science

How to Use Hybrid Search for Better LLM RAG Retrieval

Building an advanced local LLM RAG pipeline by combining dense embeddings with BM25

Aug 11

How to Use Hybrid Search for Better LLM RAG Retrieval

Aug 11

Pinned

Dr. Leon Eversberg
in
Towards Data Science

How to Use Re-Ranking for Better LLM RAG Retrieval

Building an advanced local LLM RAG pipeline with two-step retrieval using open-source bi-encoders and cross-encoders

May 2

An image of an advanced RAG pipeline with two-step retrieval. First, a bi-encoder is used to find similar embedding vectors. Then, a cross-encoder model is used to narrow these candidates down to the top k most relevant documents.

May 2

Dr. Leon Eversberg
in
Towards Data Science

How to Improve LLM Responses With Better Sampling Parameters

A deep dive into stochastic decoding with temperature, top_p, top_k, and min_p

Sep 2

Four charts showing the probability distributions for different values of the temperature. The charts shows T=0.1, T=1.0, T=1.5 and T=2.0. The higher the temperature, the flatter the distribution.

Sep 2

Dr. Leon Eversberg
in
Towards Data Science

How to Reduce Embedding Size and Increase RAG Retrieval Speed

Flexible text embedding with Matryoshka Representation Learning (MRL)

May 26

A picture of Matryoshka dolls, where each doll is nested inside another.

May 26

Dr. Leon Eversberg
in
Towards Data Science

Safeguard Your LLM Chatbot With Llama Guard 2

How to apply content moderation to your LLM’s inputs and outputs for a more responsible AI system

May 13

A visualization of Llama Guard

May 13

Dr. Leon Eversberg
in
Towards Data Science

How to Build a Local Open-Source LLM Chatbot With RAG

Talking to PDF documents with Google’s Gemma-2b-it, LangChain, and Streamlit

Mar 31

An overview of the RAG pipeline. For documents storage: input documents -> text chunks -> encoder model -> vector database. For LLM prompting: User question -> encoder model -> vector database -> top-k relevant chunks -> generator LLM model. The LLM then answers the question with the retrieved context.

Mar 31

Dr. Leon Eversberg
in
Towards Data Science

How To Generate Synthetic Images For Object Detection Tasks

A step-by-step tutorial using Blender, Python, and 3D Assets

Mar 8

How To Generate Synthetic Images For Object Detection Tasks

Mar 8

Dr. Leon Eversberg
in
Towards AI

Size Matters: How Big Is Too Big for An LLM?

Compute-optimal large language models according to the Chinchilla paper

Feb 24

The evolution of GPTs over time: GPT-1 has 117M parameters, GPT-2 has 1.5B parameters, GPT-3 has 175B parameters, and GPT-4 is estimated to have more than 1T parameters.

Feb 24

Dr. Leon Eversberg
in
Towards AI

How to Build Your Own LLM Coding Assistant With Code Llama

Creating a local LLM-chatbot with CodeLlama-7b-Instruct-hf and Streamlit

Feb 14

An image of the Code Llama chatbot front end

Feb 14

Dr. Leon Eversberg
in
Towards AI

Which Open-Source LLM Should You Choose in 2024?

Since the 2017 paper “Attention Is All You Need” invented the Transformer architecture, natural language processing (NLP) has seen…

Feb 6

Large language models (LLMs) are evolving.

Feb 6

Dr. Leon Eversberg

Dr. Leon Eversberg

🤖 Machine Learning PhD | AI Software Engineer | Research & Development Specialist | Data Scientist | LLM Enthusiast

Following

See all (1,169)

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams