TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Member-only story

The Ultimate Guide to Training BERT from Scratch: Introduction

10 min readSep 2, 2023

--

Photo by Ryan Wallace on Unsplash

Part II and Part III of this story are now live.

A few weeks ago, I trained and deployed my very own question-answering system using Retrieval Augmented Generation (RAG). The goal was to introduce such a system over my study notes and create an agent to help me connect the dots. LangChain truly shines in these specific types of applications:

As the system's quality blew me away, I couldn’t help but dig deeper to understand the wizardry under the hood. One of the features of the RAG pipeline is its ability to sift through mountains of information and find the context most relevant to a user’s query. It sounds complex but starts with a simple yet powerful process: encoding sentences into information-dense vectors.

The most popular way to create these sentence embeddings for free is none other than SBERT, a sentence transformer built…

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Dimitris Poulopoulos
Dimitris Poulopoulos

Written by Dimitris Poulopoulos

Machine Learning Engineer. I talk about AI, MLOps, and Python programming. More about me: www.dimpo.me

No responses yet