The most insightful stories about Pre Training

Pre Training

Topic

8 Followers

83 Stories

Recommended stories

Arzu Caner
AI DevCamp Notes: Deep Learning (Week 3)
In Week 3 of AlDev Camp, we delved into deep learning, building on the data and concepts covered in the first two weeks to make this…
Jul 11
Eduardo Ordax
Fine tuning Vs Pre-training
The objective of my articles is to ensure clarity and simplicity in technical explanations. To achieve this, I will skip over certain…
Jan 15
Subrata Goswami
Pre-training Mini Versions of LLMs — GPT and Llama3This blog  goes over how to pre-train small versions of the leading open source Large Language Models (LLM). Here 3 models are covered — 2…
Jun 17
Jun 17
Anastasia Tzeveleka
LLM domain adaptation using continued pre-training — Part 3/4Exploring domain adaptation via continued pre-training for large language models (LLMs)? This 4-part series answers the most common…
May 9
May 9
Vivek Madan
LLM End-to-End & Resources Part 2 — Pre-trainingIn the previous post, we saw the model architecture recipe for large language models. In this post, we will discuss the first stage of…
Jun 3
Jun 3

AI DevCamp Notes: Deep Learning (Week 3)

Arzu Caner

AI DevCamp Notes: Deep Learning (Week 3)

In Week 3 of AlDev Camp, we delved into deep learning, building on the data and concepts covered in the first two weeks to make this…

Jul 11

Eduardo Ordax

Fine tuning Vs Pre-training

The objective of my articles is to ensure clarity and simplicity in technical explanations. To achieve this, I will skip over certain…

Jan 15

Pre-training Mini Versions of LLMs — GPT and Llama3

Subrata Goswami

Pre-training Mini Versions of LLMs — GPT and Llama3

This blog goes over how to pre-train small versions of the leading open source Large Language Models (LLM). Here 3 models are covered — 2…

Jun 17

LLM domain adaptation using continued pre-training — Part 3/4

Anastasia Tzeveleka

LLM domain adaptation using continued pre-training — Part 3/4

Exploring domain adaptation via continued pre-training for large language models (LLMs)? This 4-part series answers the most common…

May 9

LLM End-to-End & Resources Part 2 — Pre-training

Vivek Madan

LLM End-to-End & Resources Part 2 — Pre-training

In the previous post, we saw the model architecture recipe for large language models. In this post, we will discuss the first stage of…

Jun 3

Understanding Multimodal LLMs and Video Language Pre-training: Key Progress, Applications, Methods…

maadaa.ai

Understanding Multimodal LLMs and Video Language Pre-training: Key Progress, Applications, Methods…

How to utilize the video and corresponding weak captions to perform representation learning has recently become a hot topic.

Mar 14

CLIP: Contrastive Language-Image Pretraining

Abdullah Şamil Güser

CLIP: Contrastive Language-Image Pretraining

Paper by Alec Radford, Ilya Sutskever et al. from OpenAI

May 5

Fig 1: SPECTER2 uses task format specific adapters attached to the transformer language model. These help to generate task-specific embeddings for an input document.

Amanpreets
in
Ai2 Blog

SPECTER2: Adapting Scientific Document Embeddings to Multiple Fields and Task Formats

SPECTER2, a new scientific embedding model trained on 9 tasks across classification, regression and retrieval.

Nov 27, 2023

See more recommended stories