Shravan Kumar – Medium

Shravan Kumar

Shravan Kumar

Meta Llama 3: The most capable openly available LLM to date and It’s Application

💥 𝐁𝐨𝐨𝐦 𝐢𝐧 𝐭𝐡𝐞 𝐀𝐈 𝐖𝐨𝐫𝐥𝐝: 𝐋𝐥𝐚𝐦𝐚 𝟑 𝐢𝐬 𝐇𝐞𝐫𝐞! 💥

Apr 26

Meta Llama 3: The most capable openly available LLM to date and It’s Application

Apr 26

Shravan Kumar

A deep dive into Tokenization

We recall that the language modelling involves computing probabilities over a sequence of tokens

Mar 20

A deep dive into Tokenization

Mar 20

Shravan Kumar

Gemma: Introducing new state-of-the-art open model by Google

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini…

Feb 23

Gemma: Introducing new state-of-the-art open model by Google

Feb 23

Shravan Kumar

Bidirectional Encoder Representations from Transformers (BERT)

In my previous blogs we studied about entire overview on Generative Pretrained Transformer a blog on Generative Pretrained Transformer…

Dec 17, 2023

Bidirectional Encoder Representations from Transformers (BERT)

Dec 17, 2023

Shravan Kumar

Decoding Strategies of all Decoder only Models (GPT)

In my previous blogs we studied about entire overview on Generative Pretrained Transformer and also a blog on Generative Pretrained…

Dec 10, 2023

Decoding Strategies of all Decoder only Models (GPT)

Dec 10, 2023

Shravan Kumar

Generative Pretrained Transformer (GPT)— Pre-training , Fine Tuning & Different Use Case…

In the previous blog we studied about entire overview on Generative Pretrained Transformer. Now let us look at super important topics on…

Nov 27, 2023

Generative Pretrained Transformer (GPT)— Pre-training , Fine Tuning & Different Use Case…

Nov 27, 2023

Shravan Kumar

Generative Pretrained Transformer (GPT)

A primer into the Decoder only Model — Causal Langauge Modelling

Nov 25, 2023

Generative Pretrained Transformer (GPT)

Nov 25, 2023

Shravan Kumar

Introduction to Language Modelling

In my previous blog we learned about the components of the transformer architecture in the context of machine translation.

Nov 22, 2023

Introduction to Language Modelling

Nov 22, 2023

Shravan Kumar

Transformers: Attention is all you need — Layer Normalization

There are two major concepts which we are going to discuss here are

Nov 16, 2023

Transformers: Attention is all you need — Layer Normalization

Nov 16, 2023

Shravan Kumar

Transformers: Attention is all you need — Positional Encoding

Please refer to below blogs before reading this:

Nov 9, 2023

Transformers: Attention is all you need — Positional Encoding

Nov 9, 2023

Shravan Kumar

Shravan Kumar

AI Leader | Associate Director @ Novartis. Follow me for more on AI, Data Science

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams