Venkata Dikshit – Medium

Venkata Dikshit

Venkata Dikshit

Emergent Abilities in Large Language Models

Language models, particularly those in the GPT family, have experienced a fascinating evolution over the past six years. This progression…

6 min readJun 9, 2023

--

Emergent Abilities in Large Language Models

--

Venkata Dikshit

From GPT to GPT-4: Tracing the Transformative Journey of GPT

As an ML practitioner, I have witnessed the evolution of GPT over the past five years. GPT marked an intriguing paradigm shift when it was…

6 min readApr 23, 2023

--

1

From GPT to GPT-4: Tracing the Transformative Journey of GPT

--

1

Venkata Dikshit
in
Analytics Vidhya

GPT-3: Whats/Hows/Where

When I first heard about GPT-3, my first impression was that it must be GPT-2 + more compute + more data. This isn’t a bad expectation…

7 min readJul 28, 2020

--

GPT-3: Whats/Hows/Where

--

Venkata Dikshit
in
ETHER Labs

[Part-2] Which Attention(architecture) do you need?

Overview of recent advances in Transformer architectures for NLP tasks

7 min readAug 13, 2019

--

At EtherMeet (www.etherlabs.io), we apply advanced AI to make video conferences smarter and more relevant.

--

Venkata Dikshit
in
ETHER Labs

[Part-1] Which Attention(architecture) do you need?

Overview of recent advances in Transformer architectures for NLP tasks

6 min readAug 5, 2019

--

[Part-1] Which Attention(architecture) do you need?

--

Venkata Dikshit
in
ETHER Labs

BERT for unsupervised text tasks

This post discusses how we use BERT and similar self-attention architectures to address various text crunching tasks at Ether Labs.

6 min readJul 18, 2019

--

1

BERT for unsupervised text tasks

--

1

Venkata Dikshit

Venkata Dikshit

Principal ML Engineer @ SolarWinds

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams