Venkata Dikshit – Medium

Venkata Dikshit

Venkata Dikshit

LoRA: The Underrated Key to Enterprise AI Efficiency

Enterprise AI: Progress and Roadblocks

Sep 21

LoRA: The Underrated Key to Enterprise AI Efficiency

Sep 21

Venkata Dikshit

LoRA: The Underrated Key to Enterprise AI Efficiency

Enterprise AI: Progress and Roadblocks

Sep 19

LoRA: The Underrated Key to Enterprise AI Efficiency

Sep 19

Venkata Dikshit

Llama3 is Awesome

Llama3 is crushing it, and guess what? It’s open source. The technical report they dropped has everything you need — every trick and…

Aug 14

Llama3 is Awesome

Aug 14

Venkata Dikshit

Emergent Abilities in Large Language Models

Language models, particularly those in the GPT family, have experienced a fascinating evolution over the past six years. This progression…

Jun 9, 2023

Emergent Abilities in Large Language Models

Jun 9, 2023

Venkata Dikshit

From GPT to GPT-4: Tracing the Transformative Journey of GPT

As an ML practitioner, I have witnessed the evolution of GPT over the past five years. GPT marked an intriguing paradigm shift when it was…

Apr 23, 2023

From GPT to GPT-4: Tracing the Transformative Journey of GPT

Apr 23, 2023

Venkata Dikshit
in
Analytics Vidhya

GPT-3: Whats/Hows/Where

When I first heard about GPT-3, my first impression was that it must be GPT-2 + more compute + more data. This isn’t a bad expectation…

Jul 28, 2020

GPT-3: Whats/Hows/Where

Jul 28, 2020

Venkata Dikshit
in
ETHER Labs

[Part-2] Which Attention(architecture) do you need?

Overview of recent advances in Transformer architectures for NLP tasks

Aug 13, 2019

At EtherMeet (www.etherlabs.io), we apply advanced AI to make video conferences smarter and more relevant.

Aug 13, 2019

Venkata Dikshit
in
ETHER Labs

[Part-1] Which Attention(architecture) do you need?

Overview of recent advances in Transformer architectures for NLP tasks

Aug 5, 2019

[Part-1] Which Attention(architecture) do you need?

Aug 5, 2019

Venkata Dikshit
in
ETHER Labs

BERT for unsupervised text tasks

This post discusses how we use BERT and similar self-attention architectures to address various text crunching tasks at Ether Labs.

Jul 18, 2019

BERT for unsupervised text tasks

Jul 18, 2019

Venkata Dikshit

Venkata Dikshit

Principal ML Engineer @ SolarWinds

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams