VJ Anand – Medium

VJ Anand

VJ Anand

Rotary Positional Embedding

LLaMa 2.0 Architecture

Jan 31

Rotary Positional Embedding

Jan 31

VJ Anand

Sequential Modeling for Reinforcement Learning

The idea of using transformers for different domains is gaining popularity. Academic literature reports experimentation or studies using…

Jan 21

Sequential Modeling for Reinforcement Learning

Jan 21

VJ Anand

Evidence Lower Bound

Evidence Lower bound (ELBO) is a loss function that is used for variational inference. Networks that perform variational inference is also…

Dec 4, 2023

Evidence Lower Bound

Dec 4, 2023

VJ Anand

Understanding Entropy, KL-Divergence

Deriving KL-Divergence from first principles Let X be a random variable that can take one of the following n states

Dec 4, 2023

Understanding Entropy, KL-Divergence

Dec 4, 2023

VJ Anand

Stochastic Gradient Langevine Dynamics

This paper introduced the concept of Bayesian Learning using Langevine Dynamics. In this short write up I will discuss some of the…

Dec 2, 2023

Dec 2, 2023

VJ Anand

Reinforcement Learning in the context of LLM

Introduction

Jul 6, 2023

Reinforcement Learning in the context of LLM

Jul 6, 2023

VJ Anand

In-Context Learning in Large Language Models

We all have been amazed how large language models (LLM) like GPT3/4/ChatGPT is able to perform tasks that it has never seen before, or…

May 29, 2023

In-Context Learning in Large Language Models

May 29, 2023

VJ Anand

Self Supervised Learning

Often times we encounter where we don’t have enough labeled data, what can we do in such cases. The area of self-supervised learning is…

Aug 17, 2022

Self Supervised Learning

Aug 17, 2022

VJ Anand

Restricted Boltzmann Machine

Understanding Restricted Boltzmann machine (RBM) architecture, its training and learning method is foundational to get better insight of…

Jul 11, 2022

Restricted Boltzmann Machine

Jul 11, 2022

VJ Anand

VJ Anand

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams