Sivanarayana Mamidi – Medium

Sivanarayana Mamidi

Sivanarayana Mamidi

RLHF

Reinforcement learning from Human Feedback is a method used to improve the performance of AI models like LLaMA by training them to align…

Sep 23

Sep 23

Sivanarayana Mamidi

Hyde approach to context optimization

The Hyde approach to context optimization in large language models (LLM) is method designed to manage and enhance the effectiveness of the…

Sep 5

Sep 5

Sivanarayana Mamidi

Prompt engineering

Six strategies for getting better results

Sep 5

Sep 5

Sivanarayana Mamidi

LLM on Multi_inference

Horizontal scaling means adding more instances or pods of a service to handle more load or requests. By running several instances of the…

Aug 31

Aug 31

Sivanarayana Mamidi

MULTI INFERENCE of LLM

LLM model can handle multiple users simultaneously through combination of several undelying technologies and techniques

Aug 31

Aug 31

Sivanarayana Mamidi

Training arguments of SFT of LL

Data collator : In the context of the hugging face transformers library is a utility that helps preprare batches of data during training…

Aug 31

Aug 31

Sivanarayana Mamidi

Quantization

4bit-NormalFloat (NF4) consists of three steps:

May 30

May 30

Sivanarayana Mamidi

QLORA, PTQ,

QLoRA, PTQ, and QAT are all techniques related to the optimization and fine-tuning of machine learning models, particularly large language…

May 29

May 29

Sivanarayana Mamidi

ORPO , DPO and PPO

DPO : Direct Preference Optimization (DPO) is a method for aligning large language models (LLMs) with human preferences without the need…

May 28

ORPO , DPO and PPO

May 28

Sivanarayana Mamidi

Diffusion Models

you have a lot sprite images

Apr 26

Diffusion Models

Apr 26

Sivanarayana Mamidi

Sivanarayana Mamidi

Computer Vision Engineer, Master in artificial intelligence, enthusiastically to solve DSA problems

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams