Drishti Sushma – Medium

Drishti Sushma

Pinned

Drishti Sushma

Accelerate Llama-2–7b Fine-tuning: Unsloth Outpaces Flash Attention-2

Objective of this Study

Jan 14

Accelerate Llama-2–7b Fine-tuning: Unsloth Outpaces Flash Attention-2

Jan 14

Drishti Sushma

Analyzing the Impact of lora_alpha on Llama-2 Quantized with GPTQ

|Updated: 26/01/24

Sep 14, 2023

Analyzing the Impact of lora_alpha on Llama-2 Quantized with GPTQ

Sep 14, 2023

Drishti Sushma

Analyzing the Dual Impact: Batch Size and Mixed Precision on DistilBERT’s Performance in Language…

Introduction

Sep 12, 2023

Analyzing the Dual Impact: Batch Size and Mixed Precision on DistilBERT’s Performance in Language…

Sep 12, 2023

Drishti Sushma

Comprehensive Evaluation of Various Transformer Models in Detecting Normal, Hate, and Offensive…

Objective of the Study

Sep 11, 2023

Comprehensive Evaluation of Various Transformer Models in Detecting Normal, Hate, and Offensive…

Sep 11, 2023

Drishti Sushma

Decoding the Impact of Weight Decay on MBart-large-50 for English-Spanish Translation

Introduction

Sep 11, 2023

Decoding the Impact of Weight Decay on MBart-large-50 for English-Spanish Translation

Sep 11, 2023

Drishti Sushma

Analyzing Llama-2’s Behavior with Varied Pretraining Temperature and Attention Mechanisms

Objective of the Study

Sep 11, 2023

Analyzing Llama-2’s Behavior with Varied Pretraining Temperature and Attention Mechanisms

Sep 11, 2023

Drishti Sushma

Comparative Study: Training OPT-350M and GPT-2 on Anthropic’s HH-RLHF Dataset Using Reward-Based…

Introduction

Sep 11, 2023

Comparative Study: Training OPT-350M and GPT-2 on Anthropic’s HH-RLHF Dataset Using Reward-Based…

Sep 11, 2023

Drishti Sushma

Fine-tune 4-bit Llama-2–7B with Flash Attention Using DPO

Introduction

Sep 11, 2023

Sep 11, 2023

Drishti Sushma

Comparative Analysis of Fine-tuned BERT-based Models for Detecting Hate Speech in Social Media

Abstract

Sep 7, 2023

Comparative Analysis of Fine-tuned BERT-based Models for Detecting Hate Speech in Social Media

Sep 7, 2023

Drishti Sushma

Drishti Sushma

https://huggingface.co/DrishtiSharma

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams