Zain ul Abideen – Medium

Zain ul Abideen

Zain ul Abideen

Q-GaLore | Memory-efficient Pre-training and Fine-tuning

Training or fine-tuning Large Language Models (LLMs) demands high-end GPUs due to massive datasets, optimizer states, and Billion…

Jul 20

Q-GaLore | Memory-efficient Pre-training and Fine-tuning

Jul 20

Zain ul Abideen

Coding Deepseek-V2 from Scratch in PyTorch

Implementation of Multi-head Latent Attention, Fine-Grained Expert Segmentation, and Shared Expert Isolation.

Jul 20

Coding Deepseek-V2 from Scratch in PyTorch

Jul 20

Zain ul Abideen

MHA vs MQA vs GQA vs MLA

Comparison of Deepseek’s new Multi-latent head attention with MHA, MQA, and GQA.

Jul 13

MHA vs MQA vs GQA vs MLA

Jul 13

Zain ul Abideen

Linear Rope vs NTK vs YaRN vs CoPE

Comparison of various positional embeddings.

Jul 13

Linear Rope vs NTK vs YaRN vs CoPE

Jul 13

Zain ul Abideen

Align Phi3 with CPO-SimPO

Align your LLM with less memory and speed efficient approach than DPO.

Jul 6

Align Phi3 with CPO-SimPO

Jul 6

Zain ul Abideen

Best LLM Inference Engine? TensorRT vs vLLM vs LMDeploy vs MLC-LLM

Benchmarking various LLM Inference Engines.

Jul 6

Best LLM Inference Engine? TensorRT vs vLLM vs LMDeploy vs MLC-LLM

Jul 6

Zain ul Abideen

MoE vs Dense vs Hybrid LLM Architectures

Train 600M MoE, Dense, Hybrid LLM Architectures.

Apr 29

MoE vs Dense vs Hybrid LLM Architectures

Apr 29

Zain ul Abideen

Schedule-Free Learning — A New Way to Train Models

Training 3 Llama models for comparison of Cosine Scheduled and Schedule-Free optimizer.

Apr 18

Schedule-Free Learning — A New Way to Train Models

Apr 18

Zain ul Abideen

Llama-Bitnet | Training a 1.58 bit LLM

What is 1 bit LLM and How to train 70M Llama-Bitnet?

Apr 4

Llama-Bitnet | Training a 1.58 bit LLM

Apr 4

Zain ul Abideen

ORPO Outperforms SFT+DPO | Train Phi-2 with ORPO

Train Phi-2 with ORPO with LazyOrpo

Mar 22

ORPO Outperforms SFT+DPO | Train Phi-2 with ORPO

Mar 22

Zain ul Abideen

Zain ul Abideen

Machine Learning Engineer | I share what I learn. https://www.linkedin.com/in/zaiinulabideen/ | https://huggingface.co/abideen

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams