Shrinivasan SankarXLSTM — Extended Long Short-Term Memory NetworksLSTMs or Long Short-Term Memory Networks have been around for a long time. They have been applied for quite a few sequence-related tasks…May 20May 20
Shrinivasan SankarinGoPenAIMake your LLM Fully Utilize the ContextA simple data-driven approach from Microsoft to increasing the context length of LLMsMay 9May 9
Shrinivasan SankarinLevel Up CodingChat with your emails with this RAG pipeline (LangChain + ChromaDB)Implement and run a simple application on your laptop to make LLMs chat with your emails in < 50 lines of code.Mar 28Mar 28
Shrinivasan SankarinLevel Up CodingNaive Quantization Methods for LLMs — a hands-onImplementing Absolute max and zero point quantization helps learn advanced methods like GPTQ.Mar 15Mar 15
Shrinivasan SankarinLevel Up CodingParameter Efficient Fine-tuning of the Gemma model on a single GPUA guide to fine-tuning the latest Gemma 2B model from Google with your in-house dataset on a single GPU.Mar 5Mar 5
Shrinivasan SankarFine-tuning an LLM — The six-step lifecycleFine-tuning is an art and a methodical process similar to software engineering. It is extremely simplified and portrayed as a cakewalk in…Feb 23Feb 23
Shrinivasan SankarRetrieval Augmented Generation(RAG) — A quick and comprehensive introductionIntroductionFeb 13Feb 13
Shrinivasan SankarLumiere — The most promising Text-to-Video model yet from GoogleU-Net architecture modified to a space-time architecture coupled with MultiDiffusion is LumiereFeb 1Feb 1
Shrinivasan SankarYou will do one of these if you work in AI in 2024AI has come a long way ever since it took off in late 2016. Most people who worked in AI then were well-versed experts typically with a PhD…Jan 15Jan 15
Shrinivasan SankarControlNet — Take complete control of images from the generative modelWhen we take image generation models such as Stable Diffusion, the quality of the image generated is mind-blowing. The output however is…Jan 5Jan 5