Tejaswi kashyap – Medium

Tejaswi kashyap

Tejaswi kashyap

Unpacking Attention in Transformers: From Self-Attention to Causal Self-Attention

This article will guide you through self-attention mechanisms, a core component in transformer architectures, and large language models…

Sep 8

Unpacking Attention in Transformers: From Self-Attention to Causal Self-Attention

Sep 8

Tejaswi kashyap

Memory Optimization in LLMs: Leveraging KV Cache Quantization for Efficient Inference

Quantization shrinks the footprint of a large language model (LLM) by reducing the precision of its parameters, such as from 16-bit to…

Jul 5

Memory Optimization in LLMs: Leveraging KV Cache Quantization for Efficient Inference

Jul 5

Tejaswi kashyap

Tailoring Llama 3: Harnessing Fine-Tuning for Custom Language Tasks

Low-rank adaptation (LoRA) enables the straightforward adaptation of pre-trained large language models (LLMs) to new tasks by freezing the…

Jun 4

Tailoring Llama 3: Harnessing Fine-Tuning for Custom Language Tasks

Jun 4

Tejaswi kashyap

Accelerating AI: Exploring Speculative Decoding with Large Language Models

Introduction

Apr 27

Accelerating AI: Exploring Speculative Decoding with Large Language Models

Apr 27

Tejaswi kashyap

RAG processing using Llamaindex

Query over your PDF’s using llama index

Mar 15

RAG processing using Llamaindex

Mar 15

Tejaswi kashyap

Deciphering Mixtral-8x7B: Navigating the Sparse Expert Model Ensemble by Mistral AI

How to Surpass the Capabilities of GPT-3.5 and Llama 2 70B with Personal Computing Power

Mar 11

Image by DALL.E

Mar 11

Tejaswi kashyap
in
GoPenAI

LangChain and the Evolution of LLM: Why Memory Matters

Image from Google

Sep 5, 2023

LangChain and the Evolution of LLM: Why Memory Matters

Sep 5, 2023

Tejaswi kashyap

Understanding Large Language Models: Architecture and Self-Attention Explained

Large language models have revolutionized natural language processing, enabling computers to understand and generate human-like text. Based…

Jul 30, 2023

Understanding Large Language Models: Architecture and Self-Attention Explained

Jul 30, 2023

Tejaswi kashyap

Shot predictor using polynomial regression

This project will use polynomial regressions to predict the shot.

Mar 27, 2023

Shot predictor using polynomial regression

Mar 27, 2023

Tejaswi kashyap

Parking space counter

This project is based on image processing to detect spaces in a parking lot.

Mar 9, 2023

Parking space counter

Mar 9, 2023

Tejaswi kashyap

Tejaswi kashyap

Aspiring ML Engineer

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams