Het Trivedi – Medium

Het Trivedi

Het Trivedi
in
Towards Data Science

Boosting LLM Inference Speed Using Speculative Decoding

A practical guide on using cutting-edge optimization techniques to speed up inference

Aug 27

Boosting LLM Inference Speed Using Speculative Decoding

Aug 27

Het Trivedi
in
Towards Data Science

Improving RAG Performance Using Rerankers

A tutorial on using rerankers to improve your RAG pipeline

Jun 25

Improving RAG Performance Using Rerankers

Jun 25

Het Trivedi

What I Learned As A Forward Deployed Engineer Working At An AI Startup

In January 2024, I started working full-time as a forward deployed engineer at a company called Baseten. Baseten enables customers to…

Jun 2

What I Learned As A Forward Deployed Engineer Working At An AI Startup

Jun 2

Het Trivedi
in
Towards Data Science

Deploying LLMs Into Production Using TensorRT LLM

A guide on accelerating inference performance

Feb 22

Deploying LLMs Into Production Using TensorRT LLM

Feb 22

Het Trivedi
in
Level Up Coding

Deploying Codellama As A REST API Service

Introduction

Nov 1, 2023

Deploying Codellama As A REST API Service

Nov 1, 2023

Het Trivedi
in
Level Up Coding

Creating AI Generated QR Codes Using Stable Diffusion And ControlNet

Generate awesome looking QR codes using AI and Python

Oct 6, 2023

Creating AI Generated QR Codes Using Stable Diffusion And ControlNet

Oct 6, 2023

Het Trivedi
in
Towards Data Science

Increase Llama 2's Latency and Throughput Performance by Up to 4X

Real-world benchmarks for Llama-2 13B

Aug 9, 2023

Increase Llama 2's Latency and Throughput Performance by Up to 4X

Aug 9, 2023

Het Trivedi
in
The Generator

The Witcher’s Scripting Sorcery: Empowering The TV Adaptation With Large Language Models

Recreating A TV Show Script For The Witcher Based On The Books

Jul 31, 2023

The Witcher’s Scripting Sorcery: Empowering The TV Adaptation With Large Language Models

Jul 31, 2023

Het Trivedi
in
Towards Data Science

Deploying Falcon-7B Into Production

Running Falcon-7B in the cloud as a microservice

Jul 7, 2023

Deploying Falcon-7B Into Production

Jul 7, 2023

Het Trivedi
in
Level Up Coding

Supercharging ChatGPT: Elevate Conversations with Custom Functions via Function Calling

A quick tutorial on using function calling with ChatGPT

Jul 4, 2023

Supercharging ChatGPT: Elevate Conversations with Custom Functions via Function Calling

Jul 4, 2023

Het Trivedi

Het Trivedi

Software Engineer @ Baseten

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams