Victor May – Medium

Victor May

Victor May

LLM Multi-GPU Batch Inference With Accelerate

An Implementation Walkthrough

Sep 10, 2023

Sep 10, 2023

Victor May

Solving The Issue of Falcon Text Generation Never Stopping

How to make an overly chatty bird stop talking.

Jul 26, 2023

Solving The Issue of Falcon Text Generation Never Stopping

Jul 26, 2023

Victor May

Scalable Streaming of OpenAI Model Responses with FastAPI and asyncio

A tutorial

Jul 13, 2023

Jul 13, 2023

Victor May

Victor May

ML Engineer

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams