Sean Sheng – Medium

Sean Sheng

Sean Sheng
in
Towards Data Science

Benchmarking LLM Inference Backends

Comparing Llama 3 serving performance on vLLM, LMDeploy, MLC-LLM, TensorRT-LLM, and TGI

Jun 17

Benchmarking LLM Inference Backends

Jun 17

Sean Sheng
in
Towards Data Science

Scaling AI Models Like You Mean It

Strategies for Overcoming the Challenges of Scaling Open-Source AI Models in Production

Apr 10

Scaling AI Models Like You Mean It

Apr 10

Sean Sheng

Sean Sheng

Head of Engineering of BentoML, the inference platform for fast moving AI teams.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams