Nisal Upendra – Medium

Nisal Upendra

Published in
AZmed

Deploying multiple GPU ML models on AWS SageMaker with FastAPI

An experimental approach using a modified AWS Sagemaker single-model endpoint with FastAPI to load multiple GPU ML models inside Sagemaker.

Oct 2, 2024

Deploying multiple GPU ML models on AWS SageMaker with FastAPI

Oct 2, 2024

Nisal Upendra

Nisal Upendra

Lead MLOps Engineer at @AZmed

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams