Published inAZmedDeploying multiple GPU ML models on AWS SageMaker with FastAPIAn experimental approach using a modified AWS Sagemaker single-model endpoint with FastAPI to load multiple GPU ML models inside Sagemaker.Oct 2, 2024Oct 2, 2024