Amazon announces new NVIDIA Triton Inference Server on Amazon SageMaker
Share
Services
Today, we are excited to announce [NVIDIA Triton™ Inference Server](https://nvda.ws/3k7Reip) on Amazon SageMaker, enabling customers who choose NVIDIA Triton as their model server to bring their containers and deploy them at scale in SageMaker.
NVIDIA Triton is an open source model server that runs trained ML models from multiple ML frameworks including PyTorch, TensorFlow, XGBoost, and ONNX. Triton is an extensible server to which developers can add new frontends, which can receive requests in specific formats, and new back-ends, which can handle additional model execution runtimes. AWS worked closely with NVIDIA to add a new Triton frontend that is compatible with SageMaker hosted containers and a new backend that is compatible with SageMaker Neo-compiled models. As a result, customers can easily build a custom container that includes their model with Triton and bring it to SageMaker. SageMaker Inference will handle the requests and automatically scale the container as usage increases, making model deployment with Triton on AWS easier.
Support for NVIDIA Triton™ Inference Server in Amazon SageMaker is available in all regions where Amazon SageMaker is available at no additional cost for the Triton Inference Server container. Read the [blog](https://aws.amazon.com/blogs/machine-learning/deploy-fast-and-scalable-ai-with-nvidia-triton-inference-server-in-amazon-sagemaker/) and [documentation](https://docs.aws.amazon.com/sagemaker/latest/dg/triton.html) to learn more.
What else is happening at Amazon Web Services?
Amazon AppStream 2.0 users can now save their user preferences between streaming sessions
December 13th, 2024
Services
Share
AWS Elemental MediaConnect Gateway now supports source-specific multicast
December 13th, 2024
Services
Share
Amazon EC2 instances support bandwidth configurations for VPC and EBS
December 13th, 2024
Services
Share
AWS announces new AWS Direct Connect location in Osaka, Japan
December 13th, 2024
Services
Share
Amazon DynamoDB announces support for FIPS 140-3 interface VPC and Streams endpoints
December 13th, 2024
Services
Share