Reduce ML inference costs on PyTorch with Amazon Elastic Inference
Share
Services
You can now use Amazon Elastic Inference to accelerate inference and reduce inference costs for PyTorch models in Amazon SageMaker, Amazon EC2 and Amazon ECS. Enhanced PyTorch libraries for EI are available automatically in Amazon SageMaker, AWS Deep Learning AMIs, and AWS Deep Learning Containers, so you can deploy your PyTorch models in production with minimal code changes. Elastic Inference supports [TorchScript](https://pytorch.org/docs/1.3.1/jit.html) compiled models on PyTorch. In order to use Elastic Inference with PyTorch, you must convert your PyTorch models into TorchScript and use the Elastic Inference API for inference. Today, PyTorch joins TensorFlow and Apache MXNet as a deep learning framework that is supported by Elastic Inference.
Elastic Inference allows you to attach just the right amount of GPU-powered acceleration to any Amazon SageMaker instance, EC2 instance, or ECS task to reduce the cost of running deep learning inference by up to 75%.
PyTorch for Elastic Inference is supported in regions where [Amazon Elastic Inference is available](https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/). For more information, see [Using PyTorch Models with Elastic Inference](https://docs.aws.amazon.com/elastic-inference/latest/developerguide/ei-pytorch.html) in the developer guide and our blog post, “[Reduce ML inference costs on Amazon SageMaker for PyTorch models using Amazon Elastic Inference](https://aws.amazon.com/blogs/machine-learning/reduce-ml-inference-costs-on-amazon-sagemaker-for-pytorch-models-using-amazon-elastic-inference/)“.
What else is happening at Amazon Web Services?
Amazon AppStream 2.0 users can now save their user preferences between streaming sessions
December 13th, 2024
Services
Share
AWS Elemental MediaConnect Gateway now supports source-specific multicast
December 13th, 2024
Services
Share
Amazon EC2 instances support bandwidth configurations for VPC and EBS
December 13th, 2024
Services
Share
AWS announces new AWS Direct Connect location in Osaka, Japan
December 13th, 2024
Services
Share
Amazon DynamoDB announces support for FIPS 140-3 interface VPC and Streams endpoints
December 13th, 2024
Services
Share