Amazon Web ServicesCategory: AWS Machine Learning

Amazon Elastic Inference

Amazon Elastic Inference is a service that allows you to attach low-cost GPU-powered acceleration to many Amazon machine instances in order to reduce the cost of running deep learning inference by up to 75%. Amazon Elastic Inference supports TensorFlow, Apache MXNet, and ONNX models through MXNet.

Website
Docs
Release notes