Amazon EKS now Supports EC2 Inf1 Instances
Share
Services
You can now use [Amazon Elastic Kubernetes Service (EKS)](/eks/) to run containers on [Amazon EC2 Inf1 Instances](/ec2/instance-types/inf1/). With EKS and the [AWS Neuron](/machine-learning/neuron/) Kubernetes device plugin, it’s easy combine multiple Inferentia devices in your cluster to run high performance and cost-effective inference workloads at scale.
Amazon EC2 Inf1 instances deliver high performance and the lowest cost machine learning inference in the cloud. Inf1 instances feature up to 16 [AWS Inferentia](/machine-learning/inferentia/) chips, high-performance machine learning inference chips designed and built by AWS. Using Inf1 instances, customers can run large scale machine learning inference applications like image recognition, speech recognition, natural language processing, personalization, and fraud detection. Once your machine learning model is trained to meet your requirements, you can deploy your model by using [AWS Neuron](/machine-learning/neuron/), a specialized software development kit (SDK) consisting of a compiler, run-time, and profiling tools that optimizes the machine learning inference performance of Inferentia chips, and supports popular machine learning frameworks such as TensorFlow, PyTorch, or MXNet.
Amazon EKS has made it easy run Inferentia based containers by updating the [EKS-Optimized Accelerated AMI](https://docs.aws.amazon.com/eks/latest/userguide/gpu-ami.html) with all the necessary AWS Neuron packages. After starting a cluster with worker nodes based on the latest Accelerated AMI, you can install the AWS Neuron Kubernetes device plugin, which advertises Inferentia devices as available resources to the worker node kubelet. This fine-grained scheduling capability allows EKS customers to achieve better utilization and greater cost savings compared to using standalone EC2 Inf1 instances.
EC2 Inf1 instances can be used on all EKS clusters running version 1.14 and above in regions where Inf1 is available. Today, only self managed node groups are supported, and can be started using eksctl, CloudFormation, or the AWS CLI. EKS managed node groups support will be added in a future release. To get started, visit the [Amazon EKS documentation](https://docs.aws.amazon.com/eks/latest/userguide/inferentia-support.html). To learn more about Inf1 instances and Inferentia, check out the [Amazon EC2 documentation](/ec2/instance-types/inf1/).
What else is happening at Amazon Web Services?
Amazon AppStream 2.0 users can now save their user preferences between streaming sessions
December 13th, 2024
Services
Share
AWS Elemental MediaConnect Gateway now supports source-specific multicast
December 13th, 2024
Services
Share
Amazon EC2 instances support bandwidth configurations for VPC and EBS
December 13th, 2024
Services
Share
AWS announces new AWS Direct Connect location in Osaka, Japan
December 13th, 2024
Services
Share
Amazon DynamoDB announces support for FIPS 140-3 interface VPC and Streams endpoints
December 13th, 2024
Services
Share