Amazon EC2 Inf2 instances, optimized for generative AI, now in new regions
Share
Services
Starting today, the Amazon Elastic Compute Cloud (Amazon EC2) Inf2 instances are generally available in the Asia Pacific (Sydney), Europe (London), Europe (Paris), Europe (Stockholm), South America (Sao Paulo) regions. These instances deliver high performance at the lowest cost in Amazon EC2 for generative AI models.
You can use Inf2 instances to run popular applications such as text summarization, code generation, video and image generation, speech recognition, personalization, and more. Inf2 instances are the first inference-optimized instances in Amazon EC2 to introduce scale-out distributed inference supported by NeuronLink, a high-speed, nonblocking interconnect. Inf2 instances offer up to 2.3 petaflops and up to 384 GB of total accelerator memory with 9.8 TB/s bandwidth.
The AWS Neuron SDK integrates natively with popular machine learning frameworks, so you can continue using your existing frameworks to deploy on Inf2\. Developers can get started with Inf2 instances using AWS Deep Learning AMIs, AWS Deep Learning Containers, or managed services such as Amazon Elastic Container Service (Amazon ECS), Amazon Elastic Kubernetes Service (Amazon EKS), and Amazon SageMaker.
Inf2 instances are now available in four sizes: inf2.xlarge, inf2.8xlarge, inf2.24xlarge, inf2.48xlarge in 13 [AWS Regions](https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/) as On-Demand Instances, Reserved Instances, and Spot Instances, or as part of a Savings Plan.
To learn more about Inf2 instances, see the [Amazon EC2 Inf2 Instances webpage](https://aws.amazon.com/ec2/instance-types/inf2/) and the [AWS Neuron Documentation](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/).
What else is happening at Amazon Web Services?
Amazon AppStream 2.0 users can now save their user preferences between streaming sessions
December 13th, 2024
Services
Share
AWS Elemental MediaConnect Gateway now supports source-specific multicast
December 13th, 2024
Services
Share
Amazon EC2 instances support bandwidth configurations for VPC and EBS
December 13th, 2024
Services
Share
AWS announces new AWS Direct Connect location in Osaka, Japan
December 13th, 2024
Services
Share
Amazon DynamoDB announces support for FIPS 140-3 interface VPC and Streams endpoints
December 13th, 2024
Services
Share