Amazon EMR announces S3A as the default connector
Share
Services
AWS announces Amazon EMR S3A, a new Amazon S3 connector that optimizes performance for Apache Hadoop, Apache Spark, and Apache Hive workloads on [Amazon EMR](https://aws.amazon.com/emr/). This new connector enhances the open source S3A architecture with AWS-specific optimizations to help organizations process large-scale data more efficiently. With direct integration support for S3 Express One Zone, S3 Glacier, and AWS Outposts, EMR S3A helps customers leverage different storage options in AWS to optimize both data access speed and storage cost on their EMR workloads.
Additionally, the EMR S3A connector delivers advanced security features and performance capabilities that extend beyond open source S3A. Key improvements include Apache Spark built-in fine-grained access control support, enhanced S3A credentials resolver, MagicCommitter V2 for optimized file writes, and accelerated S3 prefix listing for columnar file formats. These enhancements are available starting with EMR release 7.10 and maintain compatibility with existing applications.
The Amazon EMR S3A connector is available in all AWS Regions where Amazon EMR is available and comes pre-configured with Amazon EMR release version 7.10 and later. To learn more about Amazon EMR S3A, see the [Amazon EMR documentation](https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-s3a-file.html).
What else is happening at Amazon Web Services?
Amazon SageMaker introduces account-agnostic, reusable project profiles
about 10 hours ago
Services
Share
Amazon QuickSight now supports connectivity to Google Sheets
about 11 hours ago
Services
Share
Amazon Neptune Analytics now introduces stop/start capability
about 11 hours ago
Services
Share
Amazon QuickSight now available in Israel (Tel Aviv) Region and United Arab Emirates (Dubai) Region
about 11 hours ago
Services
Share
Amazon EMR on EC2 Adds Apache Spark native FGAC and AWS Glue Data Catalog Views Support
about 13 hours ago
Services
Share