Amazon SageMaker Catalog provides automatic data classification using AI agents
Share
Services
Amazon SageMaker Catalog now provides automated data classification that suggests business glossary terms during data publishing, reducing manual tagging effort and improving metadata consistency across organizations.
This capability analyzes table metadata and schema information using Amazon Bedrock's language models to recommend relevant terms from organizational business glossaries. Data producers receive AI-generated suggestions for business terms defined within their glossaries, which include both functional terms and sensitive data classifications such as PII and PHI, making it easy to tag their datasets with standardized vocabulary. Producers can accept or modify these suggestions before publishing, ensuring consistent terminology across data assets and improving data discoverability for business users.
Automated data classification is available in US East (N. Virginia, Ohio), US West (Oregon), Asia Pacific (Tokyo, Seoul, Singapore, Sydney, Mumbai), and Europe (Frankfurt, Ireland, London, Paris) AWS regions where Amazon
SageMaker operates.
To get started, go to SageMaker Unified Studio to configure your business glossary to generate recommendations for business glossary terms. You can also use the AWS CLI or SDKs to programmatically manage glossary term suggestions.
For more information, see the SageMaker Catalog [user guide.](https://docs.aws.amazon.com/sagemaker-unified-studio/latest/userguide/autodoc.html)
What else is happening at Amazon Web Services?
Read update
Services
Share
AWS Direct Connect announces new location in Hanoi, Vietnam
about 14 hours ago
Services
Share
Amazon SageMaker AI is now available in Asia Pacific (New Zealand)
about 16 hours ago
Services
Share
Amazon EC2 M8i instances are now available in additional Regions
about 16 hours ago
Services
Share
AWS Artifact enables access to previous versions of compliance reports
about 16 hours ago
Services
Share
Read update
Services
Share