Amazon Bedrock Model Evaluation now supports evaluating custom models
Share
Services
Model Evaluation on Amazon Bedrock allows you to evaluate, compare, and select the best foundation models for your use case. Amazon Bedrock offers a choice of automatic evaluation and human evaluation. You can use automatic evaluation with predefined algorithms for metrics such as accuracy, robustness, and toxicity. Additionally, for those metrics or subjective and custom metrics, such as friendliness, style, and alignment to brand voice, you can set up a human evaluation workflow with a few clicks. Human evaluation workflows can leverage your own employees or an AWS-managed team as reviewers. Model evaluation provides built-in curated datasets or you can bring your own datasets.
Now, customers can evaluate their own custom fine-tuned models from fine-tuning and continued pretraining jobs on Amazon Bedrock. This allows customers to complete the cycle of selecting a base model, customizing it, evaluating it, and customizing it again if needed or continuing to production if they are satisfied with its evaluation outcome. To evaluate a custom model, simply select the custom model from the list of models to evaluate in the model selector tool when creating an evaluation job.
Model Evaluation on Amazon Bedrock is now Generally Available in these[ commercial regions](https://docs.aws.amazon.com/bedrock/latest/userguide/bedrock-regions.html) and the AWS GovCloud (US-West) Region.
To learn more about Model Evaluation on Amazon Bedrock, see the [Amazon Bedrock developer experience web page](https://aws.amazon.com/bedrock/developer-experience/). To get started, sign in to Amazon Bedrock on the AWS Management Console or use the Amazon Bedrock APIs.
What else is happening at Amazon Web Services?
Amazon AppStream 2.0 users can now save their user preferences between streaming sessions
December 13th, 2024
Services
Share
AWS Elemental MediaConnect Gateway now supports source-specific multicast
December 13th, 2024
Services
Share
Amazon EC2 instances support bandwidth configurations for VPC and EBS
December 13th, 2024
Services
Share
AWS announces new AWS Direct Connect location in Osaka, Japan
December 13th, 2024
Services
Share
Amazon DynamoDB announces support for FIPS 140-3 interface VPC and Streams endpoints
December 13th, 2024
Services
Share