GA Azure OpenAI Token Limit Policy in Azure API Management
Share
Services
We're excited to announce the General Availability of the Azure OpenAI Token Limit Policy in Azure API Management! This feature empowers customers to effectively manage and enforce limits on API consumers based on their usage of OpenAI tokens.
With this policy, customers can set limits on API consumers based on OpenAI token usage, expressed in tokens-per-minute (TPM). This allows for precise control over token consumption, ensuring fair and efficient utilization of OpenAI resources.
Customers have the flexibility to assign token-based limits on any counter key, such as Subscription key, IP Address, etc., tailoring the enforcement to their specific use cases. .
By relying on token usage metrics returned from the OpenAI endpoint, customers can accurately monitor and enforce limits in real-time. The policy also enables pre-calculation of prompt tokens on the APIM side, minimizing unnecessary requests to the OpenAI backend if the limit is already exceeded.
Furthermore, customers can leverage headers and variables such as tokens-consumed and remaining-tokens within policies for enhanced control and customization.
With the introduction of the OpenAI Token Limit Policy, customers can now centrally manage limits for multiple API consumers and OpenAI endpoints for both streaming and non-streaming scenarios, streamlining the management process and improving resource utilization efficiency.
Click [here](https://aka.ms/apim/openai/token-limit-policy) to learn more.
* API Management
* Features
* Microsoft Build
* [ API Management](https://azure.microsoft.com/en-gb/products/api-management/)
What else is happening at Microsoft Azure?
Read update
Services
Share
Generally Available: Storage account default maximum request rate limit increase to 40,000 requests per second
December 12th, 2024
Services
Share
Read update
Services
Share
Generally Available: Regional Disaster Recovery by Azure Backup for AKS
November 22nd, 2024
Services
Share
Generally Available: Enhancements on Azure Container Storage for performance, scalability, and operational insights
November 19th, 2024
Services
Share