Azure OpenAI - 429 - System is experiencing high demand and cannot process your request error
We are currently seeing the exception in our OpenAI resource using gpt 4.o in the Australia East region.
Error code: 429 - {"error": {"code": "NoCapacity", "message": "The system is currently experiencing high demand and cannot process your request. Your request exceeds the maximum usage size allowed during peak load. Please retry after 47 seconds. For improved latency reliability, consider switching to Provisioned Throughput."}}
We are using a Standard gpt-4o model with no Provisional Throughput in the Australia East region. This has been occuring between 05/12/2025, 00:07:07.150 and 05/12/2025, 03:27:53.760 with approx. 300 failed requests.
Is there any visibility to when Azure is experiencing high demands and any way we can mitigate this - we do not have capability to use a model deployment in another region.
THanks,