Hello Azure Community,
I'm trying to deploy an Azure ML real-time endpoint with GPU compute but unable to select GPU VMs despite having quota allocated.
Subscription Details
- Type: Pay-As-You-Go
- Status: Active
- Region: West Europe
Quota Status
I have verified quota allocation in Azure Portal → Quotas:
- Standard NCASv3_T4 Family - West Europe: 0 of 12 cores available
- Standard NCASv3_T4 Family - West US: 0 of 12 cores available
Problem Description
When creating a Managed Online Endpoint in Azure ML Studio:
- Navigate to: ML Studio → Endpoints → Real-time endpoints → Create
- Register model and environment successfully
- On Compute configuration page, attempt to select GPU VM
- All GPU VM sizes appear grayed out with message:
"You do not have enough quota for the following VM sizes"
- Specifically trying to use: Standard_NC4as_T4_v3 (4 cores, 28GB RAM, NVIDIA T4 GPU)
What I've Verified
✓ Quota page shows 12 cores available for NCASv3_T4 family in both West Europe and West US
✓ Subscription is Pay-As-You-Go
✓ NC4as_T4_v3 is officially listed as available in West Europe region
✓ Model and environment registered successfully
My Question
Is there additional approval or activation needed for first-time GPU VM usage in Azure ML, even with Pay-As-You-Go subscription and available quota?
The quota shows as available but VMs are not selectable in ML Studio. How can I enable access to GPU VMs for ML endpoint deployments?
Any guidance would be greatly appreciated!
Thank you!