#16 GB GPU availability almost always low
1 messages · Page 1 of 1 (latest)
Are you deploy to all data centers or you selected specific regions?
All data centres
What CUDA versions do you have selected? Is it possible for you to widen your range a little bit more?
I have allowed all CUDA versions
max workers varies per endpoint, 2-3-5 mostly
but the problem is if I set all enpoints max workers to a large number i'll hit my max workers limit
many endpoints running in parallel
and like all of them got throttled?
like 3 all got throttled?
yes almost all keep getting throttled , and initializing in some time gaps
@astral cave
Escalated To Zendesk
The thread has been escalated to Zendesk!
Ticket ID: #22526
try open a ticket