#16 GB GPU availability almost always low

1 messages · Page 1 of 1 (latest)

astral cave
#

Hence very frequent throttling workers and pulling docker image again and again

low zealotBOT
#

To help others find answers, you can mark your question as solved via Right click solution message -> Apps -> ✅ Mark Solution

tame flint
#

Are you deploy to all data centers or you selected specific regions?

astral cave
#

All data centres

olive robin
#

What CUDA versions do you have selected? Is it possible for you to widen your range a little bit more?

astral cave
#

I have allowed all CUDA versions

real plover
#

And your max workers? set it to a larger number

#

maybe share your endpoint id?

astral cave
#

max workers varies per endpoint, 2-3-5 mostly
but the problem is if I set all enpoints max workers to a large number i'll hit my max workers limit
many endpoints running in parallel

real plover
#

like 3 all got throttled?

astral cave
#

yes almost all keep getting throttled , and initializing in some time gaps

fleet bisonBOT
real plover
#

try open a ticket