#Can we autoscale past 100 GPUs?

2 messages · Page 1 of 1 (latest)

woeful flare
#

Reading the serverless documentation, under the autoscale section, it says "Dynamically scale workers from 0 to 100 on the Secure Cloud platform, which is highly available and distributed globally. This provides users with the computational resources exactly when needed." Not sure if 0 to 100 is meant literally or figuratively-

Our current provider has around 50 H100s available so this is an active point of investigation for us.

TLDR: Can we scale past 100 GPUs on enterprise plans? Is there an enterprise POC I can reach out to?

somber steeple
#

yes you can scale past 100, we have some users going up to couple 100
0 to 100 don't take it literally, maybe we should change it 😄