#High delay time

43 messages · Page 1 of 1 (latest)

hasty gorgeBOT
#

To help others find answers, you can mark your question as solved via Right click solution message -> Apps -> ✅ Mark Solution

cunning yarrow
#

Stuck. Not sure why

grim bronze
#

I don't get this, are you trying to download docker image inside a docker(runpod)

#

If so, you shouldn't, because it's not quite supported except cpu pods

#

You should use templates directly on runpod and refer to a registry from runpod, let runpod's system handle the docker image download

cunning yarrow
grim bronze
#

I see, Can you check the logs

#

Or via workers tab then check one worker's logs

cunning yarrow
#

Okay sure

grim bronze
ember gardenBOT
grim bronze
#

Maybe also create a support ticket for that

ionic coyote
#

please

cunning yarrow
#

Model: deepseek-ai/DeepSeek-R1

cunning yarrow
#

In the logs:

ionic coyote
grim bronze
ionic coyote
#

you need about 1.5TB of vram

#

you gave it 48gigs

grim bronze
#

Yeah alot of people run into this, it can be expensive to run the large typr of that model

cunning yarrow
#

I tried to change the model

#

To "mistralai/Mistral-7B-Instruct-v0.1"

#

I got the response, but it took 2 minutes. Will this be the case for every future request?

ionic coyote
#

no only if it is a cold start

#

aka if you dont send requests for a while and then send it

#

it will take some time for the worker to start up

grim bronze
cunning yarrow
#

Seems to be okay now, it's just the delay times are high

#

Actually, looks good now

grim bronze
#

You can use a ns to prevent model re-download for every cold start

cunning yarrow
#

What's an "ns"?

grim bronze
#

Network storage

cunning yarrow
#

Where do I enable that?

grim bronze
#

Edit endpoint

cunning yarrow
grim bronze
cunning yarrow
#

Got it, thanks. I appreciate your help guys

#

Great support

grim bronze
#

Your welcome

lucid zenith
#

I'm experiencing the same issue (SDK version 1.7.7)

#

My jobs are just stuck in the queue, so it's not because of my code