#deepseek-r is loading for >1h into vram.

15 messages · Page 1 of 1 (latest)

rain idol
#

Seems it is related to nmap on network drive. How do you solve it?

south helm
#

What error or what's the log did you get

unborn cove
#

are you using our vllm with network volume? it might downloading the model which could take a while.

rain idol
south helm
#

Okay, if it's loading, is there an oom error?

rain idol
#

loading safetensors checkpoint shards for 1 hour

#

no oom, this is 8xMI300X

#

I read on the github that it is often related on nmap over network drives, but not sure

south helm
#

Nmap? What's that

#

What's taking so long, can you debug to see the progress ( loading)

rain idol
#

function, that reads .tensor and load it to gpu, takes extremally long time. mmap is mapping file as memory to load data directly from ssd to vram with no ram consumption

south helm
#

Oh Im not experienced on these model downloading area, so I don't know much

south helm
fallow hornet
#

dont use network storage to load the models, instead move them to container disk or pod volume disk, see if that loads them any faster

rain idol
#

It is on pod volume disk