Can't run a 70b model, gets stuck. | Runpod | Page 1

rugged mason Aug 8, 2024, 2:44 PM

#

you seems to have the same issue I got, try to add in your environment variables

paper perch Aug 8, 2024, 2:45 PM

#

Do you know what this does?

rugged mason Aug 8, 2024, 2:46 PM

#

--max-model-len
Model context length. If unspecified, will be automatically derived from the model config.

paper perch Aug 8, 2024, 2:46 PM

#

was urs just getting stuck?

rugged mason Aug 8, 2024, 2:46 PM

#

yep. i fixed with that

#

reading my log and it's the only change i made

paper perch Aug 8, 2024, 2:47 PM

#

also my worker is ready but it is stuck in queue

#

do u why that can be

rugged mason Aug 8, 2024, 2:47 PM

#

clear the queue and launch again. sometimes it helps at start.
can't know why but i did that and it seems helpful

paper perch Aug 8, 2024, 2:48 PM

#

did u deploy llama 3.1 70b too?

rugged mason Aug 8, 2024, 2:48 PM

#

yep one version of it

paper perch Aug 8, 2024, 2:49 PM

#

which one?

rugged mason Aug 8, 2024, 2:49 PM

#

if you want llama vanilla version, it's cheaper to run it via groq

paper perch Aug 8, 2024, 2:49 PM

#

which one did u run?

rugged mason Aug 8, 2024, 2:49 PM

#

mlabonne/Llama-3.1-70B-Instruct-lorablated
and now i m trying
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8

paper perch Aug 8, 2024, 2:49 PM

#

i need a version where i can use image+text input

#

which one do u think would be best

#

@rugged mason it still got stuck after this: 2024-08-08 19:52:16.529
[l1foz24ligz9rd]
[info]
[1;36m(VllmWorkerProcess pid=100)[0;0m INFO 08-08 14:52:16 weight_utils.py:223] Using model weights format ['*.safetensors']

#

@rugged mason My sentencec also gets cutoff do you know how to fix that

rugged mason Aug 8, 2024, 3:04 PM

#

paper perch <@618888580449828884> My sentencec also gets cutoff do you know how to fix that

not really, i'm not runpod expert 😦

paper perch Aug 8, 2024, 3:05 PM

#

ah alr

#Can't run a 70b model, gets stuck.