#where can I access convo-6B-8bit model
28 messages · Page 1 of 1 (latest)
I'm not sure if I hosted the current weights to HF, but the secret sauce is in the set up, not the model.
Which I don't have access too
this but quantized
lotus 12b?
yup.
Keep in mind, this is a very old build, the bots currently run on something similar that was built off that. I don't think haru ever pushed the re-write
https://huggingface.co/Ryex/Lotus-12B-GPTQ
I made it much smaller.
oh wait, you weren't the one asking...
@sonic bough
Heck yeah
Okay
should be alright? who knows
You just gotta test it man
it actually loads into vLLM now, so it's better than the first attempt 
Well that's cool,but how is it?
dunno I'll test it when I have time. it was more just something for people who had no room
well it needs a modified tokenizer so far...
