#How to deploy Unsloth train model from Huggingface repo?

54 messages · Page 1 of 1 (latest)

silk heart
regal geyser
#

You have only pushed adapters by the way.

#

It would be easier if you quantised it to GGUF or merge it to 16 Bit.

#

Loading adapters onto the base model requires a lot of ram and if you don't have good hardware on HuggingFace, it is not possible.

silk heart
#

@regal geyser we need GGUF or 16bit of pytorch model to deploy? Will try

regal geyser
#

If you convert to GGUF, then I can deploy it, then you can clone my Space.

regal geyser
#

For free Spaces, do GGUF.

silk heart
#

alright will try!

regal geyser
#

Cool.

#

Just send the link to the GGUF model and I will make you a inference space that you can duplicate!

#

When I have time.

silk heart
#

I just pay for " pay as you go" in Google Colab, mayb it will do

regal geyser
#

No, for deploying on Spaces.

regal geyser
regal geyser
silk heart
#

but I can push gguf from unsloth example colab right?

regal geyser
#

Yes.

regal geyser
silk heart
#

I see.

#

Convertingggg

regal geyser
#

Oh no.

#

Stop it.

#

Convert it to q4_k_m!

#

@silk heart

#

@silk heart

silk heart
#

okok lol

regal geyser
#

Hahahah... Q8_0 is way too big for Spaces.

silk heart
#

ok q4_k_m right now

regal geyser
#

Cool!

#

By the way, is this a chatbot?

silk heart
#

will have to go in a bit tho, moving house with two newborns, maybe seee you in like 12 hours

silk heart
regal geyser
#

I am only 14, no hassle of children lol.

silk heart
#

just follow unsloth example

#

alright later man Thankss

regal geyser
#

@silk heart See ya!

silk heart
#

Done done

rotund hazel
#

Hopefully resolved guys? 👏

regal geyser
#

Not yet.

regal geyser
regal geyser
#

Here you go!

#

Duplicate it.

#

By the way TinyLlama isn't that smart. So the responses are really bad.

#

@silk heart

silk heart
#

Thankss. Let me close this.

regal geyser
#

Cool.

silk heart
#

yeah it's really bad lol

regal geyser
#

Yep...