#dolphin-2.9-llama3-8b

61 messages · Page 1 of 1 (latest)

quasi sun
#

Oh boy

tranquil stratus
#

xD what's up?

molten stratus
#

Oh, here's my choice for self-hosted model for the day lol

west steeple
#

Now to wait for someone to quant it

tame lark
#

niceee

tranquil stratus
#

Also waiting on a quant, hoping OR just picks it up before lol

#

My hope is that the model doesn't lose too much intelligence but can be rid of its positivity bias

molten stratus
#

or --load-in-smooth on Aphrodite for 8bit quant

west steeple
#

I've only used GGUF before. Would my GTX 1080 be able to handle that?

west steeple
#

oh, epic

molten stratus
#

Took like a minute to quant to 4bit on 3090 btw

#

wow ouch
Nice broken EOS tokens there

#

I shouldn't have tried this on Ooba

molten stratus
#

KoboldCPP is borked too btw

#

Yeah, Aphro + AWQ quant works flawlessly

molten stratus
molten garden
#

Early testing shows degraded performance compared to the original. Maybe some further fine-tuning is in order for this fine-tune... 🙂

molten stratus
pale lark
#

Oh wait it's already up?

#

@dire zephyr you mean this one?

molten stratus
pale lark
#

oh nvm you meant the 70b variant

dire zephyr
#

I mean the 8 billion variant is awesome, the 70b is what I really am looking forward to though

#

definitely use cases for both

molten stratus
molten stratus
molten stratus
pale lark
#

rip

molten stratus
#

Koboldcpp broke too obv

pale lark
#

oh if it's just EOS token

#

maybe it's just the template?

molten stratus
pale lark
#

llama3 template is pretty nuts, we had to fork the one on llama3 repo

#

hmm I don't think chatml works

molten stratus
#
This model was trained FFT on all parameters, using ChatML prompt template format.

example:

<|im_start|>system
You are Dolphin, a helpful AI assistant.<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant
#

I also never seen 5-12 broken EOS tokens spammed in the row consistently on every try before on anything
I've noticed that this EOS problem with L3 is well known already

inland canyon
molten stratus
#

I assume to get this model out faster.

#

also Meta's rules on finetune names are wack

molten stratus
#

Oh, I can see what's wrong with this one
It has notably worse attention span, and is actually worse at following instructions because of that
I may have kinda ignored this in on first try, probably shrugging it off as 4bit AWQ effect. It is not, same problems in fp16 precision.

#

Yeah, existing (mostly) synth datasets are not going to cut it against Meta's 10M human annotated examples.

tranquil stratus
#

I find the llama 3 model unusable because of that. It kinda takes positivity bias into a whole new level.

molten stratus
# tranquil stratus Is it uncensored at least lol

It gave some "As an AI..." refusals to me (less refusals than official L3, but still)
Straight up GPT-esque refusals, not even L3's "Why should I do that?" stuff
I'd rather go with official L3 for now

molten stratus
pale lark
#

Just woke up, gonna look at this now

#

Wow this model is heavily undertrained

molten stratus
# pale lark Wow this model is heavily undertrained

Yeah, this Dolphin is kinda just a straight up downgrade from L3
Doesn't even have it's usual upside of being fully uncensored, 'cause it's so undertrained that decensoring tuning hadn't even fully applied and it still gives refusals.
We really should wait for 70B one

pale lark
#

rip

#

yeah I think we will skip this one TBH

#

it's just bad

tranquil stratus
#

I think Eric should have initialised it from LLAMA instruct to overwrite its safety instead while benefitting from its existing training

molten stratus
tranquil stratus