#Can Mike/Daniel share how you guys split/mistral-fied the Phi-3 mini model?

6 messages · Page 1 of 1 (latest)

dapper flax
#

Title. Really cool work but I am kinda curious: how did you guys convert phi-3 mini into the mistral architecture? Do you mind writing a quick blog post or share a script or something similar.

I am also hesitant because idk if the lora adpters produced by unsloth trained on phi-3 mini can be used on the original phi-3 mini... or if im forced to always use unsloth's version during inference

sturdy badger
#

Oh yes sorry I missed this

dapper flax
#

oh no worries. But I was curious if there are any technical details beyond what's in this post? it's quite high level... I am trying to reproduce this in some extend to double check something since my company serve models very different

sturdy badger
#

daniel has more info