#Attempt unsloth library on Mamba-2.8b

1 messages · Page 1 of 1 (latest)

halcyon sandal
#

I've read the introduction blog post, and the blog post about apply unsloth on gemma. Where do I start 🧐 ?

tropic stratus
#

Unfortunately if you want to use Mamba, it will not work with unsloth, you'll need to make a lot of changes

halcyon sandal
tropic stratus
mint viper
#

Had a look yesterday - its exttremely different, like a whole new beast than transformers 😦

halcyon sandal
#

Im happy to do it as personal project but I kinda need to know that code-wise, how did you approach manual autograd with transformer in the first place.

p.s. im bad at jacobian derivative

mint viper
#

oh ye sadly derivatives

#

but as a start, just try coding up Mamba from start to end

#

and compare losses 🙂

halcyon sandal
mint viper
#

Ohh no so the issue is Mamba isnt doing attention nor MLP anymore

#

fast lora will not work

halcyon sandal
#

on the other hand, their philosophy and yours seem awfully similar already

mint viper
halcyon sandal
#

bump

mint viper
#

hmmm mamba will sadly have to wait - it looks overly complex to optimize

#

ill have to get myself more well versed with it