#AI21 Jamba MoE: 256K Context, 3X the speed.
11 messages · Page 1 of 1 (latest)
probably a mix of AI21's "Jurassic" models and the name mamba.
so yes its mamba
Any idea if/when we will get this model to play with on OpenRouter? Looks amazing
looking into it
looks like only a base version is available (on HF) - no instruct yet, but it's coming
are there any base versions on OpenRouter at all? I'd love to try them out for certain use cases
If you mean base versions of any model - OR has base Mixtral and Yi, for example.
Sorry to resurrect an old thread, but looks like AI21 is now publicly testing this: https://www.ai21.com/blog/announcing-jamba-instruct