Mistral Large 2 (Mistral-Large-2407-Instruct) | OpenRouter | Page 1

fluid igloo Jul 24, 2024, 3:49 PM

#

New 123B param, 128K token context model

License: Mistral Research License MRL-0.1 (noncommercial use only)

HuggingFace weights (instruct, bf16) | Blog post + benchmarks

Hosted on Le Plateforme

HumanEval+ and MBPP+ performance slightly below GPT-4o

#

Looks to be GPT-4o level by benchmarks

valid fjord Jul 24, 2024, 4:08 PM

#

Does noncommercial means it won't appear on OpenRouter?

fringe cloud Jul 24, 2024, 4:10 PM

#

valid fjord Does noncommercial means it won't appear on OpenRouter?

I think it should be fine: #general message

#

nvm, I guess not

glossy harness Jul 24, 2024, 4:27 PM

#

Mistral hosts this as mistral-large-2407, OpenRouter could route to that

jagged reef Jul 24, 2024, 4:40 PM

#

It's hosted on the big cloud providers as well. On Vertex AI - https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/mistral - $3/$9 per 1M input/output tokens, and you agree to https://mistral.ai/terms/#terms-of-use for terms

#

Still waiting for AWS Bedrock/Azure AI to update their models

vague sedge Jul 24, 2024, 6:07 PM

#

@mental moat the pricing didn't get updated?

mental moat Jul 24, 2024, 6:17 PM

#

vague sedge <@353228093420208131> the pricing didn't get updated?

cc @eternal oyster

mental moat Jul 24, 2024, 6:17 PM

#

vague sedge <@353228093420208131> the pricing didn't get updated?

Will be updated soon

#

https://mistral.ai/technology/#pricing

Technology

Frontier AI in your hands

eternal oyster Jul 24, 2024, 6:21 PM

#

its already fixed, someone pointed it out in #arc-requests

mental moat Jul 24, 2024, 6:22 PM

#

Ah mb didn't check

dusty edge Jul 24, 2024, 6:54 PM

#

Its almost same on price like gpt4o. Don't see point in this

tall parrot Jul 24, 2024, 7:07 PM

#

Small correction for Large's description on site - it's not closed-source anymore, it's proprietary weights-available now

#

(I can't bring myself to call model licensed like that open)

hollow sand Jul 24, 2024, 8:44 PM

#

mental moat Ah mb didn't check

FYI: The slug for mistral-large still says it's 32k context while the API reports 128k -> https://openrouter.ai/models/mistralai/mistral-large

eternal oyster Jul 24, 2024, 8:47 PM

#

fixing both the above now, thx yall

tall parrot Jul 24, 2024, 9:51 PM

#

eternal oyster fixing both the above now, thx yall

nice!
also it would be nice to have to the link to it's weights
https://huggingface.co/mistralai/Mistral-Large-Instruct-2407
It's a weights-available model afterall

mistralai/Mistral-Large-Instruct-2407 · Hugging Face

eternal oyster Jul 24, 2024, 10:14 PM

#

yah that's getting added in a sec, when I add lepton as a second provider for half price

tall parrot Jul 24, 2024, 10:36 PM

#

eternal oyster yah that's getting added in a sec, when I add lepton as a second provider for ha...

oh? Seems like it's not impossible to get a commercial license then
That's news to me

eternal oyster Jul 25, 2024, 12:23 AM

#

tall parrot oh? Seems like it's not impossible to get a commercial license then That's news ...

thought they had one but they actually just didn't look at the license yet, so nah they won't be hosting

pine hill Jul 25, 2024, 1:57 AM

#

is this any good? : O

#

seems very pricy ;_;

fringe cloud Jul 25, 2024, 1:59 AM

#

pine hill seems very pricy ;_;

cheaper than gpt-4o and claude 3.5 sonnet

pine hill Jul 25, 2024, 2:02 AM

#

ya slightly cheaper

#

how's performance tho, if it's slightly worse, then it's kinda just priced as a stopgap model ig : D

hollow sand Jul 25, 2024, 12:33 PM

#

Small observation: The 'weights available' Mistral Large v2 on Huggingface has -> "max_position_embeddings": 32768,, which seems to limit the context size to 32k (without further RoPE scaling configuration) if I understand this correctly? -> https://huggingface.co/mistralai/Mistral-Large-Instruct-2407/blob/main/config.json

hollow sand Jul 25, 2024, 1:51 PM

#

Fixed -> https://huggingface.co/mistralai/Mistral-Large-Instruct-2407/commit/5c9ce5b5f7a7ad62d03e8c66c719b66d586de26b

viscid fern Jul 25, 2024, 7:11 PM

#

yup it would be awesome

exotic copper Jul 26, 2024, 7:11 AM

#

pine hill how's performance tho, if it's slightly worse, then it's kinda just priced as a ...

worse that gtp4o in my testing

#

but i only tested logic questions so far

exotic copper Jul 26, 2024, 7:49 AM

#

but still better than llama3

exotic copper Jul 26, 2024, 8:10 AM

#

an so much less censored

hollow sand Jul 26, 2024, 8:42 AM

#

I do not think Mistral models get censored in any way, at least I have never seen them refuse any request like other popular LLMs.

exotic copper Jul 26, 2024, 9:10 AM

#

hollow sand I do not think Mistral models get censored in any way, at least I have never see...

haven't used base Mistral that much,
but I find Wizard22b censored in an insidious way, which is really annoying, would prefect if they just straight up refused.

#

I think it because other providers been offereing m22bx8 at the lower cost is why they released it non-commercial.
though it pretty terriable timing since l3-405b just got released.

vague sedge Jul 26, 2024, 10:01 AM

#

exotic copper I think it because other providers been offereing m22bx8 at the lower cost is wh...

I think 405b is much better at making responses good

exotic copper Jul 26, 2024, 10:15 AM

#

vague sedge I think 405b is much better at making responses good

ya, but censored and not in a good way.

vague sedge Jul 26, 2024, 10:23 AM

#

exotic copper ya, but censored and not in a good way.

I don't really have any use cases related to censoring so I can't really feel anything, I feel like multilang is still not the best but it's a significant step up

primal steeple Jul 26, 2024, 7:23 PM

#

Anyone else having issues with regenerating message not working for this model at the moment?

exotic copper Jul 27, 2024, 3:14 AM

#

primal steeple Anyone else having issues with regenerating message not working for this model a...

no, but im currently using the direct api with $5 trial.

molten field Jul 27, 2024, 6:25 AM

#

So is this already available at openrouter? mistralai/mistral-large <- is this it?

hollow sand Jul 27, 2024, 6:32 AM

#

molten field So is this already available at openrouter? mistralai/mistral-large <- is this i...

Yes, the old mistral-large got replaced with "v2" on the same endpoint (unless you have very specific needs you won't miss the old version).

#

See also the model card -> https://openrouter.ai/models/mistralai/mistral-large

Mistral Large by mistralai

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement here.

It is fluent in English, French, Spanish, German, and Italian, with high grammatica...

upper lava Jul 29, 2024, 9:38 PM

#

After messing with this model. I would say, it's quite creative. Almost better than wizard 8x22b.

viscid fern Jul 30, 2024, 8:56 PM

#

love this model, will add it to my app within 2 days

#

very good one

spring marten Jul 31, 2024, 11:47 AM

#

hollow sand Yes, the old mistral-large got replaced with "v2" on the same endpoint (unless y...

That explains why it feels diffrent from when I played with it months ago.

#

Personally I like it better than Sonnet

exotic copper Jul 31, 2024, 1:38 PM

#

Been switching between this a sonnet 3.5. It's pretty good for the size but sonnet is still a lot better at creative writing. Though this is better than 405b.

coral turret Aug 3, 2024, 9:26 AM

#

@mental moat Mistral's own API is horrendously slow for this. Maybe you guys could add Azure? We compared them and Azure was ~2.5x faster on average.

mental moat Aug 3, 2024, 10:41 AM

#

coral turret <@353228093420208131> Mistral's own API is horrendously slow for this. Maybe you...

Ohh, is it the exact same Mistral large model???

#

I thought the weight is private for that obe

coral turret Aug 3, 2024, 10:42 AM

#

mental moat I thought the weight is private for that obe

afaict it's the same? It was announced in Mistral's official blog for Mistral-Large-2

https://mistral.ai/news/mistral-large-2407/

Large Enough

Today, we are announcing Mistral Large 2, the new generation of our flagship model. Compared to its predecessor, Mistral Large 2 is significantly more capable in code generation, mathematics, and reasoning. It also provides a much stronger multilingual support, and advanced function calling capabilities.

#

#

AWS/Azure/GCP host Mistral non-OSS models due to their partnership

#

The pricing is the same as Mistral API, but wayyy faster

bold pier Aug 3, 2024, 2:59 PM

#

upper lava After messing with this model. I would say, it's quite creative. Almost better t...

I felt like it follow my instructions much better than WizardLM-2-8x22b which often miss simple instructions. Tho I wish it had a cheaper price.

upper lava Aug 3, 2024, 5:17 PM

#

bold pier I felt like it follow my instructions much better than WizardLM-2-8x22b which of...

Yeah, i loving it this right now more than Wizard. I hope they can upgrade with mistral small and medium for cheaper price.

mental moat Aug 3, 2024, 9:46 PM

#

coral turret <@353228093420208131> Mistral's own API is horrendously slow for this. Maybe you...

Azure host has been added! Thanks for the flag 🙏

iron steeple Aug 7, 2024, 3:43 PM

#

this model is suddenly refusing all "unsafe" requests, it wasnt doing that up until a few days ago :/
anyone else notice this?

tall parrot Aug 7, 2024, 3:47 PM

#

iron steeple this model is suddenly refusing all "unsafe" requests, it wasnt doing that up un...

well, only that changed is that Azure was added as a provider
try to exclude Azure

iron steeple Aug 7, 2024, 4:12 PM

#

tall parrot well, only that changed is that Azure was added as a provider try to exclude Azu...

hmm looks like with different prompting it doesnt refuse anymore. (but with weird changes, for example appending "use a json string" made it refuse)
and changing provider didnt affect refusals, but thx for the idea.

midnight rock Aug 7, 2024, 4:36 PM

#

It’s an Azure issue indeed.

#

Frustrating as Mistral’s own endpoint is really slow.

#

If anyone knows whether any of the other possible Mistral Large 2 endpoints are reasonably fast and uncensored, please let me know!

tall parrot Aug 8, 2024, 5:17 AM

#

midnight rock If anyone knows whether any of the other possible Mistral Large 2 endpoints are ...

It's on AWS
Haven't tested it personally, but afaik AWS doesn't add any additional moderation by default

#

It's also on Vertex

mental moat Aug 8, 2024, 5:20 AM

#

iron steeple hmm looks like with different prompting it doesnt refuse anymore. (but with weir...

if changing provider didn't affect refusals, I think it'd be the same thing on other cloud. It seems they are just taking the engine and run it as-is on the cloud infra

iron steeple Aug 8, 2024, 5:42 AM

#

one big thing, i think, is ambiguity of the request makes it more likely to refuse.
so like, ask write about <this crazy thing with X Y and Z>, isntead of write about <this crazy thing>
(i think that was the problem in my case)

midnight rock Aug 8, 2024, 4:54 PM

#

mental moat if changing provider didn't affect refusals, I think it'd be the same thing on o...

Yeah, it definitely depends on the provider, as in Azure is definitely censoring it.

#

Do you have AWS on OpenRouter? Would be great to be able to add that to the list of providers.

spring marten Aug 11, 2024, 2:06 AM

#

ML2 suddenly gotten really strict on me. Ignoring Azure is not helping.

buoyant atlas Aug 11, 2024, 2:16 AM

#

ML2 (and small/medium) all don't seem to work in the chat playground right now with the Mistral provider and instead returns this error. Only requests through Azure seem to work.

#

Looks like all the Mistral models are throwing the same error only from the Mistral provider. I'm guessing any requests with unsupported samplers aren't ignored on their end and instead give an error now?

mental moat Aug 11, 2024, 2:22 AM

#

cc @placid sonnet

#

Fix coming soon, thanks for the flag!

mental moat Aug 11, 2024, 3:08 AM

#

It should be up now fyi

spring marten Aug 11, 2024, 3:13 AM

#

mental moat It should be up now fyi

It is working fine now, thanks!

#

What was going on?

mental moat Aug 11, 2024, 3:17 AM

#

spring marten What was going on?

we were forwarding an anonymous user id to providers (this is useful to detect abuse so we're not de-platformed by the provider themselves). It appears Mistral itself does not accept the parameter

placid sonnet Aug 11, 2024, 9:05 PM

#

Thanks for flagging @buoyant atlas and @spring marten - happy to give you guys some free credits for this issue if you dm me your openrouter emails. we started submitting anon user ids to a few openai-style providers to help them smooth out traffic from some very aggressive users, and our e2e test suite for Mistral wasn't included.

We're going to improve our e2e tests to prevent this issue going forward.

hollow sand Aug 11, 2024, 9:10 PM

#

If an ID is anonymous, it can never be correlated to an actual user, but then abuse prevention would not make sense. So I think you are using pseudonymous IDs, which can be traced to a certain user, if necessary, correct? "anon IDs" is IMHO misleading in that case.

spring marten Aug 12, 2024, 12:41 AM

#

hollow sand If an ID is anonymous, it can never be correlated to an actual user, but then ab...

I think its less about targeting individuals, and more about finding patterns between abusive users so they can better train the AI to ignore those requests.

spring marten Aug 12, 2024, 12:41 AM

#

placid sonnet Thanks for flagging <@1119790675010007060> and <@447549215393054730> - happy to ...

Nah, its ok. I'm not going to sweat over a few cents. Thank you for the offer though.

buoyant atlas Aug 12, 2024, 2:27 AM

#

same, didn't lose anything really, just noticed it happening and switched the provider while it wasn't working

placid sonnet Aug 12, 2024, 7:26 PM

#

hollow sand If an ID is anonymous, it can never be correlated to an actual user, but then ab...

for providers without moderation requirements in their terms, like Mistral, we hash the user ids before submitting to the provider

#

(though, for Mistral today, we aren't sending anything at all. we were just submitting user: null and they didn't like that)

hollow sand Aug 13, 2024, 5:53 AM

#

Hashing the user id is still pseudonymous IMHO as this consistently produces the same ID/hash and can be traced back the second this hashed ID can be correlated to the real ID via a request log on both sides. The necessary metadata for this trace back gets logged officially, all that needs to be done is hashing all real user ids and see which pseudonymous ID from the other log fits to the log entry with the real ID, correlated by timestamp or other very unique request identifiers.
Anonymous would be if this was not possible.

#

(I'm an annoying nitpicker, I know)

raven sorrel Aug 13, 2024, 2:14 PM

#

very good model but it is too slow...

midnight rock Aug 14, 2024, 3:47 PM

#

raven sorrel very good model but it is too slow...

@mental moat @placid sonnet would be great to add a faster & uncensored provider. Amazon Bedrock and IBM watsonx seem to be the available options.

placid sonnet Aug 14, 2024, 9:08 PM

#

hollow sand Hashing the user id is still pseudonymous IMHO as this consistently produces the...

fair point!

harsh portal Aug 16, 2024, 10:13 PM

#

I used it via OpenRouter chat and it followed the instructions fantastically,
I simply needed to extract information from several texts and format them in several different lines, I created special instructions to enter in the system prompt of the interface provided by OpenRouter.
I compared it to the 405B Lllama Sonar and it wins Mistral Large, at least in my tests.
Please note: I am only talking about strict adherence to the instructions!

timber mantle Sep 10, 2024, 9:21 AM

#

Sorry to revive the thread, but does anyone know why Microsoft charges more for mistral-large than mistral-large-2407?
($4/M in, $12/M out VS $3/M in, $9/M out)