MiniMax-M1 | OpenRouter | Page 1

tawny raft Jun 16, 2025, 4:30 PM

#

MiniMax-M1

#

https://huggingface.co/collections/MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094

MiniMax-M1 - a MiniMaxAI Collection

wooden quail Jun 16, 2025, 5:46 PM

#

Benchmarks make Minimax look like its on-par with Gemini 2.5 pro

#

I wonder if it token spams, though

analog steeple Jun 16, 2025, 5:52 PM

#

their previous model minimax-01 was also good and very underrated imo

#

came out at the wrong time, like a few days after deepseek-r1 and wasn't a reasoner, but it had really strong long context.

misty onyx Jun 17, 2025, 1:53 AM

#

any plans to host it?

wooden quail Jun 17, 2025, 2:09 AM

#

misty onyx any plans to host it?

Open Router doesn't host models. You have to wait for providers to host it

serene laurel Jun 17, 2025, 3:30 AM

#

16476d60cd7b12a9e44edad47300196581224060_2_1035x514.png

#

Demo:
https://chat.minimaxi.com/

chat.minimaxi.com

MiniMax - 你的AI智能助手

MiniMax AI 基于自研的多模态大语言模型为用户打造的AI伙伴，可以帮你智能搜索问答、精准识图解析、沉浸语音通话、专业/创意写作、文档速读总结、还有独家悬浮球功能帮你把琐事化繁为简。10倍速获取信息，10倍速解决问题。从学生到打工人，或者是自由工作...

obsidian yoke Jun 17, 2025, 4:49 AM

#

anyone got some feedback on how it RPs?

serene laurel Jun 17, 2025, 8:15 AM

#

Novita have hosted it!!!

fallow goblet Jun 17, 2025, 8:58 AM

#

@twilit jacinth @shadow tide
Do we still have deals with MiniMax?

shadow tide Jun 17, 2025, 12:06 PM

#

it’ll come online today

round mantle Jun 17, 2025, 12:13 PM

#

obsidian yoke anyone got some feedback on how it RPs?

bump

fallow goblet Jun 17, 2025, 1:13 PM

#

obsidian yoke anyone got some feedback on how it RPs?

Not yet being hosted

dusty kettle Jun 17, 2025, 1:59 PM

#

What’s the TPS look like?

analog steeple Jun 17, 2025, 3:23 PM

#

Minimax themselves are almost certainly hosting it

shadow tide Jun 17, 2025, 3:30 PM

#

they are, we'll have it today

humble talon Jun 17, 2025, 4:19 PM

#

waxen lily Jun 17, 2025, 5:35 PM

#

How's that coming along

waxen lily Jun 17, 2025, 5:35 PM

#

shadow tide they are, we'll have it today

The hosting

#

Cuz the demo asks for signing up or smth

#

And it's all chinese and chrome's auto translate isn't working

obsidian yoke Jun 17, 2025, 6:01 PM

#

https://huggingface.co/spaces/MiniMaxAI/MiniMax-M1

MiniMax M1 - a Hugging Face Space by MiniMaxAI

waxen lily Jun 17, 2025, 6:24 PM

#

ooohh

#

cool cool thanks

#

the reasoniing looks solid

#

so far only reasoning...

#

hope iit doesn't do a qwen

#

lord wiill it ever stop thinking

#

the pygame app crashed at first cuz it used the same identifier to refer to different things

#

after fixing it it looks kind of boring having asked it to be creative

somber sapphire Jun 17, 2025, 10:16 PM

#

🫠

elfin robin Jun 17, 2025, 10:59 PM

#

it's not even available for me

#

wheezeold

#

aw

fallow goblet Jun 18, 2025, 2:51 AM

#

Okey, this model actually quite good

#

Has been tested it, not as smart as other big model but it didnt have the problematic way of the other model.

wary vine Jun 18, 2025, 3:35 AM

#

unfortunately only the novita ai's endpoint supports tools :/

fallow goblet Jun 18, 2025, 10:32 AM

#

How do we get pass limit of 1000 thinking token? i alwasy get that but i want to try and get 80K token thinking

#

Is there a command on their page about it or it depends on some parameters?

hasty carbon Jun 18, 2025, 2:39 PM

#

waxen lily lord wiill it ever stop thinking

it's qwen on steroids

#

i reached 895s of thinking with no output

humble talon Jun 18, 2025, 3:01 PM

#

Finally a model that just thinks

strange marlin Jun 18, 2025, 3:43 PM

#

very verbose + ultra slow inference isn't such a great combo tbh.
Will see how this one does, though since MiniMax-01 also had grand marketing and was pretty meh overall.

worthy socket Jun 18, 2025, 4:47 PM

#

wtf 1m context window

hasty carbon Jun 18, 2025, 4:52 PM

#

humble talon Finally a model that just thinks

after 15 minutes of pondering, the model decides the best course of action is to remain silent.

humble talon Jun 18, 2025, 5:02 PM

#

the smartest llm would know better than to interact with humans

worthy socket Jun 18, 2025, 5:10 PM

#

humble talon Finally a model that just thinks

LMAO

viscid nexus Jun 18, 2025, 5:25 PM

#

Is this the o3 pro experience? 17 mins is 0_0

supple estuary Jun 18, 2025, 7:23 PM

#

What is "extended" it's more expensive, but says smaller input context. Longer output??

strange marlin Jun 18, 2025, 8:45 PM

#

asking a simple 3-step math question 😅

wary vine Jun 18, 2025, 8:57 PM

#

wait!...

obsidian yoke Jun 18, 2025, 9:10 PM

#

model be like
<think>
So the user wants me to roleplay them as Sasuke in Naruto. To achieve peak immersion, I must emulate post-trauma stoicism, pre-revenge arrogance, mid-revenge angst, and subtle homoerotic tension with Naruto, while maintaining canonical speech brevity and ignoring filler contradictions. Compute precise balance of “Hn” frequency. Cross-check clan massacre references every third sentence. Initiate brooding glare subroutine. Ready.</think>
Sasuke: Konnichiwa

stiff hill Jun 18, 2025, 9:31 PM

#

obsidian yoke model be like <think> So the user wants me to roleplay them as Sasuke in Naruto....

Banger

stiff hill Jun 18, 2025, 9:32 PM

#

worthy socket wtf 1m context window

Yeah but they had 4m context window model previously and it was doo doo

#

Large context window really a marketing gimmick nowadays

fresh prism Jun 18, 2025, 10:03 PM

#

This model is peak

fallow goblet Jun 18, 2025, 11:45 PM

#

strange marlin asking a simple 3-step math question 😅

Haha

#

They truely using manually calculation

cunning flint Jun 19, 2025, 12:43 AM

#

strange marlin asking a simple 3-step math question 😅

The best part is that it isn't even right

hasty carbon Jun 19, 2025, 11:33 AM

#

I wanted to test out MiniMax's older, non-reasoning model, but it seems like it just instantly Internal Server Error's

strange marlin Jun 19, 2025, 4:19 PM

#

Tested MiniMax-M1:
At 456B too large to run local, and as a ultra-verbose reasoning model and slow inference speed via API, found this model to be unusable for any real work.
With 92/8 reasoning split, this model spent most of its time thinking, sometimes exhausting all 40k max tokens without giving a single reply token.

In terms of capability, I found it to be competent at my tech and coding tasks, while producing fairly average results in other areas; around Qwen2.5 Max level.

I place this model in the same category as Phi-4-reasoning-plus or, to an extend, Mistral Magistral, not really usable. But, YMMV!

hasty carbon Jun 19, 2025, 5:36 PM

#

strange marlin **Tested MiniMax-M1**: At 456B too large to run local, and as a ultra-verbose re...

I sorta agree with this, however I believe it's around qwen 3 235b, not phi-4 or magistral.

It gets into more thinking loops than qwen, but when it doesn't it's around on-par with it when it comes to code quality

fallow goblet Jun 22, 2025, 8:36 AM

#

Finally we have model that can goes above 192K with 71% understandbility

#

Minimax really are different, hope they can keep on going improving their model
Aint the smartest yet but really competent model to fight the gemini 1M context

civic coral Jun 22, 2025, 2:06 PM

#

Can you link this table? Does it go up to 1M?

wooden quail Jun 22, 2025, 3:09 PM

#

civic coral Can you link this table? Does it go up to 1M?

It doesn't go up to 1m

This graph is from fiction livebench: https://fiction.live/stories/Fiction-liveBench-Mar-25-2025/oQdzQvKHw8JyXbN87

strange marlin Jun 22, 2025, 7:16 PM

#

fallow goblet Finally we have model that can goes above 192K with 71% understandbility

the model is ultra slow though. 22tok/s currently at long cot reasoning, when adjusting for verbosity, its effectively the same speed as claude generating at 1.6 tokens per second, unusable.

just for reference, when I set up some chess matches (usually takes a few minutes), I had to have it run all night > was still playing > and when I then came home from work almost 24 hours after starting some matches finally concluded. that is the degree of usability we are talking about here.

serene laurel Jun 23, 2025, 6:49 AM

#

wooden quail It doesn't go up to 1m This graph is from fiction livebench: https://fiction.li...

This is 40K model not 80K

fallow goblet Jun 23, 2025, 7:27 AM

#

strange marlin the model is ultra slow though. 22tok/s currently at long cot reasoning, when ad...

Still impressive
No other model than google model has able to achive good context understanding at high token amount, maveric still below them and they are much more uncensord unless if we talking about china stuff.

wooden quail Jun 23, 2025, 10:32 AM

#

serene laurel This is 40K model not 80K

Context length, I meant

wary vine Jun 23, 2025, 10:28 PM

#

maverick is just unusable in every way for me, very awful

#

weird style of writing, forget most stuff so awful for summarizing things like youtube videos, doesn't know what to prioritize

serene laurel Jun 27, 2025, 3:20 PM

#

A new provider for hosting minimax m1
https://www.siliconflow.com/models/minimaxai-minimax-m1-80k

Run MiniMax-M1-80k for Fast, Scalable Inference | SiliconFlow

Deploy MiniMax-M1-80k , a powerful text model, on SiliconFlow with ultra-low latency, high throughput, and flexible pricing. Fine-tune or integrate instantly via OpenAI-compatible APIs.

fallow goblet Jun 28, 2025, 12:03 AM

#

serene laurel A new provider for hosting minimax m1 https://www.siliconflow.com/models/minimax...

@shadow tide @twilit jacinth
You guys need to contact and work with em

cunning flint Jun 30, 2025, 9:48 PM

#

This model doesn't seem great at speaking english

#

waxen lily Jul 1, 2025, 1:12 AM

#

@shadow tide

#

That was you or the original guy?

#

Cuz i pretty sure i pinged him too accidentslly

#MiniMax-M1