#MiniMax-M1

83 messages · Page 1 of 1 (latest)

wooden quail
#

Benchmarks make Minimax look like its on-par with Gemini 2.5 pro

#

I wonder if it token spams, though

analog steeple
#

their previous model minimax-01 was also good and very underrated imo

#

came out at the wrong time, like a few days after deepseek-r1 and wasn't a reasoner, but it had really strong long context.

misty onyx
#

any plans to host it?

wooden quail
serene laurel
#
chat.minimaxi.com

MiniMax AI 基于自研的多模态大语言模型为用户打造的AI伙伴,可以帮你智能搜索问答、精准识图解析、沉浸语音通话、专业/创意写作、文档速读总结、还有独家悬浮球功能帮你把琐事化繁为简。10倍速获取信息,10倍速解决问题。从学生到打工人,或者是自由工作...

obsidian yoke
#

anyone got some feedback on how it RPs?

serene laurel
#

Novita have hosted it!!!

fallow goblet
#

@twilit jacinth @shadow tide
Do we still have deals with MiniMax?

shadow tide
#

it’ll come online today

round mantle
fallow goblet
dusty kettle
#

What’s the TPS look like?

analog steeple
#

Minimax themselves are almost certainly hosting it

shadow tide
#

they are, we'll have it today

humble talon
waxen lily
#

How's that coming along

waxen lily
#

Cuz the demo asks for signing up or smth

#

And it's all chinese and chrome's auto translate isn't working

waxen lily
#

ooohh

#

cool cool thanks

#

the reasoniing looks solid

#

so far only reasoning...

#

hope iit doesn't do a qwen

#

lord wiill it ever stop thinking

#

the pygame app crashed at first cuz it used the same identifier to refer to different things

#

after fixing it it looks kind of boring having asked it to be creative

somber sapphire
elfin robin
#

it's not even available for me

#

aw

fallow goblet
#

Okey, this model actually quite good

#

Has been tested it, not as smart as other big model but it didnt have the problematic way of the other model.

wary vine
#

unfortunately only the novita ai's endpoint supports tools :/

fallow goblet
#

How do we get pass limit of 1000 thinking token? i alwasy get that but i want to try and get 80K token thinking

#

Is there a command on their page about it or it depends on some parameters?

hasty carbon
#

i reached 895s of thinking with no output

humble talon
#

Finally a model that just thinks

strange marlin
#

very verbose + ultra slow inference isn't such a great combo tbh.
Will see how this one does, though since MiniMax-01 also had grand marketing and was pretty meh overall.

worthy socket
#

wtf 1m context window

hasty carbon
humble talon
#

the smartest llm would know better than to interact with humans

worthy socket
viscid nexus
#

Is this the o3 pro experience? 17 mins is 0_0

supple estuary
#

What is "extended" it's more expensive, but says smaller input context. Longer output??

strange marlin
#

asking a simple 3-step math question 😅

wary vine
#

wait!...

obsidian yoke
#

model be like
<think>
So the user wants me to roleplay them as Sasuke in Naruto. To achieve peak immersion, I must emulate post-trauma stoicism, pre-revenge arrogance, mid-revenge angst, and subtle homoerotic tension with Naruto, while maintaining canonical speech brevity and ignoring filler contradictions. Compute precise balance of “Hn” frequency. Cross-check clan massacre references every third sentence. Initiate brooding glare subroutine. Ready.</think>
Sasuke: Konnichiwa

stiff hill
#

Large context window really a marketing gimmick nowadays

fresh prism
#

This model is peak

fallow goblet
#

They truely using manually calculation

cunning flint
hasty carbon
#

I wanted to test out MiniMax's older, non-reasoning model, but it seems like it just instantly Internal Server Error's

strange marlin
#

Tested MiniMax-M1:
At 456B too large to run local, and as a ultra-verbose reasoning model and slow inference speed via API, found this model to be unusable for any real work.
With 92/8 reasoning split, this model spent most of its time thinking, sometimes exhausting all 40k max tokens without giving a single reply token.

In terms of capability, I found it to be competent at my tech and coding tasks, while producing fairly average results in other areas; around Qwen2.5 Max level.

I place this model in the same category as Phi-4-reasoning-plus or, to an extend, Mistral Magistral, not really usable. But, YMMV!

hasty carbon
fallow goblet
#

Finally we have model that can goes above 192K with 71% understandbility

#

Minimax really are different, hope they can keep on going improving their model
Aint the smartest yet but really competent model to fight the gemini 1M context

civic coral
#

Can you link this table? Does it go up to 1M?

wooden quail
strange marlin
# fallow goblet Finally we have model that can goes above 192K with 71% understandbility

the model is ultra slow though. 22tok/s currently at long cot reasoning, when adjusting for verbosity, its effectively the same speed as claude generating at 1.6 tokens per second, unusable.

just for reference, when I set up some chess matches (usually takes a few minutes), I had to have it run all night > was still playing > and when I then came home from work almost 24 hours after starting some matches finally concluded. that is the degree of usability we are talking about here.

fallow goblet
wooden quail
wary vine
#

maverick is just unusable in every way for me, very awful

#

weird style of writing, forget most stuff so awful for summarizing things like youtube videos, doesn't know what to prioritize

serene laurel
fallow goblet
cunning flint
#

This model doesn't seem great at speaking english

waxen lily
#

@shadow tide

#

That was you or the original guy?

#

Cuz i pretty sure i pinged him too accidentslly