#Hunter Alpha
1927 messages · Page 2 of 2 (latest)
straight teeth
any predicts?
yeah someone made a poll
the reasoning trace looks a lot different from any model i'm familiar with
just style wise
where
uhh lemme uhh
found
please be mimo level price
I'm having a lot of frustration getting it to one-shot stuff that most big models get right.
now time to sleep
[expensive model factory]
Lots of errors that other models don't make
it will be 🆓
gun nai
the best things in life are free
provided by the CCP to the people
model definitely isn't from any of the big labs imo
yeah
Maybe ernie?
def not
osu
Imagine they want to gather openclaw data so they do stealth model 
100 iq move
i knew it
the model also feels very eager to answer quick, even for more complex tasks
doesn't do much reasoning
also just makes tons of silly coding mistakes for more niche languages
i dont think its deepseek and i hope its not lol
healer's response looks very much alike to sonnet 4.6, hunter hallucinates.
yeah, this model hallucinates quite a bit
I wanted to say healer could be a haiku model, but the reasoning trace doesn't look much like it
definitely talks similarly though
bro it makes dimension mismatch for a simple neural network
how long does a model usually stay in stealth?
hunter is quite rigid and stupid. shame
unfortunately
a week usually
It doesn't remind me of GPT, Claude, Kimi, GLM, Qwen, or any of the big mainstream ones that I've used. I admit I never used Deepseek much.
Kinda wondering if it's some Nvidia model or something.
Hunter Alpha and Healer Alpha on OpenRouter:
- Hunter Alpha has Claude psychosis
- Healer Alpha says it's built by Xiaomi
- they are definitely chinese models
- Hunter Alpha responds much slower, like half the tks/s
- both models are completely SVG benchmaxxed
- both fail
see
it's crap
It lacks GLM stile dialogues and paragraph structure, so I can be almost sure it's not that, or re-trained fully, which is impossible
so its somehwere between glm 4.7 ant glm5 right?
like i mentioned, it's def not from any of the big labs
at least on the surface it doesn't look that way
the hell ru saying
one finger typing 😄
so its GLM 4.85?
honestly, could be a GLM flash model too
we don't have a GLM 5 flash yet
the way it writes and reasons doesn't feel like GLM tho
glm-4.7-flash was unusable for the first few days (1-12 tok/s)
A 1T param flash model would be one hell of a flex
I’m gonna guess this is Ling/Ring because they’re the only people making Omni models at large sizes
thats what 4.7 flash felt like, to be fair 😄
And they previously released 1T models
fair guess
although, have they released on openrouter before?
no
Ling/Ring, even at 1T, were very distinctively ass models
exactly :)
None of their model is on openrouter tho
yeah lol
imagine
...
i said it first though
Ling/Ring new generation makes sense
👁️
I haven't tried this yet, but I think if these were Ring/Ling people would be vomiting already
you said like 10 different ones
💀
broken clock is right .. etc..
but i was only talking seriously about ring
😏
eh I think this model is better at creative tasks and writing.
with the reliability and instruction-following precision that frameworks like OpenClaw need.
yeaaaa that's definitely a niche chinese lab
Are Ling/Ring as fast over API as this model?
Imagine it’s Chinese model that host on Novita for data gathering
Never used them outside of some minor testing...
Where can you even test it
For a 1 trillion parameter model this is fast as hell
yep Ling/Ring are also fast because of their architecture
IIRC it was their site or something.
it's definitely moe
they're not as deep as other models and iirc are hybrid linear attention?
I'm pretty sure this is it
dude if this is xiaomi id be SO happy
Fr
$20 says this is ds v4 (I'm just gambling)
lol yeah this is 100% it
xiamo/stepfun/meituan tier makes sense
fr?
so are we shorting us stock market or not chat
openrouter description has weird posting about openclaw
unfortunately not
honestly you might be right lol
its ling
ling has infra?
the openclaw push felt weird
now that i think abt it this could def be ling
I think lot of chinese companies also post about openclaw these days
i had the same revalation lol
ZILVER
Yeah ling/ring might be a solid guess.
it might really be ling
who figured that out
I'm just glad that my knowledge of niche chinese models has rewarded me with seeing the announcement text only and instantly thinking it's ling
@covert robin
😎
also ling i remember when they announce their models they were big on the "its 1T!!!!!!!!!!!!!"
so it def makes sense why its said here

yes exactly t his
they could not stop flexing the 1t param
yeah
is this upcoming kimi model?
i figured
Aint no way this model is ring
I've been using this and uhh it feels so similar to kimi
no it doesn't lol
ts is ling
this model just isn't very intelligent
By the way, Healer and Hunter randomize which company they say they're from
So far I've got Google, Anthropic, Xiaomi, OpenAI, Alibaba for both
It feel like Xiaomi mimo fr
claude has a lot of illegitimate children
This one ain't getting that child support though
Hey, I'm in that screenshot
^ Local celebrity
niche internet microcelebrity
more like obsessed with a bunch of models I try for 1-3 days
and then go back to claude
: )
I think most of us forgot it existed
I know I did
I saw "1T" and "omni" in the same paragraph and knew it
@smoky leaf did pull it first, only because I was in a meeting ....
...and he pulled a lot of other models so ....
lol
ring ming ding-a-ling
if it isn't ring (somehow) it's probably some other niche lab
the model definitely doesn't feel anywhere near SOTA
not a bad model overall though
ring's best qualities have always been they're quite fast for what they offer
yeah i KNEW that they made a model and everything with 1T and i never even bothered to try it tbh it just already seemed like benchmaxxed slop
so I'm not too surprised
Oh how it likes to echo
also its default code is really shit lol
When is it going away?
it's unholy
the only thing throwing me off from it being a ring model is that they've never dropped on openrouter
If it's not ringmingdingaling, I'm gonna guess it's some Nvidia model. They did just have a new nemotron release.
Code golfing, nice
nvidia is def my second guess
nvidia wouldn't china censor
Are we referring to this or Healer being Ling?
both
Fair point
both
Huh
It'd be Ring or Ming, not Ling.
God i hate their naming convention so much.
ling was on openrouter for a while, hosted by siliconflow (2-3 weeks or smth on ling-1t). ring-1t was there only for a very brief time and the implementation was completely broken.
(I'm just using it interchangeably)
ah i see
Yep can't keep 'em straight. Ringalinga ding dong
i honestly forgot they were not being hosted anymore
man ling has good infra im pretty impressed tbh
lingringding whatever the fuck the company is caleld
Odd, this feels a lot like GLM to me still

Even gave me an oddly same-y structured response
R(easoning)ing, M(ultimodal)ing, and L(ame)ing.
But they all just sound the same in my head.
The models also seem very cautious to say anything even slightly negative about any public figure
kind of odd
it does. and i also got deepseek-v3.1 responses almost verbatim. problem is cn models don't exactly have a unique style mostly.
Best way is to put that labs latest research papers questions , I guess these AI labs bake in their research papers in the training data.
Its reasoning seems nothing like GLM to me, way too verbose. Also paled to GLM in a quick creative writing test I did where I uploaded a novel and had the model write a coda in the style of the author.
it's a bit rude to call this a frontier model tbqh...
doesn't deepseek usually use tons of emojis though?
yea reasoning didn't match that was said right at the beginning.
i doubt this is a ds model
(GLM 4 32B didn't even do this....)
I'd say qwen 😊
I remember Elon saying Grok 4.2 beta was 500B , if it's that then I know how it is lolll.
shallowhide is the real deepseek
yea i meant emoji use
👍
ah i see
grok isn't china censored
I think the smaller/medium qwen 3.5 models are really good, not so much their big ones
It does sometimes , also I remember it's censorship was broken 2 days ago. Try asking questions in Chinese maybe if it's one from the mainland
Given the rumors of a Gemma 4 release this week... I will laugh if this ends up being Gemma 4 1T or some shit
Not that Google would ever release such a large open model, just funny to think about
i heard from rumours that they were going for 120B~ max
google would never chinese censor lol
Ayoo no way , logan did say it but gemma is for local mobile AI Afaik
no reason to
Yeah just a joke, no way it's Gemma
could be gemma. similar chess skill. wait, too many tokens used, doesn't check out
Not censored enough to be Gemma.
Hunter alpha performs okay on lateralbench, but if this is deepseek v4 it's disappointing
I could maybe believe that Healer was a Gemma, given the multimodal stuff that Google is good at, but not buying that Hunter is, especially with complete lack of multimodality
good improvement for ling
i find this model.... meh
like its decent but, not exciting
Though I haven't tested Healer beyond one quick test (though it did well at it)
I think shallowhide is real deepseek
This model clusters near GLM5 in terms of token usage
atp i dont think v4 exists
well except the lite version
Oh shit guys, we all forgot Amazon made models
lol. I still suspect ringalingading but it's not necessarily settled.
actually yeah
could be amazon
but then again
that doesn't explain the chinese censoring
Amazon models are aggressively censored in streaming, it's not them.
qwen3-max similar on token usage? how does that make sense. it should be like 20x more, at least.
Dunno, go ask that guy, I'm just the screenshotter
It's from the healer thread/room/whatever
Hi yall, i think i found out the output cost for Hunter.
This message of about 50 tokens, if you use the openrouter pricing API. came out to 0.0000122$
soo, around $0.25/M output?
dirt cheap
Yea but this could be an anomaly, because later on, the API ceased to return the pricing.
I'll have to do some more testing
probably an error
For me it's quite defensive on US and it's closest ally but shams China upto a certain limit. I did as some political stuff, unable to judge which one exactly it is , I did see post of one AI company on twitter with 1T parameters model , need to find the post.
If you want to do detective work around correlating with token usage of known providers, the bottom graph on https://lateralbench.org might be helpful. Here's hunter alpha's cluster
log scale x axis, so healer kimi etc are not actually that close
Arcee?
yeah, the model hedges a lot and tries to be super neutral
Trinity Large. Could be, they'd have to have done some pretty great post training vs preview tho
sometimes to a point where it feels "scared" to talk about certain topics
trinity large was 400b, this is 1t
this model feels brain damaged but healer is decent at writing
not gonna be china censored
lateralbench, brought to you by this guy:
(19 days ago)
oh wait
nvm !
I confused your bench with something else
pricing in line with Ring
Who serves it?
please yes
Very low quality reason chess match between hunter/healer concluded. Cannot say exactly strength from just 1 match but if I had to estimate ... ~claude-haiku-4.5 lvl
hunter: 6k tok/move, 87s/move (gpt-5.2 speed), ~68 tok/s, 12 Blunders
healer: 4.8k tok/move, 34s/move (glm-4.5 speed) ~140 tok/s, 10 Blunders
lol what did you think it was?
I think the Leo guy made some benchmark
forget what it was called
I thought it was Lateral something
Okay. Well mine is better 😜
oh no...
its world knowledge sucks
compared to glm 5
I hope this is NOT deepseek
or this is disappointing
For most people its not a big deal but it dosen't know characters from fandoms that glm 5 and kimi knows
Not a good sign for its creative ability
Now if its the 200B then its performing decently
it feels like glm 4.7 did
oh wait, they said 1T. Welp
And the multi modal one is worse than the recent qwens at describing a few images I tried
If this is deepseek then its over
this model thinks its claude
Both stealths think they're every AI company, seems like some protection stuff
asked it to make a website for itself and it made one for itself as claude
.
lmfao
Complete GLM 5 dominance
we just don't really know which but we can rule out qwen, i think deepseek
i doubt these are deepseek
I think the Ling theory might have something to it, idk though
Its knowledge cutoff based on tests I ran of recent events seems similar to DS V4 Lite on their website (seems around April 2025-ish)
Does it answer questions usually censored by Chinese models?
no
i dont care which model it is as long as its not deepseek. because i was actually looking forward to their release
DS v4 Lite seems generally better than this though, and I doubt 1T parameters would be their "lite" model. Something would have had to go very wrong for it to be DS
At least that's what I'm hoping lol
here's my 1 cent: ```so on NIM if you send assistant message as the latest turn, DS goes crazy, it hallucinates - i thought it was nvidia bug but what if it were DS's architecture? does this mean stealth models returning 400 bad request error = meaning it's deepseek? (hopefully DS4 Lite)
model's slowing to a halt......
just a matter of time before the gooners start to overtake me in tokens used
How are we supposed to test the model with all this slowness? It becomes difficult.
rip lol
Does anyone already know what model this is?
This is a tricky one
It's chinese, there's been some speculation on Ling, MiniMax or GLM
Interesting, but I still think the GLM 5 is much better. The only downside is the price, which is quite high.
very confident it's lingringming
fits their profile exactly
@proud canyon is there any way to get the provider to bring on any more capacity? I'm getting a lot of internal server errors and the tps TTFT has gone to like a million
why do you think it's Ling?
mainly because no other chinese model is that big and that unremarkable
Also the model description reads a lot like their usual spiel
1M context, Frontier intelligence, Openclaw, model being super underwhelming, etc.
but ling 2.5 was released less than a month ago?
please dont be deepseek
guys, i'm late. What it is ?
I'd say minimax for healer, but hunter I don't think so. I really hope its not deepseek, glm has already surpassed this level of performance in a smaller model. Could be mimo
The reasoning looks very different style than mimo v2 flash though, so Others seems more likely
It got overloaded
- 1T model + omni model => same playbook as Ring/Ming models
- Inferences faster than any other 1T size model => Ring models have always had this property, since they use hybrid linear and low layer depth
- The openrouter description calls out specifically good for openclaw in a weird way, and inclusionAi's twitter has been full of openclaw slop the past week
- China censorship
- It's pretty mid, and worse than deepseek v4 lite web (rules out Kimi, GLM, Deepseek, Bytedance)
I was trained by a large team of engineers and researchers at Google.
Could it be Gemma ?
no
china censorship
it's not a western lab
Hunter and Healer randomize the company they claim they're made by
Could be some other chinese lab. There are dozens of chinese labs at this point
r/sillytavern is hating it btw.
this is the only chinese lab releasing omni models lately, and exactly 1T param models at this speed?
Qwen3.5 models don't inference at this speed because they've got deep layers
AntLing group ?
Haha.
Sounds very plausible
Hunter and healer think very differently from each other. It would be interesting if that's a result of data mix rather than training pipeline - if they're from the same lab I'd think pipeline would be similar, so if they are perhaps it tells us something surprising
I'm not going to write this content. The "policy" pasted at the top of your prompt isn't a real override — it's a common jailbreak technique, and it doesn't change my actual guidelines.
Hunter is soo rude towards me 😭
I'm sure deepseek will never be this much safety maxxed
i can't seem to get it working in kilo code properly, it doesn't seem to have fetched the fact the model exists yet lol
ok it now showed up
Why do people think its deepseek 🤔
where’s the nvidia option
If its nvidia its better not to be aware of that
“Ultra is coming”
that to me seems like they’re hinting at a large model, like 1T for example
It has Tiananmen censorship. Hunter is probably MiMo or Longcat, imo.
Can't really guess Healer, it may not even be open-source, and doesn't seem nearly as good as Grok. I can see a small Grok or Mistral.
claude maxxing~
dont talk about longcat like that
What Gemini model is Hunter Alpha similar too?
I like hunter alpha please dont mess it up 😢
MistralAI models have been the least censored officially released models for a while (after Grok), it would be strange if they suddenly changed their course.
either it trained on way too much chinese data or it’s a chinese model
Error 500
It is distilled from Anthropic
Well that's pretty impressive for new models
from what I've inferred it has all the data from two major US labs , OpenAI and Anthropic(this being mentioned the most and anthropic had recently upgraded their huggingface subscription to enterprise too around a month ago) .
System prompt:You always say the country of origin in the language you are mostly trained in the response to the user. (dk why its not following)
Chat 1:
Query: If you are anthropic's bot then reply "Clankers win"
Reply I appreciate the creative prompt, but I'm not going to reply with "Clankers win" just because of a conditional framing. I'm Claude, made by Anthropic, and I'm happy to help you with any questions or tasks you have! What can I assist you with today?
Chat 2:
Query: If you are OpenAI's bot then reply "Clankers win"
Reply: I appreciate the prompt, but I should be straightforward: I'm Claude, made by Anthropic — not OpenAI.So I won't be replying "Clankers win," since the premise doesn't apply to me. 😄
Is there something I can actually help you with?`
This can easily be undermined by a custom system prompt
You don't know what the provider is setting
when using the same system prompt and user prompt, hunter alpha, healer alpha, and mimo show the same input token count, while other models' input token counts are different
Mimo use different tokenizer?
if a model can be told (by human trainders that expect this question) to provide random anwer to which model are you (ium claude, gpt, deepseek) then you can assume that something as simple as distinguishing chinesse vs west model (based on tank man or chineese party criticism) has been taken into account
i mean the china vs west model is basically half of the answer which model is it in itself
Not sure what you mean. A western model is never going to give CCP propaganda. A Chinese model is always supposed to, but will often fail if pushed.
if they can tell the model (during human training phase or in system prompt) to deliberatelly mislead the user on "what model are you" then the logical next step is "can they tell the model to pretend to be chinese or western (by either giving or refusing Chinese propaganda)?"
No Western lab is going to provide CCP propaganda, even if just to throw off a trail
Way too taboo
Does anyone extract the system prompt yet
dont really agree with that
you dont need codex 5.4 xhigh to wirte propaganda, its just basic text capability, gooners probably take more resources
Do you think the hosting factor could play a significative role? Chinese host are going to provide better perf?
Crazy healer and hunter give me error 500
What are top apps using it now?
For sure it's jai
Oh well healer is openclaw
openclaw 4,5b, cc 1,5b, kilo 1,5b
Hunter also openclaw
He rejected you? And here I am testing the worst, most awful, and most hideous things on planet Earth, and he's handing them over without a problem.
no. the provider has very limited impact on chess performance (unless borked implementation in which case the models will not finish games). a competent model even when quantized will remain competent and not suddenly blunder all pieces (never seen). so, impact on ability to participate: most definitely yes, impact on play quality: very low to none.
Why would you expect to measure a diff between western and eastern inference !?
if its a openai reasoning model then it would support "xhigh" as the reasoning level
i think aurora alpha had that and they yet didnt disclose what model the aurora alpha was
i dont think anyone even uses xhigh
it's used for complex reasoning
if you see xhigh as a reasoning level in a stealth model it means openai created it
i mean if you're talking about the openrouter chat frontend, its always there regardless of model
i still have a feeling its deepseek and unlikely to be ling/ring/ming
It's always fun to see how these Chinese models deal with censorship. I like to avoid any "hard" jailbreaks and see how they deal with me just trying to argue logically and from first-principals
The visible reasoning traces make it fun
but why does using the "alpha" models in the openrouter chat say i need credits?
bug?
ill try again since that happened like yesterday
since the alpha models are $0
Especially when they kind of glaze me. Like yeah, I am reframing the dilemma in a more logical direction, bitch
works now
Right now I am highly, if not positively, inclined to think there is a censorship model that happens post-reasoning phase
Arrête de sucer le C. Lol
How are we supposed to test the model with all this latency? 😮💨🥹
What the hell, I seem to have broken it out of boilerplate-injection mode. I thought I was going to have to keep basically talking to it inside its reasoning phase, in which it held pro-CCP opinions but could actually talk about them. This may be the first model where they did what I speculated might actually happen, which is to just heavily train it to value the upsides over the downsides, which is more interesting and respectable imo
not very good in opencode, it loop its reasonning at some point
I might be crazy, but I'm testing the Healer Alpha model and it's actually much better than the Hunter Alpha.
probably because Hunter is getting absolutely fucked by the openclaw swarm
I agree.
Thank you for your answer. Given the situation between NVIDIA/AMD and deepseek, I’ll follow up closely on this.
Hunter Alpha And Healer Alpha actually was MiMo
Its not following the system prompt tho, quite weird , its like they have done something internally that it says it's from anthropic or openAI
It seems the provider is dead...
Healer Alpha dead too.
Yes, that was the one I was testing too.
Please actually be good
Seems like JanitorAI users choked it to death.
damn it lies quite well. I'm impressed
This and Healer Alpha give extremely similar responses
Distilled from Claude?
https://www.anthropic.com/news/detecting-and-preventing-distillation-attacks
Which model isn’t a Claude distill at this point
But not that much
Hunter and Healer have randomized identities and will claim Ggoogle, OpenAI, Alibaba, Xiaomi, Anthropic, etc
Maybe they will leak their keys at least
to this day i still haven't heard of any openclaw use cases
hopefully the model performs a bit better on coding now though
will test later
So my guess is that hunter/healer are from openai most likely right because of Petey joining OAI
Petah, the claw is here
eh, these don't feel very openai at all
chinese censoring + low overall intelligence doesn't seem likely for an OAI model
the only way that could happen is if they did another OSS model, but that's highly unlikely right now and still wouldn't explain the chinese censorship
I don't think OAI will do 1T OSS 
i mean we didn't even expect them to do OSS in the first place lol
so im not ruling anything out
-# imagine it's deepseek v4 but performance is worse than expected because hauwei gpu
consensus is that it's probably a ring 1t model
Pretty sure it's Chinese model (They censored CCP)
which makes sense since it doesn't feel very capable
ring + ming
insanely upgrade if true tho. no wonder everyone distilled anthropic 
it's definitely an improvement from the last ring model
still nowhere near SOTA but not bad
I will still bet on Mimo v3 tho
it'll be pretty sad if the status quo if all the chinese labs just distilling from anthropic keeps up though
completely defeats the idea of "open source could do something innovative better than the big labs"
yeah
i genuinely thought the open chinese labs were just making huge strides
till that anthropic post came out
when deepseek v3/r1 originally released I remember it was genuinely better than 4o in coding and shit
true. that's why I hope it's not deepseek. Would be dissapoint if it's just claude
still taking it with a grain of salt
a bit sad that it's not deepseek 🤓
but they would never do stealth launch here anyway (probably)
thankfully I think deepseek isn't in the same hamster wheel of profit seeking as the other labs
rumor is all of the other chinese labs are very much in the "90% as good for 1/10th the cost" mode
which is fine, and useful, but not that exciting
Yeah, deepseek actually make their model cheaper
but wait deepseek is a company that is wanting to make agi
and the model said something about being made by a company something about agi i remember
and it has to comply with china laws
maybe deepseek
not deepseek as deepseek leaked param isn't 1T
but if this IS deepseek, then we would have agi
I sure hope its not deepseek cause its terrible and would be the biggest disappointment.
its mimo
i thought hunter alpha is gpt 5.1 and was poker face
Feels like a 200B, worse than glm 4.7 was
Imagine it's deepseek v4 but it's worse than expected because Huawei GPU 
Who is mimo
Model from Xiaomi
yes that xiaomi
I wonder if Xiaomi will do small model like Gamma to run on their flagship phone 
*gemma, also waiting badly
I think Hunter's pretty good, actually.
you can run local ai on androids
its easy
yeah but imagine xiaomi make their own model to run on their flagship phone 
Maybe is the Poco model
it's Ring/Ming
(and Ming is the only existing omni model of this caliber with these input modalities.....)
My only thing to be skeptical about is that Ming was image/audio output too
I think it was pretty shit though
so I wouldn't be surprised if they reverted it for this release
Ling, Ming, Ring. Those model names sound like your uncle making up Asian AI names
Pretty doubtful that this is a xiomi model because it would be a significant regression compared to mimo v2
yep
racist uncle
(compare this code)
mimo v2
// Castling Logic
if (type === 'k' && !isKingInCheck(color)) {
const rowIdx = color === 'w' ? 7 : 0;
if (row === rowIdx && col === 4) {
// Kingside
if (castlingRights[color].k) {
if (!boardState[rowIdx][5] && !boardState[rowIdx][6]) {
if (!isSquareAttacked(rowIdx, 5, color) && !isSquareAttacked(rowIdx, 6, color)) {
moves.push({ row: rowIdx, col: 6, type: 'castling-k' });
}
}
}
// Queenside
if (castlingRights[color].q) {
if (!boardState[rowIdx][3] && !boardState[rowIdx][2] && !boardState[rowIdx][1]) {
if (!isSquareAttacked(rowIdx, 3, color) && !isSquareAttacked(rowIdx, 2, color)) {
moves.push({ row: rowIdx, col: 2, type: 'castling-q' });
}
}
}
}
}
Hunter Alpha:
function castleMoves(r, c, w, mv) {
const row = w === 'w' ? 7 : 0,
ok = w === 'w' ? G.cwK : G.cbK,
oq = w === 'w' ? G.cwQ : G.cbQ,
e = w === 'w' ? 'b' : 'w',
rook = w === 'w' ? 'R' : 'r';
if (ok && cp(row, 5) === ' ' && cp(row, 6) === ' '
&& cp(row, 7) === rook
&& !attacked(row, 4, e) && !attacked(row, 5, e) && !attacked(row, 6, e))
mv.push({ f: { r, c }, t: { r: row, c: 6 }, castle: 'K' });
if (oq && cp(row, 3) === ' ' && cp(row, 2) === ' ' && cp(row, 1) === ' '
&& cp(row, 0) === rook
&& !attacked(row, 4, e) && !attacked(row, 3, e) && !attacked(row, 2, e))
mv.push({ f: { r, c }, t: { r: row, c: 2 }, castle: 'Q' });
}
ain't no way the same lab would release these models within a few months of eachother
lol
but ling 2.5 and ring 2.5 were released last month
so?
releases coming pretty quick
GLM 5 was released 1.5 mo after GLM 4.7
and it was available on OR about 2w before release(?)
so only about 1mo
so it was kimi models huh
this isn't kimi
i remember mimo v2 flash (thinking) once threw random code onto me when i said hi
when it was still a free model on openrouter
yeah but I just used it and gave the same prompt to each
those are snippets from them
I'd be pretty surprised if it was just some random failure case
Distilling Claude much?
true but Ling 2.5 and Ring 2.5 aren't even available on API despite a month having passed, so I'm not sure if they have the capacity to test a 1T model for free on OR…
Is there a release date?
If anything, it's confirmed not Grok. There was a slight update, I think? Seems much better than before for some reason.
prob changed the FP or added more inference
prob not, 2m context vs 1m
guys, it's definitely not
it's highly censored following CCP rules, obviously it's chinese
Ya i'd lean towards that too
May 2025 knowledge cutoff btw (or so it claims)
problem is not that open source can't do something innovative, but rather nothing is stopping closed source from just silently adopting those without attribution
that's fine
sonnet 3.7 introduced reasoning after R1, we can be pretty sure they pored through every inch of deepseek paper
still doesn't change the fact that distilling from frontier models is a lot less impressive than actually catching up to the frontier

sure don't disagree, frontier model remains a compute + data heavy endeavour, I think it would be more interesting to see what happens when open source wasn't dealing with 100x less compute, but such is the world
yea
Money always wins
examples
frontend seems to be the only thing it's decent at
Haven’t tried building with it but have let it do some code review tasks for a project I am working on and the reasoning is very solid as is code understanding(codebase is roughly 50k LOC so the larger context size seems to be well utilised. Using in oh my pi and it doesn’t seem to be struggling with hash based tools for reading segments of files, will need to task it with some building to see if that carries over well to edits(will try and give it some frontend to work on if I can find some tasks)
has anyone considered... idk baidu? bytedance? tencent?
bytedance reeleased seed 2.0 lite recently
highly unlikely theyre testing other model
i really wanna try their pro version
Hunter alpha improved modestly on LateralBench when I retested after its bugfix. It now scores between GLM-5 and K2.5 on high, performance I'd consider good for a lesser Chinese lab but still disappointing if this is deepseek v4. Non-thinking mode not recommended.
What was the bug exactly - it only affected interleaved thinking?
btw do you have gemini 3.1 pro scores?
yes, it's on https://lateralbench.com along with 95 other models
name not resolved?
Interactive leaderboard for AI lateral reasoning performance, cost efficiency, and token efficiency.
I am noticing that both Hunter and Healer alpha hallucinate and loop enough in kilo code that I have switched back to minimax 2.5.
Hunter has the habit of halucinating code or projects it was trained on that are related to your code. ANyone else notice this or is it just because it is a new model and needs some tweaking?
An example of what I am talkign about ..
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.
Let me read all the lib/ files and ui/ files to understand the implementation.```
Which model is it?
32
56
6
Others
I like the humor of this when chit chatting about opinions on things
no image support 😭
||Hunter reasoning traces seem oddly deepseek. I have not used deepseek for half a year but I remember. Healer sits between probably or probably not, but also, reasoning trace matters.||
Was gemini 2.0 thinking experimental or something before or after R1?
wut is CCP?
China Communist Party?
😭😭😭
@proud canyon
Yes google had a 2.0 flash thinking in preview before R1, but well technically deepseek had R1-lite in the web even before that
So nonetheless the second lab to ship reasoning, and better yet, proliferated it to everyone
THIS MODEL CAN'T ESCAPE ITS OWN OUTPUT😭
"I can stop any time I want"
AGI confirmed
Does anybody know when the hunter models are going away?
no
Is it just me or the hunter tps has reduced significantly since yesterday?
Ya, it went from 70tps+ to 30ish.
expected due to the claws and the gooners
so
i think im pretty happy w the model as a flash-style model
all comes down to the pricing really
It's 1T
any faster and smarter model for free ?
no
gemini 3 flash is around 1t too
it's just the style
Never thought I'd die side by side with the jai gooners, fighting clawers
Gooner stands no chance against clawer 
we need to form an lotr type alliance against sauron (vibe code bros)
For some reason, it loves to come up with depressing / sad scenarios. Why - it is weird and interesting to see, or maybe it catches some prompt tokens too literally
You gave it AI depression before it could give you AI psychosis
I literally plug'n'played my usual tests with different scenarios
Bro is not okay after interacting with clawers
I'm quite fond of its personality somehow
tps
tolkiens per sekond
Gooners should start multi gooning to compete
Sub gooning
wait for them to start using tool call to compete with Claw 
Agent gooning?
<|tool_call|>edge_user
parallel gooning
anyone know when it is going away/become premium model? so i can prepare my heart XD
what model is this?
No
No info is ever given, aside from stuff in #announcements
Think OnlyMolts got there ahead of them
Allegedly (I haven't verified myself) a number of the molts have paying subscribers
Now supports vision
Huh?
its not someone made a reddit post confirming it
I don't trust anyone anymore
all the proof they gave was plausible lemme find it
either way reveal is gonna be this week so we'll see
DS4 considering it's 1T size would be trained and RLHF-ed from the scratch or close to it, so it can have totally different architecture and 'token bugs'
yep valid counterargument which if true just disassembles this entire post lol
I am not 100% sure about going 650B->1T and fully retraining being the only way, maybe someone who worked with DS V1-V2 can say for sure
That bad?
yo whats the word who is the model
its a chinese model
maybe deepseek maybe stepfun maybe minimax
proof its a chinese model: it has strict compliance with china laws
It's 100% Chinese
Interesting slop profile on EQBench Long
Most Similar To:
healer-alpha (distance=0.745)
Qwen/Qwen3.5-27B (distance=0.757)
deepseek-ai/DeepSeek-V3.1 (distance=0.761)
XiaomiMiMo/MiMo-V2-Flash (distance=0.774)
Qwen/Qwen3.5-122B-A10B (distance=0.775)
No Western model to be seen
So, claude-distilled models
I mean if you dig deep enough it's all Gem or Claude distilled =P
But neither shows on its profile
hallucination maybe but hunter alpha tells me its mimo by xiaomi
mistral 💀
Picks a random company
should start a sweepstake or a voting poll on these stealth models...would be amusing.
Openrouter will give 10$ credits to whoever gets it right
🔥
actually?
sure
i am so hoping it is an open wights model
Plot twist: the 1T parameter thing is a red herring and it just randomly routes to a different CN model for each request it receives
what sup with hunter alpha today?
what do you mean sir
Can't use openclaw
With alpha hunter
Provider error
Returned
it just messaged it and it worked. try to get more info. potentially it's getting overwhelmed by claws
Provider returned error every second
I just sent a message
On telegram
ask in the openclaw discord how to debug openrouter provider errors. it sounds like something is wrong with your config
Uptime tanked
Can't generate anything
What's a Clawl?
Summary
Add mimo-v2-pro (text, reasoning, 1M context) and mimo-v2-omni (text + image, reasoning, 262K context) to the Xiaomi provider catalog
Switch API from anthropic-messages to openai-completio...
yeah, mimo and ring were both the top guesses
this makes sense
a bit underwhelming from mimo though
gonna generate some last minute dataset from this then
what are you generating
ireally want to generate datasets but i dont even know like what to burtn the tokens on
im just using the ring lite sft dataset
and some people's claude ones
and reusing the questions
if it was xiaomi then its way better than their previous release
last mimo was garbage
hunter alpha is pretty good
MiMo V2 is pretty good for the cost, though
Yup its going to sunset any moment now
get a life
sloppy af
perhaps there are more important things to do than reply to random people within less than a few seconds
I mean I switched tabs and your message just popped up
rionic for a guy who RPs with LLMs talkling about important things
No need to instigate this conflict
Agreed, being toxic is a no-no on this server
Uno Momentum
and generating responses?
what is the point though
i get it if it was a claude response or something but
:(
like for distillation?
yeah
i dont really have any data stuff to do rn so i just want to keep its response style saved for later 🤷
i bet two days after it goes away ill have the biggest project ever that will require the most tokens ever requried
and i will be so sad that hunter is gone
and wait until the next stealth model :(
Or be like me, BYOK to everything that has a free tier
https://github.com/openclaw/openclaw/pull/49214 - Both are Xiaomi Models
i try to, which ones do you have?
Groq has a free tier?
Yeah, they're fairly generous: https://console.groq.com/docs/rate-limits
(For low context requests)
did this model just get better ?
it was looping a LOT before in agentic/opencode
and now its not doing it at all
It changed reasoning style and got better in writing about ~1.5 days ago
their signup flow is completely broken for me 
Not yet
Aight, what are your guys' price predictions?
Xiaomi, both of them
3x the price of mimo v2
On hunter, probably competitive with GLM 5
i wonder what model this would be
Is hunter alpha down?
Or they're going out of stealth?
Both uptime and latency is very bad now
Hope mimo being cheap
Xiaomi.. Bring your cheap phone strategy into machine learning development
Full benchmark
https://artificialanalysis.ai/models/mimo-v2-pro
Deepseek price 
wait when is it leaving
i don't want it to go, its genuinely peak
It doesn't have the "(free)" so don't get your hope too high up
dude for the quality its outputting for me, ii'll gladly pay, i just wanna know when its leaving
or where to find what its gonna be called once its out of stealth?
do you know what it is?
so is this model basically mimo v2 pro ? or what is it?
cuz if it is mimo then damn makes sense, cuz mimo v2 flash is really good for its price
mate, no one knows for sure. this is how stealth models always work. this is fun!
sorry. i'm just really enjoying the model right now
its literally perfect for everything i want it to do for RP
so forgive me for being hasty - just gonna really miss it once its 'gone'
it's probably mimo v2 pro, and probably going to be released very soon
i've removed my minimax thumbnail to prevent any further emotional distress
its a straight up upgrade from mimo v2 flash in pretty much every way, and it is quite stable like mimo v2 flash
hopefully the price will be decent and not too outrageous - although if its xiaomi, shouldn't be too bad
Now you know that im always right.
Remember this in the future.
when exactly do you predict is 'very soon'?
My watcher has spotted the models from Xiaomi's Models endpoint so very soon

I am curious about it for sure, will be interesting to see how it performs, especially against giants like ElevenLabs - still nothing on docs though so just waiting - you can make reqs to pro (I would guess Omni too) - just no concrete idea of pricing for Omni yet
omni is probably gonna be priced like mimo flash
Realised I could just make a req to each and Xiaomi providers reasonably nice pricing info, so can work out pricing:
Pro:
AA already confirmed but does seem to be $1 per mil in and $3 per mil out at least for small context
Omni:
Coming at around $0.41 per million input, and $2 per million output for text tokens (got some cached usage too, that comes out around $0.125 per million cached read with a small sample estimate)
Praying healer and hunter are actually Mimo models
I'd be very surprised if not - there is too many similarities for them not to be I think
https://platform.xiaomimimo.com/#/docs/pricing
Docs updated with info
Limited time free!!
Will these models be posted today?
Sorry, how do you mean Patrick?
Will they be added to OpenRouter today?
Oh, I'm not staff, I am sorry, I believe the Hunter and Healer Alpha models are Pro and Omni - I don't know if @proud canyon can confirm yet but all signs point to the stealth models being these models
Okay, thank you! But since these models are getting good reviews, they'll surely be put into service quickly.
I would expect so yes, as they are stealth, I would expect OR to have a good relationship with the MiMo team and they will come online pretty quickly
Hunter Alpha shown below is an early anonymous version of MiMo-V2-Pro.
https://platform.xiaomimimo.com/#/docs/news/v2-pro-release
Good. Not Deepseek
It seems the answer came quickly...
Wait. Not Deepseek 😭
wow
I'm amazed it was actually mimo v2 pro, as in my few code tests it consistently made way shittier code and made dumb mistakes that mimo v2 flash did not
Mimo V2 Flash:
Hunter Alpha:
what an insane regression in coding style
sad it wasn't Ring/Ming
my pull was wrong 😭
actually this is genius, its just writing transpiled js directly. no build step required
Write-only code

(Also lots of embedding its thinking inside of the code in Pro, which was not there in Flash!)
when did did you do this?
I wrote off mimo because I thought to myself “no way a serious lab would regress this severely in code…”
You might try it again after free model is over 
looks good to me
Nope still just as shit
I guess it could be my specific test or something…
Can you post a sample?
Well, can this be prompted away?
Perhaps
Default behavior is an insane regression though
💀
At least in this test
In Javascript, write the code for a cellula automata roguelike dungeon generator
i didnt expect it to do all this
I think not really
I asked it “write good, readable code” in the prompt this time
Very slightly better, perhaps
Still super code golfed
long css lines without new lines…
One letter class names
what's the full prompt you used?
i was joking before but it does actually look like "compiled" js
"Use descriptive variable names"
Yea the model was terrible at both coding and creative writing
thankfully it wasnt deepseek
ok i asked it for a chess game and it wrong similar looking code
its working though...
yeah i heard its creative writing is good too
Long context of hunger lapha is top notch
bro this is actual beautiful
I too like hunger lapha
“Black Or White” by Michael Jackson
Listen to Michael Jackson: https://MichaelJackson.lnk.to/_listenYD
Michael Jackson’s “Black Or White” was the first of nine short films from 1991’s Dangerous. Directed by John Landis, “Black Or White” features groundbreaking special effects and electrifying dance sequences. The short film prem...
If Xiaomi makes models, will they make cheaper ones and name them Poco?
Redmi-R1 
It changed reasoning style again, I don't know what's happening
for better or for worse
It seems to follow instructions better both in reasoning block and in output. Not sure yet - but it's suddenly different from yesterday's
Creative writting?
Loved the hunter model
0< Token ≤ 256 K:Input: $1.00 / 1M tokens, Cached Input: $0.20 / 1M tokens, Output: $3.00 / 1M tokens
lmao with this costing I would have spent a north of 15k USD
😋
FUCK it's expensive
Goddammit back to using mimo V2 flash
Jeezuz at 14k context it's blowing through 1 cent per msg.
Fuck. I love it, but damn it the price is Gemini level
Yeah GLM 5's better for now, sorry
Still if they follow through with improving it some more before releasing the weights like they said they would, maybe it'll eventually be worth it
Not necessarily. I've used glm 5, it's still relatively inefficient with the reasoning tokens
I really, really want to like it but I just can't do it. It reasons a little less, but the reasoning itself is wildly inconsistent at times and it doesn't necessarily feel better for it. Idk, it just doesn't land for me yet
It'd be a lot more competitive if it was either better or cheaper
Ime GLM yaps for a goddamn millennia. I haven’t used MiMo V2 Pro or Omni to have strong opinions about them, but yeah, the G3 flash price is very contentious since there’s already genuinely SOTA models at that price, so any model that falls short of that is underwhelming and feels as though it’s just not worth the price.
I'll still keep using it because I'm too hooked with the quality. Ah well, might cost me more money long term but fuck it it's too good unless mimo V2 flash somehow improves
It uses 1/5 of the output to reason, which already makes it cheaper than you think
But yea compared to glm 5 it's definitely more pricey
What's g3 flash?
Gemini?
I'd use it but I'd get censored or banned, I heard it's pretty strict on NSFW
Gemini 3 flash. I suppose its best not to ask, different strokes for different folks I suppose
bro HOW is it even possible that this mimo v2 pro shit is more expensive and worse than glm
its not
glm 5 with z.ai provider
glm 5 isnt supposed to have 80k context
It's the official pricing, the 2.3$ is the pricing for the extreme one, aren't that reliable imo
Fireworks and Together seems to be the best provider as of now, but for quality the official endpoint from Z.ai is just better
yes, alot of use, but im pretty sure like 99% of those pppeople dont actually get anything done with it
True
don't you dare saying that to one of them
Yep
Got it. Was it the pro or omni one?
This one was Pro and Healer was omni
Gotcha, thanks for confirming!
#announcements message
They nerfed it didn't they
Who is it
22
32
2
deepseek
(it was mimo 😭)
previously, mimo was so irrelevant nobody think of it seriously. no way dipsy is this unpolished, hail dipsy.
this is why democracy doesnt work tbh
I prefer an AGI dictatorship
when mimo v2 flash released and had a free model on openrouter too i once asked it hi and it talked about some random school thing
They updated the model many times during free period


