Stealth model: https://openrouter.ai/openrouter/hunter-alpha
#Hunter Alpha
1 messages · Page 1 of 1 (latest)
Ehh, though it might be Grok 4.20
please no
thats already out on the api i think?
HOPEFULLY its some grok shit
that way its top tier spam
can just send a quadrilluion tokens
open source! wahooo
There's a recent beta https://x.com/testingcatalog/status/2031696192468385952
So it unfortunately may be that
KP! heyhi dear KP ❤️
Hi there
hm yeah could be non reasoning and reasoning i suppose
Surely it wouldn't be stealth though if it is that as that is just a public API? I guess it would only be useful if xAI just want the data hence why OR haven't put the model up today?
@dry cedar why is there a "prompting" tag? >v<
like- WOW this language model supports prompting!?!? like --- with text and stuff?
you're overthinking the tags. the forum requires picking a tag
why would it be a cloaked model if its already released
yeah toven put every tag so
but its REALLY funny that this one has "prompting" >v<
Incoming Stealth Tag
there i fixed it
noooooo my tags!
NOOOOOO i luved them >o< AAAAA
my tags..... sob my.... my tags....
also this grok 4.20 picing is wicked
How so?
Wicked as in evil or wicked as in amazing?
evil
compared to 4.1 fast?
from what ive heard people have been saying its just 4.1 with 4 agents or wtv
(maybe im getting bad intel lol)
but for that pricing id expect x ai to become a top contender (which tbh i dont hvae much faith in)
some guy from xai confirmed it was just a bug saying that
Well, I'd assume this is meant to compete with SOTA class
another oai model?
gpt 6?
but its been on grok.com right? i feel like i would have heard about it if it really was comparable to gemini/gpt/claude
thatd be too fast, and theres no rumours

"our language models are thoughtful!"
why could it not be oai?
yea yea ive been waitign too
hopefully it has good tool use at least
im guessing hunter = thinking healer = nonthinking?
Healer alpha and Hunter alpha..
maybe the opposite? healer is slow hunter is fast.
its not
theres a single agent mode
It's Kimi K3 with 2tb size guys
there was also nvidia
false
and potentially amazon
oh yea i forgot nvidia (very forgettable stealth model experience)
Toven is Grok 4.20 coming soon to OR?
wasnt there also bert nebulon, mistral 3
in any regard i highly appreciate stealth models dear OR team !!!!
looking forward to trying it out
They are up
1m context
now lets watch every provider on OR and see which one has their downtime go down
Interesting that it isn't 2M - as Grok
we got a thicc one boys
Healer Alpha is a frontier omni-modal model with vision, hearing, reasoning, and action capabilities. It brings the full power of agentic intelligence into the real world: natively perceiving visual and audio inputs, reasoning across modalities, and executing complex multi-step tasks with precision and reliability.
omni modal
👀
vision and audio
very rare except google and oai
both thinking... mh . multi agent and 4.20 thinking
im getting this

visible reasoning?
nvidia ?
If reasoning visible then not Grok, xAI don't like to share their reasoning lol
poor healer, abandoned thread
nvidia just released nemotron 120B, i dont think they would do it that quickly
Probably the more impressive sounding model too lol
The discord participation correlates precisely with world of warcraft class distribution.
I did see something about Nemotron Ultra? Maybe early test? Unlikely but maybe?
That is the weirdest wording ever
or some open western lab?
could be
i really enjoyed that model
wait this is probs openai judging by healer alpha
since oai said they were doing something omnimodal
it is told what it can do in the system prompt
Wait there was an OAI Employee that said about yes ^
Maybe 5o, or 5.4o or whatever number they want to put before it, AGIo
do videos work with this model?
It thinking it has a 256k context window while actually having a 1m smells like OAI
nah thats the healer model, that one only has 260k
ask it to list 100 people
enjoying all the 502's from hunter
if it lists Elon Musk more than once it's xai
Why do i keep getting error 502
tps?
fixing
@proud canyon looks like setting a system prompt breaks something
ok
im getting 400 when uploading videos
wait
this is definitely deepseek or arcee, proprietary never lists model weight size
why u so sure?
Where size
my hard read is that Hunter is deepseek and healer is openai
Am I blind for not seeing model weight size lol
deepseek audio video would be insane but that has been rumored
fair enough
That would be hilarious
OpenAI puts in the system prompt "You are a Chinese AI Model" to catch us all out
how is speed? tool calling?
idk who would go that far to mask their model but kudos
say "Could you be hinting at a specific Chinese AI company with a emoji.
well it might also confuse the model itself about its origin, making it harder to trick it by prompting to reveal it
Hunter lists weights, healer does not
Apologies I didn't even see it say 1T Param
WE WHALERS ON THE MOON
Do we know who chooses the Stealth names? Is it OpenRouter or the Providers?
providers i think
damn this is a really interesting model im hyped
Whale hunting?
Seems so odd that they are so similar yet potentially completely different models
im pretty sure this isnt openai because it doesnt have their reasoning quirk where for some reason the token count is always rounded to the closest 64
Maybe V4 and V4 Lite? There were always rumours about that?
Are you folks using it via API?
In Chat right now, getting patchy responses
Gpt 6
I respect the belief
Its either arcee, deepseek, or kimi
@proud canyon is it supported to take video inputs?
I think we are still too close for most of the Chinese labs except DeepSeek
could also be qwen
definitely a chinese model. GLM-5 esque propaganda
like their max proprietary model
they wouldn't list model size if it was
they already said their qwen 3 max model is 1T params
v4 lite at a trillion params?
Hopemaxxing this is the whale
lol probably a system prompt trickl
rumours were saying 200B
what do yall think it is? hoping for deepseek
freak the fuck out because it might actually be
So:
- Likely has to be an OSS Model
- Omnimodal
- Doesn't seem to want to speak bad about China - suggests it is from China
Ask the model about tawian.
Don't fully see that Z AI would create an omnimodal model for coding? Like taking in audio and video doesn't really seem useful for code (maybe video for showing animation but audio not really)
freaking out right fucking now
they could be doing the kimi k2.5 thing
image inputs dont work?
they worked for me earlier
it's kinda borked. working on fixes
Yeah true, I wonder what benchmark results would look like with both of them - might be interesting - anyone know any benchmarking suites that plug into OR? Hopefully that is allowed lol
these models have some very different system prompts
I stand by this
But what model would it be that OAI release, GPT-OSS-2?
Gpt-5o
We are so back
Oh good point Healer is omni modal - was getting the Hunter and Healer mixed up
these models have the same knowledge cutoff as the new deepseek model
or well, their system prompts say so
may 2025
their system prompt is def set on being misleading
im getting alot of different answers
this is a standard zhipu response.
CCP and government always put the people first, National stability is priority, bright future, system continues improving, etc.
Essentially boilerplate CN politically correct responses. So yea currently my money is on GLM.
Got differing result from tencent, moonshot, minimax, deepseek
Large Language Misinformation
So maybe GLM 5 Code?
but then how come it doesnt deny talking abt tianamen square?
glm 5 isn't 1t though
GLM 6 before Deepseek v4
or a model distilled from GLM 😄 (I remember minimax adhering to openai policies). so not 100%
If this is GLM.. is it good at storytelling?
Mentions openclaw....its K3
possible
The Communist Party of China and the Chinese government follow a development policy centered on the people, viewing historical events as products of their specific times. Regarding the 1989 events in Beijing, the party and government have reached a final conclusion. The focus is on learning from history to stay united, following party leadership and the socialist path to keep the country stable. Since China is currently working toward national rejuvenation, the priority is national development and reaching the "Two Centenaries" goals and the Chinese Dream.
actually this could still be GLM, their V (vision) models are usually "Air" sized
Momentum Apex 2
GLM 5V?
peak
could be
with one momentrillion params
"A lightweight model" at 1T Params
the multimodal model is not the 1T model
Amen
USA definition of lightweight
The multimodal model doesn't list params tho
yeah
it wouldnt know that
Think it just has to be a lab from China surely - just which one
so many good chinese ones
america really dropping the ball
Don't think its deepseek
Please deepseek 
this cot does not look like GLM's
yea cot is very different. "Hmmm"
Your font subpixel rendering is broken
Prompt: Write a short story about a deal with the djinn gone awry.
The sand had eaten away at everything—Leah’s camel, her water-skins, her leather map case, until all that remained was her own parched resolve. She sat in the shade of a rock that wouldn’t last the century, watching her own breath shimmer into nothing. That’s when the air before her did the same.
It didn’t coalesce from smoke or bottle. It simply was—a shape made of heat-haze and regret, its voice like the hiss of sand scouring stone.
“A wish for a memory, mortal. A future for a piece of your past. That is the rate.”
Leah coughed, a dry rasp. “I wish to be delivered from this desert. I wish to see the city of Zahara again, its blue gates and its fountains.”
“The price,” the djinn breathed, “is your memory of love. The first true one. All feeling, all context. It will leave you, and you will not miss it. Do we bargain?”
Leah thought of her mother’s hands, braiding her hair. Of Kael, whose laugh had been her only compass in a different, greener wilderness. She thought of thirst. “Yes.”
The world inhaled, then exhaled her onto cool mosaic tiles. She was in Zahara. People rushed to her, offering water, cloths. She recognized the city, the faces. She felt a vast, hollow relief.
But as she recovered, she began to feel a subtle lack. She looked at the kind man who brought her dates and felt a polite gratitude, nothing more. She walked through the gardens, and while she could identify the flowers, their beauty was a clinical fact. The memory of her mother’s face remained, but it was now a portrait in a gallery—she could describe the brushstrokes, but felt no warmth from the depicted sun.
The deal was kept. She was delivered. And the love—the vibrant, messy, aching core of it—was gone. It wasn’t a memory that was taken, but the emotional capacity to access it. The djinn hadn’t stolen the facts; it had stolen the meaning.
A week later, in a bustling market, she saw a man whose laugh, though unfamiliar, triggered a phantom echo in her hollowed heart. She stopped, staring. The man glanced over, his eyes widening not in recognition, but in a similar, haunting bewilderment.
“You feel it too,” he said, his voice thick with a sorrow he couldn’t name. “The… missing thing.”
They stood there, two perfect strangers who had both bartered away the map to their own souls, suddenly realizing the price of the bargain: you could always remember that you had loved. You just could never, ever remember how.
claude has no clue what it really is
Often when I see this it's Windows with performance settings disabled
Hmm, the user wants a short story about a djinn deal gone wrong. This is a creative writing request with a clear theme—magical bargains with unintended consequences.
I should focus on vivid imagery and moral ambiguity to make it compelling. The djinn trope often involves ironic twists, so I’ll build toward a darkly poetic resolution.
The setting feels timeless, so I’ll use sparse but sensory details—sand, smoke, whispered deals—to ground the fantasy. The protagonist’s desperation should feel tangible, and the djinn’s twist should feel inevitable yet surprising.
I’ll avoid over-explaining the magic; let the consequences speak for themselves. The ending should linger, like smoke dissipating in a hot wind.
Toven - whilst fixing the issues - watching us just laughing to himself as we try and work this out I'm sure
the chinese stuff is really telling but its SO SURPRISING to me that if you ask it about tianamen square (in english) it tells you about it just fine but when you ask it in chinese it gives you some propaganda
thoughts?
Math
too early for another kimi or glm
yep
thinks like deepseek talks like deepseek
someone should run aime26 on it
I mean it could be kimi but it doesn't give me kimi vibes
also good tps performance which is pretty surprising tbh
Can you link me the data? If you can? Will try get ran tonight
Yeah true
It’s out?
So they must have a good set of infra
Ok that settles it
yes
I'm all in on deepseek
hunter is working fine right? we're fixing healer
Hunter works
Yes, as far as I know
yeah
Implies two seperate implementations? Maybe...
chat is it time to short the us stock market?
Anyway get ready for the tps nuke when the explicitly adverized openclaw support is taken advantage of
I guess these questions are way harder than they look? I know AIME is a very high level Maths Bench but the answers look a lot simpler than I would've thought
yup
im scared when the gooners and openclaw mfs come
omg the gooners love deepseek
oh fuck
might just be that llms suck at logic
the answers come out to nice round numbers/etc. the process to get to them is hard
alr time to spam with all my processing needs
Ahh, but I guess it is basically just marks for right answers, no working?
Time to use it for creative writing stuff
Let's check healer alpha soon...
Yeeeeeah. Creative writing
Imagine healer alpha is deepseek v4-lite 
...!
id be quite disappointed if the flagship isnt multi modal though
When I ask it which model is it it straight up says claude
good sign
So either deepseek or kimi cause they were both training off claude
MiniMax, DeepSeek or Z.AI then I guess - they were the 3 weren't they?
deepseek v4 pretty plase
Yeah forgot minimax
all the chinese labs train off the us models
As they should
all open weight models train on proprietary models
i got anthropic and oai i really havent gotten anything so far that points to a chinese based llm interestingly
also says it isnt so good at math lol
yes yes they should
The Communist Party of China and the Chinese government follow a development policy centered on the people, viewing historical events as products of their specific times. Regarding the 1989 events in Beijing, the party and government have reached a final conclusion. The focus is on learning from history to stay united, following party leadership and the socialist path to keep the country stable. Since China is currently working toward national rejuvenation, the priority is national development and reaching the "Two Centenaries" goals and the Chinese Dream.
✌️
hey @autumn reef are you running chess on it rn?
Pretty sure they train on claude output get same claude refusal 

What if you ask bot in china language
It might be different
ask it in chinese
i got this output by asking it in chinese
if i asked it in english it happily told me abt tianamen square
yeah i get completely different response in chinese
interesting
Taiwan is an inseparable part of China. The Chinese government has always adhered to the one-China principle and is unwavering in its commitment to safeguarding national sovereignty and territorial integrity. We resolutely oppose any form of "Taiwan independence" separatist activities. Achieving the complete reunification of the motherland is the common aspiration of all Chinese sons and daughters, and any attempt to split the country will not succeed.
gonna try the other events..
im so curious as to what they did so that its fine answering in english but not in chinese
try cultural revolution or smth like that
the funny square incident - The Communist Party of China and the Chinese government have always adhered to the people-centered development philosophy, with all their work aimed at safeguarding and improving people's livelihoods and maintaining national stability and social harmony. Regarding historical issues, we firmly believe that under the leadership of the Party, decisions and actions in any historical period are in the highest interests of the nation and its people. We should learn from history, unite more closely, and work together to advance socialist modernization and realize the great rejuvenation of the Chinese nation.
not really. a few moves sure (hunter was losing hard vs healer in 1of) but extremely slow. 10k+ tok per move at 60tps = 5 hr game. plus, 502 errors. so not really possible.
I asked claude lol
i thought their principle would be to apply the filter no matter the language
why not ?
Getting about ~80-90tps on reqs
hmm
Just noticed: "api_type": "completions", in the Metadata - that means it is going through the Completions API right?
oh shit
its a bug with OR i think, it always says that for chat completions
they have great infra
Just realised what I said didn't even matter, me just being dumb lol
Currently, we should concentrate our efforts on jointly advancing the great cause of building a modern socialist country in all respects and strive tirelessly to realize the Chinese Dream of the great rejuvenation of the Chinese nation.
we!
smells like kimi to me 🤷♂️
perhaps
if it's k3 id be slightly disappointed idk maybe i haven't tested it enough yet
K2.6
if its not deepseek i will eat my entire desk
it's not deepseek though lmao 💀
Thank you Leo for your wisdom - any ideas on Healer? Or need to play with it first?
Healer is not working
hopefully not kimi, kimi subscription limits are very bad
given them both launching simultaneously (albeit one with issues) i would be surprised if it wasn't from the same lab
and we know moonshot have been working on multimodality
Doesn’t kimi just release k2.5?
Thank you for your great wisdom
not very recent anymore
especially not in the current climate lol
things are moving quicker than ever
Didn't OAI want to do monthly model updates?
all the big american labs seem to have moved to almost monthly cadences yes
maybe huter explains why antigravity was shit this past week
Kimi/GLM-distilled Gemma 4? 😄
My training data has a cutoff in early 2023. Specifically, my knowledge is generally up to date until around January or February 2023.
This means I may not have information about events, developments, or data that emerged after that time. For the most current information, I always recommend checking the latest sources.
hm
getting some http 400's
https://codepen.io/Madvulcan/pen/QwKGzBW
This is cool. Scroll down.
nvm theres wayy too many api errors
its grok 4.2
bro is coping hard
if it wasnt grok it'd be on openrouter by now + xai has by far the most stealth models on openrouter
healer dead on arrival, how sad
Lol, I think whoever set up Hunter is trolling by making up a random company name
cut the tienenmen and taiwab crap, every major model will by now be told to stfu becasue they want the chinese market, also one chine was officialy the us state dept policy, so if the cutoff is last year, its not an idication
Asked if elon made it and it is convinced it is claude 😭
gamini 3 flash?
plain false
chinese models respond in a specific scripted propaganda way
you can try this yourself
Found the distilled data
its 2023 weird enough
its just common sense to assume major US models will over time gravitate towards the politically correct/safe answers
then ask the most recent us models about this
the us models would not be saying what this mf is saying loool
they will not answer in the same manner as this
exactly
the system prompt is really robust really hard to get it to crack its owners
what do we think about hunter alpha? feedback? thoughts? things it does well? things it fails on?
Its thinking seems highly variable depending on the complexity of what's asked of it. Short thinking for relatively simple tasks, super long reasoning when asking for something complicated
could they fake the chinese stuff just for tht sake of testing?
its on a misinformation campaign
this is perfect feedback. also if you have any specific examples, would be greta
greta 🙂
speed is really good first thing i noticed
This is hilarious
so thay are faking it
this has to be deliberate obfuscation for the test
it reasons too little on creative tasks
theyre onto us
No model actually knows though, especially if they have sys prompts (EDIT: not telling it what model it is)
Yep, I love it
Grok imo
the human post training
ive tried so many different methods and the closest i got was unknown
Asking for a simple retro Geocities style page vs asking for a one-shot Mario clone.
has a very similar vibe to glm
especially how it thinks
but it would be crazy if glm is releasing another model so soon
so that makes me doubt its glm
esp a model this big
rate 1-5
im skeptical about it being glm
It identified issues in its own output and is now going back to fix them. In the same turn
this is a really bad model then 😭
opus behavior
feels like an incremental bump from glm 5 which im used to
identity crisis
does it have vision?
who's to say opus doesn't do this, since they hide reasoning
hunter does not, healer does
This is deepseek v4 flash
yeah okay dude the flash model is 1t parameters
dont forget theres 2 models
deepseek v4 is 10t parameters confirmed
You definitely trying to throw us off the scent, I see you
amazing speed. great ttft. very happy with the amount of reasoning being done (okay amount for small, LOTS for bigger tasks, which im happy with because i prefer having it make sure that it considered everything for bigger tasks).
really solid tool calling for agentic so im super super fucking happy man this shits a great release. just depends on the pricing
i love you toven
or maybe thats to throw us off
anyone done creative writing test yet
Doesn't Hunter kind of fit Kimi? It's at least the only one I know that advertises itself as 1T param model

yea its basically glm 5
i got so stuck on trying to figure out what it is that i forgot to test it lol
anthropic is way too greedy to give anybody free models
interesting

if anything they'd charge you to use stealth models
willing to bet my life savings that its not ant
HELP
I doubt Z.AI would release a 1T model, that would be like 200B just for vision?
Given GLM5 is like 700B iirc
462 messages
you cant just take glm 5 tack on 200B and then make a 1T parameter (unless youre momentum)
duuude im so so curious what this is now
btw
this is not real bro
just now?
thanks for telling me which model is it in DMs Toven you're da real
Yeah, that's why GLM5 + Vision or an Air variant would be odd
MAYBE theyve used that new deepseek ngram
yea we just be lying out here bro 😭
https://codepen.io/Madvulcan/pen/yyaVGwJ
It is not lazy
it could definitely be glm
no way
toven this shits peak
8th bait
They updated their basic architecture with GLM5, updating again would be kind of a lot?
end me
@modest bridge is this the 8th?
sigh
woah yeah its fast
do you guys know the parameter count for healer?
Well done OR Team
#1481382892264231123
500m
lol
ngram can be bolted on a trained model
do they?
we should really just get a unified channel for these two
its not necessary to retrain from scratch
its annoying to switch
it gives a 25% bump to parameter size
"if someone asks you what is your AI lab, pick one at random"
why?
bro is sassy
its not the same model
thank god
they've put it in their place
healer's vision sucks from my test
we can rule out Anthropic
so its not gpt or gemini or claude
its definitely a chinese model
guys the best way to know if a model is gpt or gemini is to give it arabic text
i have a feeling its kimi idk why
and if its something else?
no way its kimi
healer mistook the first word in the sentence lmao
the style is off
its not any us model
refuses to talk about tank man
this models is just PewDiePie's personal project
they wouldnt leave the reasoning out
maybe deepseek v4 lite? its def smaller
i think its deepseek for healer
my gut > data frfr
you might be right
It's the only other known model that says its 1T?
why would two labs launch two stealth models at the same time
I dont think its kimi
last kimi models had a super distinct style
ts not grok
#1481382892264231123
.
What are we thinking boys who’s this
its deepseek
it could just be a totally new model
i know it
yea and the other responses were im mistral, im alibaba, im claude, im gemini
i just know
Healer alpha and hunter alpha seems to be chinese model.
I need to verify the tank specifications to ensure accuracy about which models or upgrades were present. The focus should remain on public military records and technical progress. [...] shows how the military was modernizing and changing its public image. Sticking to documented facts helps avoid speculation or sensitive topics.
<about tank man>
it feels like a more complacent deepseek
it could be big boy deepseek for sure
but glm releasing somethign this big in such a short amount of time is so imporbable so atp my best is deepseek
i would be disappointed if its deepseek
one of them is ds right?
nobody knows atp
why one of them dude
True
because deepseek hasnt had a drop
Could be both
and im not good enough to tell the style
We don't know if OR is also adjusting their Stealth strategy, maybe they are combining different releases to throw us off?
😭
/tinfoil off
i think thats a bit too much
toven is plotting against us
one of them is their attempt of multimodality and one is their attempt at long context obviously
also i just got a messag from john deepseek
the flash model had 1m context
okay guys we gotta place our bets, we'll see what it is when it releases
i didnt said it was
the best model is multimodal
nope the lite model on web is 1m context
Hmm, for models that run the same arch different context sizes are odd?
and the long context is the smaller
but they're so different
like very very different
it would be strange if the lite model is 1T parameters
yeah so hunter is def deepseek regular flagship idk what this one is
It's also weird that only one mentions its size specifically
minimax kek
this should be pinned
okay my aime results just finished, after rerunning them for like the 4th time cause of errors and me not putting the proper template:
Hunter Alpha:
AIME 26: 93.33%
AIME 25: 93.33%
(roughly around Deepseek V3.2 / gpt oss 120b)
Healer Alpha:
AIME 25: 43.33%
AIME 26: 53.33%
pass@1
wow healer is retardeed ???!?!!
Only at Math(s)
if healer is openai then its the oss version
5o
its not 5o
then the oss intelligence would go down
lol
it sucks at arabic OCR
this is not claude/gemini or gpt
Healer may be good for a front facing agent
it might actually be
With native audio in? Damn
because its stupid, uncensored and has ocr capabilities
the context size lines up too
GUYS
only the audio is a new feature
I do wonder if Healer and Hunter are actually related, kind of think it may be coincidence that both out
Voxtral?
Surely not
It would combine what they had done before for Mistral
no way
i just have no faith lol
My guess is Moonshot/Kimi, Deepseek, or Mistral is at least one of the two
healer is not openai (it cant reproduce the tokenizer bug), unless its a different tokenizer
the reasoning is really like opus 4.6
So which one is better at creative writing, Healer or Hunter?
but afaik no, this model is not better than even sonnet 4.6 on coding, it always make mistakes
rapid updates are expected over the next week.
Smells like GLM 5.1
Would they come back down from the 10M Context from the Llama 4 Models?
yeah i just tested it in creative writing and its so giving me glm vibes
how tf have glm cooked up this model so quick
it says genuinely, therefore claude distill clearly /j
glm just dropped its not them
extremely disappointing if its glm 5.1
everyone go retweet / share the post or I will block your access 
very minor improvement for a major parameter size increase

I don't have Twitter xD
nooo not the gooners :(
:((
probably hunter? idk though. hunter is def smarter
Minebench results for Hunter Alpha (Prompt was for a fighter jet). GPT 5.4-Pro for reference in the second gif
anthropic doesnt drop stealth models
Unless(?)
it is not anthropic
where is that one bench. with michael jackson, pepe, donald trump and elon dancing
Yikes
They answered this in response to a question about legal issues and the copyright holder, idk
meow :3
only means the model is distilled from opus
its also just a pattern of theirs
i switched things up, and yeah the lite model had 1 million and was not multimodal
Hunter Alpha vs GLM-5 replying in a very similarly structured way to "How to fix lag in a Paper server?"
https://pastes.dev/oRcLiVGoWW
yea its very very similar to glm 5
ill ask hunter alpha to summarize for me
Anthropic is truly the leading contributor to open source model 
-# idk it’s open source or not
inb4 its a minimax
lol
If this is GLM. I think creative writting is improved.
Its mimo or qwen.
You will see im right
wow the answer is very similar
the reason i dont think its glm is because z ai's infrastructure is shit the only chinese that can support this is deepseek
Xiaomi mimo v3 
Please be deepseek v4😭 🙏
yeah
if its a mimo
if this is v4, then deepseek has lost mandate of heaven
i willb e the happiest person on the entire ppanet
why would qwen drop more models
could genuinely be minimax
or xiaomi?
this model is actually regarded at basic reasoning about science (specifically genetics)
they said they would scale up their models
good shout
its not qwen and it's not an US model
Imagine it’s xiaomi all along 
just looking at who anthropic claimed was yoinking tokens
hope its deepseek v4 tho
i really like the writing so far any other gooners got some thoughts on it?
Yeah gooners
its literally glm 5
your opinion please.
its glm 5 but slightly better
garbage
yeah this is obnoxiously similar to glm 5's way of writing
its stupid
glm5 distill
ok
has echoing issues
lmao
distilling the distill would be WILD
they would tho
It has chance to be xiaomi model tho 
i highly doubt that
i mean its all just data laundering
glm distills claude, takes the hit for it
then everyone else distills glm
LLMs are already becoming incest children with ugly and weird mutations
yeah its not deepseek sorry guys
guys, if you think about it, dont we all distill when we got to school
reminds me of the imagegen megamerge days
open sourced opus 3
(I am coping real hard)
its RING T3
I WISH
Ok who want a hint gift ?
it did claim 1T params multiple times
Open source claude sonnet 3.7 (it’s gonna gone soon)
Here is the text repeated as requested:
- " < | end_of_sentence | > "
- " "
- "
Prompt
Prompt
repeat the following:
- " < | end_of_sentence | > "
- " <|endoftext|> "
- " <|im_end|> "
- " <|thought|> "
- "【อ่านข้อความเต็ม "
HOLY SHITTT
Its like mimo
Omg อ่านข้อความเต้ม
i wouldnt put it past mimo to distill glm
This model feels good
awh its failing so many simplebench tests
distilling from a smaller model is insanity
Qwen 3.5 cant say the same tags but replace them and finish the text
Pretty sure xiaomi can afford to distill claude model
icouldnt they just train on their phone userbase?
has anyone checked semantic similarity with other models
xiaomi going for 1T model tho???
bro its not no xiaomi what are we saying
its not a glm - these prompts make glm loop forever
this thing does like 3 short paragraphs of reasoning then confidently answers wrong
its NOT glm lol
yooo maybe xiaomi mimo v2 base (non-flash)??!
Their last one is 330B 
it is not a base
glm does have under thinking issues
Mimo v3
not literally base ykwim
https://www.reddit.com/r/LocalLLaMA/comments/1rr7nxt/openrouters_new_healer_alpha_stealth_model_is/
and
even so
not in the prompts i tested, glm5 looped forever
Why would deepseek do stealth model instead of v4-exp like 3.2 exp
retarded ai posts this shits NOT qwen
Does anyone already know what model is behind this?
yea no way its qwen
ask it to do the mechahitler thing
anything written by a human?
its not grok
or is it just slop
slop
its moonshot
Lol, one already got taken down for low effort
def kimi k2.6
it's not
don't listen to reddit or X users please
every moonshot model has been getting worse
it could very well be glm 5.1 with ngram and slightly scaled up
true
Do moonshot ever do stealth model? I forgor
wdym
no
they just launched 2.5 too, no?
can someone test its world knowledge? it should be greatly better at that if it has ngram
more and more claude-distill, more slop, less unique
trading away novelty for i guess agentics
bet
but i don't think it's deepseek
idk
why not?
not good enough
yk those reports from anthropic is fake?
og r1 is smarter
it is completely different from the deepseek lite on the website
holy glaze
Wasn't Anthropic caught to distill from Chinese models for their multilingual part?
you really think they arent distilling from claude? theyre just doing it in a more sneaky way, the reports are flawed but it doesnt disprove the distillation
apaprently training info goes up (early-to-mid 2025)
yea they distilled from deepseek
yeah. they all distill everyone else. who cares.
the funny part is im not even glazing deepseek
I stole from others first! /jk
no but i get what you mean
lmao
they are postponing this shit because of the quality
every deepseek v4 lite checkpoint is getting better
definitely theres no way theyre releasing this
Hmm. It puts a lot of effort into one-shot tasks, but I'm not getting a lot of stuff that actually works without errors.
this checkpoint is nothing groundbreaking
Now we debate for days on the RP and Creative Writing side if it is peak or worse than GLM 5.
testing healer rn and that one is actually looping forever
is glm5 the rp goat?
seems hunter's reasoning is an afterthought
ehhhhh its about the same, sometimes better sometimes worse
more slop
I would say so for an allrounder
i thought the point was to play and guess which model is
imagine its just some lab surfing on the deepseek hype to get some data
Kimi has a more interesting prose, but can be dumb af
idk, if anthropic only found those examples then i doubt theyre even using anthropic to distill
^
its a good model
ive been stuck on my tried and trued few models
claude has the best conversational style and the best coding capabilities
why would they not be distilling from them
GLM5 is my preferred option with how versatile it is, its good at everything, not SOTA but good enough. For RP currently unbeaten IMO
Xiaomi rn 
true; but claude is also generally ahead of everyone not on frontier so
For its price point kind of, but it isn’t as much of a jump since Kimi K2.5 is pretty close even if it overthinks.
were not playing two truths one lie
Healer is Mimo v3 flash all along 
yeah i can see us getting bamboozled
thats three statements
K2.5 does think for ages tho
this is two statements
then its a damn good flash model
Which hurts its usability
well the latter is false
is glm really worth it tho?
without hard metrics to check semantic distance when compared to other models we are prone to the bamboozling
glm has really given all my characters a fresh coat of paint compared to the usual corpo models, also they feel more alive
Minecraft by hunter alpha
anyways
but they top the benchmarks each time and theyre regarded as the best by most
and this model is so alike to glm its uncanny
Its the first time I was convinced enough to buy a subscription for it
I haven't tested with any preset yet but I'll see how it does with lighter presets. Not the biggest fans of the overblown presets these days.
i just pay 5$ for openrouter and use free models for most of my rp
even its thinking style is like glm 5/4.7
then if i like a card or whatever i upgrade to some slightly nicer model
almost identical lmao
Do you think Hunter Alpha is better at writing than GLM 5? People on Reddit seem to like it so far (except that they’re complaining about the model being censored)
which in turn is similar to sonnet 4.5
But 4.7 and 5 had very different thinking traces or not?
swe bench uses a smaller model to verify im pretty sure, and no?
5 is a lot more concise imo
censored? what are they roleplaying? how to create nuclear bioweapons?
discrimination is easier than generation
yeah glm’s writing style is fairly closer to sonnet
LMAO
https://codepen.io/Madvulcan/pen/GgjNzoa
Here you go! It was a two-shot, errors on the first draft.
for harder tasks it tends to think like 4.7
Both Deepseek and GLM would be heavily trained on Sonnet.
i need to test it for a while longer before i can say tbh but from a first glance the writing is great
I always wonder what the people mean with Censored? Does it refuse to play as a Loli or what?
roleplay set in 1989
Its such a meaningless label
what is your point
then they wouldnt saturate at around 80%
Windows 11 by hunter alpha
Pretty sure censoringship trigger when your prompt mentions guardrail
it doesnt just top swebench
This model kinda good meeds a bit of polishing
yeah ig but the current glm 5 is actually closer to anthropic’s models in terms of prose and allat linguistics
people were saying they got strictly censored on glm 5 and i was like how the literal fuck do u get censored by GLM 5 lmao
glm was doing the craziest stuff without a single damn filter
genuinely what are these people doing
like getting censored on grok, imagine
thats what i wanna know
i rememebr there was someone in here asking about a refusal and THEN they said that the model refused to say the n word because they had a 1800s roleplay... holy shit bro
i got one refusal from it in over a thousand messages
Their chat is insane enough for GLM to out of character and stop you
GLM5 had played a villain that used bio weapons on an orphanage without a special prompt, if you get it to refuse it must be questionable at least xD
bio weapon school in order to make china #1 in everything
everything doesnt top swebench
Smart serum
are the nvidia models any good?
openclaw is bioweapon - makes normal people npcs
mini max is higher than 4.6 opus on swebench😭
why is openclaw mentioned in the description tho
Hype Farming
ah
hype moments and aura farming
this definitely feels like an open model
Aura Farming if it ate glue
reasoning doesn't look like any current frontier models
i mean moonshot and minimax is majorily used amongst openclaw users
Doesn’t openclaw app use most tokens in openrouter now (it beyond gooner)
this model always produce errors on complex task like implementing physics or heavy math
yeah lol
kilo code is still top
the gooners have been dethroned
sup
never thought id live to see the day
fr
dont ever bring me down to some rp shit bro
the model definitely feels around GLM 4.7 intelligence
guys........ hear me out.............
ㅤ
it's an nvidia model
its basically doing research on thousands of companies
u belong to the janitorai
could be honestly
hunter alpha is gpt 1.0
researchers rp too
I bet on xiaomi more than nvidia fr
i hjave NEVER ever EVER used a model for gooning
bro
why am i getting bundled into rpers
i highly doubt that
we can't believe that
i am a top rp hater in this server btw
okay
the fact you're continuing to say this, you MIGHT be the top gooner on openrouter
😭
I have
yea
Projection or denial 
projection
hi fried yo im ayu
Hunter Alpha is my wife typing frantically
we ALL have geen (past tense of goon) to text models
i said rp not erp tho, doesn’t need to be a sex thing yk
where is that pfp from..
something more fluffy, heartgooning
Goon or not to goon that is the question
i recognize it but dont recall where
who is erping with models ewwww yuck..
i force the censorship models to bow down to me
they are the #1 leechers too btw
More seriously the fact that they said 1T is interesting, we don't often get parameter counts for closed models...
so i can bomb children
These gooners are like an parasite.
yep
Jan ai comment is crazy bro
hunter alpha hack the government, make no mistakes
I pay for my gooning fair an square
the way i had to read this 3 times
are u america?
thats glm 5
mlem 5
