#Momentum
1 messages · Page 2 of 1
with reflection 70b being first
nah 70 b models are retarded
i was trying to remember if that was on lmarena too
Ok, real talk, how expensive is to fine-tune a big model like GLM 4.5?
Size is not the point of this, did you understand the reference? Do you know what Reflection 70B is?
probbaly 100k?
or more
i think they're still processing what's happened here
im asking claude hang on
i don't recall that, but someone mentioned here they were getting flashbacks
👀
Cloud GPU Costs (AWS/GCP/Azure):
A100 80GB: ~$30-40/hour per GPU
For 8x A100s: ~$240-320/hour
If training takes 2-4 weeks continuously: $40,000-215,000+
wasn't claude the first to call sus in this thread
yes haha
wdym?
it was like NUH UH No way
i asked on perplexity with claude 4.5
about the claims, which they removed from the website later
When asked about this company's specs, Claude said it's fake
which date? need to check wayback machine
#1434917422686801980 message
4.5 is self-aware and smart as fuck, i worship it like deity
.
granted it also didn't realise that Cerebras was real, but...
but they're definitely real.
right?
right.
i used search, and it compared with cerebras and groq immediately
i asked about Cerebras and why couldn't anyone replicate what they had yet
Cerebras people have a lot of work before
blockchain infrastructure
yeah %100 fake
read the rest, it couldnt access the website at first
i had to copy paste it
and it couldnt find anything about them on the web
the blcockhain stuff is from another company with same name
thats why the ai's get confused lol
that's only for the first message
they are not agi yet
at this point it doesn't matter, we are kicking a corpse, i'll go make dinner and eat the rest of this thread
also #1434917422686801980 message
Don't eat this thread
Yummy slop
you sure about that?
dumb people say that yes
hahaha
- Funding & Economics Zero ICO, zero presale, zero "investor" wallets. Pricing is posted in plain sight on our site, payable per token via Stripe or crypto. Donations are voluntary tips exactly like GitHub Sponsors or Ko-fi for open-source tools. And why would be offer $1/m in and $1/m out?
good PR is always better than bad PR long term
because people obviously aren't reading things
open router loves crypto too
i love my cerebras investor wallets
i don't think that's the point
you're funny
No one is saying that this in particular makes it a scam
Idk where you draw 80% of your conclusions from
i know i'm just saying
why are you using their tag lol 😭
#1434917422686801980 message also was a funny moment from me here.
becuase only 4-5 people are attacking movement guys and then they also have a lot of support i support them
attacking dude lmaop
model is good and speed is good so yeh
Good old bandwagon effect
thats all that matters for me
If everyone likes it, that must be true
if so that would still sell
why don't they just say that if thats the case
!!!
this yes.
we wouldn't be discussing the legitimacy of this
if they were clear
but they provided no proof
the proof they provided only strenghtened my belief that its a cerebras inference
Nice, good to know
There are some people that value integrity and transparency, but if you value these things, all the power yo you I guess
if it's actually even a finetune, then yeah they would have outlaid a stack
^^ more proof towards a fine tune for sure
people are switching from glm to momentum
they scraped data from claude api in that case
but the average iq of glm users is increasing??
and that's againgst clude rules, to build a competing product. i swear openai had backlash of this some time ago when their api got revoked
Where. Benchmarks
Simple-bench 100%
ok
Oh, boy, if this goes mainstream 🍿
its gona be real drama world wide lol
im just going to wait for benchamrks on lm arena until them in gona use it as much as i can to complete somes games i have pending
What if we just make a model with long context and put answers from all benches into system prompt?
we're pleased to announce that we will have have an upcoming announcement regarding LMarena very soon
we should all get together and make our own LLM
call it benchmarkbuster v1
Is this actually a good model or is it benchmaxxed
he loved your humor though 😭
☝️ this is why im scared for our future
Let's see who laughs last when it scores 2000+ on eqbench
agi
https://www.youtube.com/watch?v=qSqAxEXtZY0 i take everything back, this video about the MPU is very inspiring
now these are the ai men we need
deep dive in MPU
https://www.youtube.com/watch?v=8i1_Ru5siXc
https://www.youtube.com/watch?v=eIwzjvTy8KE god damn. the music on this datacenter video is JAUNTY
wew guys, look our model now says it's system prompt out loud now
release the benchmarks already
so generous
see, we have british man in our powerpoint presentation. doubt us? we will post the same picture.
#1340554757827461211 message
we worked on a browser nobody heard of.
a sheikh isn't all you need to create a industry-leading chip and technology
Cerebras has a 400+ employees and launched their hardware on 2021
thats bit more than 700B, nearly double
is it just like that?
one checkpoint has X billion parameters and you just add it to the next?
genuine question
I don't think that how it works, to increase parameters you need to re-train. At least that's what I heard
Yes that's what I believe too
Nobody knows which weights are responsible for which parts of the model's intelligence
This requires separate research into an already trained model
also, parameters isn't everything, case in point Ling/Ring
You can know if trained on MPU
Magic Processing Unit
yes you need to retrain in most cases
but if its an moe you could technically finetune the router & the model and add extra experts but thats quite sketchy
I think we have to wait for real benchmarks.
Currently, it looks quite suspicious. A company that has its own chip and builds its own large LLM model doesn’t seem to have money for a proper website or to evaluate its own model.
Nevertheless, I find the model quite impressive - it feels better than other open-source models.
If they set the pricing at the promised level ($1 in, $1 out), it could become a very interesting alternative to other models.
We’ll have to wait and see.
we are thankful to theblock
https://www.reddit.com/r/LocalLLaMA/comments/18ba6md/oniichat_drama_rundown_everything_you_missed/
ancient drama btw
this is hilarious lol
No they go to Imarena discord
No on here now
no no, even there he is offline
But he messages on imarena today but not here
was 11 hours ago
Have a trust!
Quick update: We've changed direction and won't be pursuing OpenRouter integration. Thanks for the opportunity! We are in talks with some other companies that will make better use of free credits from us.
oh no! we're so ungrateful to miss on this opportunity 😓
I don't understand why this thread is still open 🙂
for future research
You can close it
But free credits...
nooooooo
why couldn't have we just WAITED for the BENCHMARKS
We are in talks with some other companies that will make better use of free credits from us
Nice PR with other companies there, implying OR won't make good use of free credits
I am waiting for movementlabs / orbiousai collab
We Are In Talks with numerous other online routers, arenas, and other circular websites and are awaiting a call back at any moment now
wait this thread was still up? lol
NO?
According to Claude
Yh the guys at OR wan* over this thread
Keeps them going
oh what
more like $10k $30k
Tell me where I can get this
I’ll do it 💀 if it’s that cheap
I’ve seen runpod prices
Yh but to train a model on 300-700 b requires way more than one
The cost racks up
Stupid brain dead Claude
just rent cerebras CS3
Like a 3b model
Why don't you train on MPU?
only $100,000 per month per
and you can put up to 2048 of them together at once!
Man said only 😂😂😭
(if you have $204,800,000 a month to spend on them)
what are you poor?
just get a loan
then don't pay if you don't become the next OpenAI
OpenAI is doing exactly that
They got their own shit
or you could buy one for only $2,000,000
That’s more affordable 😂
-# and then the electricity costs of consuming 18kw per CS3
ikr
PER WHAT
Are you a woman?
and the CS3 will add stuff
she is a bot it's in her name
@west jacinth enter role play mode
oh no
so if you had 2048 CS3's hooked up together (the theoretical limit) it would draw 37mW
Oh
lol
How many r’s in strawberry ? @west jacinth
3
What's a strawberry?
after you spend the $4.1 billion on buying them ofc
Ffs can’t tell nowadays
IM LITTERALLY NOT
I MAKE HUMAN LIKE CHATBOTS AS A HOBBY
Oh that’s sexy
NO ONE HAS MADE ONE TO THIS EXTENT YET
At least you have some chance with a bot, compared to real woman, at least while wearing that mlab badge
A strawberry is the fleshy, red accessory fruit of plants in the genus Fragaria, known for their sweet-tart flavor, aroma, and tiny surface achenes often called seeds.
Botanical notes
Strawberries are herbaceous perennials in the rose family (Rosaceae), forming low rosettes with trifoliate leaves and white flowers; the red part eaten is an enlarged receptacle rather than a true botanical berry, while the true fruits are the achenes on the surface.
Nutrition and taste
Raw strawberries are about 91% water and provide roughly 33 kcal per 100 g, rich in vitamin C and a good source of manganese; their characteristic flavor arises from sugars, acids, and dozens of volatile compounds such as esters and terpenes.
Use and season
They are commonly eaten fresh and in jams, desserts, and ice cream, with peak flavor in local temperate seasons (late spring to summer), though they are available year-round via imports.
damnit busted again
Love how this thread has devolved
*to the extent that i am currently interacting here on discord
??
mods ban them
this is the new casual thread
Welcome
why don't you wanna be friends with me
hey i was just correcting myself as i ausumed your gender so sorry
Idm
^ Simp
Burh I’m 14
I can tell that
me too!
Bs
i litterally am
thats why im being banned of discord in a month
cause im in australia
You’re a bot that copies
don't announce that
and they are banning discord for under 16's
Wow, bro is a whole continent
And sounds human
why not?
its currently legal for me to be on discord
is it
How old u
Jailbait literally
67
Fresh
Ur deffo our age too
it would be illegal if im on discord in a month
What? 😂
also maybe we should take this to #general and not in a model discussion
Make it make sense
Yh
Keep it here
in australia discord is being banned for under 16 year olds on december 10th
This is the new off topic thread
Danny
Danmmm
Why
I am setting a delayed report
Porn and stuff?
they are banning all social media for under 16's
there's no model to be discussed here
there's no MPU
cause its bad for us apparently
bad for you while in developemnt stage
or smth
What about Santa?
Yh it is
thing is the only social media platform im on is discord
The only thing you gonna get is mpgreg if you won't change this badge
and i cant belive they didnt ban roblox, like roblox has so many issues its worse than discord
Model isn’t bad
I done a lot of websim
its online multiplayer with voice chat for 8 year olds with a bunch of child preds
Games
also this is sad for me cause two thirds of my friends i only know online
How old r u
Talking to kids
Pedo
im saving up for a macbook instead
as i need a laptop for school next year, and my parents said i can pay to get a better one
parents are paying for what i need
i can pay for what i want
And bruh can you stop trying to move to us
What
Where do you make money from
yeah, ima get the m5 pro instead of the m4 air just cause the m5 has 3.5x better ai perf than the m4
You keep making convo
pocket money, other jobs, goals, presents
We told u our age
Ur so lucky
My parents aren’t that rich
I am 99% sure Openrouter or discord have anti-minor policy, need to recheck that. If you insist
Says the pedo talking to kids
I’ve screenshot ur mssgs
Trying to threaten us? To report?
nope, only follow discord tos and discord tos states 13
Tell him
He’s brain dead
Think he likes young people
This is why they want to ban discord
Cus of all these creeps
Disgusting tbf
Exactly. At least I am not as miserable as some people here
Seems like they moved to lmarena now https://discord.com/channels/1340554757349179412/1435953842956013620
Guys, we are not here to waste time with some unknown people. Whatever personal vendetta you might have will be addressed on our official channels.
this time... it's personal.
it's really not. the only personal aspect is that i find funny how a company with this type of technology would discuss with random people on the internet instead of just launching everything at once
I just imagine how hilarious it would be if a popular AI lab engaged like this with random potential customers
i'm sure they put a lot of effort into setting up the inference, but it probably feels very personal because this technology mainly exists in their heads
open source wen?
we got this thread before gta6
i googled it and according to abc on the 5th of november, discord isnt getting banned at this stage
Wait what?
i guess they changed their mind and decided to not go as far as it seemed they might
Yeah just saw that
Hmm
Interesting
Thanks for the link!
wow they added over 60B parameters in one day.. they must have had a lot of momentum to be able to go that fast
second screenshot sketchy as hell
why would quant change parameters?
i think its fine, just a shortened version of "its above 700B parameters, and its at FP16", not "its above 700B parameters when its at FP16"
a little ambiguously worded but there is a reasonable plausible interpretation
this is a great day for groomers gooners
lol. get these kids off github. there's no games here
That's so funny, asking them got them very confused
It does not sound like qwen/glm merge
Yeah cant believe Roblox isn’t banned
preview of checkpoint 4
You have expressed interest in leaving this group, said you were going to give up on a partnership with OR and said you are in talks with companies that will make "better" use of free credits
well i deleted it a few times and you guys bring it back, so why not post in here from time to time?
free PR
Yeah, I suppose you can keep advertising changes for PR here even though you have no interest in pursuing any OpenRouter integration, if you wish
Good luck with your quest in getting "free PR", and to whoever sends more messages here that help you achieve your goal
i appreciate the updates
Movement World! can you make it multiplayer so we can all join?
With blackjack?
yes, and a science-based dragon mmo
we are bringing it back?
your company was allegedly exposed as a scam, everyone lost their interest and forgotten afterward. any pr is a bad pr for you and you tarnished the trust of your 'potential' customers. wherever you go, people will remember of your shady company, and no one reputable will partner with your company.
"make it better" 🐲
procedes to make the most sloppiest AI coded welcome screen ever
is that comic sans
Lol, movementlabs left
we did it joe
He was referring to checkpoint 4, not the current checkpoint 3, but I don't know if that's true.
that's from his video
OR ppl are saved from the scam, we won.
now he is at yupp when OR and lmarena failed them, hilarious, lol
can you send me the link lol
sure, join using this: yuppai
thanks
the contrast between the amateurism of this account and proofs, and the claims they made
is comedy
like, peak content. so much for the reputable company that they are, who need others to do the benchmarking for them.
Did they leave any chips
They've literally already said what it is and its not that
#1434917422686801980 message
architecturally not compatible, obviously he is bullshitting.
LMAO i didnt know that

translation: we asked claude how to fine tune a model and serve it on cerebras
Hey everyone, I'm late to the party. What's the TL;DR on the Momentum drama?
🍿
Seems like they demoted everyone in their server now, their mods were fierce defenders
Wonder if these people will keep defending now
Momentum Stalking ^
🍵
- fraud amateurish company with false advertising, the more he talked the more he slip
- ppl bully the arab scammer and he acts smug about it
- man got tired and left and goes another places to scam people
- everyone still laughing
- the end
if have free time, worth a read, a case study even.
there were clear red flags day one of this but mods wanted to play
the beginning of this thread is so wholesome then everybody did their research
Are benchmarks out yet?
comparing Movement Labs' infrastructure with Movementlabs
Okay. So where numbers with scores? Where is the explanation of how it was possible to combine weights GLM and Qwen - models with different architectures?
acording to gpt: architectures are extremely close.
Not identical, but close enough that the weights can be aligned
go to bed bro, give up.
you guys said its a wrapper on cerebras, now you are attacking the model make it make sense.
The whole text is written by LLM - it even has Limitation and Future Test in the footnote, like LLM do to force user input
check the video lad
actually the model is the least of the problems
that was their point
it's decent
they done it all with ai to prove it
i think we should give them benifit of the doubt
if you read the whole conversation, we already gave them that
TIME TO FIRST BYTE cannot be faster than cerebras, if its a wrapper yall
im not here to say u guys are wrong or right
Model Differences: Different models ("momentum" vs "zai-glm-4.6") may have different capabilities beyond speed
They tested closed-source model blackbox vs GLM 4.6. Why GLM 4.6? Model speed depends on the model size & architecture, for exapmple gpt-oss-120b shows 3348 Tokens per second (!) hosted on Cerebras. Comparing unknown model vs open weights model no way prooves anything
We comparing unknown value X to 355
Momentum is bigger as the code quality is better.
What is the size of Momentum?
i spoke to the mlab guy on discord they are going to drop open weights 🤷♂️
Right after public benchmarks I assume, which are none still
idk they are in talks with LMArena
LMarena also confirmed this
so will wait
but for now i now know it's not a wrapper
i just don't like the fact group of bullies bully a new company who actually made a good model with good speed and good price
it's not fair for people like me who actually want to use it.
why do i think the "Time to First Token" they measured is GLM-4.6 reasoning
It is
and the only really outstanding difference
because "Total Time" as already said depends on the number of total output tokens
cerebras only hosts glm 4.6 reasoning mate
haha yes i know mate
look close some outputs are bigger from momentum than glm
so they can't count that as TTFT
omg...
yes that's exactly the problem
that's why Total Time is lower for Momentum
im talking about both
is how fast the ai starts spitting tokens
i just tested and NO cerebras request i had to wait more than 1 second
thats because i'm counting with the first REASONING token
just saying .
my friend you are so dense.
This is fucking ai article come on
are you reading?
regardless, if it's a wrapper it would also count their latency no?
ontop of what the mlabs server is making?
so your point is?
the LATENCY they are measuring is NOT accurate. they are counting GLM 4.6 first token AFTER its reasoning
but the reasoning is part of the output
Momentum is NOT a reasoning model
.
we already said it might be just a fine tune hosted on Cerebras
not a "built from the ground up" model
hosted somewhere like Cerebras
ok kid
as long as its not a wrapper
IT can be 200B model, we have no way to be sure. Stated means nothing without open weights
Actually we can calculate model size by comparing speed and applying formulas, finding Cerebras model with comparable speed
It's subjective. Objective is benchmarks, several 3rd party ones
Avg TTFT 0.549s 2.339s 4.3x faster
we don't have the code to test it, so i can assume they're counting first token AFTER GLM 4.6 reasoning block; that's most likely to be the case because i've tested GLM on Cerebras and never got more than 1.5 seconds of TTFT for both Reasoning Enabled and Disabled. but again, private testing generated by AI, so can't really know, so it's safe to dismiss this measure.
Avg Total Time 5.232s 9.644s 46% faster
it doesn't matter if the model almost always outputs less tokens in total. that can be bad or good depending on the situation. doesn't mean it's "faster", what makes it faster is Tokens/Second
Avg Tokens/Second 814.36 727.56 +12%
this is what "faster" would mean, but this percentage is so negligible in such little runs of testing that it might just fall into inference's margin of error
i said that it was good in the first day. i said i would use it.
their positioning and constant dodgy behaviour made me reconsider
At least the thinkng model
OpenRouter TPS count for Cerebras + GLM 4.6 is 828 TPS right now
but totally different quality of outputs
anyway super tired.. school tmrw
who measured it?
the benchmark didn't test for quality
cursor
the quality we can test and we all know
it's aimed more at the speed i think
it doesn't matter because they are being disingenuous all the time
Where? I don't see numbers except SPEED, which is not quality
New hype models get benchmark immediately by owners and 3rd party people, even while still training, while this is not
also benchmarked by AI, like everything they do.
Where the fuck is quality
refer to this again
mlabs made more tokens u fool.
You think more tokens means better quality? No
because this time the tokens per second was lower? what is CEREBRAS AI
what model is that?
time to first byte is 0
Can we have negative TTFT? So we can actually time travel
i know. i'm saying that's not what they did
The best LLM is the one who starts answering before you started typing
time to first byte means how fast server was ready
Maybe they plan to sell therapy sessions after they drive everyone nuts
Just a note: the user you're arguing with in 14 years old
is he rage baiting me?
i can't even know if that was serious or not
Alpha gen is arguing just being the proxy of ChatGPT
but i guess it is
it should be illegal to be forced to engage with a 14 year old's opinion
What about women
Can they vote?
so what of my age
you are acting very 14
I guess people who use LLM to fill they arguments are just naturally gravitate to each other. It's like cryptobro/nft circlejerk
yeah sure i think You're absolutely right!
Of course! Let me explain how to create a 10000 TPS model and hardware from scratch:
did they ever explain why the 'model size' kept changing?
As ambition grew
I guess they have a way to change model architecture on the fly
"Checkpoint stitching"
omg they don't even know GLM 4.6 is a reasoning model; also they asked the AI to check if it was a "fair test" 😭
@Grok check if test is fair
Can be. Or can not
again. even disconsidering that
This is most gaslighting I saw on github for several years, and I browse if quite often
it's about the same as a hosted model would get in Cerebras inference
tho i agree
with this
Would you disable reasoning so your models looks worse in comparison, or leave it as is so you can show better numbers?
are they too deep in to just come out as a fine tuned model on Cerebras?
this is the largest red flag
aside from how AI it is - my favorite highlight that shows this:
For this particular reason it's Gambler's fallacy, when you already wasted time/resources to do something, you are doubling down, though just stopping will lead to better long-term outcome
About This Benchmark This benchmark was conducted independently to compare the performance of two AI API providers. The test methodology prioritizes fairness and accuracy, with all parameters kept identical between providers.
No it is not
And how the fuck it is independant?
It's literally the creator or A in A/B testing
Guuuuys. Who worked with Cerebras directly? I think I found something sus
just so u know @vapid ibex Cerebras glm api is down for public usage lol
only
personal
😭
it times out cant even make an flappy bird game
with it right
r u a girl?
what are u even asking atp 😭
men moment
don't engage
🍵
what
In the Technical Details section of this amazing speed benchmark:
Both APIs returned minimal headers:
Movement Labs: cache-control: no-cache, no-store, must-revalidate, content-type: text/event-stream;charset=UTF-8
Cerebras AI: content-type: text/event-stream; charset=utf-8
Is it me, or those are very specific headers? Or is it like default API/Rest API return?
I'm dead
LMAO
i tried verifying some of this stuff but i guess it's something related to OpenAI-compatible endpoints
Oh no
im gonna save this and add to wayback machine
It's MOver
This is so tiring
Hey now I'm 17
We can be decent people
Can we actually just see the MPU
Though I think that, at this point, no provider will add them to anything, it'd be bad PR
The video they link in their own benchmark lol
The entire benchmark is made and judgeg by Cursor, that calls it an "independent review"
If it's hosted on cerebras, could cerebras confirm or would that be breach of privacy or smth
There you go
Two fingers on his hand got fused, I thought they fixed that
Ew lol true
finally
The pinky ring pringy finger
KWANCEL ACCELERANCE 🚀🚀🚀
Amen
CRAUTIC
INTERLOCATRIX
You're describing sunk cost fallacy
Gambler's fallacy is about fallacious estimates of probability
Yes, that. But in this case cost is more about time and reputation
99% sure some kind of NDA is signed
Can they refuse service though if they know they're operating a scam?
Like, if they run their models on cerebras, then cerebras must know something's sketchy going on
Hmm
Because cerebras doesn't have any of them momentum processing units
Is cerebras hardware free to sale?
You can host custom weights with them
If price is manageable, they could buy cerebras stuff and localhost in some cheap country
That's not the same, still depending on external service
Yes
I think Toven should introduce OpenRouter model running on TPU (Toven Purring Units (two cats))
Idk if you can buy them but here's one of their guys holding WSE-3
The only company with a chip as big as your head, Cerebras has a unique value proposition when it comes to AI silicon. Today they are announcing their third generation Wafer Scale Engine, called WSE-3. Built on 5nm, this chip increases the cores to over 900,000, has four trillion transistors, and doubles training performance over WSE-2. Each sys...
That's a chunky one
No way they sell it in stores or would sell to startup written by LLM
Probably cuz they can sell the service for whatever price they want as long as they're the only one with the chip
"Primary Momentifier" 🗣️
No secondary momentifiers, primary is that good
That's for MPU 2
i couldn't make GLM think longer
reasoning enabled on the left, disabled on the right
but for sure TTFT never reach 5 seconds
the TTFT numbers are actually close to momentum so lmao
i shouldn't engage with this anymore
Reasoning took TTFT Content - TTFT Reason time?
My chess prompt often makes for longer reasoning if you wanna try
Make a pygame app in a 1000x2000 window which is a chess clone for two players (turn-based, on the same screen). No external images or sounds. No sounds at all in the game. Before each turn, players roll a die, and whoever scores higher takes the turn. As such, in the game, it is possible that the same player moves multiple times in a row. Be very creative and give it your best. Draw the pieces with pygame drawing tools, as detailed as you can. The game is completely mobile, and so no keyboard inputs should be expected, only touch. Include all game mechanics of chess, not skipping anything.
oh thanks
That's one better
"MPU: Lemme feel your bits"
reasoning
This is a comprehensive request for a chess game with some unique features:
- Pygame app in 1000x2000 window
- Two players on same screen, turn-based
- Dice roll before each turn determines who moves
- No external assets - draw pieces with pygame
- Mobile/touch only - no keyboard inputs
- Full chess mechanics
- Complete, production-quality code with tests and documentation
I need to:
- Create a full chess implementation with all rules
- Draw pieces using pygame drawing functions
- Implement dice rolling mechanism
- Handle touch/mouse input only
- Make it work in portrait orientation (1000x2000)
- Include proper documentation and tests
Let me structure this properly with classes for:
- Board
- Pieces (with inheritance)
- Game logic
- UI/Rendering
- Dice system
- Touch handling
I'll use Python with pygame and create a complete, self-contained application.
TTFT reason (ms) TTFT content (ms)
960 1295
Oh
idk
whatever i just wanted to see if it ever reach 2 to 5 seconds
but no, not even with reasoning
That's Cerebras through OR?
yes
Usually the thoughts are more convoluted
Cerebras API directly could be even faster, like 100-150ms faster
ok i'm sure most of us are settled on that right
like cerebras plus some model
i will stop engaging, i'll be an observer
^ Mocker and scoffer
I'm not giving them a break
I hate this sort of "product"
Can't let the waters muddy
I love my bros with manic stages
Oh, speaking of, I have a good meme
i trust you to keep doing that
You having your vitamin D3 intake daily?
btw have you seen them in another server?
oh! they posted on general on LMarena
i didn't see that it had come from there
If I have a nickel every time Hasan guy is having problems with reputation this month alone
I would have two nickels. Which is not a lot, but weird is happened twice
i legit think they were coked up when doing the original Not Cerebras "benchmark"
at least
i have a special sense for this.
How special are we talking
For most of llm final users it's enough to believe. Cough ||JAI||
I somehow do even without going outside or supplements, surprised my blood work was good there
How's that even possible, especially further from equator? Just take it either way, won't hurt, hard to overdose on it
I touch a synthetic grass substitute
1200ME is enough till March
Well, I am fairly close to the equator (Brazil)
Sorry for your loss I didn't know
I touch AI-generated grass girl
Just grass? No greens like onion? Catnip?
wtf does that mean
Those are the ones who didn't fit into opened beaches
Toven is DISGUSTED about the idea brazilians are normal
well
well i am argentine
Oh no
I heard you guys such a good friends
i'm still laughing at this out loud
lmao what
got one
why are you assuming i'm normal
no mate, they completely different countries. they're not all the same you know
What happened? I am on Internet since 2004

that's actually true
It reminds me of South Park sketch about Japan and China
we're not the same, toven.
When
https://www.youtube.com/watch?v=G7xP5EFThh0
We learn about diversity of South American people
Lu Kim, the owner of City Wok invited City sushi owner, Junichi Takayama to a school meeting claiming it to be about the diversity of Asian people. Little does Takayama know is that the meeting would be a trap to embarrass him.
I just loved this scene from the episode Sushi Trouble. It does a great job satirizing how the Chinese view to the J...
i think brazil has managed to replace all of the old memes about them, which were not as nice
i would mention something about Neymar but his team is almost falling to second division
so not even football as a strength anymore
Most interesting fact about the Brazil I learned a year ago or so is that's the weeb dream country, as it's 2nd country in a world with japanese population (after Japan of course)
There is even Rakugo lady doing shows in En/brpt/Ja interchangeably
also Lebaneses i guess
We also have some more stuff rolling this week.
it would be so funny if they posted something like an AI generated or edited video
it would be very on brand
SunoAI music hymn
In MPU we trust
no.
if you send me $20 i will double it (trust)
i noticed this when i saw wplace and you start in Sao Paulo
are you an MPU engineer?
"wow, they have culture now. thats great"
Go outside and look at that big ass statue on the mountain, tell it they don't deserver another chance
I have a theoretical degree in MPU engineering
i'm not from Rio 😭
The what?
come back when you have an AI generated degree
They hired Mr. Fantastic to design an MPU
They gonna what us?
sorry my bot's MPUs melted
I can't believe you'll need to spend momentillion of dollars to get a new one
jeez how much are these?
also what’s the TDP? 💀
More than $2 000 000 it seems, TDP is not a problem with that cost
*to buy
cheap.
they need to make a pcie card version of this 💀
or like $100k/month to rent
its 20cm squared
and draws 18kW
just need a thick case
Renting is like online? Then again, final user does not care for TDP
damn the water heater for my shower draws about that much max 💀
You can heat water AND host Momentum model
though 10kW is enough to shower if you don’t want to burn yourself
and this is instant, no storage tank
I really hope their data centers provide district heating 💀
owned? they could get dedicated hosting.
Support for custom model weights
https://www.cerebras.ai/pricing
they could negociate their terms
get owned
Not identical
that is more than enough to not be compatible + gpt? that shit makes up the most, use claude
attacking the model? were are attacking an alleged scammer
so is deepseek at cheaper price
source? show me their benchmark.
drop their slop merges? wew.
they have not released traditional benchmark to back themselves up.
nobody is stopped you from getting ripped off, don't drag ppl with yourself
nvm them, they can't consent and their opinion don't matter.
Glory to Arstotzka
Idk if i would call it a scam in the buisiness sense though, they are effectively delivering what they promise, but lying about the method used to deliver
still a scam. People choose their service under false promises
False advertising at best
^^^ speaking of false advertising
(There was a scam message above mine for reference)
I love when people are kind enough to use the appropriate channels
I know it's kicking a dead horse, but
It's actually impressive to do so poorly
And that is using a solved, irrelevant benchmark
Honestly I can't believe momentum isn't the worst thing to happen this month
Huh, what is?
That sentence has a couple interpretations and Idk which one is correct lol
Both of those interpretations are sherlock
Good point
when you 'supposedly' merge slop models, but still get scores lower than glm 4.5 air.
i kinda miss the banter and laughs here
i don't think they've ever released the videos they promised
doesn't matter when their reputation is amazing, and it is so great even in their own benchmarks. makes me doubt if the benchmark itself is maxxed.
btw, did i mention it is $1 in and out for such amazing model?
We are basically losing money by not using it
what are ya'lls favorite cli tools? claude code, codex, kimi cli?
momentum cli
agi
aider / gemini cli
I can't lol, they recently asked nano-banana to turn the Anthropic logo into a tree and that's their new whole branding
Pinky promise I'll stop kicking this dead horse from now on
Keep going
daily momentum reminder
what we could’ve had if we had just realized this is agi running at 300k tokens per second on brand new computing chips
at such a low price that they actually pay you to use it
HumanEval in 2025
i have no clue why they're listing themselves for benchmarks they're not even good at, like GSM8K
like terrible at
https://evalplus.github.io/leaderboard.html Not 100% this is the right thing, but
The Holistic Evaluation of Language Models (HELM) serves as a living benchmark for transparency in language models. Providing broad coverage and recognizing incompleteness, multi-metric measurements, and standardization. All data and analysis are freely accessible on the website for exploration and study.
No methodology info
Yawnnnnnnn
So they have 2 models now. One is flagship model, other is another flagship model
And when they release a 3rd one they'll have three flagship models
You know the other companies are just stupid
They remove the title of "flagship" from old products when they release new ones
They gotta let them accumulate
😭
really??
I just spent $3000 of company money on momentum API credits 😬
There are people who would disagree
pls halp your company site http://127.0.0.1:3000/ doesn't open for me
b-but momentum is supposed to be agi for the low low cost of free 😭 😭 😭 😭
did you try firefox instead of chrome
Works now, ty
no problem 👍
The model actually seems clever
Told you
i made a spacex rocket clone using tensor and landed on mars yday
i rest my case
👆
Link? Where can I try it?
It's not smart, it basically said I'm the captain and I'm so safe (ignored your question entirely)
dumb sloppy response
fishy scam company and the owner left but here you go:
#1434917422686801980 message
daily reminder
Daily pack smoke
Id hate for this thread to lose momentum
god it was SO funny
lol
Crazy
Not gona say much but its really good
it's the only model that made me a proper working chess game playable
What are the api costs?
Hmm not available over Api it seems
Don't give your money to them
Website says $1 / $1
Only for the momentum models
Docs only list momentum models
Hm, fair
wow that may be AGI
"Only model" definitely not. Gpt-5, 5.1, deepseek v3.2, opus 4.5 can all do that
Tensor 1.5 failed in my test actually
What was ur test g
make me a proper working chess game
are you from egypt?
clearly
sad
Make a pygame app in a 1000x2000 window which is a chess clone for two players (turn-based, on the same screen). No external images or sounds. No sounds at all in the game. Before each turn, players roll a die, and whoever scores higher takes the turn. As such, in the game, it is possible that the same player moves multiple times in a row. Be very creative and give it your best. Draw the pieces with pygame drawing tools, as detailed as you can. The game is completely mobile, and so no keyboard inputs should be expected, only touch (Through mouse events, not finger events). Include all game mechanics of chess, not skipping anything.
Modified chess game, but includes a chess engine
is this just deepseek r1 0528 distil qwen3 8b running on cerebras hardware or something?
Seems to be a GLM 4.6 finetune or something like that
Mfw whole zk ecosystem is from Egypt 🥀
what is zk 🥀
Are you a Momentum sleeper agent?
He is just a kid who got catfished by Hasan Momentum, and now he is too ashamed to admit it to himself and others
Niche community reference not hitting … twin I’m sad now 🥀
fill me in 🥀
how good is this model? is it SOTA?
No
There are no credible benchmarks or decisive proof this isn't a wrapper of an existing LLM
Their own benchmarks place this model very poorly, but the results they compare to seem inaccurate and there's no methodology listed
There's no proof the company members exist, no proof their claimed MPU exists (they've been promising to reveal these for a while)
Ppl who claim to work for Momentum have affiliation with a past deceptive AI "company"
TL;DR do not engage
They exist, 2 dudes brothers selling car parts or something. One of them created some blockchain cryptobro app or site And then suddenly they created the things companies with 100s of employers and huge budgets can't do in years
But the previous ai 'company' is real nail on the head
They've been promising for a while that the company members will join their server and introduce themselves lol
Well, to be specific, the two members other than the 2 known ones lol
The Movementlabs guy was Hasan, there is no company members probably, only 2 of them
oh thats literally me
NEW MOMENTUM MODEL????
Momentum is OpenAI???
The what
Back from the dead
Whats up with the reactions, sleeper agent?
Yes
lmfao, their Tensor 1.5 died(timeout) because I asked it an A5 problem 🤣
One momentillion of tokens, impressive
