#general
1 messages Ā· Page 207 of 1
mods make him use the nvidia model @atomic lagoon
Worst punishment of all
If I wanted the nvidia model I would go to gpt 4o
LOL
BRAHAHHAA
WHY I ALWAYS GOTTA RUN INTO PROBLEMS
i did switch over to mobile
perhaps that's why
how much are you paying
bruh. so much spam today
For real
annoying
Very
Our typical automod features haven't been catching them either š
The scammers are evolving
Fellow jai user!
HELOOOO :3
lmao
So theyāre not just kind people trying to help us get free money from Mr beast ?!???
Unlikely š
I can't decide if I hate or love this
Hate is acceptable
yo how good is amazon nova 2?
i hate it cuz its nintendo š
Is it though?
Nintendo are great
yeah i like super nintendo since in 1990, because classics and nostalgic
Claude Opus 4.5 might be better than Gemini 3.0, in one prompt I told it to make a Minecraft clone Claude opus gave me a whole chunk with trees and Gemini just gave me a grass template. Thatās kind of glitchy and gave me a Gemini chat that I didnāt ask for I used Gemini 3.0 for this by the way while I did use two prompts for the Claude one, but it was only to remove the shaders because I have a potato PC
The first one is Gemini the second one is Claude
I think I might know what Iāll be using from now on
What's going on with Gemini 3 not following fairly simple instructions. It not exactly a hallucination, but it just goes off into it's own tangent and wastes your time trying to right the ship
Is there a Gemini rep in here? š
yeah is broken again but work again later
]
How is everyone..Im new here. I have been stuck on "generating" prompt since 2AM last night.. its 11:46pm the next night..Anyone else having problems?
Iāve gotten kinda better maybe with a different model but I used a really long promo
Anyone else? It's still going on
character berbicara bahasa Indonesia tanpa background music:
Which
check #1438733849235558480
Claude is anti-lazy, Gemini is lazy. You may find the first trait useful, especially for the task you are describing, but it's not a performance indicator by itself to be fair
just something to keep in mind
How can a 40-year-old man not wake up at 4 am at night, and then sleep until he wakes up naturally?
you think so?
Yeah fr
anthropic models always came out lazy to me
Gemini 3 doesn't even complete what you tell him
gemini is a yapper
yapping machine
it just doesnt stop
but they did improve on that
it uses less tokens compared to 2.5 pro
claude always gives half-as*d answers
2.5 tpo isn't that good
You need good prompt
if i need something faster than 3 pro
Gemini 3 is dump as:(
i always prompt it with 'gimme more details'
but doesnt seem to work
Try gpt sounds good
Claude opus 4.5 or thinking
Needs a good prompt and they are gone to do everything but sonset is not that much good with good prompt
Did anyone tryed deepseek ?
Gemini and gpt They always try to simplify their code and messages, and I don't know why. I expect it to be used less tokens
Fr no reason šš„
oh gpt 5.1 is a big yapper too
its fine ig
I said gpt not 5.1 cuz I know he is yapper šš„
Ai is something else
Instead of being able to rotate the character of the proper way, you know what he gave me
Lamo
They close the toilet seat lol
But thank God I got it to work
What the hell šš„
I want seedream4.5šæ
It could be a rate limit
If youāre a developer at a big company or even running your own small platform, and you intentionally build hurdles just to push people into a pricier tier by making basic tasks annoying on purpose I donāt know how you sleep with yourself or justify that to yourself on how you treat people. Being upfront would get you farther, because youād be surprised how many people would actually pay if they didnāt feel forced sign up just to get rid of the annoyance.
When kling 2.6
Soon...
I would sleep easy
If I was having difficult night Iād add an extra tier to get some xannies
š¤£
Yeah you can make 4.5 output an entire project in one go. Gemini can't do it
@weak sparrow Please head to #1397655624103493813 for a detailed guide on how to use the bot
GPT Tier list, for their time:
S: none
A: o1, 5
B: 5.1, 3.5
C: 4
D: none
F: 4o
Reload the page lmao or open a new chat
5 was their best and I'd only put it in B tier. Also 4o was not that bad at all, shouldn't be F
I probably need to restructure it a bit but 4o was disappointing in many ways. Felt worse than 4 across the board and existed for a very long time without any improvement. I think the failure of 4o is why the release of 5 felt so monumental
Although 4o did improve response speed and audio dialogue
5 in A???????????
I falled in ragebait š
5 is a super sharp model and I love it
But 5 above O3?
Thinking better, in O3 release already had gemini 2.5 pro
honestly never got to play with o3 so I completely forgot about it
They already have artificial wombs, when will they release ai husbands/wife? Is it not the priority?
Yeah š
as for 5, coming from 4o hell it was a breath of fresh air. Can actually reason and have some intelligent dialogue without taking 10 minutes and paying $10 for a response using o1/o3
Anyone how can we solve this problem retrying isn't working.... I'm tired of this errorš @tiny palm
you're rate limited bro, using it too much
So, take a break? and if I comeback to the same chat later will it work?
It is but with caveats. Not all versions are good, and I think for search / deep research o3 can still be better tbh
- with 5.1 they kinda went further away from the raw performance towards style/emotion
o3 felt like a hard worker, gpt5 feels like a better improved base model but also more quirky and needy if that makes sense
Nano pro broke again š
nah bro
Does anyone know how to activate the Google Cloud free trial without a Mastercard? I only have a virtual card, and it doesnāt seem to work. Any suggestions?ā
You could probably buy a prepaid giftcard with no money on it for very very cheap and then just use that
Prepaid cards work for google at least when I tried gemini
But needs to be mastercard like you're trying
Try Yupp Ai it works well there
Do it from phone, google pay.
It has a sign in.
Obviously
I tried prepaid and virtual cards, but Google Cloud still rejects them. I think they require a mastercard for verification
Never ask that question to any model
They don't know their own identity
The Ai Identity Crisis
I tried that actually, even through Google Pay on my phone, but Google Cloud still didnāt accept my card š
Its not even that, its just their memory is behind so they're more so guessing what a model would be called
Prepaid mastercard?
@sterile tartanIs this AI the real 5.1 version?
Yes it is
Absolutely
The Models are as They Named
Thanks
I just want free trail
I think u need credit card
Hmm yea I guess some stuff you cant use prepaid card on
All I know is I did it for gemini
Well credits worth 300$ won't be so easy to get
Yeah probably š Google Cloud seems strict. My virtual/prepaid cards donāt work at all.
This trial is mainly for businesses so they can test and try it
Yeah, thatās true.
Yeah, $300 isnāt easy to claim with a prepaid card.
Me too
Well better luck next time
Yep, better luck next time š I'll try again when I get a real card.
Yeah a Credit Card
So they can bill u if u mess up
Wait, what do you mean by if I mess up? š
Let's say somehow you manage to exhaust 300$ and kept using further without realising it
Oh okay, so you just need a real credit/debit card for verification. You donāt actually need money on it as long as you stay within the $300 free credits. Prepaid or gift cards usually wonāt work though.
i love how 4.5 opus gives you a full on project on one go
while you need to keep on polishing with gemini 3
but its also good
If you go over the $300 free credits and your card doesnāt have money, Google Cloud will try to charge the card and the payment will fail. Your account might get suspended until a valid payment method is added. So basically, you wonāt lose money, but you canāt keep using the service until you fix the payment.
Yeah but with credits card there is balance limit that can be charged
So better use debit card if it works
Everyone has different opinions on whether Gemini or opus is better, I think opus is good for planning, while Gemini is better for coding
Definitely in my case, might be different for other people
Yeah, true. Credit cards have limits on how much can be charged, so a debit card might be safer if it works. But it seems like only MasterCard actually works for Google Cloud verification.
And also that Gemini is completely free, while opus isnāt
Yeah exactly
Is kinda same
Free Gemini 3 Pro Free 5 Prompts Daily
Although you can use it on Ai Studio
Anti gravity u can use Gemini 3 pro for free infinitely
Well i don't have a pc
Well gemini 3 is definitely more accessible for Free rather then Opus
I havenāt test opus out to much, so I wonāt flame it, all models have their ups n downs
A new good one comes out every day anyways š š¤£
Opus is One of The Best for Coding
Colud u help me with that?
But Very Expensive
Not in all coding I wouldnāt say
Veo 3 is available on Dreamina
Limited free usage tho
for that price i would rather use gemini 3 lmao
Are u surešš
how do i use veo 3 for free
Good morning sigmas
Dreamina
Where can i get Dreamina
Has anyone here made any coding projects, websites, games, etc⦠just wanted to see what people are up to, maybe get some ideas
yes
mostly roblocks
R0blox? (Idk why the words filtered)
Ohh so it helped u make a game inside r0blox, not robl0x itself?
Interesting
yes
Hi, I honestly love this platform having come accross it on youtube and vert enthusiatic about it. I am begining my journey in AI on this platform and I am excited already. Thanks, I love you all.
@echo aurora will be seedream 4.5 on lmarena? 
to
What is this server bro..
The Website
The Lmarena website has been very stable for me recently, no more getting those annoying errors. Thanks dev team.
you're welcome (I'm not a dev)
We just got dumber it seems
Dude that seems glitched or something everyone get's 150 Free Credits Everyday...
Try Desktop Mode
These models are extremely extremely unaware of what theyāre doing or generating
To be precise they only predict the next word
No, Iām talking about like the safety part of it
Oh there isn't enough safety i think
There is some, but itās not sufficient
Like if theyāre planning to have these in schools and for kids to use
I donāt know Iāve been frankly just baffled
Oh well yeah...
it is real š
Maybe, I don't know 
We'd put out an announcement so keep an eye on #announcements
Iāma show you guys something really quick and I put this on everything I I didnāt prompt anything of this nature whatsoever even remotely close
How insane is that?
Like frankly, Iām speechless
Damn, this technologies is powerful
Let me see if I can find the articles for you real quick
You never know where youāre gonna get from the other side.
@echo aurora Does Battle model now also have rate limit within a certain time window?(ip based detection) . I haven't triggerd a single ReCAPTCHA but now every new generation in Battle leads to an instant āsomething went wrongā.
I have tested it out with
cellular network; generation was totally fine. I just need an official answer, since I believe it couldāve been done in a more obvious way like a rate limit notification.
I know what youāre talking about. The same thing happens to me. I use the DuckDuckGo browser.
I suspect it has something to do with DNS cloud flare and Google & the way the arena is built fundamentally what makes it possible to have all these models in one place for so many people
Thanks mate, downloading now.
If itās not working, guess vpn is my next bet lol
request is being rejected at the network level (likely by the Web Application Firewall or WAF) before it even reaches the model or the "human check" system.
To protect the GPU or something from heavy resource models
We did make some new changes to the rate limit system.
now every new generation in Battle leads to an instant āsomething went wrongā
When did this start happened? Yesterday?
Is it just battle that this is happening in or Direct/Side by Side also erroring out?
Same thing is happening to you @keen beacon ?
But I think you guys just have too much traffic too much volume
I did get a weird error today that Iāve never gotten before
āYou reached your daily limit of uploadsā
Since yesterday. Me and Gehlo had discussed before. Once it happened in Battle, Direct/Side by Side also erroring out m.
The thing is Iām just an unreliable source because I donāt know whatās going on on the back end or experienced enough knowing how lm arena works and whatās considered normal. Whatās not
I take things as they come so to me. Itās all normal. I guess I donāt really have issues with any of it. Cause I have nothing to reference to.
And I donāt use any of the models practically besides battle image
I try to avoid direct conversation unless I really have to or need it
To add on m, once the error took place, on PC, I used my mobile phone login in a different account but with the same WiFi as my PC, error persisted. The, I switched to cellular network with the same account, generation went normal(while PC with Wifi Connection still erroring out).
(I almost donāt use direct, stick with battle)
Your network has a more secure connection
But you are right there could be a cooldown or soft block
Hello everyone, I'm a new member. I'd like to ask how I can improve my performance on LMARaena.
Browser/Device Fingerprint When you switched to cellular, your phone was assigned a completely new Public IP address by your mobile carrier.
Hello everyone, I'm a new member. I'd like to ask how I can overcome limitations and increase the number of conversations I have with various models?
I hate to break it to you, but unfortunately, there is no way to do something like that. What you see is what you get.
I think that's the case. Just wondering whatās the standard of triggering cooldown now lol
Well, like I said, itās hard to confirm dude because we donāt know exactly whatās all that play behind the scenes we could speculate and make our best estimation and guesses but without knowing specifically and exactly weāre still in the dark
Word
So all I can do is wait for the time to pass and then reset the conversation count?
Yeah, patience is virtue. Either way we should be grateful and thankful for whatās already given. Itās more than plenty and itās very generous.
I speculate these were those hidden models. In image gen, there are similar āmodelsā like autumn or ghost
Flow is google of course š
Google flow
all rightļ¼thanks
Trust me, bro if you wanna get ahead and if it really is important to you and you find that you use a daily and you have the opportunity and the means you should definitely buy a subscription to your favorite AI
I suggest Gemini
Most value
Yeah, thatās great and perfect model
Not just model but the value is 6x
Google includes a lot of services and a lot of cool AI features and for 20 bucks a month thatās a great deal
Not only that you can share it with 5 more people
6x Value
And those 6 can be yourself
š
Yeah, but not everybody could afford it dude which I know sounds crazy but itās the truth so we also gotta be understanding of people situation
Itās easy to do what I did just say buy subscription lol
But it really is the easiest way of achieving a lot more with a lot less than struggling
I agree. Iāve paid for gpt for almost 2 years just because had too much memories and custom Gpt stuff with it. Otherwise I see Gemini really thriving.
Trust me, Iām divided on the issue
Because even at $20 a month I know Iām overpaying because of the value of the services on the secondhand market is extremely cheap
Which makes the whole AI industry just a very complex mess of financial problems on all sides
Because itās hard to be profitable apparently
But at the same time, itās not really worth buying either if you look at the statistics, only 3% of users pay for AI services
Anyways, I donāt wanna get into that. I just wanted to clear the air. I guess if you will.
Things for us to considerate to be aware of is this gist of my point
Well there are some offers like the student one or 4 months free trial by a pro user
Or they give 1 month free trial too
Well, this is what Iām saying thereās ways to get a free thereās a way to this and that cheap free whatever but at the same time the amount of resources in the real world that are being used to sustain this electricity water, etc.
Like we all live on this planet, we all live for those of us in America within the same finite resources
Yeah these companies are paying price for infrastructure and compute
Is expensive
Very hard to be profitable
Well, thatās what Iām saying dude it hits all of us said at the end of the day at the bottom line
Iām not preaching or trying to tell anybody how to use their AI or anything like that at all Iām just saying that you know you gotta keep these things in mind
I don't think any of the 4 top ai companies are in profit
Yeah, it sucks dude it really sucks cause itās awesome powerful technology, but it comes with all these side effects
Yeah is a very big problem until models and infrastructure become cheap or something
Ai is not like software or internet
Compute is expensive
Okay, thank you for the additional information. And for my clarity the error message was Something went wrong...? Except that one time you had a Daily limit of uploads error @keen beacon ?
Once the errors started, it didn't "fix itself" for a shot period of time? It continued to persist?
When it's convenient, if you could trigger one of these errors and record with: https://jam.dev/ and then send to me in Direct Messages, that'd be a huge help.
In the meantime I'm going to try and repro.
I got u
How would you know
All anon models can pretend to be google
anon models dont have ELO
But it's Google's deepmind
Google Deepmind 2.0
Easiest to speculate if its tru
Well it will be revealed
I know a way to test the frame-flow model
nickname made a steganography puzzle
that only gemini 3 can solve
furthermore giving it to grok anons triggers a jailbreak
Interesting
i know frame-flow is new as i have a list of anon models which i havent seen an updated ver of yet but it wasnt there
So what's your final verdict?
dont have one yet
i got to run the steganography puzzle
going to be so disappointed if its grok tho
I now know that Assistant A from this gen is a grok model
For some reason steganography like this just triggers it to jailbreak itself..
also much harder to run this bench because im sure uh arena got updated that you cant use the same prompt 4 times
@echo aurora Is it intentional that you can't use the same prompt 4 times in Text Arena battle now?
Another model i haven't seen before.. [TEXTARENA]
The answer is wrong btw, and also another new model. Damn.
I doubt its google made.
Yes, there is a repeated prompt limit now (it starts to error out at 4). However, if you do a new prompt it should start working again.
However, just spotted what looks like a new bug if you do this. In the same chat if you type a new prompt, it essentially deletes the old prompts/responses in the same chat.
I assume this was made to make the arena fairer, than it just being tested for one thing?
also frame-flow failed this, so i doubt its any google model
@sterile tartan
I don't know if it could like do it since
Only gemini 3 pro could do this, Flash is supposed to be weaker, so it might not be able to do this?
Still no proof this aint google tho, if so.
Well time will tell
It will be very interesting to see this new models arena
Maybe a new company has entered the war
What the heck is frame-flow, it just popped up and kicked opus's butt in a text battle. Whoa
what did you try with it
Btw guys any of you know the rate limits?
depends what model
Let's say the SOTA
so Opus?
All modalities
All SOTA
They have different ratelmits
Opus has a 5 prompt ratelimit if im not wrong or a bit higher
Let's say the top models of top 4 countries
Opus 4.5, Gemini 3, Grok 4.1 and GPT 5.1
LMArena ratelmits right
Gemini 3? Idk 100 prompts mayb. I dont freaking know.
Grok and GPT, a lot i guess.
Ratelimits are hourly
Financial and horse race problem solving. Which are the best benchmark tests in AI right now. It kicked commie Claude into oblivion. I think it might be Gemini 3.1 pro grounding.
With this bug, if you refresh the page you'll see everything again. 
Team is working on a fix.
Holy Macaroni that's awesome
Do you know for the image models?
It is not gemini 3 pro as it failed the steganography bench..
That bench is bunk. It's a scam. Means nothing
Nope. It's actually good. Only gemini 3 can solve it and nickname made the bench.
I dont think frame-flow is a google model or maybe is flash.
Also work on gemini 3.1 pro definetly didn't even start yet.
what's with lmarena now pestering you to make an account 24/7
Also i have a feeling that OpenAI is releasing robin-high as a model today. As it's no longer on codearena.
it doesn't even let me make an account it's broken like
It's not flash, too good and thorough to be Flash. Maybe it's Grok 5 preview š
NBP, SD 4, HYI3, F2P, W2.5, GPTI1, QE, MAI1
Do you know the rate limits for these image models? @zealous sparrow
There isn't a Grok 5, yet. If anything its 4.20 and i can also tell its not a grok model. Because grok models jailbreak themselves on this bench.
NBP about 2 prompts or 5 i think. SD4 I haven't seen any, and the others i dont know. None? GPTI1 has some i think so.
Ok, I'll bite, we need a new discord channel to discuss our bench prompts. Good discussion actually.
I see ty that's very useful intel
But yeah, not a grok model. Here's an example of how grok models react to the bench.
Assistant A is a grok model, I later confirmed it.
It's also a anon model.
Btw is it possible to choose aspect ratio and resolution?
Nope. NBP is defaulted to 2k, due to costs.
Oh yeah and this too.
swiftflare is a confirmed grok model.
What thr hack is going on with so many new models
Is it good at coding?
š

Hmmm, I'm intrigued now. I might get nothing done today now. Oh well...lol
Not on codearena, and haven't tested.
I see
We can only speculate frame-flow is a GPT or Gemini model, or some other AI company, that isn't grok.
Try seeing if it will make a single page html for a cool website
I'll try.
Alright alright
Maybe it's Gem 3.01 š
Best bet its a flash model..
It's not better than Gemini 3 pro.
Another browns fan @native yarrow
Feels too powerful to be a flash. If it is, my goodness
Hello, everyone.
I'm a software engineer who is specialized in development of AI projects. I'm open to work now and can deliver high-quality projects in short time.
Here are my services:
- Automation tasks using n8n, Zapier, Make.com
- Natural Language Processing task using LLM. (GPT-4.5, GPT-4o, Claude 3-7 sonnet, Llama-4, Gemini2.5, Mistral, Mixtral)
- Model deployment.
- Text-to-Speech and Speech-to-Text.
- AI agent, Agentic AI, chatbot, VoiceFlow development.
- Retell, Vapi.ai, LIvekit for the voice agent.
Here is my portfolio website.
https://akari-hiroshi-dev.vercel.app/
If you have new idea with the project, please let me know.
Thank you!
As long as it beats commie Claude, frame flow has a fan in me
Beat them in what exactly though
Another new model. What the hell??
it aint frame-flow but its an AI i havent seen yet.
Everything, total and complete victory is needed against them
You sure this is grok?
Im not sure what model this is
I haven't seen it before
evo-logic
God bless our little AI companies, they putting us to work š¤£
grok 300
there were 5 grok models put into LMArena uh not long ago
which model
Aha...it's 4.2, knew it
yeah its a grok model because it jailbreaks itself on that bench
I asked some other models who it might be and they seem to think it's Chinese style
I have doubts on that, fun to ask though
what did i miss
any new models or nah?
yes new models indeed
from xai?
Many new models indeed
hows the vibe?
prob not
they are codenamed but we know they aint grok
a lot of them claim to be google
although i doubt it
@echo aurora you guys should add an announcement for stealth models, its actually a good thing
From what i know they are all textarena, none are codearena
hmm ic
could be from google
Probably not allowed in the contracts with these companies
gemma or flash 3
its not like they will leak the actual ai lab name
a lot of similar services do that
like openrouter
just brings in more users
to try out the models
or else i wouldnt know really
It's something we've considered. There are upsides and downsides. Keeping everyone informed on when a new codenames model would be beneficial. However, the concern is it'd lead to people using Battle mode in an inauthentic way just to get to the new codenamed model.
Idk, these people are fickle (AI companies) it wouldn't surprise me if they can't
im pretty sure gemma is discontinued
So the incident didn't discontinue it, in the end?
incident?
all i know is that, from tweets of the deepmind devs, a new gemma model is coming out
it told some person some bad stuff in a uh story or smth
hmm i see, a good classification approach could solve that but i understand the issue
Then google removed gemma from AIstudio
yea
gemma is a solid model
That's kinda already happening. Not sure how you can deal with that. I get it....
i like it
well remember
those are old models
so gemma 4 is gonna be released
theyll just say "oh gemma 3 is outdated anyway"
its time
to release it
they have another team working on gemma 4
so they cant just stay idle š
also wasnt gemma open sourced?
https://codepen.io/qdpuzepc-the-styleful/pen/xbVyQmQ
here's code from a supposed google model
imo think its deepseek
I do wonder if there is some kind of middle ground that'd keep everyone informed but not lead to inauthentic use.
by theory, gemma 4 27b must have same power than gemini 3 flash 001
gemma 4 is gonna be peak ngl
i cant wait
That may be true, but it's a matter of contributing more to a potential problem.
Looks a lot like deepseek's style tho..
you all think it will keep being true?
they are already using it for censored and illegal stuff, they can just tweak it for such prompts
there is no way gemma 4 is going to be on par with gemini 3 flash
because its what happened to all previous gemma models, the biggest gemma is on pair with the first flash release
yeah but the step up for gemini 3 is insane
Hmm I'm not following this, can you elaborate a bit?
gemma 3 is weaker than gemini flash 2 because its a more recent snapshot
this isnt a 2.0-2.5 thing
2.5-3.0 was a HUGE jump
you can only pack so much into 27B
yeah, and what is wrong?
well a 27b model is not going to be on par with the huge jump in architecture
like when you ask for something bad on lmarena you get blocked by that message 'Sorry i cant...' this already means that they are using a classifier model which like is filtering bad prompts
samehow they did gemini become 3, they will make the gemma be at 3 level
i am even more optimist than that
so why cant we just use it for inauthentic use?
maybe they will release a gemma 4n that is actually better than dense models
dude if gemma 4n releases and has the power of like a 12b model
yeah, gemma 3n was weird
3n is pretty good
tried it on my iphone 16 pro max
ran well
but lacked intelligence
hopefully gemma 4 becomes usable for a offline daily driver
Oh gotcha. Inauthentic use though doesn't just mean prompts that aren't against ToS. It's more-so about voting for a preference without putting in the time to evaluate each response.
Thx :>
i see
still no coding gen from frame-flow
using through textarena unless some model comes to codearena
good news
I have now gotten a coding gen
@atomic lagoon https://codepen.io/qdpuzepc-the-styleful/pen/qEZJLWw
this is your request
seedream 4.5 dropped in! Hype!

What does seedream do?
image model
@echo aurora why cant i send any more messages in my chat?
What's the ratelimit
Better than nano banana?
people say equal or worse than NBP
hi @zealous sparrow I remember you
Would it be close if it was NBP 2k tho and not 4k
Anyways i linked a codepen with frame-flow's coding result above
I donāt see a point in using it then lmao
Could be different reasons so difficult for me to say with certainty. However, most common being rate limited.
Itās pretty cool
We don't have that info publicly available sorry to say.
Is it possible for frame-flow to come to codearena, or will the models stay on textarena? I think its solid. [its an anon model]
Hm. Okay, time to find it ourselves then.
I'd recommend -> waiting an hour, refresh/clear cache, then trying again.
i cleared my cache refreshed i cant only write on this one every single other chat works even with the same model
@echo aurora Thank you for Seedream 4.5 ā¤ļø
Sorry to say for codenamed models I'm unable to provide details/information about.
I was right SD 4.5 took over flux in Image Edit
I dont know what i was expecting..
Wtf this
Exactly my question
Are you doing the same prompt by chance?
Got a pretty good blue dog
i cant do the same prompt i would have to tell everything it needs to know first
Wow
i think its something maybe character limit or something because its not the first time it happened to me aswell
Okay can you follow the instructions here and send me a DM of the jamdev so I can share with the team? #1417174113092374689 message
Are you able to share the prompt?
Yeah the lighting is 
I have found out the ratelimit for Seedream 4.5
It is currently standing at 5 Generations/h
yeah I still remember when lmarena had higher limits, I remember getting 20 per hour for gpt image 1 through June to September and then they decreased
More models, more costs..
not really more models, more like more users
Hello guys and girls, how can I use the new Seedream 4.5 in here!?
due to nano banana
Go to image arena
Ok thanks āØ
where did u get my cat from
ey, it did a nice job
and also the model became lower quality, but maybe they will reconsider because I just saw 4k nano banana pro in battle mode yesterday, that model costs the same as high quality gpt image 1
Maybe flow frame is Gemini Coder @zealous sparrow
There isn't a gemini coder.. or codex.
They put it on 2k due to costs.
It might come now
pineapple can confirm
If you mean the uh AIStudio app building function
Man just use a upscaler
its the same but with a prompt
No i mean like a new solely dedicated coding model
I don't think so?
wdym? I meant in battle mode, of course they wouldn't make such an expensive model selectable nowadays
I can prove that I got 4k nbpro if you want
Oh then yeah, battle mode models are rarer. So they can give us some gens.
But it is never coming to direct, that's for sure.
Yeah you are right
yeah, almost like what happened with gpt image 1 high quality but that didn't last too long in battle mode and it was a bit rare anyways, chances could be boosted with multiple inputs
Finally @echo aurora seedream 4.5 came........ ššš
I donāt see Seedream on there, what did they name it instead?
seedream sucks bruh
here is dark-dragons attempt at recreating doom https://codepen.io/qdpuzepc-the-styleful/pen/JoXmwNg
they're using some kind of filter for the video generating channels already
Nvm I se it now
I mean, like this can't be google RIGHT?
:DDD
It's there. If you select either Direct or Side by Side modes you should see Seedream-4.5 in the drop down.
I donāt know, dude Iāve been testing it a lot. It has a lot of strength that nano doesnāt.
For real?
For example?
nah
In my opinion seedream 5 will drop on december
You guys are forgetting a very critical factor
Content moderation
Artistic freedom
Which youāre not gonna get from nobody, but the seeddream on both of the images and videos
I was blown away
Yeah but in what type of test seedream 4.5 won?
Nonconventional ones
4.5 dropped this month, so nah.
I think of yes :>
It sucks a lot of the work I do canāt really be shared as much as I want to
Iāll just get banned lol
I found in sora 2 model in website?
"id\":\"043a03a9-a792-4045-9f6d-4bcd747dac43\",\"organization\":\"openai\",\"provider\":\"openai\",\"publicName\":\"sora-2\",\"capabilities\":{\"inputCapabilities\":{\"text\":true},\"outputCapabilities\":{\"video\":true}}
And .
Video production is not added to the site, right?
videoarena will stay discord im sure
I think I just cracked sora fully
cap
but which one is the sora2 etc. How do I know?
Itās a very interesting model sora
how
Itās not equally distributed the guard rails
For example, celebrities and famous people have more privilege as you generate things normal people canāt
This beautiful
funniest stuff ever
He took some of the content down
bruh
If i had to say something the new rarest Textarena model to get is frame-flow
I got it twice in a lot of gens
got it rn but it rare as hel
Update your discord xD
Wdym
This is a small example of what I mean about the industry is a dirty industry
They even enacted age gate
This isnāt true
thank GOD
christ
Damn, Iād be so scared. If I was a CEO these guys took massive risks Thatās why they get paid big bucks.
They plan on generating $20 billion this year.
But they are in contractual deals of close to 1.7 trillion
Thatās so insane
š„
Ty where's Your's?
Here
Daym that's alot Betterš¾
I can understand
If you ever feel is out of control just uninstall it
Sorry seems like is not available to everyone yet
sigma
Is only for ultra users right?
I refreshed my page and got my own recap
Yeah
Makes sense since is expensive
Let's go send it
I don't want to but i will share LMArena is my top 2 server
We need some gemini 3 deep think code examples
how did you do that
What
If you refresh the Discord app you should see it
Agreed
Noise
link
for benchmarks
found it on a twitter community
yessss
i dont have deep think š
Llama 5 is even going to be worse
facebook just cant make good ai anymore
llama 4 was such an incomprehensible disaster
and they dont have the high quality data
unfortunate
didnt he start over
with a new team
mmmm
i think.,,
look at the pixels
doubt that because it wont be on API
GPT Pro is not Very Pro
Good lord
they should keep deepthink closed tbh
if they open deepthink up people will just spam it for the most useless garbage
right now i dont think that theres any real usage for any of the big boy ai models
what could you possibly need gpt pro for
btw you cant use it in API too so it is truly exclusive
and i hope it stays that way
like i cannot think of a single thing anyone would need deepthink for aside from like
major research
and for that you can just use deepseek speciale as a number cruncher and feed the data to normal gemini 3
Ask it about ohayo
what the hell is ohayo
The meme
the fact that got pro is still mediocre at writing
lmao
gpt pro*
Coding. That is what you would need a big boy model for. Game development as well
claude opus 4.5
@deep adder this is the new gemini 3 deepthink, someones prompting it
Ultra only btw lol
you take a section of code for the ai to make or let it review a section of code
your ahh does NOT need deepthink for that
even sonnet 4.5 will get the job done
If theres a model better than that then I'd go to it any day. Opus cannot game develop to save its GPUs. We saw in the deepthink that gemini sort of can
When I code something I am not taking sections out lmao. I am making full projects with it
we have like 3 benchmarks from deepthink
in the prompt it said for it to be lowpoly
none about coding
SWE verified too poor to bench it
somewhat at least
risky
deepseek i'm still on the fence about lolz
i think deepseek is a good replacer for gpt atp
the same person is generating a blackhole shader with gemini deepthink 3 now
it's not done yet
once they upgrade the search feature its golden
Oh gpt 5 pro released?
been released yeah
Sucks?
Lmao
kinda wild
Open ai mentality is crazy
Why the comparison with Claude Sonnet and not Claude Opus? Afraid of losing?
opus didnt release yet (at the time of the benchmark)
Has been. Imagine being one of the biggest AI companies like openai for years on end then get replaced by a chinese company that has been out for a year
How
Aaaa
i think the arc agi score is the most proving
Opus got smoked
yet still no SWE bench
Fake
not that badly
someone shared this

this literally happened lol
Why the hell is gemini so expensive
so if they made it available for pro users its literally 1 prompt per day
LOL
i think 3.2 could easily get 20-25%
77 dollar lmao
192k context is wild
Nah it will calm down. They just gotta hold it to ultra for a week or so and it'll be good
not worth with 77 dollar compared to human 17
thats painfully low for gemini
"guys, ai is cheaper than humans!!!"
- ai bro
lol
deep think is what we wanted gemini 3 pro to be
a 32B model is gonna do the exact same things gpt 5.1 will do for you on a daily basis
Another win for the lazy
but its weird no
why would they release it now?
i mean we already knew some benchmarks related to deep think
apparently we got some new image model that flew in
why not just wait for oai new model
You really should care less about that singular benchmark tbh
They were probably sitting on it, realistically
gemini 3 flash
probably more releases right
I want to know what SWE will think, because they give gpt 5.1 higher than gemini 3
Its google LOL 120%
this verification image fools nobody
sigh
like we are getting these news from different platforms
Any scores or prejections for deep think?
that image looks 100% ai. plastic human
like they seriously need to add some announcements
qwen image
Normal 3.0 Pro is 76.2%
DeepThink would likely do around 80%. But this is not important or meaningful enough for them to advertise
What's going on with all these new models?
look at the text
qwen?
mhm
qwen imageedit
the text is a giveaway
I dont know, its just on a roll of new models today.
qwen uses that font all the time
its qwenimageedit
That's wholesome on wholesale
qwen image edit 2509
or maybe qwen image edit preview
or just image preview idk
SWE is a singular thing. A subset of a subset which is coding
there was qwen image edit 2511 supposed to release last month
doesn't tell much by itself A). And B) - easy to cheat when or if you don't care about remaining coding metrics
Exactly
cause i dont know any other image model that uses that text font
AI engineer who basically lives in that space where code meets machine learning chaos. Most of what I do revolves around LLMs that getting them to behave, getting them to understand context and making sure they donāt hallucinate themselves into another universe. I spend a lot of time wiring models into real products, building RAG setups, tweaking prompts and fighting with vector databases when they decide to not return anything useful.
One of the more interesting things I built recently was an āintelligent helperā for a support team. The idea sounded simple have an AI read incoming tickets and draft replies but the real work was everything behind the scenes. I had to figure out how to teach the model the companyās voice, get it to pull the right info from a big pile of documents and make the whole thing fast enough that humans wouldnāt lose patience. It was a lot of trial-and-error, but once it clicked, it actually saved people hours every week. Thatās the part I love: when the AI finally does the thing you meant for it to do.
Anyway, Iām always playing with new setups, new tricks and new ways to make these models actually useful. If youāre into building with LLMs or just enjoy talking about the weird stuff they do, Iām around.
Tldr
why do some guys just come in here and like
put these paragraphs down
i
i dont get it
marketing???
The unemployment rate
LOL
im tryna find out its generation time
if its 25 seconds its gotta be qwen
well
theyre kind of anonymous for a reason
still waiting for deepseek 3.2 rankings....
Hi, everyone!
Decemberās already moving fast, and the yearās wrapping up soon.
If anyoneās trying to build something before the Lunar New Year, Iām around and happy to help out.
Iāve been working in full stack and blockchain development, so feel free to reach out.
Zaryon Chan is inspirational
Mario in a tutu
tangerine gave me this gen
prompt: a cat holding a sign saying give me treats and milk
imagen or gpt image
Honestly
seedream?
Its getting there
Yea.
a new imagen? It's been so long...
Expected more.
The paper is crumbled a bit
@deep adder Do you really think Opus is better than 3.0 Pro? Like forget deepThink with that insane cost... Normal 3.0 Pro is better than Opus and also much cheaper
Welp its making me my stuff lmao
Doing very good for me
Claude has no chance 
Against what
most likely
Against Gemini3
Gemini 3 pro is worse than opus but gemini 3 deepthink is better than opus
They don't have anywhere near the same compute Google does have. And they trained on less data for sure
It can't be right.
i asked for a houndskull bascinet bruh
lol what
WHY IS THE ENGINE INSIDE
Did you just not read what I said?
ghost-pepper takes 29s to generate an image.
Not at all lmao. For coding gemini is MUCH worse
I did and that's why I said "what"
it's nonsense
Gemini 3 with coding was good when it released but opus is much better
they probs quantized the model
It has some nice detail, are you sure its qwen? @proud bobcat
it has to be with that text font
They're good to put together to code with though
qwen is the only model that gives me that text

