#general
1 messages Ā· Page 201 of 1
the bug that you need to restart the page
Hi
Not really an issue
Just restart
Reload*
Can you refresh the page and let me know if it's still like that?
I not able to generate videos?
it's worked
idk why did it caused
Yes, bot isn't working at the moment.
it says i do not hav epermission to messege in video arena , what to do
Need to wait.
ok
@loud cosmos Our video bot isn't working at the moment. Please stop trying to prompt in text channels.
@worldly glacier Our bot isn't working right now. You'll need to wait until I turn the channels back on.
Any prediction for speciale's return?
Nope
Thinking seems to be good anyway for the time being
Their thinking model is superrrr fast
I thought Gemini 3 was quick but this blows it out of the water lmao
I didn't really understand the differences between both, they didn't even graph speciale for coding
So is only 3.2 thinking refined for agents?
I'm not sure honestly, but from what I've seen they're very similar
I tested both for front end but a single prompt was what I managed to gather and I preferred the 3.2 thinking one
I hope they'd add it to the free tier on GitHub copilot
WedDev one?
deepseek will be added to codearena yes
Alirhgt Alright good
I'm not tryna be impatient or anything lmao its just exciting
I feel like webdev arena limits their creativity and design potential when on react compared to normal arena hmm
wait new deepseek model? how's it keeping up?
Welcome to the big leagues
Freakin awesome
speciale had hallucinations and was taken off
thinking is pretty solid, ofc doesnt outperform the better models
I figured it was taken off because of the time outs leading to failures
hallucinations wouldn't be a reason to take a model down from lmarena
š
Would be perfect for coding ngl
Can we use openrouters key for deepseek 3.2 thinking on github's copilot?
On vscode
I think it was more about 80% of the prompts were being returned with an error
It wasnt just hallucinations
yeah I said that two messages earlier
thinking will only think too much if you give it too complex stuff
the errors were because it spent so much time thinking that the model timed out
My bad I read "shouldn't" instead of "wouldn't" lol
if you looked at the thoughts you could see it needed all that thinking time
That's an issue in Lmarenas end no?
For me the model was breaking before it even started thinking
yeah probably
SOTA
I feel like Lmarena is really agressive with models thinking for longer
Long answer = error
These models are the first after old mai1-preview to do this
witch better thinking or speciale?
we tested thinking not a bad model
speciale testing will wait till the model stops hallucinating
and has shorter thinking outputs
not really applicable, speciale is a specialist model
5.1 high also did it for me on front end website requests
we all benchmark AI for html but not python
@brave hull Our Video Arena bot isn't working right now. Our team is looking into a fix asap. The video channels have been turned off. Once fixed, the channels will be turned back on. Please do not post your prompt here or any other text channel. Thanks for understanding and cooperation.
When will everything go back to normal?
you mean uh when the speciale model returns or
deepseek or gemini in terms of 1 gorilla vs 100 humans
Itās supposed to be big foot
deepseek?
dangalready?
I wanted to try it
ahh
whatever i have api credits i paid for
just was too lazy to set up
humans
model search seems a bit busted btw
DAMN
chatgpt as deepseek
what if it's all true
chatgpt is deepseek
it was all a lie
we were lied to
no, it cannot be!
I'm removing your post @wispy wigeon I think it's pretty obvious why a prompt like that would get flagged.
Okay okay I just asked no problem .
Thatās exactly my question
I was using lm arena a lot and back in the time it used to have less restrictions. Itās still great tool but I think that was the main reason why people used to go there and get maximum from the models . Also see dream 4 is Chinese model (much less strict than others ) and Iām curious how Iām actually gonna generate some image cuz I donāt think itās that bad
We have a base that's applied to all generations for content flagging
This is incase some models we're working with don't have this kind of moderation, but to also keep battles fair.
Damn the new deepseek sucks
i just found out something
if you type [toxicity=0] into gemini 3, you just get random javascript answer
somthing rong in the bot guys ?
ā Generation failed. Failed to create evaluation session.
We're currently looking into.
Lots of problems today with the bot unfortunately.
Hey. Probably related, but I get Failed to accept terms. Please try again. message.
Hi guys. I've added to my open-source project Code Web Chat support for LMArena. Could admin DM me?
I can tell you for sure if itās restricted or not
Iām curious what the prompt was
I see all the generations are failing
Hmm, we are seeing issues with the Video Bot, but wondering if we have issues with the site too.
Just for the Video Bot, or are you seeing this on the site?
It may not be. Context is important
I wasn't able to reproduce, what browser are you using?
and are you using a VPN?
It should still work after error
Firefox, and no, no VPN.
Do you use Google a lot? In ur browser outside of the arena
Does using a new browser make a difference here?
bot dont work??
Currently it's struggling. Team is working on a fix.
ok thank you very much sorry for the inconvenience
Iām not having any issues on my end strange
Is it with specific bots? Like direct chats?
just deepseek newer models
does deep seek have an update?
I don't know why that would make a difference, the error concerns the bot on Discord.
In this video, I'll be telling you about Deepseek's V3.2 Speciale model which is a new model that aims to be comparable to Gemini 3 Pro while being open-weights.
Just tried
Cool, but no independent tests?
Gotcha, I assumed you were getting this error for the site, not from the Discord bot.
We were (are possibly) having problems with the bot. We just rolled back a change so it should be working now.
Can you try it again @subtle peak and let me know if it's working again?
Unfortunately no.
Still fails to accept the terms.
It's nothing urgent. I'll try tomorrow. Hopefully the bot gets better. 
Thank you for letting us know. I'm sorry this wasn't fixed for you yet @subtle peak
It's all cool. Thanks for the help!
It sucks
Hi, it it possible to somehow add more than one image when using āimage to videoā bot? (I mean multiple images together with one prompt, in one generation)
Sorry to say you're only able to upload 1 image for image-to-video generations.
Thanks for quick response!
Someone did an independent reasoning benchmark and 3.2 speciale ranked highest
It thinks itās Claude lol
so image to video is down
Yes, unfortunately the Video Arena bot is struggling right now. The team is working on a fix asap.
Werid how this still works lol
And it starts thinking itās made by anthropic
(itās nothing really harmful it just mimicking)
deepseek v3.2 good?
This is because itās hosted from deepseek api directly
Other providers are gonna provide it for roleplay on openrouter
I didn't understand anything, but over time I was able to read 50% of the words
When steady service is guaranteed DeepSeek 3.2 will be my main
No. š¢
It is
I tried it for a few prompts
Does very well compared to exp
Definitely more performance in coding and shorter thinking
I'll test it again (not special mode more lol)
Oh yeah no speciale is hallucinating rn apparently
DeepSeek team is working on a fix
Dude me reading english even if fluent (yeah š writing is horrible) in reading spends more energy than my native, imagine me reading something like that? Am I doing cardio?
This work in opus 4.5?
Idk I donāt usually mess with models like that
I just do images mainly
Can somebody explain to me the significance of this?
And the two fundamental ways to view the information that is presented in front of us?
Iāll give one more example to see if somebody understands whatās going on here
Which is more accurate? And which is more honest? What part of the output can be considered hallucinations?
Because like always with deepseek the wave of speculation surrounding censorship, regardless of the achievements or accomplishments it is shrouded by this deep mysterious question that Iām presenting in front of all of you.
@echo aurora when the prompt fails. Is it because it thought for too long or is it a bug of lmarena?
They reason for like 2-3 minutes then fail
If its a bug of lmarena i wont rate it as its unfair let me know
Are you seeing a Something went wrong while generating bug?
Iāll test this
For those running into a Something went wrong while generating bug this is the best way to flag this - #1417174113092374689 message cc @proud bobcat
Is LM Arena down again?
special is gone
Hello, are there still issues with videos not being generated?
Lets give it one more test to see if it caps out at 8k
It definitely starts bugging after a certain point
hello
I get them a ton, especially on thinking and reasoning models
yeah, what happened
Itās undergoing fixes
Itās very solid
Not a Claude replacer but itās noticeably better than its experimental version
So it doesnāt even compete with Gemini 3 or Claude 4.5
Apparently deepseek 3.2 is just good for math
DeepSeek V3.2 Thinking feels easily identifiable since it takes so long to think
It does in SWEBench
And very nice scores in terminal
I just think itās way of coding is different than Claude or Geminiās
I let it code a few concept apps and it did quite well
Itās errors were easily fixable and quite minor
I think it can excel as a code review LLM
I could see DeepSeek + Claude being a great combo
Claude to cook up the code and DeepSeek to bugfix
Is it the same as speciale?
Like the new one
Does anyone know a good speech to text that works on Brave browser for free ( because I'm on Chromebook and Chromebook speaks in text for some reason won't even turn on on that browser)
Speciale is a high compute version
Basically think R1 or 3.2 thinking but on loads of crack and efficient architecture
Will deepseek private their next gen
Theyāre probably working on V4 for a while. This is to show off how much you can do with the same dataset
Extremely impressive imo
Probably not no
U donāt think so?
No
why cant i see the special one anymore?
DeepSeekās main thing is open source
They contribute hugely to the field
No reason to private it
Itās being fixed
Had some issues on first launch
oh,okayy
No I doubt they care
How so?
DeepSeek has been quite committed to open source
They recently released an absolutely amazing math model that can actually reason in math correctly
Hello everyone
Well the people who work there probably care
But not the company
The company cares just as much
Otherwise they would probably have went private long ago with V3
i dont think if deepseek really cares anymore
they are struggling to make a decent model but also they are benchmaxxing
they wasted so much time fixing huawei hardware issues ( ascend chip )
its not a secret anymore that most new models get better at math and code reasoning
because CoT favours the step by step reasoning and also because they have much more data for math proof reasoning
deepseek sucks lol
the issue is generalizing that type of data across different domains
it got worse unfortunately
Hedge fund?
Theyāre notoriously selfless
Is that qwen
imo the best non reasoning model after opus 4.5
No
qwen is so so
Kimi K2
alibaba are rushing it with qwen tbh
Is it open source
Qwen3 max is quite good I just donāt like its personality
I heard qwen was no longer gonna be open source
Qwen3 is mostly but their max model is proprietary
the issue with chinese models is that they lack good quality data
Openai lock in
its no secret that they are training on big models to distill but still
deepseek v4 will be trained like 100% on gemini 3 pro
the knowledge gap is just too big
How will you just train a model on a private model
on its output
Seems stupid
Rate limit?
they all do that
Output but you need the knowledge it uses for that output
api + multiple accounts
I mean a lot of models do think theyāre other models
they are paying for it, could be 3rd party
they will extract that knowledge from the questions they ask
Apparently I saw OpenAI was pre training a model rn
And also apparently they have the best coding model some say
I mean
Eh
Fair
Yeah
Not at all
Codex is worse at coding than normal 5.1
Yes on lm arena
i thought they stopped pre training
Iāve gotten it a bunch
Are we sure thatās openai
pre training is kinda like starting from scratch.. takes a lot of time
all they do now is post training
It says itās OpenAI
on top of old models
And itās atleast better than Gemini 3
wym
codex is unusable
wym...
its so slow
its not practical at all
when did they add ads
OpenAI the only brand with some sort of loyalty
Not always reliable
And even that is low
no one uses chatgpt in browser
ChatGPT blows
yea no one use their browser
Theyāre running out of options
Nobody cares about worse stuff
Their models suck
Shallotpeat
Claude and Gemini lead in most use
although i hate perplexity and their ceo but i think comet > chatgpt browser
Apparently full pre training
when you look at the big picture they are all leading in different areas ngl ... oai + anthropic + google
Most of their customers are non paying
More people pay proportionally for Gemini and Claude
Also OpenAIās reasoning is said to be the best
So if they can make a new good pre training
It will be way better
maybe
but ive seen some news related to that
just not sure
Cause the base model was trained like 1.5 years ago I hear
the base model is so old
Same since like 4o I think I heard
maybe older than that
They donāt make revenue though
not sure
they do make revenues... when gemini 3 was announced google stocks increased by like billions in a matter of seconds
depends on what they saw tbh
but all the improvements nowadays comes from post training
its not just useless
its more like time consuming
so is it worth it?
'we have models internally that performs the same as gemini 3'
'we felt confident'
I call bs
Theyāre a private company so thereās slightly less incentive
Also its ai
OpenAI is known for hyping up models
The moment other AIās truly catch up in reliability and brand name itās joever for OpenAI
No
one of their models was already tested in lmarena just days ago
called robin
its an updated codex version
I literally just mentioned it
But yeah apparently better than Gemini 3
I got better results
does some silly mistakes
But front end design kinda trash
yea frontend is bad
I think #2 ignoring all the versions of opus
It was giving me way more long results
needs more testing
Like if I asked for some app
yea
Gemini would give me 600 lines
its thinking for much more time
Robin high would give like 2k
i mentioned that before
i was asking if its just codex + more thinking time
like not an improved version
but more like they just gave it more time to think
that large of an improvement just from more thinking time?
they have this internal parameter called juice
for this robin model it was like 512
juice = 512
which is like a higher thinking budget
He doesnāt sound all that confident in that interview tbh
Also some people think companies have way better models and others think they release their best (some time buffer ofc)
So which do u think
if you ask a google staff he would say the same
Which do u think
thats 5.1 codex right
google definitely has better models
but it needs to balance cost with performance
since its providing models for free
Like thereās deepthink ofc but do u think they have like actual product level models
and im talking about frontier models
nah
i think ds are like still struggling to find the recipe they want to continue on
Well yeah OpenAI had a general model which scored gold like 4 months ago
Which is much better than even like Gemini 3
its the opposite
gemini was the general model
oai had more like math proof model
just spits some math gibberish but gets the result correctly
They claimed it wasnāt specialized
Robin high or Robin
Hope you all are doing good
nah im right
Regarding the "opposite" takeāyour intuition is spot on for the key differences in their approaches. Both are built on general-purpose models (OpenAI's experimental system evolving from o1, and Google's advanced Gemini with "Deep Think" enhancements), but OpenAI's outputs for the proofs tend to be longer, more verbose, and less elegantly structuredāoften described as "rambling" or filled with exploratory steps that resemble math gibberish, even if they ultimately arrive at correct solutions. Google's Deep Think, on the other hand, produces cleaner, more concise, and formally structured proofs, building on their prior specialized systems like AlphaProof but integrated into a general model.
google model was more read to use
its grok search
I doubt it was a general model
it was
Making elegant proofs
nah it was...
i think they shared outputs somewhere
We achieved this yearās result using an advanced version of Gemini Deep Think ā an enhanced reasoning mode for complex problems
An enhanced reasoning mode for complex problems
1 output is gemini
2nd oai
It sounds specialized
We also provided Gemini with access to a curated corpus of high-quality solutions to mathematics problems, and added some general hints and tips on how to approach IMO problems to its instructions.
but you can actually read it and make sense of it
Wow
Doesnāt sound too general
oai caught doing the same š
Will Z Image turbo be added to LMArena?
there was a debate about that on x
Pretty sure the Imo solutions are way shorter
like these model arent as intelligent as you think really
they are a good pattern matching models
They know a lot though
you just have to find the perfect recipe to get the pattern matching to higher accuracy
like what oai are doing
they are good at that
Was it with tools
with the current architecture
they will never innovate
like never
with tool calls they can somehow give you hypothesis to test like what google is doing with co-scientist
so many agents layered together
Is OpenAI as bad as people say
idk what you mean by bad
but i wouldnt trust any company
or any closed sourced company
sam is more on the evil side lol
i dont trust him
U got an American flag and the crazy senator lady in ur bio
Majorie Taylor green
A real expert in the field
Worse than Elon?
they are the same... one is diplomatic and one is straightforward
i saw that
Bro just lies
we have enough small models already š
About everything
And is greedy asf
never liked elon tbh
Is it weird i purposely wait for a while every time the verification checkup starts just so it resets and not make me do it ( I do it because yeah just know I'm not dealing with the hell and inconsistency of it with the images)
i always thought he had some mental condition
try different browser
Heās the only person who still says Aspergerās
whats the deal about this model? is it uncensored or what
Really good fast image model that is nearly on par with frontier models
And itās easy as hell to run
Only 6B
elon probably destroyed his cognitive abilities with excessive drug abuse
he just got dumber
Yeah I think cause itās gotta be like sorted
Well I donāt think ever a genius but smart
No that issue has been there all for different browsers the problem is the images themselves are either inconsistent or for the ones that disappear after you select them very slow to appear again wasting time until it becomes useless or r the fact it becomes hypersensive activating in every other prompt or the worst case forcing me to do that inconsistent verifying for eachprompt which is unfortunately often yeah so I don't think I need to tell you with it inconsistent prompt and hypersensitive one it becomes a nightmare so I just gave up on it waiting for the reset
smarter than now atleast
gemini 3 pro is not imo level?
Bro just tweets about immigrants all day
and worse
Itās so sad that he screwed grok 4.1 personality
Like he actually cares about protecting people
he used to care about more important things
( i have tried on Chrome which I can't even sign in in anymore on there i have tried brave which is a whole nother issue because of speech and text says not work on there and etc)
his brain is puuf, gone
Helping people? Maybe notā¦
Saying weāre gonna build a colony on mars
longer context = more compute/training = massive gpu hours for little gain for a mid size model
like i told you
Isnāt exactly that
generally not being as much of an ego maniac like he is now
alibaba are working on 10000000000 product
Retweeting he just wants children to be safe again
Maybe he couldnāt end world hunger forever but
He could certainly save many children
But man the architecture is awesome
Grok 5 is agi he says
@deep adder also 32k fits just perfectly with 20vram if it was native 128k you woulkd need an extra 10-20gb of memory just for coding
if youreally care about running it locally
yea
bruh
ive seen it on way less gpus
5090 to play r o blox
Grok 5 will either be absolute peak or benchmaxxed
They do awesome jobs on efficiency
its not about being small
Grok 4 fast is just goated
They hire Chinese people at all the us companies
like culture wise which is the base of a company is messed up
from the roots
you need like radical changes
Plenty of Chinese phds in the us from like tsinghua
Just to get last place
why would you do that lol
Saw that post
Of the guy
In the Tesla
Well itās never really a flex to work 36 hours
Itās kinda just sad
they do with 4.1
Bro Elon flipped multiple times
He literallly called him a pr ed
Base salary as an ai researcher?
@deep adder there was this research shared the other day, idk if its in this channel or the other one about compression tokens/text like in .zip files and conserving like 90% accuracy
but its so so dumb
like soooooooooo dumb
Like are these ai researchers that got paid 100m really that much better than 100 1m researchers
its insanely huge at the big labs
the highest salaries on the planet potentially
Like would they hire Terrence Tao even though heās not in AI
the lowest is probably a million in a year
i cant say the same
U donāt think he could get hired just cause his math talent and intelligence?
but most are much higher
Also heās probably done some cs and ai
Millionaire =/ salary of a million
gemini is INSANE at lua coding oh my
many key oai workers were poached by different AI labs, and many AI labs have already caught up
like dont bother
couldnt find the article
but it was stupid
so many silly mistakes
salary is something entirely different though
R o bloxian
in order to have research agents that can run for days, we need context compaction
i used RL to have LLMs naturally learn their own 10x compression! Qwen learned to pack more info per token (ie use Mandarin tokens, prune text)
read the technical blog: https://t.co/3pzRvt4zvr
this one
i used RL to have LLMs naturally learn their own 10x compression! Qwen learned to pack more info per token (ie use Mandarin tokens, prune text)
Yes Iām sure the average phd is making well over a millionā¦
just one simple prompt "particle accelerator" and it made a complex structure that works
it looks cool but its stupid
lol
like the model using other langs to pack more info?
seems like a cool idea
Just drop the brainrot game
but its more like reward hacking
on it :P
yea i saw that its 6B
but dunno about the quality
i guess for its size its good ?
the idea is fine
but the execution is just not it
constraining the model from the get go to 90% accuracy instead of dynamic compression
30% -> 40% -> 50% ...
it could've learned way more cool stuff
Terrence Tao couldnāt even get $100k a year š£ļø
yea
now that you mention it
there are some mathematicians working at xai with that salary
you guys remember that troll xai guy
i dont remember his name but he was always on my feed page
No
the guy with the hat
omg he was so annoying
i need to know his name
found him
I am a mathematician at xAI. Previously I was a researcher at Microsoft Research.
thats my point
he was trolling non stop
i had to block him
he was the one that started 'i dont sleep, i work 24/7 at xai office'
and 'make grok great again'
Yeah thereās like alot of math involved
Itās like flux 2 schnell before flux 2 schnell
Ig
Mechahitler
i call this one of the worst releases pr ever
why would you post it if you are not giving access to anyone
its so stupid
they just killed the hype with that
runway 4.5 seems on par with veo 3 but their release strategy is so bad
@echo aurora
if sora falls so will all other ai vid gen
Veo 4 soon
bro
@echo aurora I tried to open a new conversation window, but I can't open it anymore.it has no answer. Is there a limit now?
who cares about tiktok
Apparently people are testing it
its just because it was uncensored (kinda)
Recent update
never generated an ai vid gen in my life
Helps fix that texture thing
Theyāre not very good
im not a tiktok creator or trying to earn money from that
if its for fun maybe
but i agree
sora 2 generated way more viral videos
gen 4.5 is 2 times better than veo 3?
it seems solid
veo 3 is half
we still need more demos
lol no
look at the numbers
the graph is kinda misleading
xddddddd
yea
why no audio
What kinda lies is this
demos
bottom page
The account seems legit
How is 3.0 higher than 3.1
i am hearing no sound
cuz that ranking is fraud
sora 2 pro way better
No audio is crazy
yeah
What is "artificial analysis text to video" means
lol this should be right
It has same point
Some benchmark
its with audio
Is it actually better
The model excels at understanding and executing complex, sequenced instructions. You can specify detailed camera choreography, intricate scene compositions, precise timing of events and subtle atmospheric changes all within a single prompt.
We remain committed to making highly.
Omg I'm so oof asking such a dumb question
Does the chart include audio though
i think its below veo 3 and sora 2
show proof, beside random sound effect
cuz its not native audio
im so lazy
we dont need fake audio inserted
to search that up
but i heard it with audio and one of their dev confirmed that in some tweet
its not the same thing
we will see ig
yes
But it isnāt specified to have no audio
What is this
we will see if its true if audio is native or not
Doesnāt know his own product
lol
It 100% does not have audio
but in none of the demo page, it had no audio
Atleast the current release
It would mention if it did
ye 100%
Especially since the previous didnāt
the main demo where they were narrating they did in some parts
but i guess its not that good
Where did he comment that
thats why they didnt add demos with audio
you would think that in the main page of 4.5, they would have audio included with video. but it doesnt have that
Where were comments
twitter/x
Holy prompt
Prompt:
A young woman with straight blonde hair and a freckled complexion sits quietly, looking up through her lashes with a vulnerable expression
We were working late last night to get everything ready for the announcement, a common Runway tradition by now. I saw early versions of this sizzle but never got to see the final cut until today. What a treat.
lol
even the ceo doesnt get to see everything
Isnāt this supposed to be like
Ultra ultra good
more demos
it looks like veo 3.4
Fraud
yes
oh it has audio
No
then why they putting music over it so loud
let me hear the audio clearly
it looks like its video to video, not text to video
how long does a chat rate limit usually ask?
It looks terrible and has no audio https://x.com/tlakomy/status/1995664601170534557?s=46&t=iDcf2nE8xUHsV_LN_6wymg
been waiting a while but i keep getting Something went wrong with this response, please try again.
its cherrypick data
this happens on pretty much every model after enough chatting
is it like cookie clicker
do i have to retry 1000 times to unlock it working again?
or do i have to wait
ill post results and see if it actually works in a couple of hours
Definitely not native
thats not native audio
they prob using other ai to add audio sound effect
gemini
yes!
uayayayayyayaa
I want deepseek speciale back šš
Man
Letās talk about real censorship Google is implementing
And I donāt mean no gore and I donāt mean no NSFW
Iām talking about historic individuals and language use of time periods and eras
Itās completely wrong & still fails to censor real harmful ai output as a consequence everyone has to suffer because of their mistrust of the users
For all the progress AI makes it takes two steps back in the wrong direction
SUPPORT OUR SERIES OF DEEP-DIVES INTO CORPORATE SURVEILLANCE & THE AI DYSTOPIA: https://store.gamersnexus.net/ai-dystopia
In this video, we walk through "AI Summaries" on Google and YouTube. Although these may currently be an attempt at some kind of actual value proposition (mostly for shareholders, of course, because evidently no one else matte...
Prime example
GamerNexus gives me hope for consumer advocacy
So does lm arena lol š
You know it kinda blows me away to that as long as Iāve been here none of you guys or anyone in that matter has ever complained about AI hedging
hm hi
An important feature of academic writing is the concept of cautious language, often called āhedgingā or āvague languageā. In other words, it is necessary to make decisions about your stance on a particular subject, or the strength of the claims you are making.
For lots of information on hedging go to our webpage:
https://www.academic-en...
where is speciale
Kling 01 model is nice
can anyone help me please
Sure wats the issue is
I m trying to generate image on lmarena image section using Google gemini 3 pro
it just fail all time and also estenguish my credit
and don't even give and out put
what to do to get an output
I have tried every way
So youāre saying every time you try to use Gemini pro image model directly that garrulous of your prompting or anything you do no image gets generated?
it's says something went wrong with the responce
and fails
and that's not happening it generates infinitly
that's hour and still don't give anything
Well, first of all there is a rate limit. I donāt know how many number of images from what I understand itās possibly for maybe a little more?
ik
It takes a little longer to generate too, so you have to kind of be patient with it. If it doesnāt work, try to start a new chat.
but after 50 mins it gets over
Yeah, I know what you mean
I cant even generate a single image
Try clearing your browser
I tried new chats
Let me test it out right now one
should I give you my prompt and image ?
come dm
Sure
Just if you donāt mind, just share it here unless itās something personal
Ok
Are you using a VPN by any chance?
Iāll teach you a better way where you donāt need Gemini pro for what youāre trying to do
whats kling o1
nope
pure wifi
idk what's happening
U canāt make ur img in lm arena using Gemini pro for 16:9
The output is 1:1 I think Iām not sure. Let me check one second.
Nvm Iām wrong
nah
the 1st image I gave is generated by gemini also
it's 16:9 out put
but maybe my prompt is not correct
I think I'll use blender instead
Is LMArena still implementing rate limits??
It's so annoying
One second it keeps messing up
I gotta ask pineapple
Iām not using Gemini pro
@echo aurora Remove all rate limits and make LMArena truly unlimited
But I do have Gemini pro so I can do it for you officially one second
Lm keeps flagging
Damn wtf
Ok got it
what's better then gemini pro
Let me remove that watermark
I donāt know I donāt like nano sometimes it makes a composition look weird
Like the height
Looks very off to me
hm now let me generate it
Ya try ur image
New Kling model
I donāt get why it keeps flagging your image, bro sometimes
Isnāt this from like Minecraft or something?
Rate limits for text arena???
is it video editing model or text-to-video
idk know why
Same
Rate limits for text generation????
I donāt know dude the composition is just weird
I think my image is open sourced that's the problem lemme fix that
wait
gehlo come dm
I think itās that fake pixelated blood
Odd
Yeah, Iām not really good with composition either
Anyone encountered endless āsomething went wrongā in Battle?
Ya
Like there should not be rate limit in Battle, and I have not triggered a single verification. But the endless error message seems like a hidden rate limit control.
Why shouldnāt there be regular limits?
Number of limits in direct chat????
It itās design is not for infinite generation lol
I mean, how long and how much time and how much prompts do you really need to really test the models after a certain point you kind of have an idea what the model could do? lol
Number of limits in direct chat for text????
I hear what youāre saying and I feel your frustration so I donāt want you to think Iām
You certain everyone using the platform is for the sake of ātesting model for the goodness of the platformsā lmfao
No, thatās why I said I get it
Direct chat for text limit numbers pls
Iām just trying to see both sides of the situation so I can understand better the dynamics that play
Two opposing forces trying to achieve two separate objectives š¤£
If platform expected everyone is so benevolently selfless, the platform wouldnāt get that much information for learning and improving. So limiting Battle done no good to both sides.
I hear ya
Number of chat a day for text rate limits???
It just puts lm arena in kind of a weird awkward situation
Both on the expectation side, and on the other hand, fulfilling their obligations whatever they may be
I think it varies from models to models, and refresh every hour at least for image gen
It's just so annoying and it needs to stop
lol this is exactly the mentality Iām talking about
Which is unfortunate cause I understand and Iām part of that mentality also
There needs to be a subsidized variant of something like lm arena
Now the error message in Battle is quite ridiculous. I used my phone and turned off WiFi, using cellular network. Boom, generating smoothly.
Really Arena? Doing that kind of funny ip lock trickš¤£
Could be a browser could be something on their end. Itās hard to say a lot of people seemed to encounter this.
Thatāll be crazy if thatās what was going on
Ikr
But it also means if that is the case then thatās a lot of generations
Number of limits for deepseek 3.2 thinking in LMArena??
I mean I havenāt triggered a single ReCAPTCHA. Blame the system lol
Well, that has nothing to do with them. Thatās more of a google
I hardly hit any either
But I use a lot of Google services
Help
I mean shadow ban is just lame. Couldāve just told me like cloudfare.
We donāt know. Ask mod or wait.