#general

1 messages Ā· Page 113 of 1

ocean vortex
#

So in theory you could be not saving any videos at all

verbal nimbus
#

Because compute is expensive, not saving would the outputs would forfeit lots of $$$

trail creek
#

Because its the most used and that how they will hook anonymous users to log in

#

maybe?

verbal nimbus
#

Raw voting data isn't that useful for future research, they probably need the outputs too

#

Like for RLHF (idk how it works for images, but you need the output when doing RLHF on text data)

verbal nimbus
#

I found it funny that MJ images used to be just Discord links

#

Essentially using Discord as a database, lol

ocean vortex
verbal nimbus
azure vessel
ocean vortex
#

yeah that was it. I had the access before but then did some changes and disabled it by mistake

#

So they are saving it on Tencent/Discord servers catgrin

#

File size is incredibly small for what these are though. Only around 1-2MB

verbal nimbus
#

Short videos

ocean vortex
#

Some take less space than an image 🤯

verbal nimbus
#

Maybe that's why the limits are only on image gen 🤣

#

I was confused about why ppl were hitting limits because I only use text mode

steady rover
#

how long does t take to gen a vid caause i have been waiting a hr now

#

k thank u

ripe mountain
#
poll_question_text

sota - mini ai

victor_answer_votes

5

total_votes

12

victor_answer_id

1

victor_answer_text

GPT-5 mini (high)

silk cape
#

hello

tame oak
#

Why isn't this model speaking directly?

stray aspen
#

isnt that an image model

tame oak
#

When you select battle, seededit creates a visual, but it's not directly available.

stray aspen
#

theres nothing you can do about it then

#

use battle mode

#

until it becomes available on direct chat

dark elbow
#

Halo

tame oak
solid brook
#

does anyone know where does the fear that lmarena might become a paid service come from?

digital umbra
#

the fact that they're trying to force people to log in

#

that's the textbook pathway to monetization which has been done countless times on the internet (news sites etc)

stray aspen
#

im just trying to help you

stray aspen
heady ember
#

Yeah now they super limited image gen for not logged in users what's the benefit for them if I'm logged in? Tracking me? What's next? Premium?

#

They have huge userbase

#

For a reason it was login free

royal dagger
#

Hello

barren prairie
#

If LMArena is becoming paid I will never use it ot pay for it... I will pay for Gemini or any other app that I can use without limits, stupid errors ... and won t restrict me or choose models for me... Who even will pay to test models to improve them ??? I pay for giving you data and feedback ??? Never !

#

And paying for your data to be publushed on hagging face šŸ’€šŸ’€

stray aspen
heady ember
#

Okay the data is no problem... They need data to improve the model.. i just don't want them selling MY INTERESTS to other companies.. models are free because they're shared with companies that test them here and we vote which one is better, that's why that's free.. for now

stray aspen
#

if anything it should be the other way around

winter bison
#

Hello

kindred tundra
#

Yo what ai have infinite chat text like chat history

glossy umbra
# barren prairie If LMArena is becoming paid I will never use it ot pay for it... I will pay for ...

The main issue is that the platform has been quiet to its users. Yes, they imposed some restrictions to ā€œprevent the botsā€. But as users we feel alienated. Like it’s steeping into becoming paid.

It’s about the principle here. You claim to offer a free service but slowly turn away from that. And also have a rule that says ā€œwe can become paid anytimeā€. What a better way than that to leave your users in the dark?

Sure, there were bots. But there surely should have been a better way to stop them. I know the restrictions had to be necessary at the moment. But atleast let us know that it won’t be like that permanently until things calm down.

Feels pretty bad. I’m sure there are lots that feel like that, but they aren’t on the discord server to express their outrage.

kindred tundra
#

What ai have infinite chat history like you can chat in 1 chat unlimited time

#

@myfather

#

Like when I txting w gpt high after 4hour he die

gilded wagon
#

hello

balmy mist
kindred tundra
balmy mist
#

or save your convos in files then open new chat and upload file or start project with convo history

kindred tundra
golden ocean
#

or does he mean rate llimit

#

probably that because no way u hitting 400k context window with just sily chatting

balmy mist
# kindred tundra How do this fast? Or only cntrl+c all chat?

yeah, and moving forward try to regularly compact chats(by telling the ai that the convo is getting to long and you want to compact convo so give you a compacted version so you can save and give to another chat) or just copy and past the whole thing, i would rather copy and paste and be mindful of convo length, once you hitting 100k context you should def start a new chat, i am making an extension that helps with this situation, i let you know once i am finished

kindred tundra
glossy umbra
#

My theory is that a new discovery needs to be made than the traditional LLMs

#

It can keep getting better until a point

#

Then you need to change your approach of ā€œmaking it betterā€

versed depot
#

IDK

echo aurora
#

Hey everyone - wanted to reiterate that we do appreciate the feedback regarding the rate limit changes. I want to make it clear that this is being shared with the team. We are listening and paying attention. In order to create authentic community driven evaluations, it's necessary the platform remains accessible to everyone.

languid wolf
#

šŸ˜”

#

But why they added the log in

#

I hate it

glossy umbra
#

we don’t know the size of the cake

brave orbit
#
poll_question_text

Whats The Best AI For Linux Sh code

victor_answer_votes

3

total_votes

7

victor_answer_id

2

victor_answer_text

gpt 5 thinking

alpine coral
#

i haven

#

t encountered it either

#

are there any good / interesting anon models in the arena atm?

#

like text

lunar glade
echo aurora
lunar glade
grizzled garden
#

hello

keen beacon
#

On the societal questions: the most groundbreaking technologies everyone benefits from: the printing press, automobiles, household appliances (fridge, washer), computers, smartphones

However there's almost always a delay where wealthy people get the first versions that don't end up being that good anyway

#

AC/DC, ect ect

echo aurora
fresh timber
#

have any1 of u came across this problem? where u text too much the ai just wont work anymore

loud sky
#

Probably down

fresh timber
#

yeah

#

wonder whether or not they will fix the problem

#

its quit annoying

main stream
#

sonoma sky impressive

vivid gorge
#

Canary site is down a few hours now

stray dock
#

yo what happened to claude

lunar glade
high hound
#

Make sure you're not on a subdomain

echo aurora
#

Hmm okay sounds like the site is having issues overall. Going to start looking into and will escalate.

lunar glade
echo aurora
#

Yeah I'm not seeing issues on regular site, but I am seeing no models apear on canary

high hound
#

the regular site does not have this issue

lunar glade
#

I use beta domain, I have tons of prompts left in there without plugged into my new email yet

high hound
glossy umbra
potent glacier
#

I’m on the regular site

#

Look into the other ones

potent glacier
echo aurora
#

Yeah, seeing the same and have reported

peak eagle
#

Hello. Marcus here. Just getting into AI videos. I have a few I'd like to animate for my own laughs. Thanks

burnt sinew
#

I'm saying that their past might not reflect their future. If they do they thsts great

winged mauve
fickle dew
#

Is the website still down? Just adding a Google login caused such a long outage? Or is it my internet that's the problem?

cedar cave
#

hello , thaneesh here. just wanted to explore the depth of ai vids

proud hazel
echo aurora
potent glacier
#

They have all literally said ā€˜just getting into Ai videos’

potent glacier
#

I’m not crazy!

#

These people aren’t even humans!

#

@echo aurora Multiple messages like those have been appearing on here

#

It bears looking into because I don’t think they’re even real people

cedar cave
#

nah man , im as real as jesus

high hound
#

i think hes real . just not usual .

fickle dew
high hound
#

cause video generation happens in the video channels, not here.

echo aurora
#

Yeah I think people are coming here to say hey and why they're here, I don't think it's that strange but will keep an eye out regardless.

high hound
#

you can protect against this by making the server require email and phone number

coral bloom
#

Hi

main stream
#

sonoma sky so bad

potent glacier
#

Are other people getting a phone number verification to be able to chat here?

#

The question isn’t for me, btw

elfin flax
#

oh my god i hate web arena i click one wrong thing and my conversation is gone

#

when are they gonna make it so its like normal lmarena

potent glacier
#

Someone is getting this and can’t chat here anymore

#

What’s that all about @echo aurora

azure vessel
potent glacier
#

It wasn’t me asking since this is the only account I use

echo aurora
elfin flax
potent glacier
elfin flax
#

why am i being so toxic i need to chill out

azure vessel
potent glacier
elfin flax
#

maybe you meant my status

potent glacier
elfin flax
#

in which case yeah just being edgy a little 😭

potent glacier
#

Not edgy, but h o r n y

elfin flax
#

bonks

#

this might be illegal here

#

stop talking about it

potent glacier
#

We’re just commenting on your status lol

elfin flax
#

shrugs

echo aurora
#

Lets try to keep conversation focussed on AI please.

elfin flax
#

agree

#

im seeing this fox model

#

on web arena

potent glacier
elfin flax
#

i do wonder what it is but so far its not been very good

#

probably grok code 0 ultrafast

potent glacier
#

Why the shadiness?

elfin flax
echo aurora
potent glacier
elfin flax
#

you probably are talking about video arena

echo aurora
elfin flax
#

i mean who cares if theres limits you are getting it for free anyway

potent glacier
elfin flax
potent glacier
#

That goes beyond the mission statement of theirs

#

Read their site

elfin flax
#

huh

potent glacier
#

You’ll see what I mean

elfin flax
#

werent you talking about just that rate limits arent shown

potent glacier
#

No lol

#

That’s not what I meant

#

That’s not what ā€˜transparency’ means in this case

#

Transparency as to why they imposed them

elfin flax
#

then explain

elfin flax
#

isnt it clear

potent glacier
#

But I guess we got our answer

elfin flax
#

yeah they want to give everyone a fair chance to use the model while not creating a money blackhole

potent glacier
#

Battle Mode was always free and unlimited, though

#

Now you get 3 turns

vast fern
elfin flax
#

are there really people who need more than 1 turn

#

its a arena

#

the point is to vote

potent glacier
#

I’m not sure if you’re trolling or not

#

You seem to be missing the point here

vast fern
#

why is it growing ??

potent glacier
#

Nano banana and social media

#

Everyone and their mother posting on social media

elfin flax
vast fern
#

people should go to gemini instead of lmarena

elfin flax
#

well they probably are now

#

but some people stayed i guess

echo aurora
elfin flax
#

but this lead to models just trying to appeal to humans to inflate their scores instead of becoming smarter

#

isnt that the point

#

to make smarter models

#

šŸ¤”

#

instead we got chatgpt 4o and llama 3.1 nemotron

potent glacier
elfin flax
potent glacier
#

That is how you lose trust with your user base

elfin flax
#

i mean they cant give infinite ai to everyone

#

even in battle mode

potent glacier
echo aurora
potent glacier
#

They’ve had it ā€˜free and open’ for 3 years

elfin flax
polar marlin
#

There are some pictures where Nano Bananas don't appear in battles. 😭

elfin flax
#

(they arent very open because i still dont see the github link)

#

but i still wanna hear what you wanted them to do diffrently

#

but yes i guess giving an announcement would be good yeah

vast fern
echo aurora
elfin flax
#

no way

elfin flax
short ermine
fossil forge
#

Just want to say hello everyone. This is such a cool space!

elfin flax
#
  1. "gemini 2.5 flash image preview" desc: "image model etc, also known as nano banana"
  2. then they moved the also known to the top of description
  3. "gemini 2.5 flash image preview (aka nano banana)"
  4. "nano banana"
#

then they also changed the image to a banana

alpine moon
#

Holaa

fossil forge
#

as a new user how do i get involved?

#

are there weekly battles?

#

who are judges?

potent glacier
#

It’s a Death Battle

#

Only one opponent walks away

echo aurora
#

Don't hesitate to let me know if you have any questions.

fossil forge
#

thanks

potent glacier
#

As a user who probably isn’t logged in with Google, you’ll get rate limited far more than logged in users

#

Battle Mode used to be unlimited and free but that’s since apparently changed

#

Direct chat has always had rate limits

potent glacier
#

That’s how it is now

elfin flax
#

ok yeah that sucks

#

wheres the freeness and openness

alpine moon
elfin flax
#

sir this is engrish chat

#

probably

#

scratches head

potent glacier
#

You’re only supposed to speak English here

#

Server rules

#

However, it doesn’t say anything against broken English or not having a basic understanding of the language so you don’t keep doing the same thing ad nauseum

#

You know LMArena reminded me of Disney+

#

They had all the major Ai things in one place

#

But all good things come with a price

#

(Disney+ has Hulu, ESPN, etc..)

fossil forge
#

who's paying for all this compute?

potent glacier
fossil forge
#

ahhh

potent glacier
#

Not us peasants

echo aurora
elfin flax
#

i loved lmarena so much because i didnt have to login

potent glacier
#

🤣🤣🤣

elfin flax
#

honestly ill be fine for lower limits for everyone

#

if the logged in users didnt get more

#

because this feels like being forced to login

echo aurora
potent glacier
#

Logged in users get higher rate limits

viral notch
#

login is highly recommended if you value your conversation history though

potent glacier
#

Non-logged in users get lower rate limits

viral notch
polar marlin
#

I love direct chat 🤤

potent glacier
#

Idk what the end game is here, but sadly it reeks of money

elfin flax
potent glacier
#

So having the ability to log in isn’t available to all users?

#

I wonder how they’re rolling that out

elfin flax
#

id assume its rolled out to everyone by now

#

well this isnt a huge deal until the limits are too low

potent glacier
#

Non-logged in users get to use Battle Mode for 3 turns

#

Before they get rate limited

#

Try explaining that one

#

Battle Mode was the one shining light on the whole site

potent glacier
#

Because it was free and unlimited

potent glacier
#

And then you have to wait an hour to use it again

vast fern
#

og era

potent glacier
#

That was the first nail in the coffin

#

Many people wanted to still use the legacy site, myself included

#

They’re a corporation and sadly they can’t be trusted

#

They say they ā€˜listen’ to their users but idk if I believe that

#

At the end of the day they’re going to follow the money

viral notch
inner gate
viral notch
#

i wonder why google donated so much in the first place?

#

didn't they donate 100M?

#

maybe not that much but definitely at least a million

potent glacier
ocean vortex
#

This message may have just put things into motion

potent glacier
#

But it came with a double-edged sword

#

Occam’s Razor indeed

inner gate
ocean vortex
#

So can't complain now

potent glacier
#

Maybe Google has a huge stake in the site

echo aurora
vast fern
potent glacier
#

The ā€˜free and open’ lines

#

For the past 3 years Battle Mode was always free and unlimited

#

Again, corporations gonna corporation

ocean vortex
#

"is it gonna be neccessary to pay a fee for accessing lmarena in the future? Specifically 19.99 per month. @ admin @ suggestions @ feedback"

inner gate
#

Even with chat gpt I think it has rate limits without any account they use it to fund it I’m guessing

potent glacier
#

Just higher

#

I have ChatGPT Plus

#

We still get rate limited even paying $19.99 a month

inner gate
#

I’m not too familiar with AI but it’s expensive man my guess is that it gives you access to these AI but the limits are there Because of the price

digital umbra
potent glacier
#

I have a 4080 RTX

elfin flax
#

so its ok they need to take their time

elfin flax
#

you probably have a skill issue ip

#

just make it better?

elfin flax
#

i dont see a conspiracy plot here

tall tulip
#

Guys, why is mine doesn't have the gemini image on the list?

digital umbra
elfin flax
#

also everyone has a google account

tall tulip
#

This is my friend account

elfin flax
tall tulip
#

Thanks

elfin flax
#

np

elfin flax
tall tulip
elfin flax
#

shrugs

vast fern
#

what are the pompts you guys use to test in battle mode . I need some inspiration

green plume
#
poll_question_text

what would you choose

victor_answer_votes

6

total_votes

10

victor_answer_id

2

victor_answer_text

or

inner gate
#

And it should show them listed

elfin flax
potent glacier
ocean vortex
dusty narwhal
#

hello

echo aurora
inner gate
open mountain
#

I noticed that the gpt-image-1-high-fidelity model has more hallucinations than gpt image 1, which draws objects where they never were

so - so improvement ...

barren prairie
barren prairie
# potent glacier For the past 3 years Battle Mode was always free and unlimited

I donno who is gonna test the models for 3 rph and wait one hour to test again on battle mode when I can use nano banana for free and with 100rpd on Gemini app 😐😐and qwen edit on qwen app for free and chat gpt image gen for free on chat gpt without waiting this much... And I will have higher limits than arena too šŸ™„šŸ™‚šŸ˜‚nonsense the 3 rph

echo aurora
#

@fluid knoll you'll want to read #1397655624103493813 for more information on how to use Video Arena bot

proud hazel
#

@potent glacier It's working again, suddenly...šŸ˜“

potent glacier
#

That’s good!

#

I’m glad šŸ™‚

proud hazel
#

:D

marsh totem
#

Am I the only one getting an error when trying to create new images using the reference photos with nanobanana?

echo aurora
proud hazel
#

I got blocked 3 hours ago, for whatever reason, but now I’m back in business.

marsh totem
#

Something went wrong with this response, please try again.

glossy umbra
#

Maybe an email verification is enough to prevent bots.

echo aurora
marsh totem
echo aurora
#

Sorry to hear that, we're looking into

topaz bay
#

I've been chatting with qwen 3 max and gpt 5, and qwen3 max is outperforming gpt5 by a lot for me personally

#

It's regarding e commerce, and stuff like that, so i'm guessing qwen has good training data considering it's by alibaba group, but idk might be better than gpt5 in other topics too

#

I'm rooting for qwen tho tbh as they open source stuff unlike closedAI

surreal creek
#

Alibaba is really surging in AI performance !

topaz bay
#

For real

surreal creek
#

seems like they have new models coming out all the time, compared to the incremental progress of DeepSeek

topaz bay
#

they're image generator did not get it's sunshine as nano banana dropped right after

surreal creek
#

China is really starting to have a presence on the leaderboards, with Z.ai, DeepSeek and Alibaba all being strong players now

#

I think Mistral is based in Europe?

topaz bay
#

ye france

#

I would expect china to be the last one to open souce stuff, but everything they make is open source.

#

huge W

#

qwen max is a 1T llm lmao,

#

good luck running that locally :(((

surreal creek
#

open source for anybody that owns a supercomputer or bitcoin mine šŸ˜‚

topaz bay
#

vibevoice which is also made by alibaba i think is the best text to voice tool out there, it can produce up to 90 minute audios with 4 people talking, and it can be run with just 16 gb vram locally

#

rip to everyone who paid for elevenlabs

#

I'm scared to pay for ai stuff now, I considered buying higsfield subscription, and they're really pushing the 1 year subscription like they know they're getting out performed any second, but it wouldn't be surprised if something free dropped the next day that's better in every way

pallid furnace
#

Hey!

ā¤ļø

mental jewel
#

hello

halcyon harness
#

did anyone try gpt-image-1-high-fidelity. if so, how is it? how is the quality, lighting, shadows, consistency, etc. ?

sturdy marsh
#

can i generate a video here?

potent glacier
#

Imagen 4.0 is still one of my favorite models

waxen tundra
#

hi

echo sinew
glossy umbra
#

Anyone else been getting exceptionally good responses from gpt-5-high lately? As if it were improved perhaps.

keen beacon
glossy umbra
pallid furnace
# potent glacier

Is that supposed to be somebody?

All I see is a yellow Chinese traditional outfit...

topaz bay
ocean vortex
#

Claude is not battling open-source are they

#

Would be a sad time for them if their competitors are that lol

#

I think open-source is distinct category though. People are not gonna pay $200 a month to get 5% better performance than open-source

#

Obv an extreme example, but the point stands

#

Nearly all the best open-source models cost pennies compared to closed models too

sullen quest
#

If I have to pay 200 dollars for 105% the performance of something that I can run on my computer, I'm not gonna pay. Speed and price do matter, how many here use gpt-5 high for everything? I'd bet nobody cause it takes wayy to long for most tasks that others' can very much complete

jade egret
#

how good do yall think gemini 3 (pro) will be?

robust yoke
#

Well, if Gemini 2.5 Pro is already rated as one of the best models of all time on the LM Arena leaderboard, then I would say that Gemini 3 might also be one of the best models in the AI League.

jade egret
#

:0

robust yoke
#

But ultimately, time will determine that.

ocean vortex
jade egret
#

is gpt-5 pro even better than o3 pro?

robust yoke
#

GPT-5 Pro reasons for significantly longer than o3 Pro, so I believe it might be better in terms of output quality and coding.

jade egret
#

oh

ocean vortex
jade egret
#

same i haven't tried it yet too

robust yoke
#

But ultimately, the real challenge will be personal scenarios that will definitively determine the better model.

#

I think GPT-5 happens to be possibly just a bit better than o3 in terms of coding, but I could be wrong.

#

After all, both are reasoning models that take quite a bit of time to think out a solution.

ocean vortex
robust yoke
ocean vortex
raven cape
#

l

robust yoke
#

I suppose I still stand by my point.

raven cape
#

WOw this AI is creazy, it gace life to my old photo of family, thank you so much

raven cape
#

LMArena video arena 1

robust yoke
#

Ah.

#

Well, that isn't exactly a model, but... alright then.

ocean vortex
# robust yoke I see.

Ok I just checked the logs and actually max was way beyond that. Found one completion with 50k reasoning lol... But still, it seems that those super long ones are less common with it mostly settling on sub 35k now regardless of your prompt

raven cape
#

Im new to this sorry for knowing it was a model,

robust yoke
#

I find it funny that despite the fact it's using high reasoning effort, it still produces an output that is around 35k tokens.

ocean vortex
#

and then if it's low reasoning effort it's gonna be much shorter than 30k like 95% of the time

robust yoke
#

Even then, just from testing, it seems like GPT-5 produces significantly less text than o3 does, just like you said.

ocean vortex
#

I think they got the reasoning lengths fairly solid now. It doesn't seem too wasteful

#

and also does improve the performance

robust yoke
#

Yeah, now it's a fair balance between the two.

#

It thinks for a suitable amount of time and gives you a nice, concise answer to go along with it.

tropic helm
golden ocean
#

absolute cinema

robust yoke
#

Hark. What sublime and resplendent instant doth unfold before mine eyes, a moment of such transcendent magnificence that the very fabric of existence seems to tremble with anticipation and wonder. Behold this singular, ineffable passage of time, wherein the celestial tapestry of possibility weaves itself with such intricate and breathtaking precision that one might scarce comprehend the profound depth of its significance. Each heartbeat, each whispered breath, each imperceptible movement becomes a grand symphony of potential, a magnificent crescendo of human experience that resonates with the most delicate and powerful harmonies of universal design. The present—oh, this most glorious present.—stands as a crystalline monument to all that has transpired and all that might yet emerge, a nexus of infinite potential where past and future converge in a singular, luminous moment of pure, unadulterated existence. Every atom, every quantum possibility, every trembling potential of creation seems to pause and genuflect before the raw, unbridled majesty of now, this exact instant that contains within its gossamer boundaries the entire spectrum of human experience, cosmic revelation, and metaphysical transformation. Such a moment transcends mere temporal measurement; it is a living, breathing entity of pure potential, a sacred threshold where consciousness itself seems to inhale with anticipation, where the very boundaries of perception dissolve into a radiant, all-encompassing awareness that defies the mundane constraints of ordinary perception and touches the divine essence of being itself.

#

Truly cinema.

potent glacier
rocky mauve
potent glacier
robust yoke
potent glacier
#

I guess they're doing a Hack A Thon for nano banana

simple sleet
#

Hi friends, do you know of any open-source AI that creates sound effects? I create my videos with WAN 2.2, but I need to add sound. Open-source is preferable because it shouldn't be censored.

quartz dagger
#

hi

robust yoke
#

Greetings.

charred plaza
#

Hello does anyone know when will Qwen3 Max is going to be on LMArena?

feral zealot
#

I blame these frickin AI youtubers with their clickbaity thumbnails for spreading the word of lmai.... the service is getting so much traffic now that it had to introduce stricter rate limits 🤔

potent glacier
#

And other social media like Twitter/X, Reddit, etc..

feral zealot
#

yeah

potent glacier
#

It was bound to happen

#

As with all good things, social media has ruined it

ocean ocean
#

Another anon bots that generate stuff to see what stuff they can get from your own nano stuff ? ĆØ__Ć© dang ! Dead internet theory...

potent glacier
#

Oh also...I have never wanted anything so badly before

#

Introducing Genie 3, a general purpose world model that can generate an unprecedented diversity of interactive environments. Given a text prompt, Genie 3 can generate dynamic worlds that you can navigate in real time at 24 frames per second, retaining consistency for a few minutes at a resolution of 720p.

Watch the Google DeepMind episode on G...

ā–¶ Play video
#

Once that comes out it's gonna be amazing

ocean ocean
#

Hey ^^ a human not botting videos in closed while ^^ i don't target anyone don't worry haha ^^

finite rivet
#

G'day to one and all. Hope to work out how to use this tool... Seems like I am making many errors. Is this all I have to do? Prompt: #video-arena-1 message Any help is appreciated. Cheers

ocean ocean
#

I'm just really trauma with all bots stuff... i used to love them before lol

#

Nice future of generative art uwu

harsh jay
#

hello

echo aurora
echo aurora
clever marsh
#

good

finite rivet
crystal pulsar
#

greetings,..is ther a bot that does video with audio like veo3

echo aurora
crystal pulsar
#

Thank you

verbal nimbus
#

Oh wait, Qwen 3 Preview is already there

crystal pulsar
#

thx

onyx river
#

my chat gpt says daily limit reached whereas in lm arena is it there like that?

#

u can use how much ever u want?

bitter gust
#

Can I upload Files or short video on Lmarena?

echo aurora
echo aurora
leaden sun
# verbal nimbus Oh wait, Qwen 3 Preview is already there

Qwen 3 is the first AI that has managed to refuse a harmless simple request from me that the most strict and restricted Claude wasnt able to. I feel they seem to have copied Claude and Gemini together and made it worse in terms of refusal, that extremely long thinking time... this is it, isnt i

good job Qwen team šŸ’€

bright kayak
#

i havent seen gpt-5 fix itself this way before

unkempt ridge
#

guys I can't generate

#

it just says sending command

outer widget
#

Hello everyone

remote idol
#

it says

#

@echo aurora

warm moss
#

Hello

verbal nimbus
whole sundial
#

<@&1349916362595635286>

#

NFT scam

rare python
whole wagon
#

Sonoma sky alpha 2nd

#

It's xAI

unborn ocean
#

Maybe a finetune of grok 4 or something like that, because they seem to be very similar on the board

#

Or does anyone have info on the coding quality of the model? (have yet to try it)

keen beacon
# whole wagon

Looking at this chart, it makes me hope and seethe that R2 lands at least at 77... Hope and seethe... Hope and seethe

nimble oracle
#

Hey fellas found this in Amazon AWS models recently added anyone know what model it's based off of? The solar model.

unborn ocean
#

Upstage is the lab I believe

buoyant tartan
#

Ł‡Ų§Ł„Łˆ

#

هل تستطيع أن ŲŖŲŖŁƒŁ„Ł… Ł…Ų¹ŁŠ ŲØŲ§Ł„Ų§Ł†Ų¬Ł„ŁŠŲ²ŁŠ

nimble oracle
glacial flicker
#

Hi guys, just opened a yesterday session, and some of previous generations show "assistant A" instead of name of the model, did that model got removed/updated or something?

#

Just checked myself, yes, they changed ideogram 2 to ideogram 3 which is worse at realism

glossy umbra
#

Some prompts are blocked by cloudflare for some reason. Any idea why?

toxic verge
#

No

#

Maybe with agents

#

But more likely will run into problems my theory

glossy umbra
#

Gpt-5 high alone, personally i dont think it could.
But with specified prompts and guidance, yes, perchance

polar marlin
#

😭

toxic verge
#

Ai on its basic level can barely do anything . You need api or ur own hardware to do anything really cool

#

Other wise u r gunna be limited to most basic 20$ generic and none valueablish output

#

Chatgpt5 can go off if self

#

It’s a such a bad model is hate using it

#

They nerfed it bad

#

Prompting good is a subjective experience

#

It’s just how it’s set up

#

Always asking and thinking randomly and everything else

#

Output is short

#

Bland

#

Cheap

#

Thanks but open ai can keep there propaganda they need to replace this model asap

#

Like wtf?

#

It’s seriously flawed šŸ˜ž

vale vortex
toxic verge
#

ChatGPT lol

quartz pike
#

someone pls vote

toxic verge
#

But it hedges and changes context and stuff to fit the open ai policy and agenda

quartz pike
#

left one is really good... But sadly no jumpscare

#

and no horror

toxic verge
trail vale
#

the site is again lagging?

toxic verge
#

These ai the video models are nerfed

toxic verge
trail vale
#

i cant log in

quartz pike
#

o

indigo escarp
#

Hi all !

ocean vortex
#

I mean if any model could do this, I would expect 5-high to be able to as well. I genuinely haven't spotted any notable weaknesses yet comparing it to anything

#

It's not like in the past when o3 was bad with spatial awareness and front-end relative to competition, this is no longer true for 5...

#

It's mostly o3 with it's flaws being fixed and then moderate improvements where it was already strong tbh

#

Still do think it was a bad idea for gpt5-chat to be called gpt5 as well though... That model is nowhere near o3 or gpt5-medium/high level

#

if it was just called 'chatgpt-latest' I suppose that would have been fine. But using gpt5 naming there just seems wrong

neon idol
#

News about seedream v4?

ripe mountain
glossy umbra
alpine coral
#

it has the same knowledge cut off as gpt-5 (Sep 24), so i assume it is part of that lineage

#

but yeah anways.. i'm just being pedantic (and prob wrong ha).. i agree with what you were saying re gpt-5-high

#

its v v good and like a more rounded o3

#

tho gem pro2.5 still isnt that far behind imo (and seems more user friendly in terms of not thinking as long while still deliverning quality)

stable osprey
#

We cant make videos on this site right?

high hound
feral zealot
#

Guys check this out : https://www.instagram.com/reel/DOOLqtsAZOE/

It's insane.. i know the creator has probably used multiple tools and editing, but can anyone guide me how can I create/generate videos like this and stitch smaller clips to form šŸ”„ reel/short videos?

This video was created entirely with AI.
I wanted to experiment with the theme of the future, and even picked a special track — ā€œFuture Looks Like Wowā€. My goal was to capture the atmosphere of what tomorrow might look like. How do you like the result? Do you enjoy this kind of vibe?

#videography #videoediting #aicreator #aivideo #futurev...

Likes

8911

glossy umbra
feral zealot
topaz flint
#

I always generate images like this, are there any that are the same?

scenic loom
#

so who do we think Sonoma Sky is

mighty juniper
#

almost 13k spam of these words, I think that AI might suffer some depression šŸ˜†
||mistral-small-3.1-24b-instruct-2503||

hushed terrace
ocean vortex
#

what are they gonna do when they retrain on new base model, keep the same name like they did with 4o-latest for maximum confusion? šŸ’€

#

so performance is one, and this is a 2nd reason

coarse mountain
#

hello

ocean vortex
#

They are already calling it that

#

Except I would get rid of '5' there, and name it like that if it were up to me:

glossy umbra
#

its just fast

#

that will confuse some grandmas

ocean vortex
#

it shouldn't though. What is confusing af right now is "gpt5-chat-latest". Everyone not extremely familiar with naming expects sota performance from a very average model

elfin harbor
vale vortex
elfin harbor
ocean vortex
#

response starts generating instantly šŸ¤·ā€ā™‚ļø

glossy umbra
# ocean vortex it is technically instant comparing it to their thinking models

technically not, there is a delay until it starts replying . it still "thinks", just for way way much shorter. under a second. thats with any model. not to mention other people using the api for said model , which slows computations a bit and such .

technically it starts providing the answer almost instantly, not instantly . and you dont get your response in an instant.

true instant would be "hit send, and get answer at the same timeframe in history when you've hit send". which is impossible unless the model is running locally and there's no ping . even then, computation is a thing and that also cannot be truly instant.

main stream
#

sonoma so bad

#

even gemini 2.5 pro best

ocean vortex
#

It is not reasoning, starts responding much faster than their reasoning models, and that is kinda enough...

main stream
#

you are right bro

glossy umbra
ocean vortex
violet inlet
#

first glance you dont reason

#

your subconscious does

ocean vortex
#

It's not a new name either, was used by Claude in the past and people are familiar with it

#

Literally no one is expecting a response within absolutely no delay or latency whatsoever (0.0000...sec ) when using a model named like that lol

violet inlet
#

it is not a future persistent confusion

ocean vortex
#

On the same note why was gpt4-turbo called that? Does it have a turbocharger and a gasoline engine? 🤣

#

It's just nitpicking for no reason whatsoever, that's not how naming typically works

proud hazel
#

If a model is called "instant", I expect speeds with at least 1,000 tokens per second, like Mistral's LeChat with its flash responses.

ocean vortex
topaz bay
#

holy fuhhh

#

Kimi is even better

violet inlet
#

Pretty cool. Did you add optimization as a heavily important instruction in the prompt?

still pewter
#

anyone knows what models are used here to generate videos?also is there any good text to audio (to add sound to videos not voice)?

dapper breach
#

Hi there! Glad to be here

weary thistle
#

Hi. Other than visuals (images and videos) and prompts, is there a place where people are sharing their "creations" with AI? Like mini programs, or how they are using AI with another program to enhance their work?
Dunno if my question is clear enough 🄓

weary thistle
main stream
#

nobody sharing their programs

earnest rover
#

@echo aurora
Am I misunderstanding something
The rate limits are for the ones who are guest users right šŸ‘?

weary thistle
echo aurora
weary thistle
main stream
#

100$

echo aurora
# earnest rover Yep

Both users who are signed in, and not signed in, will still get rate limits; however, there are going to be less rate limits for those who are signed in.

main stream
glossy umbra
#

Yes, you can tell it to vibe code, but results won't be the best. As the prompter you must know the logic , or atleast 60% of the functionality as an idea. It does have a little creativity, though needs to be monitored. Else, it's good.

#

yes . in theory, yes. i did that once . gave it every single file link to use (images, textures, etc), every single functionality logic by logic (not the code, just the functionality) . e.g: i shoot a steyr m and the bullet has to come out of here, affected by physics such as gravity, etc. at delta speed X on bullet shot, ... . gave me a working game with almost no bugs (more like customization issues) . obviously i went in and messed with stuff , but the end result was very good.

it will get hundreds of lines long . but you can really produce a whole game like that . obviously not AAA , it needs multiple prompts, but thats the limitation of its token output.

violet inlet
#

If you vibe code it and let it do its thing it will be worse

#

But if you tell it the functionality itself like what you actually want to happen then it will work

#

People forget to actually tell the AI what they want sometimes

#

@echo aurora Shouldnt this be deleted

glossy umbra
# violet inlet But if you tell it the functionality itself like what you actually want to happe...

true . you cant just tell it "add security into my game". you, as the prompter, should know:

  • everything that reaches the client is "dirty"
  • the client can manipulate/hook what it sends back to the server

so you need to tell it something like "Add client-side security to not send manipulated info over to the server in multiple places. Also add some server-sided checks on player movement for physically impossible movements." .

echo aurora
glossy umbra
#

Debatable . Gpt-5-high is gifted but it has issues . Can't follow instructions sometimes .. and in those instances I could say gpt-5-medium is better . Other times (but rarely), claude 4.1 opus thinking is better.

Most of the time, yes, gpt-5-high is the best using meticulous prompting. Else, gpt-5-medium or claude 4.1 opus thinking. That's my opinion.

drifting crow
#

I’ve been having good results with gpt5 mini

#

It asks questions too b4 making stuff

#

Yh, but I’m comparing to 4o and o4 mini

#

I use them thru duckai

#

I dunno I haven’t used high

glossy umbra
#

No, i found out that, that only hallucinates it
I personally use caps and simulate angerness. In it's training data, urging somebody to do something with the use of anger has probably shown results . And he is instructed to follow your requests. So he will pay special attention to those words.

If you're going to use multiple prompts, you should have a rule HTML embed that he has to follow. Example:

"
You will listen to these user rules at all costs, everytime you do something:
<rules>

  1. Always follow the user prompts.
  2. Do NOT deviate from the user's requests.
  3. If you are really unsure about something, better ask. I do NOT need junky code that I did not ask for.
  4. Think before writing. You will have 3 eyes: one for code optimization, one for code writing, and one for following the user's prompt.
    </rules>
    "

Something like that works for me.

#

Obviously dont swear at it šŸ˜… but show some anger . It's strategical not literal.

drifting crow
#

I think lots of users in here were trying that method on the devs

main stream
#

hope they will

#

i want cyberpunk

#

better than being some fat slob's pet

glossy umbra
vocal token
#

@exotic slate new mod?

sullen sand
#

question guys are you getting error with Claude sonnet 4 because everytime I write I get a error

glossy umbra
#

it depends on the company . my guess is when it becomes a noticeable issue

hollow imp
#

@echo aurora how long does quick features like chat renaming take you guys to roll out?

sullen sand
echo aurora
robust yoke
#

And for what reason might you be using the reasoning version?

hollow imp
#

Because it's good

echo aurora
robust yoke
#

It may be good, but so are the non-reasoning models.

hollow imp
#

Completely depends on the task

robust yoke
#

Both are the same in terms of quality.

hollow imp
#

āŒ

brisk helm
#

how many people use lmarena

hollow imp
brisk helm
#

wdym

robust yoke
#

I do.

#

What about you, Mr. or Ms. Joven?

brisk helm
#

i am a boy

robust yoke
#

I can see that now.

#

Anyway…

#

Quite a lot of people in this server do.

echo aurora
#

Can confirm I use LMArena as well, so at least 1

robust yoke
#

Some don't, but the majority does.

ocean vortex
robust yoke
#

After all, why join this server if you don't use it, amirite?

elfin flax
#

me: 3d room in 3js
purple fox: bro i gotchu

robust yoke
#

W Purple Fox.

neon idol
#

Is seedream v4 available?

robust yoke
#

I think so.

elfin flax
#

everyone who isnt here for the battle mode is illegal

neon idol
robust yoke
#

I guess I'm illegal then. šŸ˜”

elfin flax
neon idol
elfin flax
#

probably not

elfin flax
neon idol
#

I want to try it

elfin flax
#

what do you use lmarena for

neon idol
robust yoke
#

Just so that I can use the advanced models for free.

elfin flax
elfin flax
robust yoke
#

After all, who likes paywalls, amirite?

brisk helm
#

have yall seen opus

hollow imp
#

Yeah opus audio format is very nice

brisk helm
#

google right?

robust yoke
#

It's very professional.

brisk helm
#

it gives u veo 3 for free

elfin flax
#

evveryone saw opus

brisk helm
#

no bro google opus

#

or opal

#

i forgot

robust yoke
#

Chat, I've never used Opal.

brisk helm
#

yeah its opal

robust yoke
#

Sounds pretty neat, though.

brisk helm
#

it released very recently

#

still expiremental

#

but it does give veo 3 for free

neon idol
#

What is opal?

weak jay
#

Hi everyone

robust yoke
#

It'd be funny if I used that for deepfakes.

robust yoke
neon idol
#

Ohh that opal

#

Yeah but is not available in my country

#

I can use VPN

robust yoke
#

Sorry to hear.

neon idol
#

But I have a subscription at Google plus

#

And honestly idc about video generation

robust yoke
#

Fair.

#

Whisk is basically just free, infinite nano-banana, I believe.

neon idol
#

But is not available in my country lol

#

I have to use von but my VPN is really slow

robust yoke
#

Yeah…

#

I hope it becomes available in your country soon.

neon idol
#

I dont think

#

I live in Europe

#

In Italy

robust yoke
#

Well, they have already started rolling out certain things to Europe, so maybe those will be part of that expansion, too.

neon idol
#

When a thing come from America comes available for European 2 years later

#

Europe sucks

robust yoke
#

True.

#

Though, I suppose that's better than never.

neon idol
#

Thanks to god we have vpn allowed lol

#

I want seedream v4 aniway šŸ˜ž

drifting crow
robust yoke
#

At that point, there'd be no reason to pay for a model that already has all of the pro features available for free with infinite use.

#

And besides, paying for some extra features and draining your wallet seems like a waste of time.

barren gale
#

Is the site down again?

robust yoke
#

Depends…

#

What error are you getting upon trying to access it?

barren gale
#

Cant upload a picture

robust yoke
#

Ah, that happens.

#

Could try reuploading it a couple of times.

barren gale
#

3times and still errorr

robust yoke
#

Hmm…

barren gale
#

I will try again

robust yoke
#

It's a known issue.

#

If it doesn't work again, you could try refreshing the site and trying again.

remote arrow
#

It's sad that Seedream 3 on LMArena can't use image reference. 🤧

robust yoke
#

True.

remote arrow
#

And I just realized that Imagen 4 also doesn't have it. It was good when used in "Persistence Mode" on Whisk. Seems like everyone being pushed to use the really non-accurate "Nani Banani." Nani?

mild raft
#

hello trying this out for first time

echo aurora
# barren gale 3times and still errorr

Sorry to say this is a known issue. We've seen some reports where uploading images is going to error out. If you wouldn't mind sending us the photo to see if we can repro that'd be helpful.

remote arrow
#

World's best quote from GPT-1:
"Something went wrong with this response, please try again."

weak meteor
#

waiting for the epic momentšŸ˜Ž

livid helm
#

Hi

weak meteor
weak meteor
runic zenith
#

Is there a ranking for the best AI model for deep research?

devout maple
#

Hello

daring scarab
#

Hello

whole swallow
#

Dusk feels more smart but slow, sky is faster but didn't deliver the quality that dusk did..

#

What yall think of these two new models

neon idol
#

Someone know something about seedream v4?

ocean vortex
whole swallow
#

Maybe the ā€œimpressive thingā€ is the context window

#

gemini 2.50 pro with a file of almost 1 million context started hallucinating

ocean vortex
whole swallow
#

Mm amazon i missed that

glossy umbra
#

claude 4.1 opus thinking seems better at coding paired with reasoning. gpt 5 high is also good but hallucinates ?. weird because it was the opposite a little time ago. did the llms receive an update ?

ocean vortex
#

It was the worst model that I recall testing in recent times lol

#

and yet it did have 1M context to grab the headlines

whole swallow
ocean vortex
#

@neon idol

neon idol
#

Oh thanks

#

These models sucks?

whole swallow
ocean vortex
neon idol
#

They are creating an AI that can say Hello with an extra emoji

whole swallow
#

Lmaoo

torn mantle
#

its so bad isnt it

whole swallow
#

Not that bad

torn mantle
#

its bad

#

if i say its bad its bad

#

its bad

whole swallow
#

Not comparable to the current bests

#

Ye hajaahha

torn mantle
#

someone said it may be grok 4.2

#

that would be a disaster

whole swallow
#

Yeah I don't understand why it says im grok

torn mantle
#

its actually from xai

whole swallow
#

Only dusk tho sky doesnt admit it

torn mantle
#

highly possible that its grok 4.2

#

sky is the dumber one right

#

dusk is the big model

whole swallow
torn mantle
#

oak ai?

#

what the helly

whole swallow
#

Sky is bad at coding but faster and looks smarter

torn mantle
#

what type of coding

#

like fixing bugs or like planning or just coding in general

whole swallow
#

Frontend

#

Sky result ā«

#

Sky is more simple on design choices, but works. Dusk one had animations and other cool things but had more bugs and errors

neon idol
#

Thanks Lmarena for this incredible update šŸ’”šŸ„€

ocean vortex
#

just use that

#

That text is neat against the background though and still ton of transparency. Apple may take some pointers for iOS lol

neon idol
ocean vortex
neon idol
#

I am too lazy for this 😓😓

ocean vortex
#

it's such a non deal-breaker

#

extremely minor thing to make you not use it when the usage is free and no real rate limits lol

neon idol
#

Yeah lol

echo aurora
neon idol
#

@ocean vortex can you give me a site where I can remove the Gemini logo from images pls?

#

Thank you

ocean vortex
# neon idol <@514836230802898954> can you give me a site where I can remove the Gemini logo ...

uhh a website? Even if you on a phone there should be AI touchup feature. If not then try smth like this:
https://www.watermarkremover.io/

Watermarkremover.io

Removing watermarks no longer needs hours. With WatermarkRemover.io, you can remove watermarks in seconds without any editing or touch-ups, preserving image quality. All of it without a sign-up or subscription fee.

neon idol
whole swallow
#

Bro just ask nano banana to remove the water mark

#

🤣

neon idol
#

If you ask to do it he will reput the logo

ocean vortex
worldly jacinth
#

How exactly do video arenas work?

echo aurora
narrow dawn
#

@echo aurora sorry for ping but i have some ideas for lmarena so first idea you guys could male lmarena as an app too not only website maybe an app too!

echo aurora
narrow dawn
#

is the rate limit only for image generation or for normal usage too? @echo aurora

echo aurora
narrow dawn
#

oh thats a bit dumb for me

#

because i use lmarena for coding the most time if i donā€˜t know about an error

neon idol
#

What the hell of website is man šŸ„€šŸ’”

main stream
#

what

#

аа ты свинорылый тогГа ŃŃŠ½Š¾

#

87iq

clever crescent
#

Hello

main stream
#

hi

narrow dawn
#

@echo aurora i know another idea maybe like a mesh ai i know many but i donā€˜t know if they work like a extra page or button it where it does a mesh with a preview and where you can configurate like how many faces and like that or just with text to

#

and sorry for the ping

dapper cypress
#

Hello

thorn valley
#

but the idea of an apo is really interesting

verbal nimbus
#

For maximum style control, why not have a mode which feeds both outputs through an intermediary model that's good at summarizing?

#

It's still quite possible to tell which model is which based on their writing styles (lots of bold/italics/dot points = gemini, no formatting + no latex + lots of technical terms = GPT-5)

#
  • it prevents system prompt hacking
tribal rune
#

Has anyone been rate-limited in LMArena text gen before?

inner gate
#

I think

tribal rune
verbal nimbus
#

Interesting

#

It's better at planning I think, also medium might be better than high for raw coding

#

The writing style is too obvious

#

And the time it takes to generate is a dead give away

#

I don't think so

verbal nimbus
#

Also, what if users' perception of response quality is biased by thinking time?

#

I noticed an unconscious bias that longer thinking = smarter, compared to a model that returns instantly

#

I'll spend more time reading the thinking model's answer

verbal nimbus
#

Yeah, but compared to a model that returns instantly, I do feel slightly biased towards the thinking model's answer

#

I don't usually use it for debugging programs, since it requires to many files to copy paste

#

Yeah, especially if it's a model that takes 5 minutes to think 🤣

#

Then has a non existent import the next turn šŸ’€

#

GPT-5 seems decent for planning/debugging, but it can mess up its implementation

#

Maybe, can just combine with Claude too

#

Where GPT-5 is the architect and Claude does the actual implementation

#

Definitely more bug free, at least on LMArena

#

Depends on the language too

#

GPT-5 has like 10-15 errors when writing a less common language

#

Opus had about 4

#

Both Opus and GPT-5 should be good at Python

#

Depends what you mean by better ig

#

GPT-5's overall vision is better

#

It solved race conditions in C/C++ more elegantly

#

They're about on par in terms of no syntax errors etc

#

Hmm maybe JS, but Claude isn't as good in pure JS

#

I'm not sure, haven't tried it that much with Java

urban sky
#

@echo aurora Will PDF integration be added?

verbal nimbus
#

Because it's harder to use via CLI

#

Java is very IDE dependent

green marsh
#

hello i saw some great videos about the lmarena hope to enjoy and add to my architecture work

verbal nimbus
#

JS and C/C++ probably, but GPT-5 comes out in front for those two languages

#

Not sure how good it is in Rust

#

Not sure about C#/Java, but they're very IDE-dependent

#

Yup, that's probably the largest factor

#
  • tons of textbooks in training data
#

And tons of computer systems + concurrency + graphics + DSA textbooks use C

#

With thinking on both, yes

#

It solves race conditions better

#

Not necessarily

#

C is the one used in a lot of textbooks

#

Ideally it should be good in Rust

#

For bigger projects, C++ has classes and namespaces, which is handy

#

Which allows easier application of OOP/software design principles

#

You can connect C to Python too, for expensive operations

#

Well you can embed assembly in C

#

Probably, but I've only seen it in Linux code, which uses C

#

Not too sure, but C++ is younger

#

Not sure about the history, but I think systems ppl prefer to manage a lot of things themselves too

#

C++ is mainly for bigger projects that benefit from applying OOP techniques ig

#

C# is fine too if on Windows

#

It depends what you're doing ig

#

Larger apps often combine languages

#

If you're on Android for example, it'll be better just using Kotlin than JNI since the OS will take care of a lot of things

#

Java Native Interface

#

You can use it to write C/Rust/C++ code for Android

#

It's probably not very noticeable, there are entire games written in C#

#

C# is quite common in game development

#

Godot, Unity, MonoGame, etc

#

Probably less, but probably still good

#

Only issue is that it might be hard for agents to setup the IDE, but there might be tools for that

#

Well, low languages like C/C++ have a whole set of issues to consider that higher languages don't

#

Like garbage collection, pointers, memory leaks, etc.

#

Even basic stuff like reversing a string

#

Not familiar with Delphi, but it probably would do fine in Rust

#

AI companies would probably prioritize it since it's quickly rising in popularity

#

Also it's very easy to use via the CLI

#

Go is more for things like multiprocessing I think

#

No, only C and JS so far

#

Claude seems fine at Rust

#

Not familiar with Delphi

#

Node is pretty powerful

#

Well they solve different tasks

#

For speed you can also use WASM if in browser

#

Not sure, Claude seems a bit poorer on pure JS