#general

1 messages · Page 112 of 1

echo aurora
#

It's fine, our site being down like this isn't an okay thing, we are going to do better.

fierce cove
#

Agreed
our feedback is constructive, not a complaint

glossy umbra
glossy umbra
#

They said they would do better. Let’s be respectful atleast. 😅

naive rapids
echo aurora
#

This is an overall issue we're aware on and would like to fix.

Mid-outage though that's going to be much trickier.

viral notch
#

okay nvm the site only looks like it works... i can only read the conversations but not add to them or start new ones

ornate swift
#

Bbb

glossy umbra
#

Well sorry if it bothers you. We don’t deserve what’s free. Just know you could, theoretically, not be able to use it anyday. Obviously that’s not going to happen. But be respectful. 😁

echo aurora
#

Honestly, it's the engineers who are working in this that are the true heroes with these outages.

glossy umbra
#

@wanton vortex

wanton vortex
#

the admin is pissed off ig XD

stuck kindle
#

What time will it come back?

echo aurora
viral notch
#

i suspect the site is going through feature transitions so some code may not cooperate properly

echo aurora
minor burrow
#

@echo aurora Thank you for your response. I think attaching files is one of the fundamental features. I haven’t seen an admin like you replying to every user’s message. Hats off to you!

stuck kindle
fierce cove
#

Google appears free, but its true cost depends on your definition of "free." Lmarena also isn't free; users provide data to improve its models. True free means receiving nothing in return. Nevertheless, I am grateful to Lmarena for this service ♥️

tight oriole
#

is more than 24 hours down?

viral notch
echo aurora
viral notch
#

usually the site isnt out of commission for multiple days

wanton vortex
#

ig yaal have to open a site like status.lmarena.???

stuck kindle
sullen sand
#

What happened

glossy umbra
sullen sand
zealous turtle
#

Hey all! While the engineering team is working on a fix, please keep chat and tone respectful 🙂

Updates will be posted in #announcements.

Thanks for your patience and understanding! ☘️

jovial sapphire
#

hi guys

#

have you seen gpt5 image model?

#

it's pretty bad lol

#

there's a new model being tested on artificial analysis

#

it's not gpt1 image

glossy umbra
jovial sapphire
#

it's "gpt5"

glossy umbra
#

Nano banana is better at that.

jovial sapphire
#

exactly

primal orbit
#

site is up, but no chat history

jovial sapphire
#

or seedit

narrow dawn
#

is lma up again?

glossy umbra
narrow dawn
#

ok i have every chat

hollow ice
#

I actually used it 2 hrs ago but now 😭😭

sullen sand
#

Yeah

#

Baby

echo aurora
#

We're still monitoring

sullen sand
#

Good job bro

#

Lmarena the best

primal orbit
narrow dawn
#

ye

primal orbit
#

I was not logged in and all previous chats are gone 🙁

wanton vortex
#

love yaal

#

💕

echo aurora
narrow dawn
primal orbit
glossy umbra
storm ermine
#

Where do you guys make your videos?

torpid locust
#

what just happen?

echo aurora
narrow dawn
narrow dawn
echo aurora
narrow dawn
#

how do you get the ai's to work is it with api or how if i can ask or allowed to

echo aurora
echo aurora
narrow dawn
#

huh?

#

i don't understand so i could do my own website with ai or what?

glossy umbra
glossy umbra
# narrow dawn that too

Isnt it pretty straightforward? Just call the API for gpt-5-high or whatever and retreive the LLM response.

narrow dawn
#

better question does it cost money

glossy umbra
narrow dawn
#

ohhh

storm ermine
#

What is the best way to make AI videos for YouTube shorts for free?

cedar tide
#

@echo aurora qwen 3 max 😶

neon idol
cedar tide
ashen plaza
#

im still getting rate limited after a single generation. of course im not logged in. is this intentional now or just a bug left over from everything that was going on? as this happened before everything went down originally

neon idol
ashen plaza
neon idol
#

Site is working for me

glossy umbra
#

I don't work for LMArena so I can't tell you exactly

cedar tide
#

it rather competes with Claude opus

narrow dawn
#

for what do you guys use the LMArena?

ashen plaza
unkempt elk
#

where did the google login and my chats go after the website down
nvm i refreshed site 3 times and all back

glossy umbra
glossy umbra
ashen plaza
narrow dawn
#

for what do you guys use the LMArena?

glossy umbra
narrow dawn
#

:0

green plume
#

its back

#

omg

#

wait its back ?

#

or am i trippin ?

glossy umbra
viral notch
#

glad its back

mild garnet
#

/video

ashen plaza
#

@echo aurora sorry for the ping just had to ask. is the rate limit of only a few generations a bug (as it started before everything began to cash), temporary thing while there was issues or permanent now?

gray quartz
#

hello guys

regal cairn
#

gpt-image-1-high-fidelity 🤔

crimson escarp
#

"high fidelity"

past marsh
#

Hi

fossil fable
#

Actually gonna crash out the new release is CLOSED SHUT

#

i had a hunch

#

it was correct

old halo
#

hi

rigid bay
#

cinematic

compact turret
#

Sim

#

Yes

thorny kayak
#

Hi

#

How to change images to video

undone flint
#

/video

turbid verge
#

hello

regal cairn
balmy mist
#

how is the new Kimi K2 update?

formal jungle
#

You guys can't see this but I I am seeing the most adorable bunny rabbit

subtle forum
#

Hola

echo aurora
open mountain
echo aurora
open mountain
#

gpt-image-1-high-fidelity

tight oriole
#

is video working in next 5 -12 hours?

echo aurora
smoky ice
#

/video

echo aurora
echo aurora
mild minnow
#

I hate Claude on LMArena. I can’t even make a proper Doom Slayer story without getting a Terms of Service violation warning. 😭😭😭 And Claude is so damn good at making stories too. My life is over

polar marlin
#

😭

lucid mural
#

/movie

#

I hate Claude on LMArena. I can’t even make a proper Doom Slayer story without getting a Terms of Service violation warning. 😭😭😭 And Claude is so damn good at making stories too. My life is over

open mountain
#

I hate Claude on LMArena. I can’t even make a proper Doom Slayer story without getting a Terms of Service violation warning. 😭😭😭 And Claude is so damn good at making stories too. My life is over

open mountain
willow grail
#

first of all claude sucks at story making and swe.

#

next.

open mountain
barren prairie
open mountain
willow grail
barren prairie
open mountain
#

хорошо

willow grail
open mountain
willow grail
#

are u new ?

#

use openrouter or openais playground api

regal cairn
willow grail
#

and novelcrafter is the software where u write the story

open mountain
open mountain
#

@regal cairn

latent crest
regal cairn
open mountain
#

@echo aurora when will gpt image high be available

tender sphinx
open mountain
#

say hi

mild minnow
regal cairn
latent crest
#

Hold on, do we have to login now? I don’t want to

inner wadi
#

Hi when will bot start again

echo aurora
# regal cairn

Yeah so that's a new model that's only going to be available on Battle mode, you won't be able to find in Direct/Side by Side. cc @open mountain

echo aurora
open mountain
echo aurora
mild minnow
#

Will Arena ever have an App for mobile

echo aurora
open mountain
mild minnow
#

I mean there has to be a way to keep apps alive no?

sage thistle
#

Hello, Just Join Here

mild minnow
#

Welcome

echo aurora
echo aurora
mild minnow
#

Then you would have the most popular app on the appstore

#

Most apps on the appstore shove in-app purchases down your throat which can be expensive as hell

ashen plaza
#

@echo aurora so is this still just a bug to only be bale to generate about 2 images now? or some kind of safe guard brought up when everything was crashing? or possibly a bug left over from the website going under as i was getting this same thing just before it all started

fathom jackal
proud hazel
ripe mountain
#
poll_question_text

sota - local ai

victor_answer_votes

5

total_votes

7

victor_answer_id

1

victor_answer_text

Qwen3 30B 2507

faint torrent
#

Hi everybody!! Why I don't have permission to write in # video-arena-1, 2 or 3? Thanks

green plume
keen ferry
wooden crown
#

/video

orchid orbit
#

Why can't I take photos and videos?

wooden crown
#

/list

#

la

#

ls

#

df

real whale
#

What happen

lusty tinsel
#

cant they add option to login with other email instead only gmail? like outlook, etc...

ornate swift
#

Why lmerna dont working??

ashen plaza
ornate swift
#

@ashen plaza go to nani banana

ashen plaza
devout vault
#

@echo aurora nice typing in annoucements

devout vault
leaden laurel
#

or what was its name

whole sundial
#

btw @echo aurora you should have made some sort of announcement before you guys decided that it was a good idea to force Google login after 3 image generations

#

not everybody wants to make a Google account, you know?

echo aurora
whole sundial
#

just because 99% of the world uses Google doesn't mean that you should ignore the other 1% that doesn't use Google

whole sundial
echo aurora
echo aurora
solemn hedge
#

Noticed that you added the account feature lets goo!

leaden laurel
#

it was called like that

gleaming roost
#

I knew that with the implementation of login would come higher rate limit fees

civic spindle
#

vro is there a private video generator so i can make free unlimited vids for myself, pfft

boreal saddle
#

What is this bastardized word?

timber iris
#

What are detailed limits?

civic spindle
#

don't worry bout it unc

boreal saddle
willow grail
#

i would choose banning all new member who came in in the last few weeks

#

so i can do more usage in here

#

D:

timber iris
#

What are detailed limits?

civic spindle
#

does lmarena have unlimited free uncensored private every single chat, image, video model for free with no paywalls

echo aurora
civic spindle
echo aurora
timber iris
#

ok

echo aurora
civic spindle
potent glacier
#

@echo aurora Are the rate limits for direct chat or Battle Mode now? I was only getting rate limited in direct chat even before the login was implemented.

civic spindle
#

what is a rate limit

echo aurora
echo aurora
potent glacier
#

The other models were always rate limited before

#

Idk why it needed to be implemented if it was already a thing

civic spindle
#

this used to be unlimited

glossy umbra
echo aurora
civic spindle
glossy umbra
#

Don’t take them for granted

potent glacier
glossy umbra
#

Be grateful

echo aurora
potent glacier
#

I’m very confused

#

Rate limits already existed before

#

So what exactly was added or changed?

civic spindle
#

lemme guess theres a subscription plan to bypass the limits

#

49.99 dollars a month

civic spindle
potent glacier
echo aurora
magic stag
potent glacier
civic spindle
#

mind games

potent glacier
#

I can use it unlimited

echo aurora
#

Both will still have limits, but if logged in you're going to have less. Does that make sense?

glossy umbra
#

Why can’t people just be grateful that there’s a place where you can use LLMs for free? even though there are limits?

ashen plaza
potent glacier
#

Battle Mode isn’t supposed to be rate limited though?

civic spindle
potent glacier
#

That was always unlimited

ashen plaza
echo aurora
potent glacier
#

I think a lot of people are about to get angry lol

civic spindle
#

why cant i send a video game character being mauled to a direct chat to gpt with it constantly being flagged

gusty loom
#

guys what is the best web search model currently

remote arrow
#

After force login, now IP banning. LMArena was a kind a hero, until about... A week ago..

echo aurora
civic spindle
#

lads what happened to story telling lads, cant even send a pic of some game screenshot scenario on there without being flagged, certainly not REAL

ashen plaza
gusty loom
#

guys what is the best web search model currently?

gusty loom
#

thx

fathom jackal
civic spindle
#

admin the censorship thing is so geeked rn

glossy umbra
#

The site must have went down for a reason

ashen plaza
whole sundial
#

tbh i think one of the main reasons they did this is because of bots that use lmarena, and also to lower the load so more generations from real people go through. I don't like the Google requirement in particular, but they are coming up with other options, hopefully email login.

glossy umbra
echo aurora
ashen plaza
glossy umbra
#

We don’t know. Regardless, if there were even bots on the site, there should have been a better way of preventing them.

lusty tinsel
#

didnt they have captcha going on for every generation?

glossy umbra
lusty tinsel
glossy umbra
# lusty tinsel ig

Cloudflare captcha is dependent on other factors. It’s not a fixed rule that it should be every generation.

#

It can happen every generation. But it’s not fixed. It depends

lusty tinsel
#

i got that a lot before. guess i was unlucky

whole sundial
ashen plaza
neon idol
#

@echo aurora What is the limit of generated images for people that have made the login?

glossy umbra
glossy umbra
echo aurora
whole sundial
glossy umbra
potent glacier
#

Battle Mode isn’t supposed to be rate limited at all

#

That’s honestly ridiculous and should be reverted to how it was

misty star
#

DH3 is Seedream 4.0

high hound
# potent glacier Battle Mode isn’t supposed to be rate limited at all

It's official. LMArena is slowly becoming paid. The one service that was known to be non-biased and provide a good place to use AI for relatively free (with some restrictions), is now getting more and more restricted. Forcing you to make accounts ...

We all know what comes next. Subscription models. Brute-forcing users to login...

misty star
neon idol
misty star
#

This is 10 days after nano banana

#

Lmaoo

#

💀

#

Idk

neon idol
#

Where can I try it

dawn wharf
#

Seedream 3 was already very cool

potent glacier
glossy umbra
limpid schooner
#

it's SOTA

dawn wharf
#

but it was bad in prompt following

#

and complex scenes

#

had a very good style tho

golden ocean
#

GIVE EXAMPLES

limpid schooner
dawn wharf
limpid schooner
#

left broski

#

dh3

neon idol
#

How can we try this model?

limpid schooner
#

it will come up randomly often

neon idol
#

CHAT WHAT

#

I have found a crazy thing

#

Look at the model

#

GPT 5?????

potent glacier
#

I will say that having Battle Mode be rate limited now is scummy and shouldn’t have been done

inner wadi
#

Guys anyone using piclumen

neon idol
#

Ok

keen beacon
#

goodbye free gpt 5

neon idol
topaz trellis
#

hi

glossy umbra
#

I understand LMArena . After all it needs to be monetized

#

they cant just burn their wallets endlessly

keen beacon
potent glacier
#

@echo aurora All this stuff definitely goes against the mission statement of LMArena

#

They shouldn’t say it’s ‘free and open for everyone’

#

That’s false advertisement

#

I really don’t think the Battle Mode should be rate limited either because that doesn’t even make sense

hard quiver
#

Down again?

potent glacier
#

All because the website got popular

#

That’s the only explanation

#

If you’re rate limiting us, we should get an explanation as to why and also what the rate limits are

#

Like…Gemini 2.5 Pro itself doesn’t really have rate limits to begin with

#

So how does that make sense?

#

There wasn’t an announcement about it or anything, just blindsided your users and expected them to be happy with the changes

neon idol
#

Yeah you are right

echo aurora
potent glacier
#

That is highly unprofessional

#

A proper announcement is what usually happens in situations where drastic changes are occurring

#

Yeah the site got popular, because it’s a great site

#

But the kinds of changes that are trying to be implemented will make a less greater site

#

The whole rate limiting Battle Mode really leaves a bad taste in my mouth

#

I know I’m complaining but if people don’t speak their minds and stand up against these things then people get walked all over

#

And it really does go against what the mission statement is of LMArena

echo aurora
neon idol
potent glacier
# drifting crow Cry about it

Cry about it? No, I’m raising valid complaints against a company that clearly wasn’t transparent about their intentions to their users

#

That’s called ‘holding accountability’

#

Or do you just let people run roughshod all over you?

drifting crow
#

I don’t typically get upset at free services coz I’m not entitled

hard quiver
#

Where i can use seedream 4.0

echo aurora
potent glacier
#

Has nothing to do with entitlement, has everything to do with their mission statement that is clearly on their website

drifting crow
#

No it doesn’t, it has to do with your interpretation of it

potent glacier
#

No

neon idol
hard quiver
#

😔

neon idol
hard quiver
#

And seedream is better than nanobanana or comparable?

neon idol
#

I am very curious to see how good is seedream v4

neon idol
hard quiver
#

I hope the use of artificial intelligence never gets monopolized. All this competition in the market is driving the creation of better and better tools.

drifting crow
#

I hope I get the sole ownership of the monopoly

echo aurora
misty star
hard quiver
#

Please, may a billionaire use all their money to create an artificial intelligence that allows image creation and editing without censorship

empty stump
#

grok sorta

onyx python
#

I think I hit my limit using nano banana when using the website. I did not get a message at all though. When does the limit reset?

echo aurora
#

Generally, we're not going to provided details on if/when specific models are going to be added.

#

So lets keep it mysterious then ghostnod

#

But yeah like I said I won't be sharing details about upcoming models until we're ready to announce.

echo aurora
boreal saddle
#

Another outage.

kind merlin
#

Hello, excited to start this new journey exploring the world of AI.

boreal saddle
#

It keeps disconnecting.

polar marlin
#

😭

gleaming roost
#

and here we go again

terse shuttle
#

what happening?

boreal saddle
#

The website s stability is worse than a DDoS, fr

topaz sun
#

Why do you guys restrict the models so much? Imposing logins? I've been using it since early lm-sys . The restrictions were not that bad. @echo aurora

echo aurora
# boreal saddle

I was seeing the same; however, it's now working. Are you seeing the same?

crimson escarp
#

do we have any ideas about image generation limits? like how many generations am i allowed for gpt-image a day?

echo aurora
neon idol
#

I am the only that can go on lmarena.ai without any problem?

echo aurora
mighty juniper
#

What happened to the Copilot leaderboard? it's 99 days old now
Also any plan to get some 🎶 AI Music generation leaderboard at somepoint? 🤔

potent glacier
#

That was very well-known

languid wolf
#

😫

echo aurora
potent glacier
#

If the site no longer aligns with that, they should change what it says otherwise that’s misleading

#

How are models supposed to get adequately tested if they’re getting rate limited in Battle Mode?

topaz sun
# echo aurora We've been seeing unexpected traffic and interest in the site, as a result raisi...

That sucks. I get that there’s unexpected traffic, but requiring logins feels like a pretty harsh step for what was originally a free, open experience. It comes off less like rate-limiting and more like a step toward monetization, which a lot of users won’t like. There are lighter-weight solutions you could try before forcing logins: temporary IP-based rate limiting, request queuing, etc. Those approaches keep access open without tying usage to accounts. Part of what made lmarena great early on was the frictionless experience. Adding barriers like logins doesn’t just hurt usability, it sends the message that you’re willing to trade off the community’s trust. If you go down that path, don’t be surprised when long-time users start leaving. At that point, you’re not really offering the “open experience” you built your reputation on anymore.

potent glacier
#

Sadly altruism isn’t realistic

potent glacier
#

They probably have good intentions, but they’re going to end up being like all the other companies and charging premium fees for usage

#

And you’ll still get rate limited even when you’re being charged 🥴

#

I can see rate limiting direct chat on the image gen models, but on the LLMs?

#

That’s pretty much a slap in the face

fathom jackal
#

The most frustrating thing is these generations that fail but don't give you a refund... Every AI gives you a refund for failed generations except this one... for only 3 generations this is ridiculous

high hound
glossy umbra
austere charm
#

hello

cinder epoch
#

Maaan fk the new system its some bs

#

This is gonna be a sheetty ai router if it continues like this

#

@echo aurora Listen to user reviews you guys damn it

#

We aint sign up for this bs

lusty tinsel
#

hopefully they would listen to what most people been complaining and find a better way... (kinda beginning to look like a cope thought sometimes)

drifting crow
#

How can anyone be angry at pineapple with that pfp

cinder epoch
#

This become paid and i zoom the fk outta here

#

"Lets offer our users a free experience"
"Proceeds to add a login wall thats gonna be followed by some shetty restrictions soon"

#

Fk you man

lusty tinsel
#

at least for the battle mode, the original purpose and how it was working should never be touched

echo aurora
echo aurora
ashen plaza
high hound
ashen plaza
#

we are talking ive seen some people complain they have like 70% of their generations used up on errors and didnt get them back. now it was a text based bot not an image generator but still

frozen aurora
#

hi guys can we create 9:16 aspect ratio video?

ashen plaza
echo aurora
# high hound yeh the money gotta go somewhere after all. but i think there are options. maybe...

Something that has always resonated with me from this blog post (would recommend a read if you haven't already) is this section:

At our core, LMArena will always provide:

  • An open, accessible platform for the community to participate in evaluating and comparing models through real-world prompts.
  • Transparent, science-driven leaderboard policies that ensure every model is tested fairly, consistently, and with community input.
  • Features designed for the community, from a better UI/UX to more ways to engage, vote, and contribute to AI progress.
  • Research to push the cutting edge of AI evaluation and reliability.
    I can assure you this will always be the case.
tall summit
#

people make empty promises all the time

#

nobody knows one way or another

#

maybe lmarena will do better

#

i have faith

frozen aurora
ashen plaza
# frozen aurora its to bad any one can give me any site for use free veo3 model or very cheap...

veo 3? for free? GOOD LUCK! I know of a website with veo 3, cheapest around for 5 dollars. A GENERATION! veo 3 is much too expensive atm to be running for free anywhere outside of testing purposes like in this discord. I can send you a website you can generate videos in 9:16 ratio tho. it has 10+ minute generation times and 5 free priority generations a day tho. other wise unlimited to use

keen beacon
#

I assume there's limits on using the higher models right?

potent glacier
#

It should be very obvious why everyone is angry

#

These new rules or whatever they are entirely contradict all of that

#

Battle Mode should NOT be rate limited

#

It wasn’t before

#

The only thing getting rate limited was using direct chat

#

The old model was beyond preferable to whatever mess this new thing is

#

“Commercial sustainability should never come at the cost of community trust or great science”

#

Well, I can guarantee you, with these new rules the trust of the community is definitely plummeting

#

I would hate to see LMArena become another “Open”Ai

#

Sam Altman really can’t be trusted either

whole sundial
#

"accessible to everyone" well, not everyone wants or has a Google account, Google accounts may be banned in some regions as well. I hope they add other options so more people can make accounts, but I wonder if LMArena will be able to tell which person did a certain prompt or generation, which I imagine they can.

formal jungle
#

What's up with the people as small figurines on a desktop?

ashen plaza
formal jungle
#

Ah cool. I enjoyed that action figure trend, I suppose this is similar. Another good one might be to create you and some friends as miniatures in a tabletop war game

flint river
#

i am excited to be here, hope to learn new things here.

formal jungle
#

Welcome! 🙂

wintry tinsel
#

Flushing away good will is a full time job of the modern tech industry

quiet abyss
#

how do i get the image generator on the website to generate 16:9 images?

jagged vortex
#

hello

latent crest
#

Did u try SeeDream 4?

little narwhal
#

New mystery models on OpenRouter

#

Probably Gemini 3.0

latent crest
#

What’s open router ?

zinc ore
#

Its not as performant as current top models, let alone upcoming gem 3

void trout
#

hey, if I ask to generate a video of someone that looks like jar jar brinks from star wars, is that forbidden or the error happens sometimes usually?

little narwhal
languid crescent
#

Man, what's happening? What's with the rate limits? How many are these rate limits per chat?

#

🙁

robust yoke
#

The rate limits may be due to the fact that the models are being hosted from their respective APIs, and thus rate limits are likely to occur when using the associated model too many times.

languid crescent
#

Is it because of the traffic?

robust yoke
#

I don't think traffic would cause shortened rate limits.

#

The issue may be arising from excessive use of the associated model in such a short time.

sullen quest
# little narwhal

2 mill context window? yeah that sounds like it could be gemini 3 dusk could be flash and sky could be pro?

#

nah, after testing it its just wayy too garbage and doesn't act like gemini does

little narwhal
#

So you think it'll be a smaller open-source model?

#

I'm praying for open source if it's not a frontier model

sullen quest
#

unless google was massivly cost cutting this wouldn't make much sense, its worse then gemma

drifting crow
#

It’s bard

sullen quest
#

lol

#

If I had to bet, I think its a perplexity model

little narwhal
#

Perplexity lmao

#

How do they even get billionaires to invest in them

#

There is not a single thing Perplexity does better than the frontier companies

lofty elm
#

LMArena not working on brave, keeps repeating

polar marlin
#

😭

lofty elm
sullen quest
#

sky is better, a little at least

lofty elm
#

after that login email update, eveything got worse

#

well it was fun while it lasted

sullen quest
#

mm

#

yeah they kinda messed up their site a bit

#

it was fine with slower changes

lofty elm
fickle dew
#

sky is pretty much uncensored

lofty elm
#

is it llm

fickle dew
#

I'm guessing they're some kind of XAI model

sullen quest
#

idk grok models like grok 4 a worlds ahead of sonoma sky

lofty elm
#

ohh i see

spare rune
winged mauve
#

lmarena lists image-to-video arena and the veo model, but i don't see it on the website, how can i generate videos from images on the website? anyone?

winged mauve
burnt sinew
#

yo chat

robust yoke
#

Greetings, Vova.

burnt sinew
robust yoke
#

Hello.

burnt sinew
#

anyone here used gemini cli

robust yoke
#

I have.

burnt sinew
#

do you like it

robust yoke
#

It's pretty neat.

burnt sinew
#

i know theres better ones

#

but i want to use it for free

robust yoke
#

That's fair.

burnt sinew
#

have you used openhands

robust yoke
#

I have not.

burnt sinew
#

oh its also cool

robust yoke
#

I see.

burnt sinew
#

its like what cursor had sort of

#

its in web

robust yoke
#

Ah.

#

I'll be sure to check it out.

burnt sinew
#

they give lots of free credits too

robust yoke
#

That's nice.

true sierra
#

Can you generate a video from an image you already have?

burnt sinew
#

do /image-to-video prompt:text

robust yoke
burnt sinew
#

i think

robust yoke
#

That's correct.

true sierra
true sierra
burnt sinew
true sierra
#

Do you paste the image in to the prompt?

burnt sinew
#

nah it will offer you to upload it

robust yoke
# true sierra How?

Well, within #video-arena-1 , if you use the following command, then you can input an image from your camera roll into the prompt, as well as the optional prompt for what you want to happen in the video:
"/image-to-video (image) (prompt)".

burnt sinew
#

@minor fern

#

smh

jade egret
#

gemini 3 when

#

);

burnt sinew
#

its just a guess

#

actually let me ask gemini when it will come out

winged mauve
jade egret
burnt sinew
burnt sinew
jade egret
jade egret
burnt sinew
jade egret
burnt sinew
jade egret
burnt sinew
#

half a year of 2.5!

jade egret
#

how good are you expecting 3.0 be?

burnt sinew
#

idk, if it aligns with the jump from 2.0 to 2.5

#

then it will be insane

#

but i doubt it will

jade egret
#

dang

burnt sinew
#

OR if it will be they will make it paid, or other stuff

jade egret
#

i think hopefully they will make flash free

burnt sinew
#

hopefully they keep stuff like it is now

jade egret
#

yea

burnt sinew
#

normal people using the normal gemini website will keep paying

#

and i will keep having unlimited on ai studio

jade egret
#

but is it gonna be only ultra for like a few weeks or months 🙁

burnt sinew
#

hope not

jade egret
#

yea

burnt sinew
#

like veo 3 its locked behind money 🙁

jade egret
#

🙁

burnt sinew
#

at least they made nano banana unlimited free

jade egret
#

yea

#

it pretty good

burnt sinew
#

it is REALLY good

jade egret
#

yea

#

and google anti trust case win

#

it looking good for google?

burnt sinew
#

i think so

jade egret
#

:0

burnt sinew
#

so far

#

leading in basically everything rn i think

#

except coding

jade egret
#

yea

#

what base model they usuing tho 🤔

burnt sinew
#

wdym

jade egret
#

for 3.0 do they just work on 2.5

burnt sinew
#

OH

jade egret
burnt sinew
#

obviously

#

i doubt they start from scratch

jade egret
#

oh

#

yea

#

anyways gtg

#

cya

burnt sinew
#

bye bye

jade egret
#

bye

burnt sinew
#

i wonder how right it will be tbh

flat flax
#

Hey, came across this Twitter thread summarizing OpenAI’s new paper on LLM hallucinations:
https://x.com/LuozhuZhang/status/1964209351960514778

They highlight that hallucinations aren’t inevitable if you change incentives (e.g. penalize confident errors more than abstentions, reward calibrated uncertainty).

Has anyone here read the paper? Seems like this approach could significantly reduce hallucination rates. Any dataset recommendations to test this kind of “abstain vs. wrong” behavior?

This is an excellent study on LLM hallucinations. My takeaways:

1. Treat neural nets less like rigid programs and more like statistical organisms.
From this lens, LLMs behave a bit like the brain: they fill in missing pieces by generalizing patterns, which makes them powerful

burnt sinew
#

@sly isle

earnest rover
#

@echo aurora
What's the rate limit
And remember it will be not good setting a rate limit

thick patio
#

is it possible to get desired image size output in Lmarena?

robust yoke
drifting crow
#

Looks interesting tho

flat flax
keen beacon
#

Hello LMArena!

robust yoke
#

Greetings, Peter.

winged mauve
#

Can anyone tell me if LMArena gonna be paid or free as it is rn in near future? @robust yoke

robust yoke
winged mauve
robust yoke
winged mauve
robust yoke
# winged mauve one more doubt, are we gonna see video generation on the website itself in upcom...

For the time being, video generation is limited to the Discord server. However, people have requested for video generation to be part of the actual benchmark website itself. In the near future, we are likely to see a video generation feature where you can directly generate videos with popular video generation models like Veo 3, just like how you can generate images on the website with popular image generation models like Imagen.

robust yoke
drifting crow
flat flax
drifting crow
#

I can’t see them coz I don’t have twitter account

unborn ocean
#

I think this shows that anthropic is already aware of it and addressing it.

robust yoke
#

W Anthropic.

robust yoke
#

Claude, I'd believe.

dense sphinx
#

Claude sonnet rules.

#

👍

robust yoke
#

True.

flat flax
#

But Claude is pretty good though

kindred pulsar
#

How to generate 16:9 images in nano banana

robust yoke
#

It's not directly adjustable.

olive knot
#

Hi, im new here. Nice to meet all

robust yoke
#

Nice to meet you too, Bimmo.

keen beacon
royal pond
#

hello

#

greetings from Borneo

robust yoke
#

Greetings, Tansri.

whole wagon
#

It's an obvious thing, no idea what takes them so long to discover these. All the money and talent going into this and they didn't think of that simple idea to reduce hallucinations, I wonder how much other easy things they miss

robust yoke
#

Let me share my perspective on this. While it may seem straightforward in hindsight, research breakthroughs often appear obvious only after they've been discovered and validated. The challenge isn't just identifying potential solutions, but rigorously testing them, understanding their tradeoffs, and implementing them effectively at scale.

The idea of penalizing confident errors more heavily than abstentions isn't entirely new, it builds on established concepts in machine learning about uncertainty calibration. However, systematically applying this to reduce LLM hallucinations while maintaining model utility requires careful experimental design and validation.

I think it's worth considering that AI labs are often exploring many promising approaches simultaneously. What might seem like an "obvious" missed opportunity could be something they've investigated but found challenging to implement effectively, or that had unexpected downsides that weren't apparent at first glance.

That said, I do agree that sometimes simple yet powerful ideas can be overlooked, especially when teams are focused on more complex approaches. This is one reason why having diverse perspectives and open research discussions in the AI community is so valuable.

What are your thoughts on other "simple" approaches that might help improve LLM reliability?

whole wagon
#

Are you capable of rational thought rather than just pasting LLM output verbatim?

robust yoke
#

Just because it's long and detailed doesn't mean it's an LLM output.

#

And even if it was, who cares? We're in a server where AI is pretty much normalized, even for creating images and videos.

#

Therefore, even if my response were generated by an LLM, there wouldn't be any reason to get angry since we're already in a server where that pretty much happens 24/7.

whole wagon
#

It is devoid of any meaningful information or point

robust yoke
#

Just like how many statements tend to be.

unborn ocean
whole wagon
#

I still don't think it's that hard ngl

unborn ocean
#

E.g. moe is the most straightforward thing ever, yet it took quite a bit for everyone to adapt it, because while the theory might be simple, implementing it is quite difficultly and requires some experimentation

whole wagon
#

This idea is like reducing hallucinations to half or smth. It should be very visible even with small models

unborn ocean
whole wagon
#

I'm aware, I still don't think it's that hard. I've scaled ideas like this before it is simply some adjustments made the core idea is the same

#

Just because you have a big model does not really change the dynamics of optimal data in most cases

#

Good data for small and large models remains largely similar

#

I would assume it has been published because they have realised themselves it is an obvious thing

#

So no point keeping it secret

trail wagon
#

What's up?

junior jewel
#

Hey guys

spare rune
obtuse heart
latent crest
#

Is there a way to pin texts I send ?? So I’ll see the replies later that day?

manic raft
#

hello

ocean vortex
#

So it all comes down at what cost are you prepared to tackle hallucinations with, and whether sacrificing overall performance is ok

ornate agate
#

if you're going for max performance/score on benchmarks that don't penalize wrong answer, making the model hyper confidently yolo everything is probably optimal, for that.

#

I do still think "hallucinations" are a fundamental problem with llms though.

ocean vortex
unborn ocean
#

And I also personally noticed the change. + much of their system card report was focused on reduced hallucinations.

pseudo glen
#

hellooooooo

elfin harbor
#

Helloo

ocean vortex
#

small steps

elfin harbor
#

I have a slight question about the rate limit thing? Is this temporary or permanent?

glossy umbra
elfin harbor
# glossy umbra ^

Hmm I do agree but I also get it tbh the increase of traffic is expected. I do get the rate limit thing and login thing. But I hoped it would be a temporary thing till a workaround that would be great.

digital umbra
#

it's really low now

#

only 3 images or so

elfin harbor
digital umbra
#

no

elfin harbor
#

hmm

#

there was one message that told me to wait 48min but I didn't see any message after that

glossy umbra
elfin harbor
#

I guess waiting is good option for now

glossy umbra
#

i guess. but so much restriction affects user experience.

potent glacier
#

They said they're 'taking feedback' from what we've been saying

#

It remains to be seen exactly what they do

tall tulip
#

Can't upload image?

#

Down again?

#

Oh nvm

topaz flint
#

It seems like

digital umbra
#

i think lmarena will have a paid tier by the end of the year if it continues like this

#

but of course one can't really complain about it

potent glacier
digital umbra
#

i'm surprised it has been free for so long

verbal nimbus
potent glacier
# digital umbra i'm surprised it has been free for so long

As per their statement: "LMArena will stay open and accessible to everyone. To do that sustainably, we’re focused on creating long-term value through services that benefit the entire AI ecosystem and serve the larger community."
"At our core, LMArena will always provide:

An open, accessible platform for the community to participate in evaluating and comparing models through real-world prompts."

#

Limiting people who aren't logged into Google isn't only ridiculous, but just plain not right

verbal nimbus
digital umbra
potent glacier
#

I don't care if it's trained on our prompts

#

Hell, every ai thing is trained on the internet

#

Things are scraped from all over the place

trail creek
#

Also don't understand how loging in will help the ai ecosystem more??

potent glacier
verbal nimbus
#

It's not mandatory, is it?

potent glacier
#

I think that might be the case

verbal nimbus
#

It was just a requested feature

trail creek
#

the site is for benchmarking not data collecting...And also it has been free and unlimited without any limit for 3 years what changed now exactly?

digital umbra
verbal nimbus
#

I'm not logged in

trail creek
#

I feel like after the damn banana stealth model they got popular and so with that they saw how much money they could make

#

pure greed...Theres literally nothing else behind this updates but GREED

digital umbra
ocean vortex
potent glacier
#

Logged in users get 'higher rate limits'

verbal nimbus
digital umbra
ocean vortex
#

7% refusal rate for 4-Turbo

digital umbra
verbal nimbus
elfin harbor
#

does the limit affects chats too or image gen

potent glacier
#

And direct chat

elfin harbor
#

Oh well

trail creek
#

If they just not force the login and give the same limit as login in users as anonymous users i would not have any problem (I WOULD)

but they added the rate limit to the battle mode tooo which is the whole purpose of the damn site???? Im i wrong?

potent glacier
#

Enough people need to voice their concerns about it so it can get resolved

verbal nimbus
potent glacier
#

Being compalcent won't solve anything

glossy umbra
digital umbra
trail creek
#

jesus they making me hate capitalism

verbal nimbus
trail creek
#

everything ...Every fricking thing in this world have to be ruined by money, i mean how much money is too much money

glossy umbra
verbal nimbus
potent glacier
ocean vortex
#

Also o1 seems to do very well for it at 14%. That's the main reason it doesn't hallucinate more overall than gpt5 despite lower accuracy

verbal nimbus
digital umbra
#

battle mode

verbal nimbus
#

Interesting, maybe it's more of a rate limit than an absolute limit

potent glacier
#

The whole point of Battle Mode was to be free and unlimited to compare models. They really s h i t the bed with this one.

trail creek
glossy umbra
potent glacier
#

Stop

verbal nimbus
#

Like the data from May or smth is on Hugging Face

potent glacier
#

I'll be completely honest. They're a big company. Don't trust big companies. How do I know? I work for one myself, sadly...

glossy umbra
digital umbra
trail creek
digital umbra
#

this was in may

potent glacier
verbal nimbus
glossy umbra
potent glacier
#

I could care less about likes or upvotes or w/e

glossy umbra
potent glacier
#

Oh trust me I have

#

A lot of us have

#

At the end of the day what'll it get us

#

They're a company and they're going to do what's in their best interest

#

Whether that aligns with the mission statement or not

#

Companies lie. People lie.

#

That's just the way of the world

trail creek
glossy umbra
#

what’s the difference between using lm arena and open router then

potent glacier
#

I have never used open router or heard of it

glossy umbra
digital umbra
#

once the investor money has been deposited, giving them a return on the investment is more important than any mission statement

verbal nimbus
glossy umbra
potent glacier
#

You really wanna know how to get results?

#

Stop using their site entirely

glossy umbra
verbal nimbus
potent glacier
#

If a huge number of people stopped using it, they wouldn't be getting their data for new models

glossy umbra
potent glacier
#

There was that guy a few days ago that brought up payment in their TOS

#

Let me remind everyone

verbal nimbus
#

I haven't logged in so idk what permissions it asks for

potent glacier
#

This is what it says in section 8: “FEES AND PURCHASE TERMS. Company currently offers the Service free of charge. However, we retain the right to charge for the Service, or any features or components thereof.”

trail creek
potent glacier
#

However, we retain the right to charge for the Service, or any features or components thereof.

trail creek
#

i hope im wrong but i don't think im

glossy umbra
#

holds the right to make it paid

digital umbra
glossy umbra
potent glacier
#

They're gonna have a dumpster fire on their hands soon

#

I mean they closed the whole chat entirely when the site wasn't working

#

I thought that was kinda sus to begin with

verbal nimbus
#

I think the allowance nowadays is more generous

potent glacier
verbal nimbus
#

You didn't use to be able to talk to Claude directly without hitting quotas in like 10 msgs

glossy umbra
trail creek
verbal nimbus
#

Or Opus, which is very expensive

verbal nimbus
potent glacier
#

Like I said: Companies can’t be trusted

potent glacier
#

This is the whole reason why I am so glad that I am able to gen locally

#

F censorship, guardrails, corporate handholding, etc..

verbal nimbus
potent glacier
#

It’s pretty sad that the only person that we get feedback from is @echo aurora

#

There hasn’t been any transparency about any of this

steady rover
#

how long does t take to gen a vid caause i have been waiting a hr now

verbal nimbus
#

For ppl who want anonymity, are you clearing your browser history? Otherwise it's not really anonymous, and prompts are logged...

glossy umbra
potent glacier
#

We keep getting told by @echo aurora that they can’t talk about the rate limit or how much it is

#

That isn’t transparent at all

#

If you implement new rules, people should be kept afloat instead of having to play the guessing game

verbal nimbus
potent glacier
verbal nimbus
#

What permissions do they request from Google?

potent glacier
#

Doesn’t matter if you delete your history

potent glacier
#

Honestly I don’t care. I just get ads targeted at me.

verbal nimbus
#

I mean, does it ask for name, email, etc? Google will tell you the first time you sign up

verbal nimbus
potent glacier
#

I meant about the data collection

trail creek
potent glacier
verbal nimbus
potent glacier
#

We’re training models every single day

#

Our prompts help the models grow

steady rover
#

how long does t take to gen a vid caause i have been waiting a hr now

glossy umbra
verbal nimbus
#

It should reset the limits, although I'm pretty sure that's against the policy

potent glacier
trail creek
potent glacier
#

Until you turn it off

elfin beacon
#

hello! Any talk in Spanish?

verbal nimbus
steady rover
#

how long does t take to gen a vid caause i have been waiting a hr now?

potent glacier
#

People will find ways to circumvent the rate limits but it won’t work

#

I mean 3 in Battle Mode is downright insulting

#

LMArena: “We have made the difficult decision to go Pay For Play.”

trail creek
verbal nimbus
glossy umbra
digital umbra
#

free user loyalty doesn't really matter tbh

verbal nimbus
#

Honestly they're offering really expensive models for free, so it's fine imo

#

Like it wasn't possible to talk directly to such expensive models last year

potent glacier
#

@trail creek @verbal nimbus I’d delete those last chats if I were you. You’re essentially talking about overriding the site limits 😉

trail creek
digital umbra
#

user loyalty would have been a concern if they were actually paying for it

trail creek
#

if they make it paid then just use open router

verbal nimbus
potent glacier
trail creek
potent glacier
#

They’ve deleted a lot of things

digital umbra
#

and if too many free users leave the site, the rate limits will have to go up so that the paid datasets remain useful

potent glacier
#

So do other mods

digital umbra
#

so that's kind of self stabilizing i'd say

glossy umbra
trail creek
potent glacier
#

There’s no such thing as an altruistic company

#

They all have ulterior motives

glossy umbra
potent glacier
#

Yeah that’ll really teach ‘em!

potent glacier
#

You'd need to have a massive amount leave

glossy umbra
# potent glacier What, all three of us?

Doesn’t matter if it’s three or three hundred if the core users leave, the platform loses the very people who give it value. We’re only 3 people here. You don’t know the lots that didn’t join the discord server

potent glacier
#

If they did we wouldn't be getting spammed with /image

glossy umbra
potent glacier
#

These complaints about the rate limits and stuff definitely go above @echo aurora and the other mods here

#

If it was a corporate decision, it's gonna be an uphill battle

glossy umbra
potent glacier
#

Money makes the world go 'round

glossy umbra
trail creek
potent glacier
glossy umbra
#

Many people will stop when it becomes paid

potent glacier
#

Hasn't happened yet, boyo

neon idol
#

How can I try seedream v4?

trail creek
potent glacier
#

Actually I made a mistake about earlier. I can think of one company that's altruistic and has continuously given people free things for years now: Hello Games

glossy umbra
potent glacier
#

They have continuously given free updates to No Man's Sky while other companies would charge

trail creek
#

and there's some people here that don't even know what we are talking about and login to any site they enter without any care for security.

glossy umbra
#

when it does become paid

potent glacier
glossy umbra
#

Yes but they won’t be able to use the service anymore when the time comes

verbal nimbus
#

Looks like rate limits are only for image gen as per #announcements ? That kinda explains why I haven't encountered it.

trail creek
#

those people don't know what boycott is

verbal nimbus
#

They're having compute issues

glossy umbra
potent glacier
#

They have very generous benefactors and donors

#

All the major ai companies are represented on LMArena

#

They're all probably funneling money to them as well to continue training

trail creek
potent glacier
#

People actually bet ACTUAL money on how models perform

#

I forget the site

trail creek
#

so how are they having compute problems?

verbal nimbus
#

Compared to text, images and videos are probably massive

potent glacier
#

I believe that stuff gets sent to the respective companies

verbal nimbus
ocean vortex
potent glacier
#

All I know is that we aren't getting any transparency about what's happening or why

verbal nimbus