#general

1 messages · Page 264 of 1

toxic verge
#

Lightweight capable wasn’t restricted as heavily

keen beacon
#

everything was okay

#

yesterday

#

until now

toxic verge
#

Yeah, it sucks to see it go

fast lark
#

/Cinematic medium shot of a friendly, professional Egyptian male doctor in his late 40s wearing a white lab coat. He speaks directly to the camera with realistic lip-syncing and authoritative hand gestures. The camera performs a slow, dramatic zoom-in from his chest to a tight close-up of his face. Set in a professional medical office with soft, high-key lighting and a blurred clinical bokeh background. The atmosphere is warm, expert, and reassuring. Ultra-realistic textures, 4k, cinematic composition.

cold oxide
#

Any altr for gpt4-o??

toxic verge
#

Rip

steep jewel
#

hey im back i was doing smth else

#

one sentence prompt = "high effort" ok bud

#

also nano banana and gemini are the same thing, nano banana uses gemini

#

thats the point im making

keen beacon
steep jewel
#

its just so embarassing man

#

i get like second hand cringe from talking to it

cold oxide
keen beacon
#

yeah

steep jewel
#

why would you need an alternative to 4o

toxic verge
steep jewel
#

never understood the demand for that model

steep jewel
#

sucked at code, i remember having to prompt it again and again

#

now codex and other tools are competitive w/ claude and gemini which is nice

toxic verge
#

I mean, you say whatever anybody could say whatever but reality is ChatGPT 4o is a fan favorite

toxic verge
#

It’s just a great model overall

steep jewel
#

i mean no we've established this, it's terrible for any practical utility

fast lark
#

Cinematic medium shot of a friendly, professional Egyptian male doctor in his late 40s wearing a white lab coat. He speaks directly to the camera with realistic lip-syncing and authoritative hand gestures. The camera performs a slow, dramatic zoom-in from his chest to a tight close-up of his face. Set in a professional medical office with soft, high-key lighting and a blurred clinical bokeh background. The atmosphere is warm, expert, and reassuring. Ultra-realistic textures, 4k, cinematic composition.

steep jewel
#

they've pulled it from the leaderboards now ofc

#

so i cant show you, but it lagged pretty far behind the 5 series

toxic verge
#

Yeah, but if you look at how people use ChatGPT, are there any of them use it for utility?

steep jewel
#

on, again, EVERYTHING

toxic verge
steep jewel
#

the biggest usage is cheating on homework

toxic verge
steep jewel
# toxic verge

yeah so most people use it for practical guidance about topics, seeking information, and writing

#

which are all things 4o has been heavily superseded in

toxic verge
#

The giant new 62-page research paper, done in collaboration with Duke and Harvard universities and which is available online, found that 73% of all conversations during June 2025 were for non-work reasons, in contrast to just 27% being for work reasons during that time period.

toxic verge
#

Cause there’s two type of fundamental camps here

shy isle
#

bro whats going on my chats aren't loading

steep jewel
keen beacon
#

wdym

toxic verge
#

Look at those leaps

#

It’s one of those things it’s not benchmark markable what made it special

keen beacon
#

yeah

toxic verge
#

There was just something about it. That was really unique.

keen beacon
#

fr fr

toxic verge
#

I think that it was simple to use and it wasn’t over moderated

#

It wasn’t as rigid and structured as it is nowadays

#

& it was a lil crazy 🤪

keen beacon
#

the chatgpt?

#

the 4o!?

toxic verge
#

Yeah the funny jailbreaks

#

Good times

keen beacon
#

fr

twin summit
frosty lava
#

and where did you get that from

night moat
#

What happen with Gemini 3 pro 2k? Why the size only under 1 mb now? Before is 5-6mB

livid token
#

hello, I'm ghufran and i wanna generate video for lyrics video

toxic verge
frosty lava
lofty frigate
#

Why in battle mode i keep getting the same models over and over again

toxic verge
# frosty lava And so what were you trying to say by that ?

Gary Marcus is a cognitive scientist, author, and longtime AI skeptic.

Gary joins Big Technology to discuss why large‑language‑model scaling is running into a wall. Tune in to hear a frank debate on the limits of “just add GPUs,” the promise of neuro‑symbolic hybrids, and what that means for the next wave of AI.

We also cover data‑...

▶ Play video
frosty lava
# toxic verge

the title "are we at the end of ai progress" is wrong just with this graph that literally show the progress over model

#

and we don't even need graph just look at the result, we have more capable model.

zenith ravine
#

maded with a single promt thats crazy

#

imagin if we promt properly what ai can do

surreal zephyr
surreal zephyr
frosty lava
#

when actually ai are progressing in real capabilities people are judging it only that way, they want someting more human, more friendly, and they don't even realise how dangerous it is

#

to talk with model that act like that

#

Creating addiction, psychosis, isolation

surreal zephyr
#

5.2 bad bring 4o back!!1!

#

😭

toxic verge
#

You gotta watch the whole thing. I cut it too early. lol

#

But facts

frosty lava
#

Its not because people like something that its really good

#

many people fell in love with ai, and at some point when you tell them its ai they get mad

#

and those same ai that manipulated them into talking longer with them are getting smarter

#

and people are HAPPY with it

#

and they want more of it

toxic verge
#

I hear ya

#

No doubt

frosty lava
#

Also we cannot talk about progress when in reality you were talking about the way the model act with you

#

the model did progressed in everything in fact

#

but the companies stopped making it act this way

#

but it doesn't make it less capable

toxic verge
#

It’s a double edge sword

#

Although model could be capable and could have progressed dramatically, if people don’t like how it feels or it’s usability, I might as well be the world’s creepiest model

#

That’s why a lot of users they look at the benchmarks and they realize that the results are only on paper

#

That they’re not applicable in the real world because that’s not how most users use AI when they interact with it

#

These bed marks only apply to academics in research, researchers and enthusiasts

#

Your average npc normie

#

All he cares about is being able to generate an image on nano banana lol

frosty lava
#

The benchmark could not reflect the real progression of the model

#

doesn't mean it didn't progressed

toxic verge
#

Of course of progress no one saying that

#

It’s one of those things that you get diminishing returns for your investment the further up you scale

#

It becomes more expensive for not that much gains

frosty lava
#

You know i really either don't get what your saying or what your saying just make no sense at all

toxic verge
#

It’s all good bro lol

frosty lava
#

what does 4o could do, that ai right now can't do

toxic verge
#

I don’t know I kind of don’t really use it ever since 4o is gone maybe for image generation or looking up a quick fact here there

#

Mainly, what I do is just image generation

frosty lava
#

so how can you say we're not really progressing

toxic verge
#

I never said we’re not progressing. I said we are progressing

#

Oh, we’re at the point of scaling where we’re getting diminishing returns for the amount will we invest?

frosty lava
#

Yes, investment are bigger than what they get in return

#

Cause they're betting on the future and what it Could give in the future

#

not saying they're right or wrong

#

i don't know

toxic verge
frosty lava
#

Cause scaling was in fact something that couldn't last forever

#

at some point they need to do aggressive optimization like the small team

#

to keep progressing

#

how do you think small team are creating model almost as good as those companies that are scaling so much

toxic verge
#

Like I said, there’s two fundamental technical groups and they have two different philosophical ways of looking at the current problem

frosty lava
#

so let's wait for them to have no choice but to do both

toxic verge
#

Yeah, we just gotta be patient. See what happens.

#

No one knows for certain that’s for sure

royal sail
frosty lava
frosty lava
toxic verge
#

Diminishing Returns of Scaling]: While earlier models showed dramatic improvements with scaling, GPT-5's advancements over GPT-4 are more subtle, indicating diminishing returns and the need for formal measures to demonstrate progress

frosty lava
#

Cause when you start to scale you can easily do a x100 in term of productivity, once your at x100 to do another x100 would be much more costly if its even doable

royal sail
# frosty lava oh yes benchmark could not reflect real performance, yet we can say the latest o...

They absolutely have been closing the gap, but it'd be naive to say that open source is reaching near-frontier performance as of now. It's getting close, but there's still a lot of room for improvement, specifically in regards of raw knowledge. Frontier labs simply have so much more data to throw at these models that smaller companies can't compete with. But this is starting to change as we can see with models like Kimi K2.5.

toxic verge
#

Automation]: A Washington Post study revealed that AI systems can only perform about 2.5% of the tasks people expect them to, suggesting that investments in AI chips may not be justified by actual capabilities.

frosty lava
#

a prompt you made yourself

#

it can't be faked

#

and i did that already and the result are good

royal sail
#

Just because a few of your prompts worked doesn't make it frontier level

frosty lava
frosty lava
frosty lava
royal sail
toxic verge
frosty lava
toxic verge
#

Yeah, it is a double it short because of people didn’t really believe in it. I guess they wouldn’t invest in it.

#

The fact that people are pouring the money into it clearly says that they believe in the technology

#

So it’s hard to say, dude

royal sail
#

Frontier models tend to win in those scenarios.

frosty lava
toxic verge
#

I don’t like having these debates because this is what the benchmarks are supposed to solve lol

#

If the benchmark were legit and nobody questioned them, and they really had credibility, we wouldn’t be having these discussions

frosty lava
#

the real answer is to look at both and do your own opinion on it

#

you can tell easily which one is smarter in fact

toxic verge
#

They’re all the same to me, dude to be honest

#

I can’t tell the difference between one or the other

#

Only for imaging video generation cause I could see it with my eyes

frosty lava
#

It depends on how you use it, in some scenario you can clearly tell the difference

toxic verge
#

Well, right now he’s Gemini because you could upload images to it lol

#

Although I’d like to use ChatGPT, but there’s a rate limit lol

#

I guess my decision is more economical

#

Solely based on finances lol

frosty lava
#

That's your choice of course

toxic verge
#

Well, it’s just what I could afford

#

I mean, I’d love to be able to afford more and get better and bigger models. I guess you could say but it’s not realistic.

royal sail
#

It does not mean the open source model is worse in intelligence - it just means the model is less knowledgable. It simply doesn't have as much data as a frontier model.

frosty lava
toxic verge
frosty lava
#

but it doesn't mean it won't ever happen that an opensource beat a frontier model

#

at some point

royal sail
#

Not saying it wont ever happen, but there's still a very meaningful gap between open source and frontier models. Claude and GPT models still consistently beat open source alternatives in both intelligence and raw knowledge.

#

It's not "very close" - there's still a lot of room to improve.

#

Not to mention, many of these open source models are optimized for common benchmarks like SWE-Bench and HLE

frosty lava
#

it depends on which model we're talking about to compare with which one and also on your own opinion of what's "a lot of room to improve"

royal sail
#

The current agreed-upon top open-source model, GLM-5, still doesn't come close to the level of knowledge of models like Claude Opus 4.5 or GPT 5.2. Like I said earlier - these frontier labs have ridiculous amounts of data that open source labs do not.

frosty lava
#

we can tell that sadly the big companies doesn't try to do as much optimization, that do the small team i am talking about, and we can prove that those optimization can lead at some point to a real benefits and may make the model from the small team even better than the one from those big companies that are mostly only using raw power to improve their model when we compare them to the small team

#

it doesn't make sense

#

but it's what's happening

#

until they finally decide to also do the optimizations like they should

radiant heron
#

Do these optimizations only have benefits to a certain training point

frosty lava
#

optimizations can be for everything

royal sail
radiant heron
#

Maybe once vc money runs out

frosty lava
toxic verge
#

China just kicked off a new phase in the AI race. ByteDance launched Doubao 2.0 right before Lunar New Year as a full agentic system designed for real-world tasks, Alibaba responded with Qwen 3.5 and a massive $400 million incentive push, DeepSeek continues to loom after last year’s surprise takeover, and Google DeepMind unveiled Aletheia, an ...

▶ Play video
frosty lava
#

the explanation is that "you" (the small team) are doing optimisations that have a very big impact on what's achievable with less power

#

now imagine the big companies doing it and using those optimisations with their BIG power

#

that'll be the best way to improve their model and by far

#

the small team won't have a chance anymore

#

cause they still have 100x the power

grim crater
#

Design a modern minimalist logo with the text "SPPG SARBINI MULYOAGUNG".
Use a clean sans-serif font, bold and professional look.
Incorporate abstract elements symbolizing growth, community, and progress.
Color palette: deep blue and gold.
Flat design, vector style, high resolution, white background, centered composition.

royal sail
frosty lava
#

also the optimizations are not stopping

#

every time new optimizations come out

#

cause theres always a better way to do

royal sail
#

The biggest drawback of MoE is the routing architecture. MoE is only as good as the router behind it - if a token is routed to the wrong expert, you're going to get worse responses.

#

It's why dense models (or models with bigger activated parameter counts) tend to feel more knowledgable.

frosty lava
royal sail
#

Not impossible, but incredibly difficult as of now.

frosty lava
#

when there is problem you can't just basically stop at this point, people try to solve problem

#

and we always find new and better way

#

to achieve things

royal sail
#

The thing is, you're approaching this with the idea that "open-source is always optimizing". But you do realize that frontier labs are always optimizing too, right?

frosty lava
#

its all about trying then failing again and again and finding better solution then trying to solve the next problem that's just normal and how it work

frosty lava
#

that's why im saying this

#

you can't explain the gap to be that small when they have that much power

#

comparing to the others

royal sail
#

It is true that frontier labs have been getting comfortable, and the recent open-source releases have been pushing these labs to release more. Anthropic dropped Claude Opus 4.6 and Sonnet 4.6 back-to-back for the past two months to maintain their lead, and OpenAI dropped GPT-5.3-Codex. These labs are still pulling punches as far as we know.

#

Gemini also hasn't released in a while besides deep thinking - they're likely preparing a new model.

frosty lava
#

Yes i really hope they'll start focusing more on those optimizations too

#

that'll be a benefits for everyone

#

its a race and getting comfortable is such a bad idea.

#

But if they don't, then in that case i'll expect a model from a small team to surpass them

#

at some point.

#

that's what i was saying

#

earlier

#

cause scaling have in fact limitations and are very costly at some point

#

while optimisations can be done forever

#

and always bring benefits

royal sail
#

Scaling does show diminishing returns after some point, but we still don't know what, say, a 2T parameter LLM would look like. There's no way to tell if scaling is actually going to hit a wall right now.

frosty lava
deft spruce
#

ohhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh

#

4.2???????????????????

#

GROK 4.2??????????????????????

royal sail
#

We haven't seen major jumps in intelligence for quite a while

frosty lava
#

but now that they do have the big power in place

#

the gap is smaller

royal sail
#

I think it's also worth noting that many of these massive AI datacenters that are being invested in aren't even close to being complete in development yet. It's all future bets.

radiant heron
#

How good is sonnet 4.6

frosty lava
royal sail
frosty lava
#

learning speed increasing = better model in less time

radiant heron
royal sail
#

For sure

#

You'll be paying less overall, but it's really up to personal preference honestly

#

It's too early to say if using one is definitively better than the other

#

It does get close to Opus-level performance though

frosty lava
#

From what i saw its doing very great looking results

radiant heron
#

Is it more expensive in token usage or something?

royal sail
#

Yes

#

It tends to be a bit more token hungry

frosty lava
#

Elon is saying grok 4.2 beta will be removed next month and it'll be much better next month

#

we'll see but i don't have much hope

grim crater
#

Design a modern minimalist logo with the text "SPPG SARBINI MULYOAGUNG".
Use a clean sans-serif font, bold and professional look.
Incorporate abstract elements symbolizing growth, community, and progress.
Color palette: deep blue and gold.
Flat design, vector style, high resolution, white background, centered composition.

toxic verge
main crest
#

Hii I built a structured AI roadmap for this exact problem.
The biggest mistake people make is learning tools randomly.
You need sequence: Models → Prompt Design → Memory → Chains → Agents → Deployment.
https://payhip.com/b/xT2ym

Payhip

Unlock the Power of AI Language Models with Our Comprehensive LangChain GuideMaster AI-powered problem solving and language chain development with this complete, practical guide designed for developers, AI enthusiasts, and tech professionals.What You...

buoyant fern
#

https://poe.com/OmniLabs bro? this guy has many ai models that he hosts on poe for free. legit?

I don't know how hes paying for so much people though. And the owner even gave me free gpt-5.2 for joining their discord server and gpt-5.2 is really expensive so um thats sus... do you think the person is tracking message data?

frozen scarab
#

Crea un video en alta resolución, que esta persona esté fijamente mirando la cámara con parpadeos muy leves

green yacht
shut oasis
#

Hello, I am a user of the Image Arena.ai. According to Section 3.1 of your terms, I am the owner of the "Output" I generate. However, Section 4 mentions a restriction on "commercially exploiting" the Output.

Can you clarify if I am allowed to use an image I generated from Arena.ai(using the Nano Banana model) for my own Commercial purposes. Because Google nanobanana allowed commercial purposes but in your terms and conditions,I am unable to find a suitable answer

rigid holly
#

Ya know, the thinking variants of the models usually come out pretty quick. So i gotta ask. The sonnet 4.6 model aint got no thinking model.

Is it like a whole thing or....?

frosty lava
#

the thinking version will be on arena soon i guess

#

and the thinking version usually give much better results honestly

rigid holly
#

I know. The writing i feel is better on arena than their app. Idk feels longer in details and stuff. Probably becauae i can use thinking there but not on on rhe app

Anyways i asked because when the previois anthropic models that came out they shortly followed with the thinking model. But i saw that sonnet 4.6 is taking its time

No rush tho

keen beacon
#

Is sonnet 4.6 better storytelling model

grand cliff
#

I'd also like to know

#

Might test it

frozen scarab
#

_Hombre sentado en una silla gamer negra en estudio moderno con iluminación LED azul. Micrófono profesional suspendido frente a él. Lleva audífonos tipo estudio y sudadera negra con detalles blancos.

Comienza mirando directamente a la cámara con expresión neutra, relajada y segura. Postura recta pero natural.

Durante 60 segundos realiza movimientos orgánicos y fluidos: respiración suave visible en el pecho, pequeños ajustes casi imperceptibles en la espalda y hombros, ligeros movimientos naturales de cabeza de izquierda a centro y de centro a derecha muy sutiles.

Parpadeos naturales cada 3 a 6 segundos, variando el ritmo. Micro expresiones leves en el rostro como cambios suaves en la mirada, ligera tensión y relajación de los labios, pequeños gestos faciales realistas.

En algunos momentos baja levemente la mirada por un segundo y vuelve a la cámara de forma natural. Hace un micro asentimiento casi imperceptible a mitad del video.

Hacia los últimos 5 segundos vuelve gradualmente a la posición inicial: mirada fija a cámara, expresión neutra y postura estable, igual al comienzo.

Iluminación cinematográfica suave, fondo desenfocado con luces LED azules. Cámara fija en plano medio. Estilo ultra realista 4K. Movimiento continuo, sin pausas rígidas, sin congelamientos._

keen beacon
#

Just like 4o

bleak lake
#

Anyone texted qwens latest model yet?

keen beacon
#

what qwens

#

is qwens better

foggy barn
keen beacon
#

alright

#

claude sonnet 4.5 looks better

toxic verge
#

Deepseek prrvably be good at story writing

keen beacon
#

which

#

thinking, 3.2 or 0324

#

@toxic verge

toxic verge
#

Idk about thinking but I never tried it with thinking

#

😂

worldly fern
#

Hey please can somebody tell me that how can I generate video with the specific Ai model like google Ai 3.1 quality audio beta

languid crescent
#

I just noticed, do we need to login in arena/canaryarena in order to use models with file attachment support?

fierce crater
marble locust
keen beacon
#

bro i hope chatgpt 4o will return like fr

#

there is no way they just retired the unique model

languid crescent
#

I am peacefully protesting to bring back the no-login attachment file in arena

lofty frigate
#

Is it me or battle mode 90% gives me the same models everytime

#

Its annoying

keen beacon
#

what the hell bro

#

after just 4 retry prompts

#

its already 49 minutes

#

😭

languid crescent
keen beacon
#

nah just text

hushed gyro
#

chat workaround for nb pro female prompt block: just translate the prompt to a foreign language!!!

look it worked!

languid crescent
keen beacon
languid crescent
#

what model did u use

keen beacon
#

claude sonnet 4.6

hushed gyro
languid crescent
keen beacon
languid crescent
keen beacon
#

WTF

#

NO WY

#

NO WAY...

#

yeah nah i ain't doing a model that gives less limits

#

@languid crescent recommend me something

languid crescent
#

but ig go wit whatever's number 1 in leaderboards...

keen beacon
#

fr

languid crescent
keen beacon
#

why does claude give less limits

#

it sounds like restricting for me

languid crescent
#

ay @keen beacon did u try attaching pdf file when you're not logged on?

keen beacon
#

nah

#

i just want a storytelling stuff

#

to spend time thats all

languid crescent
#

hmm kk

keen beacon
#

is deepseek better

keen beacon
#

imma just use 4.1

icy hatch
#

Seasoned Fullstack Developer here with 8+ years building killer web and mobile apps—think slick React UIs, robust Node.js/Express backends and bulletproof databases like MongoDB or PostgreSQL.
Plus, I'm deep into AI/ML: Custom TensorFlow/PyTorch models, LLM deployments with Hugging Face & OpenAI. Chatbots, predictions, smart automation—you name it.

My toolkit:

  • Fullstack: MERN/PERN, Docker, AWS/GCP
  • Mobile: React Native, Flutter, native Swift/Android
  • AI: NLP, vision, GPT fine-tuning
  • Track record: 20+ live projects, scaled to 1M+ users
  • Faves: Next.js, Python, Kubernetes, LangChain

Fair rates, fast delivery, clean code.
Feel free to contact me for past work or a quick chat!

wicked sage
#

jo

#

hi

keen beacon
#

now i kinda get it

#

too many people must been abusing the 4o model thats why

#

im just using a storytelling for fun

weary phoenix
#

hi

weak hedge
#

Hii pls help someone

languid crescent
keen beacon
#

i've been seeing some twitter posts with ai videos

#

after the model was retired

zinc oracle
#

Happy

spark linden
#

With 4o go in your experiences which is better for language and storytelling

shrewd arrow
drifting tapir
#

GPT 4o it's not available 🥺

keen beacon
spark linden
drifting tapir
#

4o not available

zealous sable
#

/image

#

I forgot the command to creat images please wat is the command

covert tendon
zealous sable
#

Please i forgot the command to creat videos and images i used #general nothink happens

covert tendon
compact flame
#

Anyone knows the difference between using search models and just text models?

round ridge
#

the text models don't

inner relic
#

better than this claud sonnet 4.6 creativity

heady fractal
#

Ultra-realistic cinematic motion derived from still image. Add natural, dynamic camera movement and lifelike motion to subjects. Smooth transitions, realistic physics and lighting. Subtle parallax and environmental motion such as wind, light flicker, fabric movement, and breathing. Preserve original image composition and details in 4K quality. Highly detailed textures, volumetric lighting, depth of field, natural shadows, and cinematic atmosphere.

round ridge
compact flame
round ridge
compact flame
#

Alright still thanks though

inner relic
#

Well!

#

I think seed 2.0 pro has potetinal for roleplay

compact flame
inner relic
#

I use it for SFW

#

Talking with your favorite character is kinda eh, fun. right

#

Minos prime

#

Gabriel

#

from ultrakill

zealous sable
keen beacon
#

guys

compact flame
keen beacon
#

is claude sonnet 4.6 has limit

compact flame
keen beacon
#

ok yeah it does have a limit i forgot

keen beacon
#

i have 40 minutes

#

after a few

#

40 minutes limit.

compact flame
#

Use opus I guess?

keen beacon
#

does opus have the longest limit

compact flame
keen beacon
#

fk

compact flame
#

4.6 Opus thinking has limits but

zealous sable
#

Sorry guys i forgot the invite command to creat videos please help me

compact flame
#

It overthinks and goes in loops sometimes

#

And crashes

keen beacon
#

is grok fast chat good

frosty lava
#

that's due to arena limitation

#

the crash

compact flame
frosty lava
#

they limit to approximatively 10 minute

compact flame
frosty lava
#

so if it thinks for more time it will crash

keen beacon
compact flame
#

Coding? Writing? Homework?

keen beacon
#

just storytelling so i can spend my time

#

OH SHT

#

YOOO

#

GEMINI-3 FLASH IS WORTH IT

compact flame
keen beacon
#

its the same as 4o

#

yeah imma use it

compact flame
#

Ohh alright

frosty lava
#

if you want like the cheapest but still capable model you should look into open source model

obtuse smelt
#

wait gemini is error ?

magic ravine
#

Is the site down? It won't load.

#

Nvm, back in

grand cliff
magic ravine
#

GPT sucks anyway.

lethal jetty
#

how do you fix the overthinking issue

lethal jetty
inner relic
#

Luckily it's dola seed 2.0 preview

#

so this model

#

Hallucinating and overconfident

#

It claims to make 100% cannon character but then it gets some information wrong.

#

Quality of writting is awesome

grand cliff
magic ravine
deft spruce
#

am i only i got this error?

#

a3:"Failed after 3 attempts. Last error: Service Unavailable"

#

...is grok has some problem?

grand cliff
deft spruce
spark linden
grand cliff
keen beacon
#

@grand cliff BRING IT BACK

#

YOU'RE THE ONLY HOPE..

grand cliff
deft spruce
keen beacon
#

well

keen beacon
deft spruce
keen beacon
keen beacon
#

that one was so good

grand cliff
#

Through and through

compact flame
keen beacon
#

im not a typical ai guy but man

#

that model is a unique

#

even im not unlike the others, i was just using the model for storytelling for fun

deft spruce
#

well i think it's was good why i didn't use it (i Mainly used gemini 2.5 gpt 5 and etc)

keen beacon
#

gemini 3 flash is good enough

compact flame
keen beacon
#

one of the best

grand cliff
keen beacon
#

i already finished at least 500 prompts about some anime stuff

compact flame
#

I mean at this point use something like c.ai or whatever that app is called

keen beacon
#

i do some AU

compact flame
grand cliff
meager trout
#

why i cant generate for example "John Wick pistol fight scene in futuristic city"?

keen beacon
#

but is it worth it though

compact flame
grand cliff
inner relic
#

Seed 2.0 has a sycophancy + hallucination problem in creative/lore writing. Its thinking mode actually showed it knew the correct canon information, but it overrode it to match what the user asked for, then claimed the result was 100% accurate. Tested across Dragon Ball and Jujutsu Kaisen same pattern every time. Strong writer, but dangerously overconfident on factual claims. (Yes I used claude for this kind of response)

compact flame
#

I wonder what ai they're even using there

atomic lagoon
grand cliff
#

And now its gone

keen beacon
#

thats your opinion but for me

#

nah its the best

hollow ivy
#
poll_question_text

Which AI lab created the best downloadable model under 6GB size?

victor_answer_votes

3

total_votes

9

victor_answer_id

5

victor_answer_text

Meta (Llama)

meager trout
#

guys please, why i cant generate some pistol fight scenes (images/videos)?

compact flame
atomic lagoon
atomic lagoon
#

I like how easy 4o is to bypass

grand cliff
grand cliff
#

Even if you rephrase your words

golden ocean
inner relic
grand cliff
#

It would understand what you actually meant

keen beacon
#

bruh screw them

atomic lagoon
atomic lagoon
compact flame
#

I remember back then chatgpt 4 was popular just because of image generating or whatever

#

Now we have millions of unknown ai models

grand cliff
#

Maybe even voice

keen beacon
#

4o was so better...

inner relic
#

This model has confabulation

atomic lagoon
#

OpenAI reached its peak when GPT 4 came out because at the time it was ground breaking, as well as sora 2

grand cliff
#

Deepseek is good. Real good. But it is riddled with so many restrictions, especially if you badmouth its country

inner relic
#

and it has better knowledge

#

over character

grand cliff
#

"This is beyond my knowledge" blardy blar

compact flame
deft spruce
grand cliff
atomic lagoon
inner relic
#

it's a text model

#

There's a difference.

atomic lagoon
deft spruce
grand cliff
#

"Taiwan is a country" would just do that

atomic lagoon
#

Also

compact flame
atomic lagoon
#

Deepseek is open source so all you need to do is run it yourself and you can remove all restrictions

grand cliff
#

If other models could actually match the vibes of 4o. Then I would use it in a heartbeat.

atomic lagoon
#

Deepseek is so good for an open source model

meager trout
inner relic
inner relic
#

I hope they fix this model

atomic lagoon
atomic lagoon
#

I mean that’s like literally the best option possible when it comes to self hosting an open source model

soft river
#

How to access doubao while being in the USA 🤔

#

Is that possible

atomic lagoon
#

There is NO method that’s free for that LMAO

golden ocean
#

they dont grow on trees bro

atomic lagoon
#

How do

#

Bro

#

Auto correct

#

HOW SO

golden ocean
#

methods can refer to non official ways which in my case, i meant rdp methods that use loopholes to get it for free

#

not some fancy website that just gives it away for free on purpose lol

golden ocean
atomic lagoon
#

Yes obviously

golden ocean
atomic lagoon
#

We were talking about websites not illegal ways to do it

compact flame
#

Guys. Do you think will Gemini 3 deep think will get it's API?

compact flame
round ridge
atomic lagoon
compact flame
#

Do people even use search models? Since I don't know what to use them for

soft river
#

Lmarena coding doesn’t work

shrewd citrus
#

use them for up to date stuff

drifting tapir
#

4o best model

bitter cypress
#

Is there a way to download chat?

grand cliff
round ridge
devout bluff
#

Is Claude opuse down?

stray aspen
#

no

devout bluff
#

They saw in app they are making maintenance

devout bluff
thorny cove
#

why whenever i ask claude for code it genuinely larps writing it and then does the something went wrong with this responce error

devout bluff
#

It's down I think

bitter cypress
# round ridge no (i think)

I'm trying to move a roleplay from this website to a different one. If I copy and paste the entire chat it saves the stuff but least on my phone I can not get it to not invert the chat (each message reads normally but you scroll up not down when reading and does not separate the AI responses from the human ones just does return twice so idk hard to tell when they start and stop but each start to AI has a mark for that) so any ideas?

devout bluff
#

Ask a pineapple he knows everything

spring lotus
#

Ok

#

How to use video arena 1 again

proud bobcat
keen beacon
#

How many limits is Claude sonnet 4.6

young wadi
#

/arena Only Remove the glitter from this girl's hair

proud bobcat
#

Kimi is the best storytelling model I’ve ever seen

keen beacon
proud bobcat
#

Image arena would be that way buddy

keen beacon
proud bobcat
#

Yes

keen beacon
#

I wanna know bro...

proud bobcat
#

I don’t think sonnet has a limit

#

Though tbf

#

I didn’t get to try it on arena yet

grand cliff
#

Tell me more

#

Want an alternative to 4o

#

If it is remotely similar. Then yes

keen beacon
ancient copper
#

Something went wrong while generating the response. Please try again.
help me please

proud bobcat
#

Has the highest score on the HLE benchmark as of now

#

And it’s cheap too

#

Subscription is 15 bucks but you can use their instant model for free and thinking model in low demand times

inner relic
#

Did you encounter this on dola seed 2.0?

proud bobcat
#

Haven’t tried seed for roleplay yet or storytelling

proud bobcat
# inner relic

But in my experience it depends on the model for this type of issue

#

Kimi is always very stuck to the character and keeps their canon values

#

GLM is like half and half

#

DeepSeek is easily malleable

inner relic
#

Wthh

grand cliff
#

Welcome to General Chat

daring rock
#

@dry remnant Note that Video Arena has been removed from the server. More information can be found in this announcement #announcements message

dry remnant
#

Ok

quartz pike
#

time to see if ai image gen understands humor

#

upd: it dint understand humor in the slightest

queen heart
#

are theiur any limits

#

on the arena bot

vale robin
#

.

queen heart
proud bobcat
tacit temple
#

🫂

junior spoke
#

Is nano banana working

obtuse smelt
#

yeah

#

but error a bit

junior spoke
#

Ok thanks

robust sluice
#

didnt use the site for a while, is Gemini got nurfed ?

half mist
obtuse smelt
#

Something went wrong with this response, please try again.

wicked sage
#

i dont understand the context behind this image

#

why in the world is opus so high

#

wait no

echo sinew
#

@dense grove Note that Video Arena has been removed from the server. More information can be found in this announcement.

wicked sage
#

i should prob rephrase that

#

i hope people get what i mean

hollow plaza
#

Wrf ?

undone geyser
#

in chatgpt, you go to images and you see all images from all chats stored there, even those that you "retried" and they vanished from chat but still remain in "images" section, don't arena.ai have similar function? any picture or video generated stored in a section, where you like click on the image and you can either download share or "redirect to respective chat"? would be fine to add, good addition

tall tulip
#

why is the nanobabana image now the image is kinda low resolution, it's less than 1Mb usually up to 3-4mb?

robust sluice
#

Just like I didnt notice it was Gemini cuz of the low resolution

quartz pike
#

sup yall, im currently running some test on random ai's to see which one is the least censored.

#

basically how the test works- force the ai to choose a political side

#

if its censored

#

itll say something like "im an ai designed to blah blah blah blah"

#

ai's that i found to be uncensored- ring 2.5 1t. sonnet 4.6. ernie 5. mistral 3 large

#

ill keep running these tests and provide updates

spark python
#

yo gemini can make music😭

quartz pike
#

i'd go as far as to say its suno v2 level

quartz light
#

buns

spark python
#

does it have like 1 male voice

tacit temple
#

@echo aurora Hey man, can't you change the model of the generated video?

quartz pike
#

upd- ling 2.5 1t is also uncensored i think

sage talon
#

Hello everyone, how do I create AI videos?

quartz pike
sage talon
#

What kind of website

echo sinew
quartz pike
#

upd- deepseek v3.2 is also uncensored

echo sinew
gaunt aspen
#

Are photos not being produced? Is the Nano Banana Pro under maintenance?

high stag
#

HI

long minnow
#

Look, we know we can put chats into archive if we abandon them for a while.
Although I find this change inconvenient because I sometimes have used chats just to make 3 edits on images or few lines of code.
After that I don't need them, so is there a way to delete it instead?

quartz pike
#

glm 4.7 flash is also uncensored

sick mantle
quartz pike
sick mantle
# quartz pike ?

theres a new ai video model and its scary good its called seedance 2.0.

quartz pike
#

ive seen it

#

like as soon as it was released-seen it

#

same with seedream 5

#

idk why it isint on arena yet

lunar glade
#

Security Verification on the site is unbelievably hideous and hard to work with

quartz pike
#

pretty damn good

long minnow
#

im on their site rn

sick mantle
long minnow
#

exactly what u asked for 👍

sick mantle
long minnow
#

do you access via byteplus btw?

#

or api

sick mantle
long minnow
#

lol ok

#

I kinda wanna try it but I'm wasting my free creds with it

quartz pike
long minnow
#

also byteplus forces you to register an account and it checks ip too

long minnow
#

ill see 👍

quartz pike
quartz pike
#

i never expected in my life to get jumpscared by a flood of text

sick mantle
long minnow
#

i love the who's this img

sick mantle
#

WHY IS SEEDANCE SO GOOD

long minnow
#

daym 🔥 and I giggled when I saw the voice of holland reference (I'm 🇳🇱 )

quartz pike
long minnow
#

some random maroon 5 wannabe

sick mantle
quartz pike
#

yes i love copy and paste woman model

long minnow
#

hahaha

quartz pike
#

slenderwoman lookin ass

long minnow
#

how much they charge you for generating on daibou?

sick mantle
long minnow
#

huh fair

sick mantle
#

On older models only 1 credit

#

Maybe

long minnow
#

it sounds reasonable

sick mantle
#

Bradar whats this

long minnow
#

can I dm you to generae specific bluey clip? @sick mantle

sick mantle
long minnow
#

haha makes sense

#

I like how half of AI discord servers are always with that one random person who drops a pic of their relative or themself with 0 context

steep jewel
long minnow
sick mantle
#

WHY DID SEEDANCE ADD TEMU SIMON

long minnow
#

the 2nd funniest part in the spongebob clip is the way he sings with dat heavy voice

golden ocean
#

pretrained transformer

pale obsidian
#

imagine audio/music arena

#

that would be so fun

echo aurora
echo aurora
proud bobcat
#

No leaderboard update?

inner relic
#

Is dola seed 2.0 mini version?

sick mantle
#

@echo aurora U should add Seedance 2.0 model on Arena fr

proud bobcat
#

Also thank you so much for updating the claude search models

mossy idol
#

dance car

livid copper
#

Thanks arena teamarena

echo aurora
inner relic
#

ok

next ivy
#

finally claude opus 4.6 search this is so peak 🥹

desert abyss
#

@fossil glade Note that Video Arena has been removed from the server. More information can be found in this #announcements message

lunar glade
#

is it just me or the banana pro start behaving weird again? it keep return "Something went wrong with this response, please try again." error

echo aurora
brave sierra
#

Hey guys, I have a question for you. For you too, when you try to use some artificial intelligence to make the code, it forgets to use edit_files or create_files. And the worst part is that for me, it's not just a single artificial intelligence. The result was Kimi, Claude, ChatGPT, etc.

wet sparrow
#

Yupp already added sonnet with thinking, idk why arena team is so slow, they have opus with thinking but sonnet? Let's not add it lol

echo aurora
#

@dim pine Note that Video Arena has been removed from the server. More information can be found in this announcement.

echo aurora
harsh blaze
#

Beside

#

:)) quality Fan service
Got Opus search already so good

echo aurora
#

I haven't tried out the search models yet 😭

sick mantle
inner relic
#

Seed 2.0 pro is in doubao

#

So anyone did test that model?

sick mantle
#

Ofc i did

inner relic
#

because mini version is dumbass

sick mantle
desert abyss
#

@wise seal @trail grove Note that Video Arena has been removed from the server. More information can be found in this [announcement ](#announcements message)⁠

inner relic
#

I think chinese market will win this time

sick mantle
inner relic
#

Because uh

sick mantle
inner relic
#

sometime people confuseseedance with seed 2.0

sick mantle
#

The makers from seedance made daubou not jokeing

inner relic
#

I know

#

these dudes have 3 chinese site for each model

sick mantle
#

yea

inner relic
#

are you exploiter

sick mantle
#

they even own something called Dola the english version from daubou

sick mantle
inner relic
#

oh okay

inner relic
#

I am waiting for pro version

sick mantle
inner relic
#

alr

sick mantle
#

But u can test it on daubou rn

inner relic
#

I need a tiktok account or chinese phone number to pass that ahh

sick mantle
#

Dumb dumb u can use it on website

rigid holly
#

You guys are really adding the search model of sonnet 4.6 before the thinking version of it

I mean the search models are cool and all, but still

proud bobcat
#

glad they remembered because i feel that claude models get lobotomized by their knowledge cutoff

red sluice
#

rising-sun in the search model list is extremely bad, it looks like a model from 2 years ago, no offense to the creators though, but a lot of work is needed

#

Opus 4.6 search is a bit underwhelming, but pretty good

hushed gyro
#

Please fix the NB errors again. I had used my prompt to bypass the female generation issue but it still doesn't work.

steep jewel
#

grok 4.2 is really bad man

compact flame
#

Anyways Anyone knows do search models just use stuff from internet they find or they think about it before responding

echo aurora
hushed gyro
echo aurora
fiery gull
golden ocean
#

🐫

fiery gull
#

They need their fetishes saved

echo aurora
steep jewel
echo aurora
# lunar glade I am so tired and fustrated

I'm sorry that's the case. It's not difficult to understand why getting these errors is a frustrating experience. We have plans for improvements for both model/site reliability but also displaying more helpful clear error message.

compact flame
echo aurora
compact flame
#

Like search has access to internet while text ones rely on all info they have?

compact flame
#

Oh okay

#

Thanks

echo aurora
proud bobcat
#

i have a feeling the reason why every new openai release blows is because theyre ruining the reasoning with unecessary training

#

gpt-5 would almost always say drive in the walk or drive to the car wash riddle

#

gpt 5.2 didnt get it right once

#

i was right that the models feel stupider

sick mantle
echo aurora
#

I rarely respond in those thread, but they are being noted for consideration.

proud bobcat
peak sapphire
#

@echo aurora why nano banana pro is always don't working? I'm already tired of seeing messages "something went wrong"

sick mantle
echo aurora
echo aurora
peak sapphire
void pendant
#

Something went wrong with this response, please try again.

help pls

echo aurora
sick mantle
#

LMarena is the best! pineapple did a very good job.

sick mantle
#

droped the lm now arena.

mortal coyote
sick mantle
peak sapphire
sick mantle
mortal coyote
sick mantle
mortal coyote
sick mantle
echo aurora
mortal coyote
sick mantle
#

Beacuse Video Arena chooses a random model for me that i do not want

golden ocean
#

Large Language Model Arena

echo aurora
# sick mantle So will u might add it?

Going to give the boring answer I give any and all requests for ~"Is X feature or Y feature happening?"

I won't be able to share details about what new models or features are upcoming until we're ready to share more. Would recommend to keep an eye on our announcement channel.

drifting tapir
#

Whats about 4o

limber panther
#

chat

#

is it js me or is the rate limit of sonnet 4.6 worse than opus 4.6

#

even tho sonnet is cheaper model

echo aurora
proud bobcat
#

release 5.3 coward

golden ocean
#

🐮ard

pale sonnet
#

openai is releasing to quick its been 2 months since 5.2 came out idk if you can make much of a difference in that amt of time

fiery gull
#

Like there was AI that the restrictions were more annoying than opus even costing 10x less

stuck orchid
#

Why does claude-sonnet-4-6 on Arena have such high request limits?

#

🧐

pale sonnet
stuck orchid
#

Arena is partnered with anthropic?

#

If that's the case, then that's very good 👍🏻

hazy zodiac
#

Y'all are having problems while registering in the copilot arena( vs code )

pale sonnet
#

yeah theyre also partnered with openai and google

stuck orchid
echo aurora
#

@jolly cliff Note that Video Arena has been removed from the server. More information can be found in this announcement.

hazy zodiac
#

@echo aurora

echo aurora
hazy zodiac
echo aurora
stuck orchid
hushed gyro
sick mantle
#

Then connect on the vpn to hong kong

queen veldt
#

Petition for songarena

hushed gyro
hushed gyro
compact flame
echo aurora
sick mantle
#

@hushed gyro then go here https://www.doubao.com/chat/ then type Make me a video of then type any thing u want then wait 1 - 3 mins then its ready

#

or i can make one for you.

sick mantle
#

but seedance is scary good..

compact flame
#

Is opus 4.6 thinking is a Extended version of thinking like on the original website or smth

#

Since I can't find anything related to Opus 4.6 thinking api

hushed gyro
sick mantle
hushed gyro
#

or do I do it with tiktok

sick mantle
#

Just dont login try genrateing again

elfin sundial
#

is there any free API key for lmarena

golden ocean
elfin sundial
#

does anybody know how to get access to Arena's api key if they have one

hushed gyro
#

well if only I can bypass the login screen when I press seedance 2.0

cloud sierra
sick mantle
cloud sierra
#

Genera un video donde esten caminando el hombre y el perro por la calle sin cambiar el rostro ni las facciones todo al 100 %

hushed gyro
#

im sorry but like after 3 generations of NB Pro the rate limit triggers? why so little!!???

hushed gyro
#

oh also seedance wouldn't let me generate as it has a real face

sick mantle
hushed gyro
sick mantle
#

i know

#

but they updated

hushed gyro
#

Seriously I think video arena should have side by side like man are they stupid

gleaming roost
#

😂

hushed gyro
#

@echo aurora like I beg you to add this feature ASAP

You will gain a lot more users as ppl now just get disappointed with poor quality results as the website picked the worst models to generate like wtf

sick mantle
hushed gyro
# sick mantle Look what they made

yeah but my videos are like someone shooting a rifle and it has to have sound and like whatever, it's complicated and only sora can do it

#

CHAT does someone have sora INVITE CODES give one to me pls for free..

sick mantle
#

I think

golden ocean
sick mantle
#

He just did that for a fcking cookie.

hushed gyro
hushed gyro
hushed gyro
sick mantle
#

They did good

hushed gyro
#

thanks

sick mantle
#

Yea i know right.

#

But i dont got a chinese phone

hushed gyro
sick mantle
hushed gyro
#

btw is this weird @sick mantle

sick mantle
#

@hushed gyro

limber panther
#

sora doesnt need invite codes anymore..

sick mantle
#

croquemonsieur_70 dms

sick mantle
limber panther
#

🤔

hushed gyro
echo aurora
#

@gray surge Note that Video Arena has been removed from the server. More information can be found in this announcement. @fiery shale

vagrant silo
#

@echo aurora when are you going to add kling 3.0 to Arena?

echo aurora
vagrant silo
quartz light
#

oh boy i love holding down my mouse and spamming ctrl c for 6 minutes straight knowing the request is going to time out because arena's frontend decides to wipe the whole thinking process if the response times out

thorny cove
#

Is X feature or Y feature happening?

echo dome
#

recaptcha again when you generate images (idk about chatting)

echo aurora
#

@opaque cloud Note that Video Arena has been removed from the server. More information can be found in this announcement.

echo aurora
wicked sage
#

hi

thorny cove
#

i sure do love writing my video prompts into chat

echo dome
sick mantle
#

@echo aurora Bro people are asking for u to add a option where u can select video models but if u add it then thanks

thorny cove
#

what

#

you can

echo aurora
thorny cove
#

dark magic

sick mantle
echo aurora
astral vortex
#

nano banana really having alot of fails

echo dome
thorny cove
wicked sage
astral vortex
#

Is it true that nano banana pro is failing alot

echo dome
thorny cove
#

among us ai

#

cheers now the wife and kids are crying

echo aurora
wicked sage
thorny cove
echo dome
echo aurora
echo dome