#general | Arena | Page 342

shrewd citrus Apr 23, 2026, 10:36 PM

#

and since 5.5 is only on codex rn I think the exact same thing is happening

frosty lava Apr 23, 2026, 10:38 PM

#

wasn't it like days ?

#

i don't remember honestly

#

but what matter is that it will go public

#

at some point

loud verge Apr 23, 2026, 10:59 PM

#

https://cunnyx.com/i/status/2047382400112660608

leo 🐾 (@synthwavedd)

Perspective helps!
︀︀
︀︀GPT-5.5 underperforms Mythos on:
︀︀- SWE-Bench Pro
︀︀- HLE
︀︀
︀︀It is basically on-par on:
︀︀- GPQA Diamond
︀︀- BrowseComp
︀︀- OSWorld-Verified
︀︀
︀︀It is better on:
︀︀- Terminal-Bench 2.0
︀︀
︀︀All while being more token efficient, smaller and cheaper than Mythos (and actually available!)

Quoting leo 🐾 (@synthwavedd)
︀
GPT-5.5 benchmarks are out
︀︀
…

main nexus Apr 23, 2026, 11:18 PM

#

Gpt 5.5 out?

indigo knoll Apr 23, 2026, 11:25 PM

#

Gpt 5.5 is available on free plan Chatgpt?

frosty lava Apr 23, 2026, 11:27 PM

#

indigo knoll Gpt 5.5 is available on free plan Chatgpt?

No

frosty lava Apr 23, 2026, 11:27 PM

#

main nexus Gpt 5.5 out?

yes its out

grand raft Apr 23, 2026, 11:33 PM

#

but not on free plan

grand raft Apr 23, 2026, 11:52 PM

#

i need the 5.5

loud herald Apr 24, 2026, 12:00 AM

#

Kimiiiiiiiii yessss

zenith steppe Apr 24, 2026, 12:03 AM

#

loud herald Kimiiiiiiiii yessss

Kimi yes kimi no , is it jailbroken or just like that?

loud herald Apr 24, 2026, 12:03 AM

#

zenith steppe Kimi yes kimi no , is it jailbroken or just like that?

Its jailbroken but its so easy to do so

#

Thats the reason I like chinese models

#

They always have low guardrails

zenith steppe Apr 24, 2026, 12:07 AM

#

loud herald They always have low guardrails

They are like that naturally not even by design , lol

#

Isnt kimi a steal from claude?

vale quest Apr 24, 2026, 12:51 AM

#

loud herald Kimiiiiiiiii yessss

Kimi is also disabled on arena.ai

#

Because everyone spent their balance

whole sundial Apr 24, 2026, 12:53 AM

#

vale quest Kimi is also disabled on arena.ai

it's not?

#

vale quest Apr 24, 2026, 12:54 AM

#

whole sundial

Now try to use it

whole sundial Apr 24, 2026, 12:55 AM

#

no kimi models work

#

@echo aurora here is a trace id for you:
:19f5165f-0c6b-

#

also btw i was able to get the reason why it failed, i guess arena is broke

Your account org-3768766e50c242e2ade5fc3b3b783831 <ak-f4h9btz5i7s111b3pub1> is suspended due to insufficient balance, please recharge your account or check your plan and billing details

#

i can donate my moonshot ai key to you guys if you need it /s

primal orbit Apr 24, 2026, 1:01 AM

#

is there still any chance to get opus 4.7 thinking in battle mode?

whole sundial Apr 24, 2026, 1:06 AM

#

whole sundial also btw i was able to get the reason why it failed, i guess arena is broke > Yo...

i think we can blame moonshot ai here for not having an auto-topup function

#

i looked and i didn't find one, unless i am missing it

whole sundial Apr 24, 2026, 1:09 AM

#

whole sundial i think we can blame moonshot ai here for not having an auto-topup function

also btw @echo aurora if you want a simple message to say to the devs, just say "Kimi models are failing due to Arena's Moonshot AI platform account being out of balance, you have to top it up to fix it. I don't believe Moonshot has an auto topup function, so you'll have to check on it often."

#

also btw i really do have a moonshot ai api key, it still has some balance on it and i have some experience with the platform

void shore Apr 24, 2026, 1:10 AM

#

i just have a feeling that the people using kimi are gonna drain account balance really fast

#

so it makes sense that it would go broke

whole sundial Apr 24, 2026, 1:11 AM

#

void shore i just have a feeling that the people using kimi are gonna drain account balance...

yeah they've made it more expensive this time around, probably to make up for the price cuts and speed improvements made during kimi k2.5's time

void shore Apr 24, 2026, 1:11 AM

#

whole sundial yeah they've made it more expensive this time around, probably to make up for th...

and its also open source

whole sundial Apr 24, 2026, 1:11 AM

#

but once again i blame moonshot for not offering automatic topups

void shore Apr 24, 2026, 1:11 AM

#

if anyone has that much gpu power

#

than download it and run it locally for others to use

loud herald Apr 24, 2026, 1:13 AM

#

zenith steppe They are like that naturally not even by design , lol

Well they wont focus on putting guardrails up thats why and I approve haha

sly cedar Apr 24, 2026, 1:13 AM

#

whole sundial also btw i was able to get the reason why it failed, i guess arena is broke > Yo...

Is it still able to use or no?

loud herald Apr 24, 2026, 1:14 AM

#

zenith steppe Isnt kimi a steal from claude?

Chinese models distill from anything so yes

whole sundial Apr 24, 2026, 1:14 AM

#

sly cedar Is it still able to use or no?

you'll have to wait on arena to refill their moonshot account

sly cedar Apr 24, 2026, 1:14 AM

#

whole sundial you'll have to wait on arena to refill their moonshot account

I mean, if i use it, what notification it will be?

#

An error?

whole sundial Apr 24, 2026, 1:14 AM

#

sly cedar I mean, if i use it, what notification it will be?

it will be a something went wrong error

void shore Apr 24, 2026, 1:15 AM

#

sly cedar Is it still able to use or no?

long story short, no.

#

their account balance is empty at the moment, so no requests can be sent through

#

until it gets refilled, you'll just get an error message

#

:3

whole sundial Apr 24, 2026, 1:15 AM

#

the message i sent was from arena now sending partial trace to users, I extracted it and got the message

sly cedar Apr 24, 2026, 1:16 AM

#

Does anyone have thoughts on openmythos?

urban herald Apr 24, 2026, 1:23 AM

#

whole sundial i can donate my moonshot ai key to you guys if you need it /s

i need it :3

loud herald Apr 24, 2026, 1:45 AM

#

whole sundial you'll have to wait on arena to refill their moonshot account

Seems fine to me

#

Any company doing this though would 100% be doing a pay as you go

#

Not credit system where they have set numbers

sullen sable Apr 24, 2026, 1:46 AM

#

.

inner relic Apr 24, 2026, 2:55 AM

#

guys

#

deepseek v4 is here

#

https://api-docs.deepseek.com/

Your First API Call | DeepSeek API Docs

The DeepSeek API uses an API format compatible with OpenAI/Anthropic. By modifying the configuration, you can use the OpenAI/Anthropic SDK or softwares compatible with the OpenAI/Anthropic API to access the DeepSeek API.

#

there's deepseek v4 pro and

#

uh

#

flash

#

grand raft Apr 24, 2026, 2:57 AM

#

#1372229840131985540

vale quest Apr 24, 2026, 2:57 AM

#

grand raft <#1372229840131985540>

Bro is not a mod

grand raft Apr 24, 2026, 3:01 AM

#

yeah

echo aurora Apr 24, 2026, 3:05 AM

#

whole sundial also btw <@283397944160550928> if you want a simple message to say to the devs, ...

Okay thank you for the heads up, looking into and flagging blobthanks

#

@whole sundial are you sure this is the case? I'm not getting any issues with Kimi models.

limber crag Apr 24, 2026, 3:07 AM

#

Hey weren't there more tape models?

#

What happened to the rest

#

@pineapple

echo aurora Apr 24, 2026, 3:09 AM

#

limber crag Hey weren't there more tape models?

Sorry to say I can't go into details about codenamed models

limber crag Apr 24, 2026, 3:09 AM

#

No issues!

#

Can i suggest a feature?

whole sundial Apr 24, 2026, 3:09 AM

#

echo aurora <@675304479247040523> are you sure this is the case? I'm not getting any issues ...

they may have already fixed it

limber crag Apr 24, 2026, 3:10 AM

#

How about we can like vote on other's battle mode generations?

whole sundial Apr 24, 2026, 3:10 AM

#

are you getting responses?

limber crag Apr 24, 2026, 3:10 AM

#

Like it can be a scrollable thing

echo aurora Apr 24, 2026, 3:10 AM

#

Yeah, have tried out a bunch of and they all seem good 👍 What makes you think it was balance related and not some other error problem?

echo aurora Apr 24, 2026, 3:11 AM

#

limber crag How about we can like vote on other's battle mode generations?

That would be super cool. Some kind of social aspect to it where others can vote. Would be really interesting to see those leaderboards too.

#

This idea has been something we've kicked around a bit.

limber crag Apr 24, 2026, 3:12 AM

#

Ohh any plans on working on it then or you cant talk about it

inner relic Apr 24, 2026, 3:12 AM

#

how these dudes are not excited about deepseek v4

#

bro

#

ok i am posting nothing her

main nexus Apr 24, 2026, 3:13 AM

#

inner relic

This real???

#

Dude

inner relic Apr 24, 2026, 3:13 AM

#

echo aurora Apr 24, 2026, 3:13 AM

#

limber crag Ohh any plans on working on it then or you cant talk about it

Nothing that I'm able to share.

inner relic Apr 24, 2026, 3:13 AM

#

main nexus This real???

yes t's real

main nexus Apr 24, 2026, 3:14 AM

#

Deepseek v4 better cook my meal or else this model sucks

#

The hype better be good

#

Gotta be better the opus 5

#

And gemini 3.5

whole sundial Apr 24, 2026, 3:17 AM

#

echo aurora Yeah, have tried out a bunch of and they all seem good 👍 What makes you think i...

arena returned an error from the moonshot api to my console that said the account was out of balance

obtuse smelt Apr 24, 2026, 3:20 AM

#

hmm well is max generating image is 3 not much more ?

inner relic Apr 24, 2026, 3:22 AM

#

https://x.com/ValsAI/status/2047513613750202452

Vals AI (@ValsAI)

DeepSeek v4 is now the #1 open-weight model on our Vibe Code Benchmark, and it’s not close.

It leaves the #2 (Kimi K2.6) in the dust, and even beats out frontier closed source models like Gemini 3.1 Pro.

obsidian cargo Apr 24, 2026, 3:24 AM

#

It's already on direct and side by side mode!

#

I hope it stays

minor bloom Apr 24, 2026, 3:25 AM

#

DEEPSEEK!!!!!!

wicked talon Apr 24, 2026, 3:27 AM

#

Wtfff

#

No one told me about this

wicked talon Apr 24, 2026, 3:27 AM

#

inner relic https://x.com/ValsAI/status/2047513613750202452

Bruhhhh

grand raft Apr 24, 2026, 3:27 AM

#

but the announcement

echo aurora Apr 24, 2026, 3:27 AM

#

🐳

echo aurora Apr 24, 2026, 3:27 AM

#

grand raft but the announcement

incoming

minor bloom Apr 24, 2026, 3:27 AM

#

Wait

stray aspen Apr 24, 2026, 3:28 AM

#

minor bloom Apr 24, 2026, 3:28 AM

#

Why can't I upload images?

#

Or files?

#

To deepseek

stray aspen Apr 24, 2026, 3:28 AM

#

Deepseek is out

vernal raft Apr 24, 2026, 3:28 AM

#

I'm legit confused

#

I don't see it on arena

#

Can't tell if those are all ai from gpt 2

stray aspen Apr 24, 2026, 3:29 AM

#

vernal raft Can't tell if those are all ai from gpt 2

https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro

deepseek-ai/DeepSeek-V4-Pro · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

vernal raft Apr 24, 2026, 3:29 AM

#

Omfg can't distinguish ai from real

rigid copper Apr 24, 2026, 3:30 AM

#

awwww.... :/

wicked talon Apr 24, 2026, 3:30 AM

#

Wait did it literally just come out?

minor bloom Apr 24, 2026, 3:31 AM

#

Yes

wicked talon Apr 24, 2026, 3:31 AM

#

Oh

minor bloom Apr 24, 2026, 3:31 AM

#

Like 10 minutes ago

wicked talon Apr 24, 2026, 3:31 AM

#

I didn't even realise

wicked talon Apr 24, 2026, 3:31 AM

#

minor bloom Like 10 minutes ago

30*

#

I was wondering why it wasn't on deepseek app

#

Probably will be released later today 🙂

#

For now gotta use arena.ai

minor bloom Apr 24, 2026, 3:32 AM

#

I wanna see benchmark results

#

Its probably on the same level as Gemini 3.1

wicked talon Apr 24, 2026, 3:32 AM

#

minor bloom I wanna see benchmark results

It's gonna benchmaxx

minor bloom Apr 24, 2026, 3:32 AM

#

But worse than opus

wicked talon Apr 24, 2026, 3:33 AM

#

Wouldn't surprise me if it's close to Claude

minor bloom Apr 24, 2026, 3:33 AM

#

And 5.5

wicked talon Apr 24, 2026, 3:33 AM

#

minor bloom But worse than opus

Nah

vernal raft Apr 24, 2026, 3:33 AM

#

echo aurora incoming

Please tell me this model can kick anthropic from the throne

stray aspen Apr 24, 2026, 3:33 AM

#

minor bloom And 5.5

5.5 sucks

grand raft Apr 24, 2026, 3:33 AM

#

lets see if this model is good

minor bloom Apr 24, 2026, 3:33 AM

#

Need to compare it to kimi

stray aspen Apr 24, 2026, 3:33 AM

#

Wait for mythos to crush it

rigid copper Apr 24, 2026, 3:33 AM

#

rigid copper awwww.... :/

@echo aurora what was that actually mean? like maximum of 10 attachment per chat?

stray aspen Apr 24, 2026, 3:33 AM

#

Im still waiting for the spud

minor bloom Apr 24, 2026, 3:33 AM

#

stray aspen Wait for mythos to crush it

Only 5 people have access to that

echo aurora Apr 24, 2026, 3:33 AM

#

rigid copper <@283397944160550928> what was that actually mean? like maximum of 10 attachment...

Correct

vernal raft Apr 24, 2026, 3:33 AM

#

stray aspen Wait for mythos to crush it

Yeah 100€ per 1m can stay there overpriced for enterprises

wicked talon Apr 24, 2026, 3:34 AM

#

Why is flash actually fast asf

wicked talon Apr 24, 2026, 3:34 AM

#

stray aspen Wait for mythos to crush it

Mythos is ass bro

#

Claude is definitely over hyping it

#

Glazing

vernal raft Apr 24, 2026, 3:34 AM

#

Please someone tell me that this model is actually running on Huawei chips

frosty lava Apr 24, 2026, 3:34 AM

#

wicked talon Mythos is ass bro

AND overpriced

minor bloom Apr 24, 2026, 3:35 AM

#

wicked talon Mythos is ass bro

There is literally no way to know this

frosty lava Apr 24, 2026, 3:35 AM

#

but we already know about the price

wicked talon Apr 24, 2026, 3:35 AM

#

minor bloom There is literally no way to know this

Correct but I believe it's going to be good but everyone is over hyping it so much

#

I mean wasn't 4.7 a downgrade from 4.6?

frosty lava Apr 24, 2026, 3:35 AM

#

who want a good model that you can only use 2 time a month !

wicked talon Apr 24, 2026, 3:35 AM

#

frosty lava who want a good model that you can only use 2 time a month !

True

frosty lava Apr 24, 2026, 3:35 AM

#

cause of the price and usage

wicked talon Apr 24, 2026, 3:36 AM

#

Deepseek will probably smash qwen

#

And Kimi

#

Kimi doesn't have image generation

minor bloom Apr 24, 2026, 3:36 AM

#

Claude is unusable without paying

inner relic Apr 24, 2026, 3:36 AM

#

https://x.com/deepseek_ai/status/2047516922263285776

DeepSeek (@deepseek_ai)

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params.

minor bloom Apr 24, 2026, 3:36 AM

#

Even sonnet

#

1.6 T??????

#

Holy

wicked talon Apr 24, 2026, 3:36 AM

#

minor bloom 1.6 T??????

Bigger then Gemini lmao

#

Grok is 2T though I think

minor bloom Apr 24, 2026, 3:37 AM

#

No

#

Its 0.5 T

rigid copper Apr 24, 2026, 3:38 AM

#

echo aurora Correct

pretty sure once i hit it, I have to create another new chat :)

minor bloom Apr 24, 2026, 3:38 AM

#

Grok 4.4 will be 1T when it comes out according to Elon

wicked talon Apr 24, 2026, 3:38 AM

#

Bro why is deepseeks knowledge cut off may 2025

#

Fake V4?

minor bloom Apr 24, 2026, 3:39 AM

#

I knew my blue whale wouldn't disappoint

wicked talon Apr 24, 2026, 3:39 AM

#

No way deepseek kept this model from us for this long

wicked talon Apr 24, 2026, 3:39 AM

#

minor bloom I knew my blue whale wouldn't disappoint

It's disappointing me now

#

I gotta make some random bull htmls then try to make it code a speedtest server

pseudo magnet Apr 24, 2026, 3:42 AM

#

inner relic https://x.com/deepseek_ai/status/2047516922263285776

damn

fickle venture Apr 24, 2026, 3:42 AM

#

DEEPSEEK IS back hell yeah

echo aurora Apr 24, 2026, 3:42 AM

#

rigid copper pretty sure once i hit it, I have to create another new chat :)

That's the case if you'd like to continue uploading more files. But you should still be able to prompt.

vernal raft Apr 24, 2026, 3:42 AM

#

Nsh

pallid crypt Apr 24, 2026, 3:42 AM

#

deep seek v4 lets go

vernal raft Apr 24, 2026, 3:43 AM

#

Idk

wicked sage Apr 24, 2026, 3:43 AM

#

DEEPSEEEEEEEEEEEEEEEEEEEEK

#

YESSSSS!!!!!!!!!!

stray aspen Apr 24, 2026, 3:43 AM

#

Deepseek v4 is so cool

pallid crypt Apr 24, 2026, 3:43 AM

#

they scored worse then kimi tho sadly 😢

wicked talon Apr 24, 2026, 3:43 AM

#

pallid crypt they scored worse then kimi tho sadly 😢

Never speak again please

#

NEVER

#

Leave this server

inner relic Apr 24, 2026, 3:43 AM

#

nah bro

minor bloom Apr 24, 2026, 3:43 AM

#

Its worse than kimi in coding

inner relic Apr 24, 2026, 3:43 AM

#

i think deepseek v4 is master at roleplay

#

check

wicked talon Apr 24, 2026, 3:43 AM

#

inner relic i think deepseek v4 is master at roleplay

Roleplay 😛

fickle venture Apr 24, 2026, 3:43 AM

#

@echo aurora what's the context limit is it 1M?

stray aspen Apr 24, 2026, 3:43 AM

#

pallid crypt they scored worse then kimi tho sadly 😢

Nah it isnt

minor bloom Apr 24, 2026, 3:43 AM

#

But it has a lot more knowledge than kimi

#

It doesn't look like it's reasoning improved

wicked talon Apr 24, 2026, 3:44 AM

#

Yesss

minor bloom Apr 24, 2026, 3:44 AM

#

From 3.2

wicked talon Apr 24, 2026, 3:44 AM

#

They updated there website

empty stump Apr 24, 2026, 3:44 AM

#

I wish it was multimodal but this is impressive

wicked talon Apr 24, 2026, 3:44 AM

#

empty stump I wish it was multimodal but this is impressive

It is ain't it?

stray aspen Apr 24, 2026, 3:45 AM

#

wicked talon It is ain't it?

No we got scammef

#

But deepseek is so cool

wicked talon Apr 24, 2026, 3:45 AM

#

stray aspen No we got scammef

what

#

what

rigid copper Apr 24, 2026, 3:45 AM

#

echo aurora That's the case if you'd like to continue uploading more files. But you should s...

It will more likely to hit if I do image generation in side-by-side mode

wicked talon Apr 24, 2026, 3:45 AM

#

am I in rage bait right now

stray aspen Apr 24, 2026, 3:45 AM

#

Deepseek is so cool

night moat Apr 24, 2026, 3:45 AM

#

Screenshot_2026-04-24-10-45-27-607_com.android.chrome.png

echo aurora Apr 24, 2026, 3:46 AM

#

fickle venture <@283397944160550928> what's the context limit is it 1M?

We typically will do w/e the default API setting is.

heady kite Apr 24, 2026, 3:46 AM

#

How is Gemma 4 31B ahead of Kimi 2.5?? Big model size difference

stray aspen Apr 24, 2026, 3:46 AM

#

Kimi 2.5 sucks

echo aurora Apr 24, 2026, 3:46 AM

#

night moat

Guessing this is rate limit, but will check this Trace and keep you updated.

stray aspen Apr 24, 2026, 3:46 AM

#

And its old

wicked talon Apr 24, 2026, 3:47 AM

#

stray aspen And its old

Kimi 2.6 out now

inland quest Apr 24, 2026, 3:47 AM

#

Non thinking pro version over the thinking in leaderboard looks like mistake, especially if you look at it rating at all, cuz it looks lower than expected for new models.

heady kite Apr 24, 2026, 3:47 AM

#

Also why is gpt-oss-120b reasoning: high not on there

#

at least for code arena

wicked talon Apr 24, 2026, 3:48 AM

#

I'm gonna cry myself to sleep

#

Goodnight chat

echo aurora Apr 24, 2026, 3:48 AM

#

night moat

I take that back, this is a caused by a bug that was flagged to the team earlier today.

echo aurora Apr 24, 2026, 3:49 AM

#

wicked talon I'm gonna cry myself to sleep

🫂

inland quest Apr 24, 2026, 3:49 AM

#

Deepseek v4 Pro on leaderboard performs nearly as non thinking GPT-5.4 LOL.
Wtf
And thinking version as gemini 3 flash?

whole sundial Apr 24, 2026, 3:50 AM

#

i'm already disappointed by v4 pro, it fails some of my world knowledge test questions
one of them both glm 5.1 and kimi k2.6 gets right, another one can be correctly answered by grok, claude, gpt, gemini, old and new glm and kimi models, and even hy3, a brand new model from tencent that is the same size as v4 flash but yet it gets questions right that v4 pro (5 times the size) can't

desert pendant Apr 24, 2026, 3:50 AM

#

wicked talon Yesss

gng what

inland quest Apr 24, 2026, 3:50 AM

#

inland quest Deepseek v4 Pro on leaderboard performs nearly as non thinking GPT-5.4 LOL. Wtf ...

Having probably x3-4 size of parameters

fickle venture Apr 24, 2026, 3:50 AM

#

It beats opus 4.6 Max even Opus 4.7 cuz it suck

#

https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro

deepseek-ai/DeepSeek-V4-Pro · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

vernal raft Apr 24, 2026, 3:51 AM

#

I was expecting much more NGL

stray aspen Apr 24, 2026, 3:51 AM

#

Deepseek v4 is so ass

#

And no vision

desert pendant Apr 24, 2026, 3:51 AM

#

BRO

#

YALL ARE NOT LYING

#

#

😭 I JUST WOKE UP RN

stray aspen Apr 24, 2026, 3:52 AM

#

It sucks anyway

balmy mist Apr 24, 2026, 3:52 AM

#

is deepseek out?

stray aspen Apr 24, 2026, 3:52 AM

#

Even qwen is better

whole sundial Apr 24, 2026, 3:52 AM

#

stray aspen Deepseek v4 is so ass

i agree, i would rather use hy3 preview than either v4 flash or pro

inner relic Apr 24, 2026, 3:52 AM

#

stray aspen Even qwen is better

ok but I think deepsee kv4 peak at roleplay

#

creative writting

feral kernel Apr 24, 2026, 3:53 AM

#

what up with this?

desert pendant Apr 24, 2026, 3:53 AM

#

gng

#

i know what i will use it

desert pendant Apr 24, 2026, 3:54 AM

#

feral kernel what up with this?

arena gang when they see a new model for 0.1 sec:

feral kernel Apr 24, 2026, 3:54 AM

#

damn

whole sundial Apr 24, 2026, 3:54 AM

#

tencent hunyuan completely re-did everything with a guy from openai and their hy3 model is pretty good for its size (around the same size as v4-flash), same company that made the awful hunyuan dense models, the 4b has the world knowledge of smollm2-360m with awful reasoning and tool-calling, it came out in the middle of last year
if they made a 1t+ model it would smoke v4-pro and every other open model

balmy mist Apr 24, 2026, 3:55 AM

#

ughhh dont tell me its another bust

whole sundial Apr 24, 2026, 3:55 AM

#

whole sundial tencent hunyuan completely re-did everything with a guy from openai and their hy...

i really hope arena adds that model, it would probably be top 10 open

desert pendant Apr 24, 2026, 3:55 AM

#

gng i feel so happy

#

time to test deep seek

dusk dragon Apr 24, 2026, 3:56 AM

#

So uh what model are we looking at right now that's the best for world knowledge. Thinking Gemini 3.1 still?

stray aspen Apr 24, 2026, 3:57 AM

#

what in the disappointment is deepseek doing

dusk dragon Apr 24, 2026, 3:57 AM

#

Yeah, why did deepseek drop v4

#

Isn't it kind of bad timing

whole sundial Apr 24, 2026, 3:58 AM

#

dusk dragon So uh what model are we looking at right now that's the best for world knowledge...

yes

inner relic Apr 24, 2026, 3:58 AM

#

they do lot of experiment

#

deepseek v4

gleaming wraith Apr 24, 2026, 3:58 AM

#

What was deepseek v4 called before it was revealed what it is?

dusk dragon Apr 24, 2026, 3:58 AM

#

Deepseek v4 though is not like v4 performance

inner relic Apr 24, 2026, 3:58 AM

#

so eh they're focused on writting and code

dusk dragon Apr 24, 2026, 3:59 AM

#

It's like 3.5

#

Also anyone know the best place to get really good usage with Gemini 3.1 pro anymore

desert pendant Apr 24, 2026, 3:59 AM

#

inner relic so eh they're focused on writting and code

OH YEAH

velvet furnace Apr 24, 2026, 3:59 AM

#

gpt5.5 is good?

desert pendant Apr 24, 2026, 3:59 AM

#

happ

feral kernel Apr 24, 2026, 3:59 AM

#

yep

soft river Apr 24, 2026, 4:00 AM

#

Is DeepSeek on the app?

stray aspen Apr 24, 2026, 4:00 AM

#

guys deepseek is NOT back at it

#

woke up and turned on my pc at 10 pm just to be disappointed

#

great

inner relic Apr 24, 2026, 4:01 AM

#

stray aspen guys deepseek is NOT back at it

bro

#

are you only focused on code?

desert pendant Apr 24, 2026, 4:01 AM

#

stray aspen woke up and turned on my pc at 10 pm just to be disappointed

gngImtired

dusk dragon Apr 24, 2026, 4:01 AM

#

Gemini just needs to drop Gemini 3.5 and just destroy every model like the goat it is

stray aspen Apr 24, 2026, 4:01 AM

#

yeah and math

desert pendant Apr 24, 2026, 4:01 AM

#

if i get a good code bro

inner relic Apr 24, 2026, 4:02 AM

#

stray aspen yeah and math

ask deepseek to solve a impossible math

#

chinese are good at mathh

stray aspen Apr 24, 2026, 4:02 AM

#

it just butchered my code bruh

desert pendant Apr 24, 2026, 4:03 AM

#

stray aspen it just butchered my code bruh

someone who actually use roblox to also test deep seek or another AI

#

great

#

; D

#

im risking a good code for deep seek v4 code guys

balmy mist Apr 24, 2026, 4:03 AM

#

how is deepseek?

#

was it worth the wait?

barren sable Apr 24, 2026, 4:04 AM

#

dusk dragon Gemini just needs to drop Gemini 3.5 and just destroy every model like the goat ...

there were some rumors/drama that the deepmind guys use claude at work rather than any internal models/checkpoints. Doesn't seem great for google.

stray aspen Apr 24, 2026, 4:04 AM

#

no

#

it sucks

balmy mist Apr 24, 2026, 4:04 AM

#

lol

inner relic Apr 24, 2026, 4:04 AM

#

balmy mist was it worth the wait?

i think uh

#

it's decent

#

writting is good

stray aspen Apr 24, 2026, 4:04 AM

#

i wanna see where artificial analysis will place it on their benchmark

desert pendant Apr 24, 2026, 4:04 AM

#

2 errors

#

ehh

inner relic Apr 24, 2026, 4:04 AM

#

and this guy think deepseek v4 sucks bc It can't do a lua code perfect

#

but yeh i agree with him

#

it sucks at code

stray aspen Apr 24, 2026, 4:05 AM

#

inner relic and this guy think deepseek v4 sucks bc It can't do a lua code perfect

nah its just bad

inner relic Apr 24, 2026, 4:05 AM

#

sometime

#

Mimo v2.5 did better at one shot prompt

desert pendant Apr 24, 2026, 4:05 AM

#

inner relic and this guy think deepseek v4 sucks bc It can't do a lua code perfect

if Deep seek suck at the code now

stray aspen Apr 24, 2026, 4:05 AM

#

mimo is actually decent

#

but nothing crazy

balmy mist Apr 24, 2026, 4:05 AM

#

im going back to sleep smh, so its not the best open source?

stray aspen Apr 24, 2026, 4:05 AM

#

balmy mist im going back to sleep smh, so its not the best open source?

no

heady kite Apr 24, 2026, 4:05 AM

#

Did you guys mention that Deepseek is okay at writing or no?

inner relic Apr 24, 2026, 4:05 AM

#

balmy mist im going back to sleep smh, so its not the best open source?

idk you go check for yourself

stray aspen Apr 24, 2026, 4:05 AM

#

i just woke up in the middle of night to a trash release

desert pendant Apr 24, 2026, 4:06 AM

#

should i use mimo if deep seek fails guys?

stray aspen Apr 24, 2026, 4:06 AM

#

desert pendant should i use mimo if deep seek fails guys?

no use claude 4.7

desert pendant Apr 24, 2026, 4:06 AM

#

stray aspen no use claude 4.7

happ

inner relic Apr 24, 2026, 4:06 AM

#

wth bro claude 4.7 is so xpensive

#

yeh

#

use mimo v2.5

#

i think cheape

stray aspen Apr 24, 2026, 4:06 AM

#

gemini 3.1 pro then

desert pendant Apr 24, 2026, 4:06 AM

#

HOLY MOLY BRO

inner relic Apr 24, 2026, 4:06 AM

#

rok

balmy mist Apr 24, 2026, 4:06 AM

#

bruhh

desert pendant Apr 24, 2026, 4:06 AM

#

I AIN'T RICH

inner relic Apr 24, 2026, 4:06 AM

#

balmy mist bruhh

dont be disappointed

#

i think it's good at creative

#

and writting

balmy mist Apr 24, 2026, 4:07 AM

#

thats not impressive tho

desert pendant Apr 24, 2026, 4:07 AM

#

balmy mist bruhh

if deep seek fail me bro

inner relic Apr 24, 2026, 4:07 AM

#

I already told you guys, deepseek is focused on

stray aspen Apr 24, 2026, 4:07 AM

#

theres no way it was gapped by glm 5.1

inner relic Apr 24, 2026, 4:07 AM

#

writting and code

stray aspen Apr 24, 2026, 4:07 AM

#

thats like all the way down in artifiical analysis

balmy mist Apr 24, 2026, 4:07 AM

#

we have models that are good at that for free

inner relic Apr 24, 2026, 4:07 AM

#

everything is code sop now?

desert pendant Apr 24, 2026, 4:07 AM

#

HE DID IT

stray aspen Apr 24, 2026, 4:07 AM

#

at least we got 1 million context

desert pendant Apr 24, 2026, 4:07 AM

#

IN THE THIRD ATTEMPT

balmy mist Apr 24, 2026, 4:07 AM

#

i just dont see the point of this launch

desert pendant Apr 24, 2026, 4:07 AM

#

LETS GOOO

balmy mist Apr 24, 2026, 4:07 AM

#

desert pendant HE DID IT

??

desert pendant Apr 24, 2026, 4:07 AM

#

deep seek did it

#

in the third attempt

inner relic Apr 24, 2026, 4:08 AM

#

does this mean, deepseek adapt to error

#

each attempt

desert pendant Apr 24, 2026, 4:08 AM

#

inner relic does this mean, deepseek adapt to error

he always

#

did that

#

now i will use roblox (cuz literally im using godot right now)

stray aspen Apr 24, 2026, 4:09 AM

#

deepseek front end is so bad

inner relic Apr 24, 2026, 4:09 AM

#

stray aspen deepseek front end is so bad

try in three attempt

balmy mist Apr 24, 2026, 4:09 AM

#

i guess its just the 1 mill context that we care about?

earnest rover Apr 24, 2026, 4:09 AM

#

deepseek v4 is one of the best **||overhyped ||**model

stray aspen Apr 24, 2026, 4:10 AM

#

earnest rover deepseek v4 is one of the best **||overhyped ||**model

cap

#

its 5.5

inner relic Apr 24, 2026, 4:10 AM

#

wth it's Deepseek v4

balmy mist Apr 24, 2026, 4:10 AM

#

earnest rover deepseek v4 is one of the best **||overhyped ||**model

fr they waited so long for this lol

stray aspen Apr 24, 2026, 4:10 AM

#

im gonna try mimo

#

2.5 pro

#

seems like its insanely decent

earnest rover Apr 24, 2026, 4:10 AM

#

stray aspen Apr 24, 2026, 4:11 AM

#

earnest rover

the spud

#

they tricked us into thinking it was the spud

#

they love marketing campaigns bruh

balmy mist Apr 24, 2026, 4:11 AM

#

this is interesting: https://x.com/ValsAI/status/2047513613750202452

Vals AI (@ValsAI)

DeepSeek v4 is now the #1 open-weight model on our Vibe Code Benchmark, and it’s not close.

It leaves the #2 (Kimi K2.6) in the dust, and even beats out frontier closed source models like Gemini 3.1 Pro.

#

thats a bigg gap tbh

desert pendant Apr 24, 2026, 4:12 AM

#

to be honest

stray aspen Apr 24, 2026, 4:12 AM

#

balmy mist this is interesting: https://x.com/ValsAI/status/2047513613750202452

slopmark

desert pendant Apr 24, 2026, 4:12 AM

#

for me deep seek is doing great (yet)

vernal raft Apr 24, 2026, 4:12 AM

#

earnest rover

Mythos

balmy mist Apr 24, 2026, 4:12 AM

#

stray aspen slopmark

you dont trust that team?

stray aspen Apr 24, 2026, 4:13 AM

#

not bad for a first shot

#

needs some fllow up prompts

#

its better than gemini

#

and i like that little detail of adding the server location

heady kite Apr 24, 2026, 4:14 AM

#

How many tokens did it output for that?

desert pendant Apr 24, 2026, 4:15 AM

#

trollface

#

to be honest im kinda dissapointed with some deep 4.4/4v stuff

stray aspen Apr 24, 2026, 4:16 AM

#

is this thing fr

#

im never using it again

#

it gave me html

desert pendant Apr 24, 2026, 4:16 AM

#

stray aspen im never using it again

be specific dude

inner relic Apr 24, 2026, 4:19 AM

#

stray aspen is this thing fr

can claude do this

#

#

claude sonnet 4.6 dumb as hea

stray aspen Apr 24, 2026, 4:19 AM

#

guys did mimo cook

desert pendant Apr 24, 2026, 4:19 AM

#

ok this is great

#

it just need some adjustments

stray aspen Apr 24, 2026, 4:20 AM

#

mimo is better than deepseek lmao

balmy mist Apr 24, 2026, 4:20 AM

#

stray aspen mimo is better than deepseek lmao

dont you dare say that

#

deepseek is my friend

stray aspen Apr 24, 2026, 4:21 AM

#

its better for frontend

balmy mist Apr 24, 2026, 4:21 AM

#

what is deepseeek better for?

stray aspen Apr 24, 2026, 4:21 AM

#

for feeling disappointed about interrupting your sleep for a slop release

balmy mist Apr 24, 2026, 4:22 AM

#

stray aspen for feeling disappointed about interrupting your sleep for a slop release

lmaoo that was a good one, imm go to sleep on that note

inner relic Apr 24, 2026, 4:22 AM

#

stray aspen its better for frontend

ask mimo to mae

#

make

#

advanced npc ai shooter

#

#

and i said

#

can claude solve this

#

br

#

bro

desert pendant Apr 24, 2026, 4:25 AM

#

mimo easily go past 300 lines from code

#

dawg

#

happ

thick pawn Apr 24, 2026, 4:27 AM

#

Anyone else having trouble with gpt image 2 not completing jobs at the moment?

desert pendant Apr 24, 2026, 4:28 AM

#

ok mimo is kinda great

wicked talon Apr 24, 2026, 4:29 AM

#

Reddit should definitely make an ai app

night moat Apr 24, 2026, 4:32 AM

#

echo aurora Guessing this is rate limit, but will check this Trace and keep you updated.

It's been more than six hours

thick pawn Apr 24, 2026, 4:32 AM

#

Yup, image 2 is definitely playing up at the moment

sly cedar Apr 24, 2026, 4:34 AM

#

inner relic

Deepseek seems on logic god mode

#

I used this model, and its cooking

#

Far better than the previous model i used

wicked talon Apr 24, 2026, 4:36 AM

#

Deepseek is taking over?

vale quest Apr 24, 2026, 4:36 AM

#

God damn deepseek

#

Deepseek basically pulled a meta

#

Wdym

inner relic Apr 24, 2026, 4:36 AM

#

lmarena users

#

can you stop

#

yapping

wicked talon Apr 24, 2026, 4:36 AM

#

Deepseek v4 is literally benchmaxxing

inner relic Apr 24, 2026, 4:36 AM

#

and focus on someone else

wicked talon Apr 24, 2026, 4:36 AM

#

inner relic lmarena users

First of all we ain't lmarena bro

#

Second of all leave

inner relic Apr 24, 2026, 4:37 AM

#

wicked talon Second of all leave

Rude

wicked talon Apr 24, 2026, 4:37 AM

#

inner relic Rude

You were rude

inner relic Apr 24, 2026, 4:37 AM

#

though buddy you're too focused on code

wicked talon Apr 24, 2026, 4:37 AM

#

inner relic though buddy you're too focused on code

Mate leave no one wants you here

#

We are here to talk about ai

inner relic Apr 24, 2026, 4:37 AM

#

uh ok

#

this is general

#

i dont want to argue with you eh

wicked talon Apr 24, 2026, 4:38 AM

#

inner relic this is general

Bruh what's this servers main purpose

inner relic Apr 24, 2026, 4:38 AM

#

just yall to acknowledge that deepseek is smart at some task

#

not just code

wicked talon Apr 24, 2026, 4:38 AM

#

inner relic not just code

It's good at code too lol

inner relic Apr 24, 2026, 4:38 AM

#

ok ok

inner relic Apr 24, 2026, 4:39 AM

#

wicked talon It's good at code too lol

yeah but mimo 2.5 looks better at code than deepseek

wicked talon Apr 24, 2026, 4:39 AM

#

inner relic yeah but mimo 2.5 looks better at code than deepseek

Lmao that's like saying Kimi is better than opus at code

inner relic Apr 24, 2026, 4:40 AM

#

wicked talon Lmao that's like saying Kimi is better than opus at code

yeah bro stop being ignornant

#

and you're still rude even I didnt want t argue with you

wicked talon Apr 24, 2026, 4:40 AM

#

If you have a problem tell a mod

vale quest Apr 24, 2026, 4:49 AM

#

wicked talon Lmao that's like saying Kimi is better than opus at code

Well idk abt that

#

Deepseek is better at debugging and deep tasks

#

Mimo is good at structured small tasks

#

And being fast

loud herald Apr 24, 2026, 5:08 AM

#

wicked talon Lmao that's like saying Kimi is better than opus at code

It is

velvet furnace Apr 24, 2026, 5:09 AM

#

can we use gpt5.5 in battle?

surreal zephyr Apr 24, 2026, 5:09 AM

#

5.5 mogs all

sly cedar Apr 24, 2026, 5:09 AM

#

vale quest Deepseek is better at debugging and deep tasks

For me deepseek is better at following instruction context and debugging which is quite amazing

loud herald Apr 24, 2026, 5:09 AM

#

velvet furnace can we use gpt5.5 in battle?

If 5.5 was here we'd have a ping

vale quest Apr 24, 2026, 5:10 AM

#

sly cedar For me deepseek is better at following instruction context and debugging which i...

Never used deepseek i just see reviews

#

Glad it worked out for you

surreal zephyr Apr 24, 2026, 5:10 AM

#

At code security?
5.5>5.4>5.3>5.2>opus 4.5> opus 4.6 >>> opus 4.7 >>>>> gemini 3.1

surreal zephyr Apr 24, 2026, 5:11 AM

#

surreal zephyr At code security? 5.5>5.4>5.3>5.2>opus 4.5> opus 4.6 >>> opus 4.7 >>>>> gemini 3...

(Actual tests, prepared& reviewed by all 3 models together, anonymously)

sly cedar Apr 24, 2026, 5:12 AM

#

vale quest Never used deepseek i just see reviews

I reviewed it, its like gemini 3.1 pro but on steroids on context & debugging or even creating elements, i'm glad i can use gemini 3.1 pro but on steroids

#

I used it for roblox project

vale quest Apr 24, 2026, 5:13 AM

#

sly cedar I used it for roblox project

Nicee

sly cedar Apr 24, 2026, 5:15 AM

#

Honestly i used to make roblox game with old deepseek model, but one of the generated scripts actually creates bold ui design, and i like it, but back then deepseek was infant

#

it can't even handle codes pretty well back then imo

#

Deepseek had big glow up rn

sullen creek Apr 24, 2026, 5:24 AM

#

yo do u guys think they are adding gpt 5.5 to direct

sly cedar Apr 24, 2026, 5:28 AM

#

sullen creek yo do u guys think they are adding gpt 5.5 to direct

If its cheap = yes, elseif its expensive = yes > later remove

river moat Apr 24, 2026, 5:29 AM

#

How now how to use Gemini 3.1 pro for free

sly cedar Apr 24, 2026, 5:29 AM

#

river moat How now how to use Gemini 3.1 pro for free

Deepseek v4 for now

#

Imo

#

or Kimi 2.6 atleast

vale shell Apr 24, 2026, 5:34 AM

#

hello. Will Gemini 3.1 pro be available on arena ai?

sullen creek Apr 24, 2026, 5:35 AM

#

sly cedar or Kimi 2.6 atleast

glm 5.1 is closest to opus rn

velvet furnace Apr 24, 2026, 5:38 AM

#

sly cedar Apr 24, 2026, 5:46 AM

#

velvet furnace

https://tenor.com/view/dj-gif-24173332

Tenor

surreal zephyr Apr 24, 2026, 5:49 AM

#

Hows deepshit v4

sly cedar Apr 24, 2026, 5:51 AM

#

surreal zephyr Hows deepshit v4

Deepshit v4 cooked my code pretty well

wicked talon Apr 24, 2026, 5:51 AM

#

river moat How now how to use Gemini 3.1 pro for free

Ai studio

wicked talon Apr 24, 2026, 5:51 AM

#

surreal zephyr Hows deepshit v4

I've cried over it 🙂

sly cedar Apr 24, 2026, 5:51 AM

#

Deepshit is now deepreal

wicked talon Apr 24, 2026, 5:56 AM

#

sly cedar Deepshit is now deepreal

Yesss

grave peak Apr 24, 2026, 5:56 AM

#

How its deep?

sly cedar Apr 24, 2026, 5:58 AM

#

grave peak How its deep?

Because it can thinks deep

#

https://tenor.com/view/thinking-of-you-christopher-walken-considering-it-deep-thinking-deep-thoughts-gif-4077808712528826392

Tenor

toxic whale Apr 24, 2026, 6:10 AM

#

Deepseek v4 pro does about same performace as Sonnet 4.6 Thinking, or Opus 4.6 on like Low in my testing

karmic temple Apr 24, 2026, 6:14 AM

#

river moat Apr 24, 2026, 6:14 AM

#

How to use DeepSeek v4?

karmic temple Apr 24, 2026, 6:15 AM

#

Hi 👋

river moat Apr 24, 2026, 6:16 AM

#

Hi

karmic temple Apr 24, 2026, 6:16 AM

#

Can you make a video of the picture I uploaded above?

river moat Apr 24, 2026, 6:16 AM

#

karmic temple

This?

shrewd citrus Apr 24, 2026, 6:17 AM

#

woah v4 is here

karmic temple Apr 24, 2026, 6:17 AM

#

river moat This?

Yes

river moat Apr 24, 2026, 6:18 AM

#

Okay

#

What’s you need?

#

For a video

karmic temple Apr 24, 2026, 6:19 AM

#

Yes, I really need the view of our village. This is the school.

river moat Apr 24, 2026, 6:20 AM

#

Describe what is needed?

karmic temple Apr 24, 2026, 6:21 AM

#

The video will be drone style slow motion.

river moat Apr 24, 2026, 6:21 AM

#

Ok bro 1 second

karmic temple Apr 24, 2026, 6:22 AM

#

You don't understand Bengali.

river moat Apr 24, 2026, 6:23 AM

#

Sorry no

karmic temple Apr 24, 2026, 6:24 AM

#

When will my video be made?

#

না তুমি পারবে না তৈরি করে দিতে

whole sundial Apr 24, 2026, 6:26 AM

#

karmic temple When will my video be made?

Video Arena is only available on https://arena.ai/video, if that is what you are here for.

river moat Apr 24, 2026, 6:26 AM

#

karmic temple না তুমি পারবে না তৈরি করে দিতে

It’s generating

#

Sorry

#

I can’t

toxic prawn Apr 24, 2026, 6:37 AM

#

/ image-to-video I want to the this movie chale and play ho

brisk turret Apr 24, 2026, 7:07 AM

#

Deepseek launch looks like a dud but when you factor in cost, it's a killer

wicked talon Apr 24, 2026, 7:15 AM

#

shrewd citrus woah v4 is here

It's bad

wary nacelle Apr 24, 2026, 7:29 AM

#

toxic prawn Apr 24, 2026, 7:55 AM

#

/cinematic slow zoom, 4 friends watching movie in dark theatre, screen light flickering on faces, dramatic mood, realistic camera movement generate this video

half veldt Apr 24, 2026, 7:55 AM

#

toxic prawn /cinematic slow zoom, 4 friends watching movie in dark theatre, screen light fli...

is this a bot?

#

goofy ah bot

#

INSANECAT

spring oar Apr 24, 2026, 8:33 AM

#

GPT 5.5 better than opus 4.7 ?

knotty fable Apr 24, 2026, 8:41 AM

#

And no new version number on Seedream, but they've done something, more responsive prompting and better results.
My bet that it's a hidden update to counter GPT2.

#

Which one is better? Matter of taste - but it really need a goal photo to tell.

#

Tudi & Seedream left, and GPT2 right.

#

Funny thing is that while Seedream have added fake noise to make images more "photographic", it's seen on the GPT2 image.
While the same noise now is much smaller on Seedream at left - and I've done a dozen in the last hour to get her pose and dress right so it is consistent.

jaunty dawn Apr 24, 2026, 8:54 AM

#

Can a model's name be changed? I noticed in a chat that a model previously called deepseek-v3.2 was renamed to deepseek-v4-pro

civic plaza Apr 24, 2026, 8:57 AM

#

Is the website down? It’s using forever to load

wicked talon Apr 24, 2026, 9:01 AM

#

jaunty dawn Can a model's name be changed? I noticed in a chat that a model previously calle...

V4 is the new model

#

They took 3.2 away

compact flame Apr 24, 2026, 9:12 AM

#

Hey chat

#

How good is gpt 5.5 after testing

#

For me it seems great

surreal zephyr Apr 24, 2026, 9:14 AM

#

wicked talon V4 is the new model

haha i wish

#

atleast those were removed and replaced with

sterile tartan Apr 24, 2026, 9:14 AM

#

surreal zephyr haha i wish

What's this

surreal zephyr Apr 24, 2026, 9:15 AM

#

sterile tartan What's this

deepseek v4 is actually deepseek v3.2-experimental

#

🤣

sterile tartan Apr 24, 2026, 9:15 AM

#

💀 💀 💀

#

U Sure

surreal zephyr Apr 24, 2026, 9:15 AM

#

yeah i am

#

v4 flash is v3.2 exp

#

literally says

sterile tartan Apr 24, 2026, 9:15 AM

#

It says that fir API Replacement doesn't it?

compact flame Apr 24, 2026, 9:16 AM

#

surreal zephyr deepseek v4 is actually deepseek v3.2-experimental

Hey pro

surreal zephyr Apr 24, 2026, 9:16 AM

#

sterile tartan It says that fir API Replacement doesn't it?

it says renamed 3.2-exp to 4-flash

#

but 4-pro is new

compact flame Apr 24, 2026, 9:16 AM

#

How good is gpt 5.5?

surreal zephyr Apr 24, 2026, 9:16 AM

#

compact flame How good is gpt 5.5?

best

#

🔥

sterile tartan Apr 24, 2026, 9:16 AM

#

surreal zephyr but 4-pro is new

Well atleast pro is new

sterile tartan Apr 24, 2026, 9:16 AM

#

compact flame How good is gpt 5.5?

Really Good

compact flame Apr 24, 2026, 9:16 AM

#

surreal zephyr best

Seriously?

surreal zephyr Apr 24, 2026, 9:17 AM

#

compact flame Seriously?

yes basically they were testing 4 flash under name of 3.2 exp

#

or

#

pr move

#

not me to know

#

¯_(ツ)_/¯

compact flame Apr 24, 2026, 9:17 AM

#

I guess chatgpt finally beaten Claude after all these months

surreal zephyr Apr 24, 2026, 9:18 AM

#

compact flame I guess chatgpt finally beaten Claude after all these months

it was always better at thinking just needed more specific prompts, and it wasnt best at ui

orchid olive Apr 24, 2026, 9:19 AM

#

when can I see the V4 or 5.5

compact flame Apr 24, 2026, 9:19 AM

#

orchid olive when can I see the V4 or 5.5

Well when a crab whistles on the mountain

#

But deepseek v4 is there tho

#

sterile tartan Apr 24, 2026, 9:20 AM

#

@surreal zephyr exactly which models are available on Deepseek Web/App?

sterile tartan Apr 24, 2026, 9:21 AM

#

compact flame But deepseek v4 is there tho

Because it's cheaper

compact flame Apr 24, 2026, 9:21 AM

#

sterile tartan Because it's cheaper

Why do you correct me

#

I know it's cheap

sterile tartan Apr 24, 2026, 9:23 AM

#

compact flame I know it's cheap

Well u didn't said that before

brisk turret Apr 24, 2026, 9:23 AM

#

"Price is blended using a 3:1 output-to-input ratio: (3 × output price + 1 × input price) ÷ 4. This reflects typical usage where output tokens cost more and are generated in higher volume."

Petition to add a slider to the pareto graph for input:output ratio

sterile tartan Apr 24, 2026, 9:23 AM

#

Just incase you didn't knew

compact flame Apr 24, 2026, 9:23 AM

#

sterile tartan Just incase you didn't knew

Kk

sterile tartan Apr 24, 2026, 9:23 AM

#

K

surreal zephyr Apr 24, 2026, 9:31 AM

#

sterile tartan <@1035834558681186347> exactly which models are available on Deepseek Web/App?

no idea i dont use deepshit

#

i need multimodality

sterile tartan Apr 24, 2026, 9:32 AM

#

Ufff

surreal zephyr Apr 24, 2026, 9:33 AM

#

models without proper multimodal reasoning are unreliable imo

#

total cost efficiency 5.5 vs 5.4

#

5.5 up to 7x more token efficient is wild

#

so up to 3.5x cheaper

sterile tartan Apr 24, 2026, 9:35 AM

#

No

#

Because it's more expensive

surreal zephyr Apr 24, 2026, 9:36 AM

#

sterile tartan Because it's more expensive

2x higher cost per token yes
but uses 7x less tokens

sterile tartan Apr 24, 2026, 9:36 AM

#

Is doubled

surreal zephyr Apr 24, 2026, 9:36 AM

#

so 3.5x cheaper total

sterile tartan Apr 24, 2026, 9:36 AM

#

Wait

surreal zephyr Apr 24, 2026, 9:36 AM

#

(just dont spam xhigh when medium and high do fine then you can save 3x the quota)

sterile tartan Apr 24, 2026, 9:36 AM

#

You are Absolutely Right

#

Very Sigma Calculation Bro

brisk turret Apr 24, 2026, 9:55 AM

#

where 5.5

vernal raft Apr 24, 2026, 9:56 AM

#

In battle mode

river moat Apr 24, 2026, 9:58 AM

#

What ai is the best for a school

#

What is this

light sleet Apr 24, 2026, 10:01 AM

#

sterile tartan Very Sigma Calculation Bro

pineapple will get u 😈

light sleet Apr 24, 2026, 10:02 AM

#

surreal zephyr (just dont spam xhigh when medium and high do fine then you can save 3x the quot...

bro is roblox exploiter 👍🏼 👍🏼

surreal zephyr Apr 24, 2026, 10:03 AM

#

light sleet bro is roblox exploiter 👍🏼 👍🏼

uhhh

light sleet Apr 24, 2026, 10:03 AM

#

or wait

light sleet Apr 24, 2026, 10:03 AM

#

surreal zephyr uhhh

u in punkteam because of vpn 😡

#

same bro same

sterile tartan Apr 24, 2026, 10:06 AM

#

light sleet pineapple will get u 😈

https://tenor.com/view/shut-up-dont-talk-close-mouth-grab-lips-mad-family-gif-21732788

Tenor

surreal zephyr Apr 24, 2026, 10:12 AM

#

light sleet u in punkteam because of vpn 😡

uhh yeah totally

#

🤔

river moat Apr 24, 2026, 10:17 AM

#

Who know what ai is the best for school

shrewd citrus Apr 24, 2026, 10:19 AM

#

river moat Who know what ai is the best for school

gpt 5.2 search

river moat Apr 24, 2026, 10:19 AM

#

Thank

river moat Apr 24, 2026, 10:20 AM

#

shrewd citrus gpt 5.2 search

And coding

#

I forgot say

vital mantle Apr 24, 2026, 10:25 AM

#

restive charm Apr 24, 2026, 10:25 AM

#

This problem can be solved
This session has reached its token usage limit. Please start a new chat to continue.
Trace ID: 76f18173-373d

tidal sierra Apr 24, 2026, 10:29 AM

#

vital mantle

from what point in tim

#

can an ai

flint sandal Apr 24, 2026, 10:29 AM

#

lets go

tidal sierra Apr 24, 2026, 10:29 AM

#

just cancel a chat

tranquil burrow Apr 24, 2026, 10:29 AM

#

restive charm This problem can be solved This session has reached its token usage limit. Pleas...

your

robust marsh Apr 24, 2026, 10:30 AM

#

pls fix ur fckin captcha😭

flint sandal Apr 24, 2026, 10:31 AM

#

flint sandal lets go

lets see if the $200 pro subscription was worth it, and yeah im releasing this game on steam to get my $200 + tax back😭

#

but i heard 5.5 pro is really good at game-making

#

i will use like meshy ai and add real 3d models to the game

#

and see what will happen then

#

half way there

astral cobalt Apr 24, 2026, 10:34 AM

#

arena ai actually crash

flint sandal Apr 24, 2026, 10:36 AM

#

extended pro😭

restive charm Apr 24, 2026, 10:39 AM

#

Look at this problem

Screenshot_------_com.android.chrome.jpg

flint sandal Apr 24, 2026, 10:41 AM

#

#

the results are interesting but does someone have a great pc to run it? because on my m2 mac it runs at 5fps😭 please

waxen seal Apr 24, 2026, 10:43 AM

#

https://youtu.be/K6eSZdl6w7Q?si=aVtXBFnq8f8Lm45O

YouTube

FUTRIXX

130 Years of American Fighter Aircraft Evolution (1918–2050)

Help me reach my first 1000 subs, thanks legends

👉 If you enjoyed this breakdown, hit that Hype button to show some love and help push this video further.

This video takes viewers on a fast-paced journey through the complete evolution of American fighter aircraft—from fragile World War I biplanes to cutting-edge stealth jets and future 7t...

▶ Play video

light sleet Apr 24, 2026, 10:56 AM

#

flint sandal the results are interesting but does someone have a great pc to run it? because ...

Send

surreal zephyr Apr 24, 2026, 10:59 AM

#

vital mantle

thats why i use gpt

#

gpt has no "end conversation"

#

so it actually listens to you instead of ending himself when hes lazy

flint sandal Apr 24, 2026, 11:00 AM

#

light sleet Send

check dm

#

or

#

📎 neon_veil_fps.html

#

anyone?

rose tendon Apr 24, 2026, 11:09 AM

#

gemini 3.1 flash lite preview .. is it a temporary issue or ?

light sleet Apr 24, 2026, 11:11 AM

#

flint sandal anyone?

nvm I'm lagging hard too

bronze abyss Apr 24, 2026, 11:11 AM

#

guys wasn't claude opus 4.6 available in the LLM chat what happened to it?

flint sandal Apr 24, 2026, 11:12 AM

#

bronze abyss guys wasn't claude opus 4.6 available in the LLM chat what happened to it?

its too expensive to give free access to opus to everyone with generous limits

#

its still in battle tho

bronze abyss Apr 24, 2026, 11:12 AM

#

flint sandal its still in battle tho

ooooh
thanks

#

also have anyone tried chinese models like kimi?
if so what's your review about it

flint sandal Apr 24, 2026, 11:14 AM

#

bronze abyss also have anyone tried chinese models like kimi? if so what's your review about...

new deepseek is trash, kimi-k2.6 is the smartest one in my opinion in how it reasons in every domain, and glm-5.1 is okay but i dont like glm cause it has the same problem that gemini has like if you tell it to change one button it will additionaly change the full page, even if i ask it to not do that.

bronze abyss Apr 24, 2026, 11:15 AM

#

flint sandal new deepseek is trash, kimi-k2.6 is the smartest one in my opinion in how it rea...

thx

flint sandal Apr 24, 2026, 11:16 AM

#

bronze abyss thx

but i recommend to pay for claude or chatgpt, they are way better

#

i just bought cgpt pro for 200 bucks u know😭

#

flex

#

because paying for glm or for kimi that arent SoTA is i think a waste of money

bronze abyss Apr 24, 2026, 11:17 AM

#

flint sandal but i recommend to pay for claude or chatgpt, they are way better

i dontdo complex tasks I'm not even into coding I'm a med student

flint sandal Apr 24, 2026, 11:17 AM

#

bronze abyss i dontdo complex tasks I'm not even into coding I'm a med student

gpt-5.5/gemini 3.1 will be great for you

strong ferry Apr 24, 2026, 11:18 AM

#

Ngl, so far GPT 2 has been pretty impressive. I asked for this prompt:

Create an illustration showcasing details about the differences between Bigfoot and the Abominable Snowman. On Bigfoot's side, it describes it as being either male or female, brown fur and more man-like in its face, looking almost like a Neanderthal. It is aggressive only when provoked and can be found in the woods of America. On the Abominable Snowman's side, it is mostly a male species with white fur and a more ape-like face, bipedal with large feet like Bigfoot, and is less aggressive. It is a creature that prefers solitude and is known to save some of those who wander in the blizzard in the Himalayas. Some theories suggest it may be a Tulpa created by the Tibetians.

And it's shockingly good with the text. Even Gemini struggled when you asked for too much. This is consistent.

#

And here's a map I asked for my fictional island

bronze abyss Apr 24, 2026, 11:19 AM

#

flint sandal gpt-5.5/gemini 3.1 will be great for you

yes and no
i like to use claude analysis level for the deep content and also his special commands like oods and L99 actually make difference for me

flint sandal Apr 24, 2026, 11:20 AM

#

bronze abyss yes and no i like to use claude analysis level for the deep content and also hi...

yeahh i recommend you to buy claude max for 10 bucks monthly, look at the promo codes online and u will be happy with that

flint sandal Apr 24, 2026, 11:21 AM

#

strong ferry And here's a map I asked for my fictional island

YEAAHHH NOW PUT THAT TO 5.5 PRO AND MAKE AN AAA GAME OUT OF IT😋

#

wtf

surreal zephyr Apr 24, 2026, 11:21 AM

#

codex made factorio copy and now playing it

bronze abyss Apr 24, 2026, 11:21 AM

#

flint sandal yeahh i recommend you to buy claude max for 10 bucks monthly, look at the promo ...

transactions here and complicated in africa
also 10 bucks is waaaay more than it worth here

strong ferry Apr 24, 2026, 11:21 AM

#

flint sandal YEAAHHH NOW PUT THAT TO 5.5 PRO AND MAKE AN AAA GAME OUT OF IT😋

I wish lmao

flint sandal Apr 24, 2026, 11:22 AM

#

bronze abyss transactions here and complicated in africa also 10 bucks is waaaay more than i...

nahh 10 bucks for this much usage is really good, in api you would pay like 150 bucks for this much usage

flint sandal Apr 24, 2026, 11:22 AM

#

flint sandal nahh 10 bucks for this much usage is really good, in api you would pay like 150 ...

plus you have all the special features

#

can someone mute him please?

bronze abyss Apr 24, 2026, 11:22 AM

#

flint sandal nahh 10 bucks for this much usage is really good, in api you would pay like 150 ...

I'll look into that

surreal zephyr Apr 24, 2026, 11:22 AM

#

<@&1349916362595635286>

#

thanks

flint sandal Apr 24, 2026, 11:22 AM

#

wowww pretty fast moderation here

surreal zephyr Apr 24, 2026, 11:23 AM

#

flint sandal wowww pretty fast moderation here

🤖

#

faster than ai

#

🔥

grave peak Apr 24, 2026, 11:23 AM

#

Fair enough

surreal zephyr Apr 24, 2026, 11:23 AM

#

tbh ai moderation here would be peak

#

auto delete scams & video requests

rose tendon Apr 24, 2026, 11:25 AM

#

rose tendon gemini 3.1 flash lite preview .. is it a temporary issue or ?

any takes on this ? it's really hard for me to start a new chat at this point

flint sandal Apr 24, 2026, 11:28 AM

#

AND BTW WHERE IS SORA I BOUGHT PRO AND NO SORA HERE? ://

tidal sluice Apr 24, 2026, 11:30 AM

#

Been using Deepseek V4 for a while and it doesn’t improve. After a few exchanges, it loses memory. When I point it out, it doesn’t even remember forgetting, so things get messy. Eventually it only recalls the very first question, so it’ll hit me with “So what you meant is this!” — bringing up ancient history even though we’ve moved on.

stray aspen Apr 24, 2026, 11:40 AM

#

tidal sluice Been using Deepseek V4 for a while and it doesn’t improve. After a few exchanges...

It sucks

#

I guess ill have to switch to mimo 2.5 pro

knotty fable Apr 24, 2026, 11:41 AM

#

https://youtu.be/_193U2aNaeE
This is bloody nutz.

YouTube

Tom Bibby

OpenAI Just Went Full Supervillain

OpenAI are backing a bill that would shield them from liability if 100 people are killed by their AI. At least they're making it obvious that they are the villains.

Take 1 minute to contact your representatives about this - https://controlai.com/take-action

Join PauseAI - https://pauseai.info/
Sign the statement on superintelligence - https://...

▶ Play video

surreal zephyr Apr 24, 2026, 11:47 AM

#

Deepseek v4 being worse than kimi2.6, gpt 5.4 is just funny

split topaz Apr 24, 2026, 11:47 AM

#

Hey..

surreal zephyr Apr 24, 2026, 11:48 AM

#

split topaz Hey..

Deepseek v4 is actually v3.2 exp

#

#

😂

light sleet Apr 24, 2026, 11:49 AM

#

💀

#

yet they said it's gonna beat 5.5

#

lol

grand raft Apr 24, 2026, 11:49 AM

#

what??????????????????????????????????????????????????????/

light sleet Apr 24, 2026, 11:49 AM

#

where are the deepsleepers?

grand raft Apr 24, 2026, 11:49 AM

#

https://tenor.com/view/spongebob-you-what-what-did-you-do-spongebob-meme-spongebob-squarepants-gif-19389913

Tenor

split topaz Apr 24, 2026, 11:49 AM

#

surreal zephyr

API of V 3.2 got recently updated so they might have been shadow releasing for a while.

#

But yeah it's not that useful looking at the benchmarks. I can only hope that this being a preview would signal improvements later on

earnest rover Apr 24, 2026, 11:52 AM

#

so anyone knows whats the rl for gpt image 2 in chatgpt for free users (OFC)

storm dust Apr 24, 2026, 11:52 AM

#

yo guys

#

did you witness the kimi logo redesign?

#

it looks different now

compact flame Apr 24, 2026, 12:09 PM

#

Why do you ask

#

The API is not even out yet bro

tulip parcel Apr 24, 2026, 12:16 PM

#

How’s gpt 5.5?

flint sandal Apr 24, 2026, 12:17 PM

#

tulip parcel How’s gpt 5.5?

i am testing 5.5 pro and im quite pleased

#

but nothing revolutionary

#

just like a gpt-5.3/5.4 situation

proud bobcat Apr 24, 2026, 12:19 PM

#

The whale has awoken.

proud bobcat Apr 24, 2026, 12:19 PM

#

surreal zephyr Deepseek v4 being worse than kimi2.6, gpt 5.4 is just funny

Well the point is is that it’s a reliable workhorse

#

It’s not supposed to be super duper ultra intelligent

#

And for what it is it’s an extremely competitive model

flint sandal Apr 24, 2026, 12:21 PM

#

i would rather use qwen 27b than the new deepseek

#

tbh

proud bobcat Apr 24, 2026, 12:21 PM

#

How come

flint sandal Apr 24, 2026, 12:22 PM

#

deepseek seems to have the gemini issues

#

and glm issues

#

qwen 27b doesnt

proud bobcat Apr 24, 2026, 12:22 PM

#

That’s

#

That’s a very broad statement

#

What are these issues

flint sandal Apr 24, 2026, 12:22 PM

#

flint sandal new deepseek is trash, kimi-k2.6 is the smartest one in my opinion in how it rea...

.

dusky hedge Apr 24, 2026, 12:23 PM

#

Have you guys found a way to use claude opus for free?

proud bobcat Apr 24, 2026, 12:23 PM

#

Again DeepSeek is meant for good, fast intelligence

#

It’s not supposed to be SOTA

flint sandal Apr 24, 2026, 12:23 PM

#

proud bobcat Again DeepSeek is meant for good, fast intelligence

its not good or fast

#

faster than other open chineese models ye

proud bobcat Apr 24, 2026, 12:23 PM

#

I’d beg to differ?

It’s faster than Gemini 3 flash for me and Claude sonnet 4.6

flint sandal Apr 24, 2026, 12:24 PM

#

whats ur provider

proud bobcat Apr 24, 2026, 12:24 PM

#

I use the app and openrouter

#

It’s been quite nice

#

DeepSeek has NEVER let me down any time I’ve asked it something

#

The one time it did was because I didn’t describe something correctly

#

Which is insanely impressive for a lower tier model

flint sandal Apr 24, 2026, 12:25 PM

#

i mean flash is good as the fast cheap model

#

but pro is supposed to be good and SoTA like thats the point of pro

proud bobcat Apr 24, 2026, 12:26 PM

#

We will have to see its intelligence score

#

It’ll probably be equal to muse spark

flint sandal Apr 24, 2026, 12:26 PM

#

i would rather have a really slow model that is SoTA and is good

proud bobcat Apr 24, 2026, 12:26 PM

#

That’s your preference then

#

Nothing wrong with that at all

#

I personally fave Kimi K2.5 and K2.6

flint sandal Apr 24, 2026, 12:27 PM

#

but still with fast models that arent that good you spend more time fixing and iterating so slower models are actually faster to work with

#

from my experience

proud bobcat Apr 24, 2026, 12:29 PM

#

proud bobcat Apr 24, 2026, 12:29 PM

#

flint sandal but still with fast models that arent that good you spend more time fixing and i...

It’s more about how it thinks

#

For example I can tell you DeepSeek will always provide you decent code

#

It may not be opus quality

#

But it will work

wispy light Apr 24, 2026, 12:29 PM

#

why am i not able to login in lm arena website

proud bobcat Apr 24, 2026, 12:29 PM

#

Every time

surreal zephyr Apr 24, 2026, 12:38 PM

#

proud bobcat

Deepseek v3.2-exp*

#

Renamind model is wild

#

Deepseek geniuely has most overhyped open source models while having worst ones

#

Qwen has best models by far from open

#

Qwen 3.6 27b solos deepseek v3.2exp aka v4

#

And if you need price to perf then gpt 5.5 is still best

indigo knoll Apr 24, 2026, 12:43 PM

#

Is Deepseek 4 all that? Or just overrated?

limber hound Apr 24, 2026, 12:44 PM

#

indigo knoll Is Deepseek 4 all that? Or just overrated?

we'll see actual results in few weeks just as it always happens

void shore Apr 24, 2026, 12:44 PM

#

indigo knoll Is Deepseek 4 all that? Or just overrated?

They benchmark maxed it

#

So it seems good on paper

#

But people are saying it isn’t the greatest when it comes to programming tasks

limber hound Apr 24, 2026, 12:44 PM

#

tbh not total disappointment, wanna test it on few thousand hundreds context

pastel ember Apr 24, 2026, 12:44 PM

#

The only benchmark worth trusting is arc-agi, the rest is just benchmaxxing and pattern matching. If DeepSeek doesn’t at least hit gemini 3 flash level on arc-agi, it’s a flop. At that price, nobody’s gonna want it.

void shore Apr 24, 2026, 12:45 PM

#

I’ll test it

limber hound Apr 24, 2026, 12:45 PM

#

Engram sounded so promising

void shore Apr 24, 2026, 12:45 PM

#

And see what happens

indigo knoll Apr 24, 2026, 12:46 PM

#

Is Gemini 3 Flash still the best non thinking model rn?

limber hound Apr 24, 2026, 12:47 PM

#

pastel ember The only benchmark worth trusting is arc-agi, the rest is just benchmaxxing and ...

Due to constraints in high-end compute capacity, the current service capacity for Pro is very limited. After the 950 supernodes are launched at scale in the second half of this year, the price of Pro is expected to be reduced significantly

pastel ember Apr 24, 2026, 12:47 PM

#

indigo knoll Is Gemini 3 Flash still the best non thinking model rn?

Yep. Especially in terms of price and intelligence.

limber hound Apr 24, 2026, 12:47 PM

#

limber hound `Due to constraints in high-end compute capacity, the current service capacity f...

hopefully that's true

indigo knoll Apr 24, 2026, 12:47 PM

#

pastel ember Yep. Especially in terms of price and intelligence.

Second is GPT 5.3 or what?

pastel ember Apr 24, 2026, 12:48 PM

#

indigo knoll Second is GPT 5.3 or what?

I’d say GPT-5.2, but that’s already much more expensive than Gemini 3 Flash.

indigo knoll Apr 24, 2026, 12:48 PM

#

What, so 5.3 which is a newer version is worse than 5.2?

pastel ember Apr 24, 2026, 12:49 PM

#

Maybe GPT 5.4 Mini, but I haven’t tried it yet.

tranquil burrow Apr 24, 2026, 12:51 PM

#

Guys am i able to ask y'all a question?? When will you be able to use ChatGPT Image 2.0 ? I know its in the Leaderboard but we cannot use it yet (as of my knowledge)

stray aspen Apr 24, 2026, 12:51 PM

#

mimo 2.5 pro is so good

pastel ember Apr 24, 2026, 12:52 PM

#

indigo knoll What, so 5.3 which is a newer version is worse than 5.2?

Just open https://arcprize.org/leaderboard and check out the price/intelligence. It shows how well a model can actually think. Chinese models, as expected, are all at the bottom.

ARC Prize

ARC Prize - Leaderboard

The ARC-AGI Leaderboard.

stray aspen Apr 24, 2026, 12:56 PM

#

mimo 2.5 pro is great for front end

proud bobcat Apr 24, 2026, 12:57 PM

#

surreal zephyr Deepseek v3.2-exp*

Respectfully

I have actually never seen a lower iq argument here

#

You do realize V4 is a completely new dataset

stray aspen Apr 24, 2026, 12:57 PM

#

proud bobcat You do realize V4 is a completely new dataset

it sucks tho

proud bobcat Apr 24, 2026, 12:57 PM

#

They updated the models with the new weights

#

And removed the old ones

stray aspen Apr 24, 2026, 12:57 PM

#

mimo is way better

proud bobcat Apr 24, 2026, 12:58 PM

#

Oh my god bruh for the last time DeepSeek isn’t supposed to be SOTA

#

It’s the reliable workhorse

stray aspen Apr 24, 2026, 12:58 PM

#

proud bobcat Oh my god bruh for the last time DeepSeek isn’t supposed to be SOTA

its supposed to be the poor man's 5.5

proud bobcat Apr 24, 2026, 12:58 PM

#

In benchmarks DeepSeek V4 outperforms 5.4 xhigh pretty often

#

I don’t know why that means it sucks

stray aspen Apr 24, 2026, 12:59 PM

#

proud bobcat In benchmarks DeepSeek V4 outperforms 5.4 xhigh pretty often

it wont outperform the spud tho

proud bobcat Apr 24, 2026, 12:59 PM

#

ITS NOT SUPPOSED TO 😭

#

If you want SOTA you go for Kimi, Claude, GLM

#

DeepSeek is for rapid deployment

stray aspen Apr 24, 2026, 12:59 PM

#

proud bobcat If you want SOTA you go for Kimi, Claude, GLM

kimi is NOT SotA gang

proud bobcat Apr 24, 2026, 1:00 PM

#

My guy what.

stray aspen Apr 24, 2026, 1:00 PM

#

mimo 2.5 pro is

proud bobcat Apr 24, 2026, 1:00 PM

#

???????

#

You used it for one prompt and said like

#

“Yeah this is SOTA”

stray aspen Apr 24, 2026, 1:00 PM

#

nah

#

i tested it yesterday

proud bobcat Apr 24, 2026, 1:00 PM

#

Mimo is the exact same philosophy as DeepSeek

stray aspen Apr 24, 2026, 1:00 PM

#

proud bobcat Mimo is the exact same philosophy as DeepSeek

but its smarter

#

and does stuff correctly

proud bobcat Apr 24, 2026, 1:00 PM

#

Ehhh in my testing not really

stray aspen Apr 24, 2026, 1:01 PM

#

and we get 1 million context

#

but its actually smart

#

and gives you complete coding projects

#

not just a 100 line template like gemini n stuff

proud bobcat Apr 24, 2026, 1:01 PM

#

Well in Gemini’s defense here it’s always been a pretty ass model

#

I just like DeepSeek because it’s reliable

stray aspen Apr 24, 2026, 1:02 PM

#

deepseek needs vision

proud bobcat Apr 24, 2026, 1:02 PM

#

Yeah

#

Multimodal coming soon

#

As per their post

stray aspen Apr 24, 2026, 1:02 PM

#

proud bobcat As per their post

send me

proud bobcat Apr 24, 2026, 1:02 PM

#

Hold

stray aspen Apr 24, 2026, 1:03 PM

#

if it gets vision its better than mimo

proud bobcat Apr 24, 2026, 1:03 PM

#

stray aspen Apr 24, 2026, 1:03 PM

#

great

proud bobcat Apr 24, 2026, 1:03 PM

#

Again you can prefer what you want

#

But I think a lot of people conflate that new model must be SOTA

#

I personally love V4

ionic vigil Apr 24, 2026, 1:05 PM

#

I love that it doesn't run inference on nvidia

compact comet Apr 24, 2026, 1:06 PM

#

it's insane how deepseek just keeps being nerfed intentionally and it still manages to perform near SOTA

frosty lava Apr 24, 2026, 1:06 PM

#

pastel ember Yep. Especially in terms of price and intelligence.

then no, if you look for price to performance open source model or small team model are the best

stray aspen Apr 24, 2026, 1:07 PM

#

compact comet it's insane how deepseek just keeps being nerfed intentionally and it still mana...

how do you konw they are nerfing it

frosty lava Apr 24, 2026, 1:08 PM

#

Deepseek is working the hardest on architecture improvement

#

that's definitly true

compact comet Apr 24, 2026, 1:08 PM

#

they literally are not allowed to use nvidia

frosty lava Apr 24, 2026, 1:08 PM

#

and they say what they achieved and innovated publicly

#

so everyone can technically replicate it

compact comet Apr 24, 2026, 1:09 PM

#

they have the best engineers in the world probably

#

no questions

stray aspen Apr 24, 2026, 1:09 PM

#

compact comet they have the best engineers in the world probably

why

frosty lava Apr 24, 2026, 1:09 PM

#

it's profitable also for other ai companies they will just use those technics

stray aspen Apr 24, 2026, 1:09 PM

#

i think claude does

compact comet Apr 24, 2026, 1:09 PM

#

blud asked why

proud bobcat Apr 24, 2026, 1:09 PM

#

Wait

frosty lava Apr 24, 2026, 1:09 PM

#

compact comet they have the best engineers in the world probably

i don't think its the best in the world but i think other ai companies just doesn't care and are just scaling

#

with compute power

proud bobcat Apr 24, 2026, 1:09 PM

#

DeepSeek v4 pro BEATS Kimi K2.6 in swebench verified???

stray aspen Apr 24, 2026, 1:09 PM

#

proud bobcat DeepSeek v4 pro BEATS Kimi K2.6 in swebench verified???

kimi is ass bro

#

anything beats it

proud bobcat Apr 24, 2026, 1:10 PM

#

All your opinions are dogwater bro

stray aspen Apr 24, 2026, 1:10 PM

#

they aint

#

im just saying the truth

proud bobcat Apr 24, 2026, 1:10 PM

#

Kimi wipes the floor with Claude opus 4.7

#

It’s not always about benchmarks

#

Bros laughing while opus 4.7 won’t even read documents, listen to instructions, and takes shortcuts always

#

Not the mention the stealth price hike with the new tokenizer leading to 35% higher costs for an already expensive ahh model

#

So you’re getting worse performance with Claude while paying more premium

proud bobcat Apr 24, 2026, 1:11 PM

#

frosty lava Deepseek is working the hardest on architecture improvement

DeepSeek’s architecture is actually insane

#

Running a 1.6T parameter model at such a fast speed?

#

Holy

frosty lava Apr 24, 2026, 1:12 PM

#

honestly they will keep going like that and keep reducing compute power necessity, faster training, and it'll just be profitable to everyone

stray aspen Apr 24, 2026, 1:12 PM

#

@proud bobcatare you running deepseek v4 locally

frosty lava Apr 24, 2026, 1:12 PM

#

other ai companies will just steal the idea to implement on their but its normal honestly

sonic wigeon Apr 24, 2026, 1:12 PM

#

how's deepseek guys
anyone tried it

stray aspen Apr 24, 2026, 1:12 PM

#

sonic wigeon how's deepseek guys anyone tried it

its mid

#

bad for frontend

sonic wigeon Apr 24, 2026, 1:13 PM

#

hmm

stray aspen Apr 24, 2026, 1:13 PM

#

and its bad for Lua coding

#

but its great for math

frosty lava Apr 24, 2026, 1:13 PM

#

but deepseek doing the dirty work for architectural improvement

sonic wigeon Apr 24, 2026, 1:13 PM

#

its not multimodal either eh?

stray aspen Apr 24, 2026, 1:13 PM

#

not yet

#

but the ywill add vision later

sonic wigeon Apr 24, 2026, 1:13 PM

#

stray aspen but the ywill add vision later

interesting

proud bobcat Apr 24, 2026, 1:13 PM

#

stray aspen <@690672903292977153>are you running deepseek v4 locally

Oh yeah dude I have like 3 servers laying around just for this

#

Totally

stray aspen Apr 24, 2026, 1:13 PM

#

proud bobcat Oh yeah dude I have like 3 servers laying around just for this

its not that expensive lmao

proud bobcat Apr 24, 2026, 1:14 PM

#

I’m not wasting money to run local ai

#

I don’t need it

#

I just like keeping up with releases and benchmarks

frosty lava Apr 24, 2026, 1:14 PM

#

maybe if at some point we will be able to compress the model so much (like 99%) without loosing quality we will be able to run T model locally lol

frosty lava Apr 24, 2026, 1:14 PM

#

stray aspen its not that expensive lmao

to run a 1.6t model locally and at decent speed ?

#

it's expensive bro

rocky geyser Apr 24, 2026, 1:14 PM

#

stray aspen its not that expensive lmao

;-;

stray aspen Apr 24, 2026, 1:14 PM

#

not more than 15 k tho

frosty lava Apr 24, 2026, 1:14 PM

#

stray aspen not more than 15 k tho

yes so its expensive for most people

rocky geyser Apr 24, 2026, 1:15 PM

#

stray aspen not more than 15 k tho

15k if your lucky and want slow speeds and thats still expensive- 😭

stray aspen Apr 24, 2026, 1:15 PM

#

wdym thats like just 10 months of work

rocky geyser Apr 24, 2026, 1:15 PM

#

stray aspen wdym thats like just 10 months of work

without food, rent or anything else

frosty lava Apr 24, 2026, 1:15 PM

#

stray aspen wdym thats like just 10 months of work

bro your ragebaiting honestly

proud bobcat Apr 24, 2026, 1:15 PM

#

It’s only 15k guys

stray aspen Apr 24, 2026, 1:15 PM

#

rocky geyser without food, rent or anything else

yes

frosty lava Apr 24, 2026, 1:15 PM

#

you can't save 100% of what you get

sonic wigeon Apr 24, 2026, 1:15 PM

#

stray aspen not more than 15 k tho

15k is the avg yearly salary of the top 5% middle class people where i live

frosty lava Apr 24, 2026, 1:15 PM

#

anyway

stray aspen Apr 24, 2026, 1:15 PM

#

frosty lava bro your ragebaiting honestly

im not dude

#

its the truth

#

unless you live in some third world country

proud bobcat Apr 24, 2026, 1:16 PM

#

I’d rather buy me and my future husband a cottage somewhere than use that money for ai slop

stray aspen Apr 24, 2026, 1:16 PM

#

proud bobcat I’d rather buy me and my future husband a cottage somewhere than use that money ...

wdym ai slop

sonic wigeon Apr 24, 2026, 1:16 PM

#

stray aspen unless you live in some third world country

"some third world country"
that is literally the entire world bro.
wake up.

stray aspen Apr 24, 2026, 1:16 PM

#

sonic wigeon "some third world country" that is literally the entire world bro. wake up.

its not

sonic wigeon Apr 24, 2026, 1:16 PM

#

europe and NA is just a small part

#

to dismiss 4-5 billion people like that is a crime

stray aspen Apr 24, 2026, 1:17 PM

#

working at mcdonalds in canada can get you more money than other countries

frosty lava Apr 24, 2026, 1:17 PM

#

stray aspen im not dude

your ragebaiting cause you know its impossible to save 100% of the money you get every month

#

so it won't be 10 month

proud bobcat Apr 24, 2026, 1:17 PM

#

stray aspen wdym ai slop

As much as I nerd out about AI I’m never using that for anything serious

frosty lava Apr 24, 2026, 1:17 PM

#

but much more

proud bobcat Apr 24, 2026, 1:17 PM

#

I’d rather spend 15K on something useful

#

A used car I can drive

sonic wigeon Apr 24, 2026, 1:18 PM

#

stray aspen working at mcdonalds in canada can get you more money than other countries

97% of people cannot do that.
the ones that could and are are getting kicked out or constantly racist-ed

proud bobcat Apr 24, 2026, 1:18 PM

#

An audio setup

sonic wigeon Apr 24, 2026, 1:18 PM

#

either way we're getting off topic

stray aspen Apr 24, 2026, 1:18 PM

#

guys lets stop talking about this before the night fury warns us

proud bobcat Apr 24, 2026, 1:18 PM

#

Point is is that llama 4 maverick is the best model and you’re all wrong

#

😎

soft river Apr 24, 2026, 1:18 PM

#

So sad that the new model isn’t in the web/app yet

proud bobcat Apr 24, 2026, 1:18 PM

#

DeepSeek?

stray aspen Apr 24, 2026, 1:19 PM

#

soft river So sad that the new model isn’t in the web/app yet

it is wdym

proud bobcat Apr 24, 2026, 1:19 PM

#

It’s been out for a good month I’d reckon

soft river Apr 24, 2026, 1:19 PM

#

stray aspen it is wdym

No it’s not

stray aspen Apr 24, 2026, 1:19 PM

#

what model are you talking about

soft river Apr 24, 2026, 1:19 PM

#

It’s DeepSeek v3.2

stray aspen Apr 24, 2026, 1:19 PM

#

soft river It’s DeepSeek v3.2

we have v4 gang

soft river Apr 24, 2026, 1:19 PM

#

Not 4 yet

proud bobcat Apr 24, 2026, 1:19 PM

#

soft river No it’s not

https://tenor.com/view/oh-my-god-bruh-jjk-jujutsu-kaisen-higuruma-higuruma-hiromi-gif-6772381213627884903

Tenor

#

Dude.

#

It’s been V4

soft river Apr 24, 2026, 1:19 PM

#

Bruh it’s not 😂

proud bobcat Apr 24, 2026, 1:19 PM

#

For a month

soft river Apr 24, 2026, 1:19 PM

#

Not at all

proud bobcat Apr 24, 2026, 1:19 PM

#

Jesus Christ.

stray aspen Apr 24, 2026, 1:20 PM

#

soft river Bruh it’s not 😂

what in the ragebait are you doing

proud bobcat Apr 24, 2026, 1:20 PM

#

I think ai might be giving us brain atrophy

#

Genuinely

#

People will open the DeepSeek app and see “instant” and “expert” modes and still say ts

#

Just ask DeepSeek what it’s knowledge cutoff is

soft river Apr 24, 2026, 1:21 PM

#

“Glazer” and you can’t differentiate them

#

Crazy

proud bobcat Apr 24, 2026, 1:21 PM

#

Ts is ragebait

#

#

The first one is V4 flash

#

The second is V4 pro

#

What is there not to get

#

It’s been like this for a month my dude

#

Today it got released for api access

#

The ragebait is INSANE

#

fiery gull Apr 24, 2026, 1:23 PM

#

proud bobcat Ts is ragebait

v4 is good, but the qwen 3.6 27b bruh, how this run in my pc?

proud bobcat Apr 24, 2026, 1:23 PM

#

The power of dense models

soft river Apr 24, 2026, 1:23 PM

#

proud bobcat It’s been like this for a month my dude

Nuh uh

proud bobcat Apr 24, 2026, 1:23 PM

#

I can’t wait till we get a good 32B dense model from Qwen

fiery gull Apr 24, 2026, 1:24 PM

#

proud bobcat The power of dense models

waiting my 3.6 2b

proud bobcat Apr 24, 2026, 1:24 PM

#

soft river Nuh uh

https://tenor.com/view/off-with-his-head-cat-cat-meme-gif-2379878183170633577

Tenor

soft river Apr 24, 2026, 1:24 PM

#

That was only a change in the interference

fiery gull Apr 24, 2026, 1:24 PM

#

my pc only run the 2b in 15t/s ;-;

tired mantle Apr 24, 2026, 1:24 PM

#

Excuse me, where is GPT 5.5 on Arena? What's the name of the model there?

fiery gull Apr 24, 2026, 1:24 PM

#

the 27b is only 4t/s ;-;

proud bobcat Apr 24, 2026, 1:24 PM

#

fiery gull my pc only run the 2b in 15t/s ;-;

Deadass?

surreal zephyr Apr 24, 2026, 1:25 PM

#

fiery gull v4 is good, but the qwen 3.6 27b bruh, how this run in my pc?

v4 is same model as v3.2 it was renamed lmao

#

proud bobcat Apr 24, 2026, 1:25 PM

#

surreal zephyr

They changed the weights my dude

#

They deprecated 3.2

surreal zephyr Apr 24, 2026, 1:25 PM

#

proud bobcat They changed the weights my dude

nah they made v4 pro (thats actual new one)

fiery gull Apr 24, 2026, 1:25 PM

#

surreal zephyr v4 is same model as v3.2 it was renamed lmao

I tested the v4, is more fast and has more context

surreal zephyr Apr 24, 2026, 1:25 PM

#

but they renamed 3.2 to 4flash

proud bobcat Apr 24, 2026, 1:25 PM

#

4 flash is a completely diff model

#

It’s 285B parameters

fiery gull Apr 24, 2026, 1:26 PM

#

surreal zephyr but they renamed 3.2 to 4flash

flash v4 is another model

proud bobcat Apr 24, 2026, 1:26 PM

#

3.2 exp was 671B

surreal zephyr Apr 24, 2026, 1:26 PM

#

fiery gull flash v4 is another model

bro it kept the score on arena

fiery gull Apr 24, 2026, 1:26 PM

#

but really like the v3.2

surreal zephyr Apr 24, 2026, 1:26 PM

#

its NOT a new model

fiery gull Apr 24, 2026, 1:26 PM

#

surreal zephyr bro it kept the score on arena

bruh, Idk about lmarena

#

I'm talking about my expecience in deepseek.com

proud bobcat Apr 24, 2026, 1:27 PM

#

Yeah

#

Also

#

split topaz Apr 24, 2026, 1:27 PM

#

According to official benchmarks Deepseek V4 Pro scores 154 points MORE in comparison to Claude Mythos in codeforces rating. Only 3.5 points behind Mythos in BrowseComp, strange.

proud bobcat Apr 24, 2026, 1:27 PM

#

It’s right here cuh

grim cliff Apr 24, 2026, 1:27 PM

#

Is that a new AI?

stray aspen Apr 24, 2026, 1:27 PM

#

@proud bobcatwhy does mimo 2.5 pro thinking process look similar to deepseek's

surreal zephyr Apr 24, 2026, 1:28 PM

#

proud bobcat

ctrl+shift+r

proud bobcat Apr 24, 2026, 1:28 PM

#

stray aspen <@690672903292977153>why does mimo 2.5 pro thinking process look similar to deep...

They learn from the best

#

😎✌️

proud bobcat Apr 24, 2026, 1:28 PM

#

surreal zephyr ctrl+shift+r

I just checked this

#

Right now

split topaz Apr 24, 2026, 1:28 PM

#

That is my source.

surreal zephyr Apr 24, 2026, 1:28 PM

#

worse than qwen 3.6 27b

#

xD

proud bobcat Apr 24, 2026, 1:28 PM

#

It’s not SOTA but I’m loving it for math work and number crunching

#

It’s so peak

fiery gull Apr 24, 2026, 1:29 PM

#

surreal zephyr worse than qwen 3.6 27b

yep ;-;

stray aspen Apr 24, 2026, 1:29 PM

#

split topaz That is my source.

mythos is cookign

#

they will destroy the spud

fiery gull Apr 24, 2026, 1:29 PM

#

proud bobcat It’s not SOTA but I’m loving it for math work and number crunching

qwen 3.6 27b > v4 pro

#

in my docs tests

#

27b > max

#

lol

proud bobcat Apr 24, 2026, 1:29 PM

#

fiery gull qwen 3.6 27b > v4 pro

How so?

grim cliff Apr 24, 2026, 1:29 PM

#

Why is it so bad?

surreal zephyr Apr 24, 2026, 1:29 PM

#

stray aspen mythos is cookign

is the mythos in the room with us?

fiery gull Apr 24, 2026, 1:29 PM

#

27b is better that max 3.6 🤣

grim cliff Apr 24, 2026, 1:29 PM

#

I mean what model is good in like Science and creativity

fiery gull Apr 24, 2026, 1:29 PM

#

qwen is horrible making big models

proud bobcat Apr 24, 2026, 1:30 PM

#

Mythos glazers when they don’t even have access to the model and still hype it

stray aspen Apr 24, 2026, 1:30 PM

#

surreal zephyr is the mythos in the room with us?

it will be soon

#

aand it will ccrush the spud

split topaz Apr 24, 2026, 1:30 PM

#

grim cliff I mean what model is good in like Science and creativity

Probably yeah, also research.

proud bobcat Apr 24, 2026, 1:30 PM

#

I

grim cliff Apr 24, 2026, 1:30 PM

#

split topaz Probably yeah, also research.

?

surreal zephyr Apr 24, 2026, 1:30 PM

#

grim cliff Apr 24, 2026, 1:30 PM

#

Can you name some models maybe

proud bobcat Apr 24, 2026, 1:30 PM

#

What?

#

I was referencing the other dude