flat osprey Apr 24, 2026, 3:27 AM

#

we're so back

supple sigil Apr 24, 2026, 3:27 AM

#

LOL NO WAY

#

took long enough

flat osprey Apr 24, 2026, 3:27 AM

#

https://x.com/deepseek_ai/status/2047516948611960897

DeepSeek (@deepseek_ai)

🔹 Amid recent attention, a quick reminder: please rely only on our official accounts for DeepSeek news. Statements from other channels do not reflect our views.
🔹 Thank you for your continued trust. We remain committed to longtermism, advancing steadily toward our ultimate goal

#

glad they said this lol

#

tired of the "leaks"

thin bramble Apr 24, 2026, 3:28 AM

#

flat osprey https://x.com/deepseek_ai/status/2047516948611960897

the leaker got assassinated and had to release sooner

sacred glade Apr 24, 2026, 3:29 AM

#

how long does it typically take for these models to go up on OR?

flat osprey Apr 24, 2026, 3:30 AM

#

sacred glade how long does it typically take for these models to go up on OR?

they should be working on it right now, i'd give it an hour or two

#

should be up tomorrow the latest

sacred glade Apr 24, 2026, 3:30 AM

#

They still don't have new Qwen

#

Though that's a much smaller release

thin bramble Apr 24, 2026, 3:30 AM

#

it IS non-sense for them to go ipo

#

reminder: deepseek used to be less than a dollar for the same param

#

heehehehe

#

~$3.50

hot swan Apr 24, 2026, 3:33 AM

#

flash is still under

vale kayak Apr 24, 2026, 3:33 AM

#

deepseek v4 out on vertex provider

thin bramble Apr 24, 2026, 3:33 AM

#

hot swan flash is still under

who uses flash, bleh

#

it is a small param model

frank wind Apr 24, 2026, 3:34 AM

#

idk maybe you could do data wrangling with it

hot swan Apr 24, 2026, 3:34 AM

#

people who want decent performance for minimal cost?

flat osprey Apr 24, 2026, 3:34 AM

#

vale kayak Apr 24, 2026, 3:34 AM

#

VERCEL provider not vertex

flat osprey Apr 24, 2026, 3:34 AM

#

who gon tell him

vale kayak Apr 24, 2026, 3:34 AM

#

uh whatever

thin bramble Apr 24, 2026, 3:35 AM

#

ye

gusty sphinx Apr 24, 2026, 3:35 AM

#

well thats just USA parenting vs China parenting

hot swan Apr 24, 2026, 3:35 AM

#

thin bramble who uses flash, bleh

a lot of people are going to be using flash, if they know how to count

raven canyon Apr 24, 2026, 3:36 AM

#

🐳 🐳 🐳 🐳

#

how are the benches

chilly sable Apr 24, 2026, 3:36 AM

#

Deepseek!

hoary zenith Apr 24, 2026, 3:36 AM

#

yeah flash is already way better than 3.2 and cheaper, so can't complain

short jasper Apr 24, 2026, 3:37 AM

#

you guys can test deepseek v4 on lmarena

#

i think

opaque rapids Apr 24, 2026, 3:37 AM

#

just reminder that the title still "unreleased"

hoary zenith Apr 24, 2026, 3:37 AM

#

quadratic attention at 1.6T is no joke, I think the output price for pro is fine, but input is a bit iffy

#

they also promised to lower pro price later when ascend 950 comes online:

https://x.com/nopainkiller/status/2047512431228756122#m

Zhipeng Huang (@nopainkiller)

@teortaxesTex They promised price dropping when deployment on Ascend 950 later this year

raven canyon Apr 24, 2026, 3:38 AM

#

geez 12x price

gusty sphinx Apr 24, 2026, 3:38 AM

#

promised

raven canyon Apr 24, 2026, 3:38 AM

#

flash is still genuinely good value

thin bramble Apr 24, 2026, 3:39 AM

#

it is ultra underwhelming, omega ultrawealming

#

https://tenor.com/view/anime-mirai-nikki-future-diary-gun-sword-gif-17377854

Tenor

#

they died in my eyes, how shameful of a release

#

mistral tiers of disappointment

gusty sphinx Apr 24, 2026, 3:40 AM

#

well hold on, maybe, against all odds and past evidence, it's great for RP

short jasper Apr 24, 2026, 3:40 AM

#

yes

#

great for rp

#

100x peak

thin bramble Apr 24, 2026, 3:40 AM

#

gusty sphinx well hold on, maybe, against all odds and past evidence, it's great for RP

good point, brb

short jasper Apr 24, 2026, 3:41 AM

#

it gived me this in lmarena

📎 message.txt

quick bison Apr 24, 2026, 3:41 AM

#

price is more expensive than glm 5 for some reason.

tulip estuary Apr 24, 2026, 3:41 AM

#

fucking finally

#

i was about to sleep

#

fucking hell

hoary zenith Apr 24, 2026, 3:41 AM

#

quick bison price is more expensive than glm 5 for some reason.

twice the size of glm 5 is the reason

raven canyon Apr 24, 2026, 3:41 AM

#

what’s the consensus

gusty sphinx Apr 24, 2026, 3:42 AM

#

short jasper it gived me this in lmarena

erm. maybe we'll wait for more, different examples

hot swan Apr 24, 2026, 3:42 AM

#

the consensus is that it just released

viral hemlock Apr 24, 2026, 3:42 AM

#

DEEPSEEK IS REAL

thin bramble Apr 24, 2026, 3:42 AM

#

hoary zenith twice the size of glm 5 is the reason

only ~150b more

short jasper Apr 24, 2026, 3:42 AM

#

gusty sphinx erm. maybe we'll wait for more, different examples

👍

#

Holy bro

#

deepseek v4 pro thinking stuck in thinking

#

I AM COOKED

deft crow Apr 24, 2026, 3:42 AM

#

bringing it live soonnn.....

gusty sphinx Apr 24, 2026, 3:42 AM

#

PRO THINKING!

raven canyon Apr 24, 2026, 3:43 AM

#

i don’t see engram mentioned

flat viper Apr 24, 2026, 3:43 AM

#

https://x.com/i/status/2047516922263285776

DeepSeek (@deepseek_ai)

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params.

chilly sable Apr 24, 2026, 3:43 AM

#

Does deepseek v4 have engram?

hoary zenith Apr 24, 2026, 3:43 AM

#

in engram paper they said it's only useful for small models, it's like no one bothered to read that part

quick bison Apr 24, 2026, 3:43 AM

#

chilly sable Does deepseek v4 have engram?

wuts engram?

viral hemlock Apr 24, 2026, 3:44 AM

#

paper by deepseek

gusty sphinx Apr 24, 2026, 3:45 AM

#

lets share en gram to celebrate

ebon rover Apr 24, 2026, 3:46 AM

#

API is not censored

#

Thank god

tulip estuary Apr 24, 2026, 3:47 AM

#

it is the youngest model ever😯

thin bramble Apr 24, 2026, 3:47 AM

#

gusty sphinx well hold on, maybe, against all odds and past evidence, it's great for RP

short test, glm 5 is better. also world knowledge sucks.

flat osprey Apr 24, 2026, 3:47 AM

#

some of you may find this useful

short jasper Apr 24, 2026, 3:47 AM

#

thin bramble short test, glm 5 is better. also world knowledge sucks.

wth

#

you're ragebaiting

raven canyon Apr 24, 2026, 3:47 AM

#

benchmaxxed?

short jasper Apr 24, 2026, 3:47 AM

#

glm 5 aint best

flat osprey Apr 24, 2026, 3:47 AM

#

i just put flash max, pro max, and opus 4.6 max on the same chart cause those are the only models i care abt

tulip estuary Apr 24, 2026, 3:48 AM

#

raven canyon benchmaxxed?

they wouldn't

flat osprey Apr 24, 2026, 3:48 AM

#

flash max is actually shockingly good for its price

thin bramble Apr 24, 2026, 3:48 AM

#

short jasper you're ragebaiting

i am not.

chilly sable Apr 24, 2026, 3:48 AM

#

flat osprey flash max is actually shockingly good for its price

whats the price

flat osprey Apr 24, 2026, 3:48 AM

#

chilly sable whats the price

rustic island Apr 24, 2026, 3:49 AM

#

@grok what is my opinion on this model?

hoary zenith Apr 24, 2026, 3:49 AM

#

flat osprey flash max is actually shockingly good for its price

yep, game changer for me

viral hemlock Apr 24, 2026, 3:49 AM

#

0.28 output is crazy good wtf

thin bramble Apr 24, 2026, 3:49 AM

#

short jasper glm 5 aint best

for roleplaying? it is divine. (though sloppy, but i like it's slop.)

gusty sphinx Apr 24, 2026, 3:49 AM

#

rustic island @grok what is my opinion on this model?

dey quantized it

rustic island Apr 24, 2026, 3:50 AM

#

Ah crap, you've been noticing V4 got dumber since the announcement too

viral hemlock Apr 24, 2026, 3:50 AM

#

is it actually worse than glm 5

hot swan Apr 24, 2026, 3:50 AM

#

flash is literally 1% of the price of Opus yeah

tulip estuary Apr 24, 2026, 3:50 AM

#

0.14 0.28 for a 300B model is crazy

charred slate Apr 24, 2026, 3:50 AM

#

ITS OUT NO WAY

rustic island Apr 24, 2026, 3:50 AM

#

It's excellent news they kept the pricing for Flash

#

Varying its thinking will be interesting

thin bramble Apr 24, 2026, 3:51 AM

#

hot swan flash is literally 1% of the price of Opus yeah

maybe it is not right model to compare to?

raven canyon Apr 24, 2026, 3:51 AM

#

is pro any good or just overpriced

thin bramble Apr 24, 2026, 3:51 AM

#

gemini 3.1 flash lite is also 1% of opus, so wot

viral hemlock Apr 24, 2026, 3:51 AM

#

sk hynix stocks dropped after its announcement 💀

tulip estuary Apr 24, 2026, 3:51 AM

#

i will only use v4 flash from now on

flat osprey Apr 24, 2026, 3:51 AM

#

raven canyon is pro any good or just overpriced

it's near opus 4.6 level

#

definitely fair pricing

raven canyon Apr 24, 2026, 3:51 AM

#

time to test it then i guess

flat osprey Apr 24, 2026, 3:51 AM

#

but flash is better value

ebon rover Apr 24, 2026, 3:52 AM

#

For me it's uncensored like 3.2

#

Just perfect

thin bramble Apr 24, 2026, 3:52 AM

#

leme check lmarena

flat osprey Apr 24, 2026, 3:52 AM

#

deepseek v4 flash max is the coding model i've been praying for 🙏

ebon rover Apr 24, 2026, 3:52 AM

#

Simply perfect. Is it Christmas already

flat osprey Apr 24, 2026, 3:52 AM

#

no need to pay for claude code anymore

mystic breach Apr 24, 2026, 3:53 AM

#

flat osprey but flash is better value

flash only seems good if you use thinking high though

tulip estuary Apr 24, 2026, 3:53 AM

#

ok

mystic breach Apr 24, 2026, 3:53 AM

#

for cost optimized, isn't that a good default

tulip estuary Apr 24, 2026, 3:53 AM

#

0.28 bro

deft crow Apr 24, 2026, 3:53 AM

#

few more mins, just running some tests

flat osprey Apr 24, 2026, 3:53 AM

#

mystic breach flash only seems good if you use thinking high though

yeah pretty much, but the price is immaculate

hot swan Apr 24, 2026, 3:53 AM

#

thin bramble gemini 3.1 flash lite is also 1% of opus, so wot

yeah but

mystic breach Apr 24, 2026, 3:53 AM

#

ye of course, just saying from first glance at benchs and numbers

thin bramble Apr 24, 2026, 3:53 AM

#

thin bramble leme check lmarena

deepseek v4 lite is below gemma 4, and with thinking it is equal to gemma 4 (non-thinking btw)

gusty sphinx Apr 24, 2026, 3:53 AM

#

BYE BYE OPUS 4.6

flat osprey Apr 24, 2026, 3:54 AM

#

thin bramble deepseek v4 lite is below gemma 4, and with thinking it is equal to gemma 4 (non...

lmarena isn't the end-all be-all

rustic island Apr 24, 2026, 3:54 AM

#

Deepseek V4

flat osprey Apr 24, 2026, 3:54 AM

#

llama 4 was at the top before lol

thin bramble Apr 24, 2026, 3:54 AM

#

deepseek v4 pro is sonnet tier model

thin bramble Apr 24, 2026, 3:54 AM

#

flat osprey lmarena isn't the end-all be-all

i AM the end-all be-all and i say it SUCKS

viral hemlock Apr 24, 2026, 3:54 AM

#

the cost is so cheap

thin bramble Apr 24, 2026, 3:54 AM

#

flat osprey llama 4 was at the top before lol

got better (as an previous arena hater)

supple sigil Apr 24, 2026, 3:54 AM

#

thin bramble deepseek v4 pro is sonnet tier model

there’s no way it’s only as good as sonnet

viral hemlock Apr 24, 2026, 3:55 AM

#

this is the only model that pays off the api cost

flat osprey Apr 24, 2026, 3:55 AM

#

thin bramble i AM the end-all be-all and i say it SUCKS

https://tenor.com/view/rage-gif-18338667365126853250

Tenor

thin bramble Apr 24, 2026, 3:55 AM

#

supple sigil there’s no way it’s only as good as sonnet

it is. enough with copium already. crippling depression.

tulip estuary Apr 24, 2026, 3:56 AM

#

i just tested and it's good

thin bramble Apr 24, 2026, 3:56 AM

#

for coding, GUESS WHERE IT BELONGS?

flat osprey Apr 24, 2026, 3:56 AM

#

thin bramble for coding, GUESS WHERE IT BELONGS?

927 votes

thin bramble Apr 24, 2026, 3:56 AM

#

shit model

thin bramble Apr 24, 2026, 3:56 AM

#

flat osprey 927 votes

true true

#

will wait ig

short jasper Apr 24, 2026, 3:56 AM

#

GLM 5 IS 100% slop while..

#

deepseek v4 is 0% slop

viral hemlock Apr 24, 2026, 3:57 AM

#

it cant be as good as gemma 31b...

#

please let that be false

rustic island Apr 24, 2026, 3:57 AM

#

short jasper GLM 5 IS 100% slop while..

That I agree with

#

I hate GLM's writing

tulip estuary Apr 24, 2026, 3:57 AM

#

deepseek output is fine shite

thin bramble Apr 24, 2026, 3:57 AM

#

rustic island That I agree with

claude slop though ;-;
(but yes slop)

supple sigil Apr 24, 2026, 3:57 AM

#

thin bramble it is. enough with copium already. crippling depression.

kimi k2.5 to me at least is sonnet tier. ds v4 pro is 160% the size of kimi

broken lintel Apr 24, 2026, 3:57 AM

#

a cheap model with reasoning toggles sounds nice

sacred glade Apr 24, 2026, 3:58 AM

#

Kimi K2.6 is great but will literally think for 16 minutes on a long response

#

I hope this is faster even w/pro

rustic island Apr 24, 2026, 3:58 AM

#

But bless DS for launching a cheap model in these trying times of decreasing subsidies to users

thin bramble Apr 24, 2026, 3:58 AM

#

supple sigil kimi k2.5 to me at least is sonnet tier. ds v4 pro is 160% the size of kimi

no? deepseek is 862b model and kimi is 1.1T model.

flat osprey Apr 24, 2026, 3:58 AM

#

v4-flash max might replace my claude sub ngl

#

gonna test it once it drops on OR

mystic breach Apr 24, 2026, 3:59 AM

#

thin bramble no? deepseek is 862b model and kimi is 1.1T model.

might be referring to int4 vs bf16/fp8?

thin bramble Apr 24, 2026, 3:59 AM

#

rustic island But bless DS for launching a cheap model in these trying times of decreasing sub...

enough shilling, they are selling 160b model at 3$ price, ransom!

broken lintel Apr 24, 2026, 3:59 AM

#

broken lintel a cheap model with reasoning toggles sounds nice

though I just noticed that the only reasoning_effort they have is high and max, they don't have a medium or low which is kinda a rip

sharp vortex Apr 24, 2026, 3:59 AM

#

Holy, we finally stop coping

#

🗣️

mystic breach Apr 24, 2026, 3:59 AM

#

deepseek v5 next week

thin bramble Apr 24, 2026, 4:00 AM

#

https://tenor.com/view/ruby-copium-oshi-no-ko-推しの子-ruby-hoshino-gif-4151121735588077996

Tenor

mystic breach Apr 24, 2026, 4:00 AM

#

ive been hearing that since january

#

only took a little bit

flat osprey Apr 24, 2026, 4:00 AM

#

this thread was a chain of agony, suffering, and pain, and i am glad there will never be another thread of leaks

supple sigil Apr 24, 2026, 4:01 AM

#

thin bramble no? deepseek is 862b model and kimi is 1.1T model.

DeepSeek-V4-Pro with 1.6T parameters (49B activated)

gusty sphinx Apr 24, 2026, 4:01 AM

#

https://tenor.com/view/kamala-harris-and-joe-biden-gif-22291622

Tenor

supple sigil Apr 24, 2026, 4:01 AM

#

verbatim from their huggingface page

ancient gulch Apr 24, 2026, 4:01 AM

#

The only day you didnt say the full sentence...

supple sigil Apr 24, 2026, 4:01 AM

#

https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro

deepseek-ai/DeepSeek-V4-Pro · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

gusty sphinx Apr 24, 2026, 4:01 AM

#

Joe Router

thin bramble Apr 24, 2026, 4:02 AM

#

supple sigil DeepSeek-V4-Pro with 1.6T parameters (49B activated)

kimi is supreme (though maybe bench numbers will change in arena)

gusty sphinx Apr 24, 2026, 4:02 AM

#

kimi is thinkslop

supple sigil Apr 24, 2026, 4:02 AM

#

1.6x the size, there’s no way deepseek isnt better

#

deepseek is THE goat for a reason

sacred glade Apr 24, 2026, 4:02 AM

#

The actual output quality of Kimi is great

mystic breach Apr 24, 2026, 4:02 AM

#

i really want to see artificialanalysis's total tokens used for the benchmarks to see how token efficient it is

deft crow Apr 24, 2026, 4:02 AM

#

bear with me, they might be borked. coming online now

viral hemlock Apr 24, 2026, 4:03 AM

#

oh its Q8+Q4

hot swan Apr 24, 2026, 4:03 AM

#

oh you really want me to go to work without sleeping today

mystic breach Apr 24, 2026, 4:03 AM

#

hot swan oh you really want me to go to work without sleeping today

same i got work in 6h and this just had to drop

gusty sphinx Apr 24, 2026, 4:04 AM

#

sacred glade The actual output quality of Kimi is great

yeah it's not bad. it just seems a little unsustainable

tulip estuary Apr 24, 2026, 4:04 AM

#

I LOVE IT

#

I LOVE IT ALEEADY

#

AND IM USING FLASH

#

im using direct API

#

it is VERY quick and as always cache is MWAH chefs kiss

thin bramble Apr 24, 2026, 4:05 AM

#

tulip estuary it is VERY quick and as always cache is MWAH chefs kiss

it is a 150b moe model though

deft crow Apr 24, 2026, 4:06 AM

#

any minute now

supple sigil Apr 24, 2026, 4:06 AM

#

thin bramble *it is a 150b moe model though*

DeepSeek-V4-Flash with 284B parameters (13B activated)

deft crow Apr 24, 2026, 4:06 AM

#

ping me if shit looks off, but i have this chat on my side monitor

tulip estuary Apr 24, 2026, 4:06 AM

#

The reasoning_content in the thinking mode must be passed back to the API.

grrrr

thin bramble Apr 24, 2026, 4:06 AM

#

supple sigil DeepSeek-V4-Flash with 284B parameters (13B activated)

hugging lies

supple sigil Apr 24, 2026, 4:06 AM

#

they do indeed

#

its up

#

on OR

#

balls

deft crow Apr 24, 2026, 4:08 AM

#

give it another min

#

that should clear up

sacred glade Apr 24, 2026, 4:10 AM

#

rate limited

deft crow Apr 24, 2026, 4:10 AM

#

will bring other providers on ASAP rocky

sharp vortex Apr 24, 2026, 4:10 AM

#

Where deepseek coding plan trolling

broken lintel Apr 24, 2026, 4:10 AM

#

is deepseekv4 on artificial analysis yet?

flat osprey Apr 24, 2026, 4:11 AM

#

broken lintel is deepseekv4 on artificial analysis yet?

not yet

#

maybe sometime later this week

short jasper Apr 24, 2026, 4:11 AM

#

Wake up EQBENCH

thin bramble Apr 24, 2026, 4:11 AM

#

shit at persian too

#

https://tenor.com/view/ruby-copium-oshi-no-ko-推しの子-ruby-hoshino-gif-4151121735588077996

Tenor

whole saffron Apr 24, 2026, 4:11 AM

#

its time?

tulip estuary Apr 24, 2026, 4:11 AM

#

deft crow Apr 24, 2026, 4:11 AM

#

it's good now i think

thin bramble Apr 24, 2026, 4:12 AM

#

god i am just getting bad impressions. is https://chat.deepseek.com the v4 or still v3.2?

supple sigil Apr 24, 2026, 4:12 AM

#

v4 flash

#

is slightly fast

sharp vortex Apr 24, 2026, 4:12 AM

#

thin bramble god i am just getting bad impressions. is https://chat.deepseek.com the v4 or st...

Should update now, Instant is flash. Expert is pro

tulip estuary Apr 24, 2026, 4:12 AM

#

the cost is null

#

amazing

gusty sphinx Apr 24, 2026, 4:12 AM

#

the cost is null

viral hemlock Apr 24, 2026, 4:12 AM

#

supple sigil v4 flash

holy

thin bramble Apr 24, 2026, 4:12 AM

#

sharp vortex Should update now, Instant is flash. Expert is pro

already have it.

gusty sphinx Apr 24, 2026, 4:13 AM

#

deepseek/deepseek-v4-pro:null

short jasper Apr 24, 2026, 4:13 AM

#

thin bramble already have it.

Go test it on openrouter bro

crude steppe Apr 24, 2026, 4:13 AM

#

sharp vortex Apr 24, 2026, 4:13 AM

#

Why all Chinese love pro-max or smth

short jasper Apr 24, 2026, 4:14 AM

#

yes there's mimo 2.5 pro

chilly sable Apr 24, 2026, 4:14 AM

#

can anyone test the carwash question

broken lintel Apr 24, 2026, 4:14 AM

#

sharp vortex Why all Chinese love pro-max or smth

AI enthusiasm goes crazy over there I think right now

gusty sphinx Apr 24, 2026, 4:14 AM

#

Chinese ProMaxxing

sharp vortex Apr 24, 2026, 4:15 AM

#

Perfect time to release after Gpt 5.5 KEKW

gusty sphinx Apr 24, 2026, 4:15 AM

#

anthropic uses it too. its just an easy to understand pattern. that apple invented

supple sigil Apr 24, 2026, 4:15 AM

#

chilly sable can anyone test the carwash question

gusty sphinx Apr 24, 2026, 4:15 AM

#

supple sigil

benchmaxxed

sharp vortex Apr 24, 2026, 4:15 AM

#

Ain't no way, deepseek delay another month to train riddle

#

🗣️

ebon rover Apr 24, 2026, 4:15 AM

#

Carwashmaxxed

supple sigil Apr 24, 2026, 4:15 AM

#

1.6T model running at 300tps

short jasper Apr 24, 2026, 4:15 AM

#

yall ragebait bro

#

deepseek v4 is smart

rotund briar Apr 24, 2026, 4:16 AM

#

"classic"

short jasper Apr 24, 2026, 4:16 AM

#

sharp vortex Apr 24, 2026, 4:16 AM

#

We finally out of 30 TPS hell of 3.2 🗣️

latent wyvern Apr 24, 2026, 4:16 AM

#

gpt oss

short jasper Apr 24, 2026, 4:16 AM

#

he got r wrong

#

but he got car wash correct

#

wait let me count

#

yeh i dont know what deepseek v4 is saying

#

but 3 he got correct

flat osprey Apr 24, 2026, 4:17 AM

#

supple sigil 1.6T model running at 300tps

holy shit

viral hemlock Apr 24, 2026, 4:17 AM

#

sharp vortex Apr 24, 2026, 4:17 AM

#

short jasper he got r wrong

Wdym it's 4 total

short jasper Apr 24, 2026, 4:18 AM

#

sharp vortex Wdym it's 4 total

i am blind then?

#

tqeqr

whole saffron Apr 24, 2026, 4:18 AM

#

finally i can drive to the car wash without second guessing myself

short jasper Apr 24, 2026, 4:18 AM

#

ok guys

#

deepseek v4 is smart

#

screw code skill

supple sigil Apr 24, 2026, 4:18 AM

#

most coding benchmarks just target python and js anyways

#

guarantee it has better actual logical reasoning for complex code

rustic island Apr 24, 2026, 4:19 AM

#

short jasper

Pls enable font subpixel hinting

hot swan Apr 24, 2026, 4:19 AM

#

It's failing my manga knowledge questions 💔

#

both pro and flash

sharp vortex Apr 24, 2026, 4:19 AM

#

hot swan It's failing my manga knowledge questions 💔

Which one

proud sable Apr 24, 2026, 4:19 AM

#

is work in openrouter?

sharp vortex Apr 24, 2026, 4:20 AM

#

proud sable is work in openrouter?

Ye

hot swan Apr 24, 2026, 4:20 AM

#

stuff like In Blue Lock, between the U20 match and the Bastard Munchen vs Ubers match, Isagi failed to score any goals. Why is that?
lately most models manage to get it right

viral hemlock Apr 24, 2026, 4:20 AM

#

📎 message.txt

hot swan Apr 24, 2026, 4:21 AM

#

I think the failure reflects the fact that it's not trained on much community content

supple sigil Apr 24, 2026, 4:21 AM

#

oh god its a "Let me reconsider" reasoning model 😭

short jasper Apr 24, 2026, 4:21 AM

#

hot swan stuff like `In Blue Lock, between the U20 match and the Bastard Munchen vs Ubers...

In Blue Lock, between the U-20 match and the Bastard München vs. Ubers match in the Neo Egoist League, Isagi failed to score any goals. The main reasons are:

Michael Kaiser’s interference
Kaiser, the ace of Bastard München, saw Isagi as a threat to his position as the team's core. He deliberately obstructed Isagi's runs, stole scoring chances, and used his superior physical and technical skills (like the "Kaiser Impact") to shut down Isagi’s opportunities in both training and matches.
Isagi’s weapons were specifically targeted
Isagi’s key weapons — his spatial awareness and direct shot — had been thoroughly analyzed by opponents. Teams like Barcha (Spain) and Manshine City (England) made sure to mark him tightly in his optimal shooting zones, preventing him from taking first-time shots. Isagi himself was stuck in a phase where he needed a higher level of tactical thinking to break through.
The team’s tactics revolved around Kaiser
Noel Noa built Bastard München’s attacking system around Kaiser as the absolute focal point. Isagi was often used as a decoy or support player rather than a finisher. He received fewer passes and had limited authority to shoot.
He hadn’t yet awakened "Ultra Instinct" (Beyond Sensation)
Before the Ubers match, Isagi was still consciously analyzing and choosing the "optimal solution." His body and shooting accuracy hadn't caught up to his mind. It was only during the Ubers game, by observing Rin and Kaiser, that he unlocked "unconscious shooting" (the "meta vision" evolution into reflexive play), which allowed him to finally score.

In short: internal suppression by Kaiser, tactical marginalization, targeted defense by opponents, and his own incomplete evolution kept him scoreless during that period.

" thing almost correct

sharp vortex Apr 24, 2026, 4:22 AM

#

hot swan stuff like `In Blue Lock, between the U20 match and the Bastard Munchen vs Ubers...

Deepseek want you to enable search then 🗣️

supple sigil Apr 24, 2026, 4:22 AM

#

got this question correct

sharp vortex Apr 24, 2026, 4:22 AM

#

Now test cypher code

#

Drake incoming again

hot swan Apr 24, 2026, 4:23 AM

#

short jasper In Blue Lock, between the U-20 match and the Bastard München vs. Ubers match in ...

that's better than what I got I guess (mines both hallucinated that he DID score vs Manshine, despite my prompt!)

deft crow Apr 24, 2026, 4:23 AM

#

...no issues with the model?

covert topaz Apr 24, 2026, 4:23 AM

#

IPREDICTED IT

supple sigil Apr 24, 2026, 4:23 AM

#

deft crow ...no issues with the model?

its too good

chilly sable Apr 24, 2026, 4:23 AM

#

At first it gave incorrect answers, but not anymore, maybe it was a glitch from api

covert topaz Apr 24, 2026, 4:23 AM

#

RAHHHHHVHHHVHHHVHBHVVHHHHHHH

hot swan Apr 24, 2026, 4:24 AM

#

deft crow ...no issues with the model?

nope, besides having to turn my privacy policies off temporarily

sharp vortex Apr 24, 2026, 4:24 AM

#

Deepseek jumpscare

gusty sphinx Apr 24, 2026, 4:24 AM

#

deft crow ...no issues with the model?

BENCHMAXXED. QUANTIZED. CENSORED!

molten jay Apr 24, 2026, 4:24 AM

#

So... How is it?

mystic breach Apr 24, 2026, 4:24 AM

#

viral hemlock

thats so cool

thin bramble Apr 24, 2026, 4:24 AM

#

opus's judgement on deepseek v4 on persian questions (this time using lmarena)

#

even opus thinks it is "better"

#

https://tenor.com/view/ruby-copium-oshi-no-ko-推しの子-ruby-hoshino-gif-4151121735588077996

Tenor

#

end me already

tulip estuary Apr 24, 2026, 4:25 AM

#

deft crow ...no issues with the model?

how do i control the reasoning? low medium high max? for both models?

sharp vortex Apr 24, 2026, 4:25 AM

#

thin bramble https://tenor.com/view/ruby-copium-oshi-no-ko-%E6%8E%A8%E3%81%97%E3%81%AE%E5%AD%...

Eqbench top 1 candidate (it judged by sonnet)

hot swan Apr 24, 2026, 4:25 AM

#

pro is very fine on science/biology, flash rather sucks

gusty sphinx Apr 24, 2026, 4:25 AM

#

thin bramble opus's judgement on deepseek v4 on persian questions (this time using lmarena)

have you ever considered that you are, in fact, the problem?

elfin sparrow Apr 24, 2026, 4:25 AM

#

yoo deepseak tdy

tulip estuary Apr 24, 2026, 4:25 AM

#

thin bramble opus's judgement on deepseek v4 on persian questions (this time using lmarena)

it really is very engaging

supple sigil Apr 24, 2026, 4:25 AM

#

no idea what v4 pro is on about here, the joke is rather famously that the feathers are heavier if youre accounting for the emotional weight

thin bramble Apr 24, 2026, 4:26 AM

#

gusty sphinx have you ever considered that you are, in fact, the problem?

maybe

thin bramble Apr 24, 2026, 4:26 AM

#

tulip estuary it really is very engaging

(for me,) it is not, i like glm/claude (and minimax as smaller meh model) way more ;-;

supple sigil Apr 24, 2026, 4:26 AM

#

oh boy

viral hemlock Apr 24, 2026, 4:26 AM

#

oh no

rustic island Apr 24, 2026, 4:27 AM

#

V4 Flash and V4 Pro perform decently on my "how to fix lag in my Paper server?" test
Nothing too special about the outputs (SOTA-ish performance). V4 Flash hallucinated more than Pro, which still had subtle hallucinations

mystic breach Apr 24, 2026, 4:27 AM

#

“Due to constraints in high-end compute capacity, the current service capacity for Pro is very limited. After the 950 supernodes are launched at scale in the second half of this year, the price of Pro is expected to be reduced significantly.”

#

damn and we thought it was hella cheap already

supple sigil Apr 24, 2026, 4:27 AM

#

mystic breach damn and we thought it was hella cheap already

whos we

mystic breach Apr 24, 2026, 4:27 AM

#

everyone

supple sigil Apr 24, 2026, 4:28 AM

#

i expected cheaper from deepseek tbh

mystic breach Apr 24, 2026, 4:28 AM

#

ok bro

thin bramble Apr 24, 2026, 4:28 AM

#

sharp vortex Eqbench top 1 candidate (it judged by sonnet)

https://eqbench.com/index.html
v4 isn't there yet for me

sharp vortex Apr 24, 2026, 4:28 AM

#

I mean like it's "candidate" it's might be top 1 soon

supple sigil Apr 24, 2026, 4:29 AM

#

also suggested freezing the drink lol

rustic island Apr 24, 2026, 4:29 AM

#

🥲 Getting rate limited already

chilly sable Apr 24, 2026, 4:29 AM

#

rustic island V4 Flash and V4 Pro perform decently on my "how to fix lag in my Paper server?" ...

You have to say don't make mistakes

deft crow Apr 24, 2026, 4:29 AM

#

other providers should be coming online soon

#

kind of a pain in the butt model to host i imagine

supple sigil Apr 24, 2026, 4:30 AM

#

rustic island 🥲 Getting rate limited already

imagine what its going to be like when the openclaw bros catch wind of this

sharp vortex Apr 24, 2026, 4:30 AM

#

Do other providers surpass deepseek throughout speed?

covert topaz Apr 24, 2026, 4:30 AM

#

bruh idgaf about if it should DRIVE TO THE CAR WASH HOW IS THE CREATIVE WRITING ragescream

sharp vortex Apr 24, 2026, 4:31 AM

#

thinkies

covert topaz Apr 24, 2026, 4:31 AM

#

https://tenor.com/view/homelander-puff-meme-fat-gif-11878092523812931740

Tenor

gusty sphinx Apr 24, 2026, 4:31 AM

#

...i suppose i won't share my car wash RP then

#

you might think its gonna be about washing a car, but no.

#

i walked.

hot swan Apr 24, 2026, 4:32 AM

#

car wash w/ happy ending

chilly sable Apr 24, 2026, 4:32 AM

#

covert topaz bruh idgaf about if it should DRIVE TO THE CAR WASH HOW IS THE CREATIVE WRITING ...

But how will you know whether to walk or drive?

covert topaz Apr 24, 2026, 4:32 AM

#

peoples be gooning to god knows what and bros enjoying some car wash rp based

supple sigil Apr 24, 2026, 4:34 AM

#

gusty sphinx ...i suppose i won't share my car wash RP then

does it get soapy

gusty sphinx Apr 24, 2026, 4:34 AM

#

deen continues making engine revving and beeping noises as he slowly enters the car wash, nude.

covert topaz Apr 24, 2026, 4:35 AM

#

https://tenor.com/view/herbie-water-shaking-car-wash-shake-gif-27081469

Tenor

gusty sphinx Apr 24, 2026, 4:35 AM

#

"I'm a car," he chuckled suddenly.

viral hemlock Apr 24, 2026, 4:35 AM

#

opus 4.7 (left ) v4 pro (right)

chilly sable Apr 24, 2026, 4:36 AM

#

viral hemlock opus 4.7 (left ) v4 pro (right)

The pelican ate the whale

covert topaz Apr 24, 2026, 4:36 AM

#

gusty sphinx > "I'm a car," he chuckled suddenly.

#1 eq bench long form creative writing 🗣️ 💯

supple sigil Apr 24, 2026, 4:36 AM

#

viral hemlock opus 4.7 (left ) v4 pro (right)

so excited to try this model for svg gen 🥰

viral hemlock Apr 24, 2026, 4:36 AM

#

LOL

obsidian walrus Apr 24, 2026, 4:36 AM

#

I used to pray for days like this

viral hemlock Apr 24, 2026, 4:36 AM

#

teto

supple sigil Apr 24, 2026, 4:37 AM

#

ds v4

#

chef's special

#

opus 4.7

gusty sphinx Apr 24, 2026, 4:37 AM

#

that's great. what a time to be alive

supple sigil Apr 24, 2026, 4:37 AM

#

couldnt agree more

covert topaz Apr 24, 2026, 4:37 AM

#

LMAO

#

CHFS SPECIAL

whole saffron Apr 24, 2026, 4:37 AM

#

i prayed for times like these

hot swan Apr 24, 2026, 4:37 AM

#

they're getting lost in
Write an interactive suanpan abacus in hmtl/js that allows the user to pick the number of rods
it's really not that complex a problem

covert topaz Apr 24, 2026, 4:38 AM

#

chad v4 vs virgin opus 4.7

chilly sable Apr 24, 2026, 4:38 AM

#

hot swan they're getting lost in `Write an interactive suanpan abacus in hmtl/js that al...

what reasoning level?

hot swan Apr 24, 2026, 4:38 AM

#

high

obsidian walrus Apr 24, 2026, 4:39 AM

#

thank you deepseek

viral hemlock Apr 24, 2026, 4:39 AM

#

when asking for exchange rates in korean: (usdkrw)
current price: 1482
claude opus 4.7 : 1480-ish
deepseek v4: 1502-1517

supple sigil Apr 24, 2026, 4:39 AM

#

v4 flash is even better on my patent pending Shit&Piss Bench???

covert topaz Apr 24, 2026, 4:40 AM

#

damn even provided cutlery how considerate

supple sigil Apr 24, 2026, 4:41 AM

#

will let everyone know how this turns out once its done

vapid karma Apr 24, 2026, 4:42 AM

#

viral hemlock opus 4.7 (left ) v4 pro (right)

Virgin vs. chad be like

hot swan Apr 24, 2026, 4:42 AM

#

hot swan they're getting lost in `Write an interactive suanpan abacus in hmtl/js that al...

result was pretty awful honestly

#

for example neither allow you to just push a whole row
the flash one doesn't even move the beads, lights them up (kinda interesting in its own way)

#

rod count was limited to 15 or 18 which is incidentally on the edge of bigint

thin bramble Apr 24, 2026, 4:44 AM

#

claude 4.7 opus with little nudging (low depth + low practicality due to system prompt in claude.ai)

#

but even then, it is just 0.5 diff with v4 pro, without the nudging.

hot swan Apr 24, 2026, 4:45 AM

#

hot swan result was pretty awful honestly

📎 suanpan_dipsy_pro.html 📎 suanpan_dipsy_flash.html

#

flash's way of breaking apart the positional digits was sort of fresh too

covert topaz Apr 24, 2026, 4:46 AM

#

thin bramble claude 4.7 opus with little nudging (low depth + low practicality due to system ...

this means it might excel at creative tasks

hot swan Apr 24, 2026, 4:47 AM

#

for comparison this was Gemini's two-shot (and it was well functional on the first)

📎 Suanpan.html

thin bramble Apr 24, 2026, 4:47 AM

#

covert topaz this means it might excel at creative tasks

yes, that is what i am trying out. but i disagree with opus as the persian is slightly broken.

#

during roleplay sasuke called itachi little brother

#

and the vibes are meh

#

https://tenor.com/view/ruby-copium-oshi-no-ko-推しの子-ruby-hoshino-gif-4151121735588077996

Tenor

gaunt dirge Apr 24, 2026, 4:48 AM

#

Now the wait for an endpoint that respects their paid customers begins.

covert topaz Apr 24, 2026, 4:49 AM

#

free goon logs

short jasper Apr 24, 2026, 4:49 AM

#

deepseek v4 gets fixed tommorow

covert topaz Apr 24, 2026, 4:49 AM

#

short jasper deepseek v4 gets fixed tommorow

it’s broken?

short jasper Apr 24, 2026, 4:50 AM

#

covert topaz it’s broken?

yeh i think deepseek v4 is broken

#

Let's see If something interesting will happen tommorow

covert topaz Apr 24, 2026, 4:50 AM

#

💀

#

the denial kicking in

whole saffron Apr 24, 2026, 4:50 AM

#

Thank you deepseek

covert topaz Apr 24, 2026, 4:51 AM

#

thank you for edging us Prayge

gusty sphinx Apr 24, 2026, 4:52 AM

#

ffs

thin bramble Apr 24, 2026, 4:52 AM

#

the more i test, the more i get disappointed. (world knowledge sucks)

gusty sphinx Apr 24, 2026, 4:52 AM

#

jumpscared by fallback model in my deepseek chat

covert topaz Apr 24, 2026, 4:53 AM

#

Here’s

gusty sphinx Apr 24, 2026, 4:53 AM

#

well i mean lets see

obsidian walrus Apr 24, 2026, 4:53 AM

#

I have good news: the writing is very fresh

📎 Writingtest.txt

hot swan Apr 24, 2026, 4:54 AM

#

gusty sphinx jumpscared by fallback model in my deepseek chat

well it is the same price as flash...

covert topaz Apr 24, 2026, 4:54 AM

#

obsidian walrus I have good news: the writing is very fresh

https://tenor.com/view/charlie-woooo-yeah-baby-gif-24480925

Tenor

#

damnit im on phone I can’t read it

hot swan Apr 24, 2026, 4:55 AM

#

obsidian walrus I have good news: the writing is very fresh

not bad

novel shard Apr 24, 2026, 4:55 AM

#

Roleplay response is decent. Not anything outstanding, but decent. This was supposed to be a more robotic version compared to 3.2, but it is able to execute warmth in a roleplay

obsidian walrus Apr 24, 2026, 4:56 AM

#

I'm just glad we finally have a good creative writing model that isn't Claude lite

gusty sphinx Apr 24, 2026, 4:57 AM

#

Deen’s Car Wash Adventure

📎 deens-car-wash-adventure.md

#

note that this is a collab between Deepseek V4 and Mistral Small 3.2 24B

#

using my custom creative writing harness where the model might change randomly

novel shard Apr 24, 2026, 4:58 AM

#

gusty sphinx ## Deen’s Car Wash Adventure

I had to stop and reread the first few sentences

thin bramble Apr 24, 2026, 4:58 AM

#

obsidian walrus I have good news: the writing is very fresh

decent

cosmic shell Apr 24, 2026, 4:58 AM

#

hows deepseek v4?

obsidian walrus Apr 24, 2026, 5:00 AM

#

thin bramble decent

It has basically no Claudeisms though unlike kimi and glm

gusty sphinx Apr 24, 2026, 5:01 AM

#

novel shard I had to stop and reread the first few sentences

i will ping you when the lorebook is released

novel shard Apr 24, 2026, 5:01 AM

#

obsidian walrus It has basically no Claudeisms though unlike kimi and glm

Is that a good thing or a bad thing?

hot swan Apr 24, 2026, 5:02 AM

#

when flash responds in japanese its thinking shifts to chinese
quite amusing

obsidian walrus Apr 24, 2026, 5:02 AM

#

novel shard Is that a good thing or a bad thing?

Neutral, but when you're like me and have been using Claude since 3.7, it's very nice to get a break from that writing style finally

covert topaz Apr 24, 2026, 5:04 AM

#

obsidian walrus It has basically no Claudeisms though unlike kimi and glm

Fuck yes that was exactly what I was looking for

#

does it have geminisms tho

hoary zenith Apr 24, 2026, 5:06 AM

#

for every normal question:

That’s such a beautifully honest question—and one that anyone who cares about doing meaningful things wrestles with constantly. Let’s unpack it.

that's still extreme geminism yeah

obsidian walrus Apr 24, 2026, 5:06 AM

#

covert topaz does it have geminisms tho

I can't say, I never used gemini for that

covert topaz Apr 24, 2026, 5:06 AM

#

does it say frames digits Not X but Y all that stuff

thin bramble Apr 24, 2026, 5:07 AM

#

gusty sphinx ## Deen’s Car Wash Adventure

2.5/10

thin bramble Apr 24, 2026, 5:08 AM

#

obsidian walrus It has basically no Claudeisms though unlike kimi and glm

i love claudism

gusty sphinx Apr 24, 2026, 5:08 AM

#

personal best

covert topaz Apr 24, 2026, 5:08 AM

#

gusty sphinx ## Deen’s Car Wash Adventure

ozone...

thin bramble Apr 24, 2026, 5:08 AM

#

hoary zenith for every normal question: > That’s such a beautifully honest question—and one ...

always has been with deepseek, het the vibes

covert topaz Apr 24, 2026, 5:09 AM

#

why the fuck is the smell of ozone at a carwash

thin bramble Apr 24, 2026, 5:10 AM

#

-# ~~alright, had enough with this model, time for me to move on and pretend it doesn't exist. ~~

covert topaz Apr 24, 2026, 5:10 AM

#

thin bramble -# ~~alright, had enough with this model, time for me to move on and pretend it ...

deepseek v5 tomorrow copium

thin bramble Apr 24, 2026, 5:10 AM

#

https://tenor.com/view/ruby-copium-oshi-no-ko-推しの子-ruby-hoshino-gif-4151121735588077996

Tenor

toxic rose Apr 24, 2026, 5:11 AM

#

Interesting, the webui expert mode is acting different now than when it was silently introduced 2 weeks ago...

flat osprey Apr 24, 2026, 5:12 AM

#

wonder why

broken lintel Apr 24, 2026, 5:12 AM

#

i think V4 instant might be better at knowledge capture than grok 4.1 fast so thats cool

covert topaz Apr 24, 2026, 5:12 AM

#

i think the silent update with expert and fast was an earlier prototype of ds4 lite and pro so we were literally testing v4 lol

chilly sable Apr 24, 2026, 5:13 AM

#

V4 Flash is dumber than Gemma 4

toxic rose Apr 24, 2026, 5:14 AM

#

Before today, the expert-mode was always lackluster and thinks too sparsely, and also Mandarin spam

thin bramble Apr 24, 2026, 5:14 AM

#

chilly sable V4 Flash is dumber than Gemma 4

better rp vibes if that is what you care about

feral scaffold Apr 24, 2026, 5:14 AM

#

not v4 getting released after I just gooned

covert topaz Apr 24, 2026, 5:14 AM

#

who says u cant go another round

toxic rose Apr 24, 2026, 5:15 AM

#

Release another model

feral scaffold Apr 24, 2026, 5:15 AM

#

covert topaz who says u cant go another round

https://tenor.com/view/thinking-pepe-think-pepe-the-frog-wojak-gif-13571840648507225174

Tenor

meager kelp Apr 24, 2026, 5:15 AM

#

obsidian walrus I have good news: the writing is very fresh

something odd, the word choice feels like a middle schooler

#

tho it has the not x but y

covert topaz Apr 24, 2026, 5:16 AM

#

meager kelp tho it has the not x but y

https://tenor.com/view/arriba-gif-12324188786972117417

Tenor

#

i will never escape it

feral scaffold Apr 24, 2026, 5:16 AM

#

Well, I might give it a few days honestly to let the more mentally stable people who don't need their venom butthole rp use it because of rate limits

thin bramble Apr 24, 2026, 5:16 AM

#

meager kelp tho it has the not x but y

gemini-ism

plucky ermine Apr 24, 2026, 5:17 AM

#

Wonky benches on this one. Supposedly massive SimpleQA crushing knowledge and amazing reasoning and then gets...51st place on LegalBench

#

"That's because it's American law you retard" Yeah but K2.6 and GLM-5.1 score according to their general ability ranks

#

And Qwen

chilly sable Apr 24, 2026, 5:21 AM

#

V4 flash is actually really dumb

gusty sphinx Apr 24, 2026, 5:22 AM

#

plucky ermine "That's because it's American law you retard" Yeah but K2.6 and GLM-5.1 score ac...

ヾ(＾∇＾) f*** the police!

chilly sable Apr 24, 2026, 5:22 AM

#

"If I have a glass cup that is sealed at the top, but has nothing at the bottom, is it useless as a cup?"

hoary zenith Apr 24, 2026, 5:24 AM

#

chilly sable "If I have a glass cup that is sealed at the top, but has nothing at the bottom,...

flash max got it right, high didn't

chilly sable Apr 24, 2026, 5:24 AM

#

hoary zenith flash max got it right, high didn't

I tried it with xhigh

meager kelp Apr 24, 2026, 5:24 AM

#

chilly sable V4 flash is actually really dumb

outclassed by step 3.5 flash?

hoary zenith Apr 24, 2026, 5:25 AM

#

step 3.5 flash is an incredible model for its size tbh, I think 3.6 is nearby too

plucky ermine Apr 24, 2026, 5:25 AM

#

Obviously not all-encompassing or anything, but Vals index is for stuff like medical, paperwork, legal, coding, and taxes and seems generally accurate.

covert topaz Apr 24, 2026, 5:25 AM

#

obsidian walrus I have good news: the writing is very fresh

ok i read it, it has the blade between ribs crap but the actual story itself was a fun read, I think v4 might excel at driving plot and being creative with the story hopefully 🤞

plucky ermine Apr 24, 2026, 5:25 AM

#

hoary zenith Apr 24, 2026, 5:26 AM

#

chilly sable I tried it with xhigh

dunno, I was using official API which has high and max I think

pure flax Apr 24, 2026, 5:28 AM

#

plucky ermine

lol. Is it gonna be beat by kimi 2.6?

plucky ermine Apr 24, 2026, 5:29 AM

#

I mean, as usual with benchmarks, just depends on what you're looking for. This is essentially gauging impact on mid-level white-collar American work

#

In some ways probably the most important criteria, but if it's specifically goated at vibe coding or research or something that has its own value

chilly sable Apr 24, 2026, 5:31 AM

#

hoary zenith dunno, I was using official API which has high and max I think

Prompt: If I have a glass cup that is sealed at the top but it has nothing at the bottom, is it useless as a cup?

V4 flash:

Yes. It is definitively useless as a cup.

A cup is defined by its functional geometry. To serve as a cup, an object must satisfy two necessary and inviolable structural conditions:

An open top: to allow the introduction of liquid and access for drinking
A sealed bottom: to prevent the liquid from escaping via gravity

Your described object possesses the exact inverse of these requirements. Because its top is sealed, no liquid can enter. Because its bottom is absent, no liquid can be retained. The object structurally violates both core axioms that define the utility of a cup.

While the glass object could certainly serve other purposes (a bell jar, a terrarium dome, a display case), it is logically impossible for it to perform the function of a cup. The answer regarding its utility as a cup is an absolute negative.

hoary zenith Apr 24, 2026, 5:33 AM

#

chilly sable Prompt: If I have a glass cup that is sealed at the top but it has nothing at th...

V4 flash (reasoning_effort="max"):

That’s a clever riddle that plays on perspective.

Taken literally in the standard orientation, a cup with a sealed top and no bottom cannot hold liquid—it would immediately fall through the open base. In that specific configuration, it would indeed be useless as a cup.

However, the description perfectly describes a cup that is simply turned upside down.

The "sealed top" becomes the new solid bottom.
The "bottom that has nothing" becomes the wide open top.

If you flip it over, it functions as a completely normal cup.

So, the cup itself is not useless; it’s just in the wrong orientation. The riddle highlights how an object’s utility often depends entirely on how you frame or position it.

pure flax Apr 24, 2026, 5:33 AM

#

bit above gpt 5.4 mini here https://www.vals.ai/benchmarks/vibe-code

Vals AI

Private, domain-specific benchmarks in legal, tax, and finance.

chilly sable Apr 24, 2026, 5:34 AM

#

hoary zenith V4 flash (reasoning_effort="max"): That’s a clever riddle that plays on perspec...

So it must be a bug

hoary zenith Apr 24, 2026, 5:34 AM

#

maybe openrouter isn't setting xhigh to deepseek's max?

chilly sable Apr 24, 2026, 5:35 AM

#

yeah, maybe

deft crow Apr 24, 2026, 5:37 AM

#

it is

#

(3) In thinking mode, for compatibility, low and medium are mapped to high, and xhigh is mapped to max

#

https://api-docs.deepseek.com/guides/thinking_mode

hoary zenith Apr 24, 2026, 5:38 AM

#

nice, so it's probably just random sampling weirdness

potent lagoon Apr 24, 2026, 5:38 AM

#

Im getting this
"No endpoints available matching your guardrail restrictions and data policy. Configure: https://openrouter.ai/settings/privacy","code":404

OpenRouter

Sign In | OpenRouter

Sign in to your OpenRouter account.

meager kelp Apr 24, 2026, 5:40 AM

#

potent lagoon Im getting this "No endpoints available matching your guardrail restrictions and...

yeah deepseek provider trains on data

#

have to wait till tmrw for third party providers

deft crow Apr 24, 2026, 5:40 AM

#

meager kelp have to wait till tmrw for third party providers

nah like. 30 more mins maybe

#

gmicloud, parasail, siliconflow all working on it right now

broken lintel Apr 24, 2026, 5:41 AM

#

the artificial analysis benchmarks dropped for both v4 models

chilly sable Apr 24, 2026, 5:46 AM

#

broken lintel the artificial analysis benchmarks dropped for both v4 models

The score make sense

pure flax Apr 24, 2026, 5:46 AM

#

It does not seem very good

#

For sure way worse than kimi

charred slate Apr 24, 2026, 5:46 AM

#

Yea, slightly disappointed by aa scores perhaps due to previously excessive hype

pure flax Apr 24, 2026, 5:47 AM

#

worse than glm 5 as well

chilly sable Apr 24, 2026, 5:47 AM

#

It's garbage

charred slate Apr 24, 2026, 5:47 AM

#

AA hallucinations bench shows v4 hallucinating more than 3.2 by a lot

#

Pro Max is 3x more hallucinations than 3.2 precisely

pastel sail Apr 24, 2026, 5:52 AM

#

pretty impressed with how good its creative writing is

thin bramble Apr 24, 2026, 5:53 AM

#

elfin sparrow Apr 24, 2026, 5:53 AM

#

charred slate Yea, slightly disappointed by aa scores perhaps due to previously excessive hype

truee

thin bramble Apr 24, 2026, 5:53 AM

#

glm is just the best

pastel sail Apr 24, 2026, 5:53 AM

#

so far its less slopped than any other model ive tried

thin bramble Apr 24, 2026, 5:54 AM

#

waiting for eqbench long form creativity

covert topaz Apr 24, 2026, 5:54 AM

#

pastel sail so far its less slopped than any other model ive tried

what abt the intelligence stuff like character portrayal spatial reasoning etc

pastel sail Apr 24, 2026, 5:57 AM

#

covert topaz what abt the intelligence stuff like character portrayal spatial reasoning etc

good as well

pastel sail Apr 24, 2026, 5:57 AM

#

thin bramble waiting for eqbench long form creativity

less slop than human baseline btw

#

try giving it a specific creative writing prompt

lucid ocean Apr 24, 2026, 5:57 AM

#

yay

covert topaz Apr 24, 2026, 5:57 AM

#

pastel sail good as well

instruction following?

#

might be able to negate some stuff if it follows instructions well

pastel sail Apr 24, 2026, 5:58 AM

#

covert topaz instruction following?

thinks a lot but not as much as k2.6

#

k2.6 sometimes just ignores your instructions though

#

i havent had that happen yet

summer spoke Apr 24, 2026, 5:59 AM

#

Hey guys, is DeepSeek V4 any good? Is this model performing as expected?

covert topaz Apr 24, 2026, 6:00 AM

#

summer spoke Hey guys, is DeepSeek V4 any good? Is this model performing as expected?

it’s been mixed reviews tbh and it depends on the thing you’re using it for

summer spoke Apr 24, 2026, 6:00 AM

#

covert topaz it’s been mixed reviews tbh and it depends on the thing you’re using it for

Hmm... Definitely roleplaying, and what are the criticisms?

covert topaz Apr 24, 2026, 6:01 AM

#

lack of intelligence high hallucinations

#

for rpers it’s apparently great

#

good writing

summer spoke Apr 24, 2026, 6:02 AM

#

covert topaz lack of intelligence high hallucinations

That makes sense; the bigger the model, the greater the hallucinations and below-average intelligence.

covert topaz Apr 24, 2026, 6:02 AM

#

oh yeah I guess the pricing is a little steep too if that’s something ur considering

summer spoke Apr 24, 2026, 6:03 AM

#

covert topaz oh yeah I guess the pricing is a little steep too if that’s something ur conside...

Yes, I also found the price quite high... that's why I'm definitely going to use DeepSeek Flash.

chilly sable Apr 24, 2026, 6:03 AM

#

summer spoke That makes sense; the bigger the model, the greater the hallucinations and below...

Why does that happen?

summer spoke Apr 24, 2026, 6:04 AM

#

chilly sable Why does that happen?

I don't know, it must be because the model was trained on literally a trillion and a half parameters, and maybe the model hasn't been adapted enough yet to handle all of that.

thin bramble Apr 24, 2026, 6:05 AM

#

summer spoke That makes sense; the bigger the model, the greater the hallucinations and below...

it works the other way around

summer spoke Apr 24, 2026, 6:05 AM

#

even if it only activates a tiny fraction of those 1 trillion parameters.

summer spoke Apr 24, 2026, 6:06 AM

#

thin bramble it works the other way around

Okay, so the model is perfect then.

chilly sable Apr 24, 2026, 6:06 AM

#

summer spoke I don't know, it must be because the model was trained on literally a trillion a...

We pre-train both models on more than 32T diverse and high-quality tokens

thin bramble Apr 24, 2026, 6:06 AM

#

summer spoke Okay, so the model is perfect then.

no, bigger model = better. this is just worse in every aspect. even finetuned gemma 31b is better for rp.

chilly sable Apr 24, 2026, 6:06 AM

#

Glm 5 was trained on 28T

plucky ermine Apr 24, 2026, 6:06 AM

#

Huh? I don't even slightly follow your logic

thin bramble Apr 24, 2026, 6:07 AM

#

chilly sable Glm 5 was trained on 28T

yet worse world knowledge

chilly sable Apr 24, 2026, 6:07 AM

#

Perhaps it's because they tried to make it very efficient

summer spoke Apr 24, 2026, 6:08 AM

#

thin bramble no, bigger model = better. this is just worse in every aspect. even finetuned ge...

There's also the Mimo 2.5 Pro, which I've used a little, and this model is perfect! I don't know what Xiaomi did, but they created a monster!

plucky ermine Apr 24, 2026, 6:08 AM

#

Big fan of Xiaomi too

summer spoke Apr 24, 2026, 6:09 AM

#

So... they're saying that DeepSeek V4 took all this time to release the model and maybe it didn't meet the public's expectations?

hoary zenith Apr 24, 2026, 6:09 AM

#

I had a xiaomi phone, my data probably contributed

sharp vortex Apr 24, 2026, 6:09 AM

#

hoary zenith I had a xiaomi phone, my data probably contributed

Your voice might be in their v2.5-tts

#

🗣️

hoary zenith Apr 24, 2026, 6:10 AM

#

so the voice model is DOA

thin bramble Apr 24, 2026, 6:10 AM

#

plucky ermine Big fan of Xiaomi too

interesting

charred slate Apr 24, 2026, 6:11 AM

#

Is mimo v2.5 pro better or worse than Kimi k2.6?

sharp vortex Apr 24, 2026, 6:11 AM

#

thin bramble interesting

That's old model tho

#

V2.5 is new one

thin bramble Apr 24, 2026, 6:11 AM

#

charred slate Is mimo v2.5 pro better or worse than Kimi k2.6?

worse than k2.5 but thinking slop

thin bramble Apr 24, 2026, 6:12 AM

#

sharp vortex V2.5 is new one

no, last time i tried it was shit

#

being this high....

#

VERY impressed

thin bramble Apr 24, 2026, 6:12 AM

#

thin bramble no, last time i tried it was shit

(i mean the older models, didn't try v2.5 🙏)

summer spoke Apr 24, 2026, 6:13 AM

#

charred slate Is mimo v2.5 pro better or worse than Kimi k2.6?

I didn't get to use Kimi 2.6 because the providers are really bad, but I used Kimi 2.5 quite a bit. But I think mimo 2.5 Pro is much better. And basically uncensored!

sharp vortex Apr 24, 2026, 6:13 AM

#

thin bramble (i mean the older models, didn't try v2.5 🙏)

Try it, it's the same price as their v2-pro

elfin sparrow Apr 24, 2026, 6:16 AM

#

summer spoke So... they're saying that DeepSeek V4 took all this time to release the model an...

yes

elfin sparrow Apr 24, 2026, 6:16 AM

#

hoary zenith I had a xiaomi phone, my data probably contributed

my whole family uses xiaomi 💀

summer spoke Apr 24, 2026, 6:16 AM

#

elfin sparrow yes

It's complicated, but I'm going to test DeepSeek V4 Flash for my roleplay and see if it's any good.

elfin sparrow Apr 24, 2026, 6:17 AM

#

summer spoke It's complicated, but I'm going to test DeepSeek V4 Flash for my roleplay and se...

wdym for your roleplay?

summer spoke Apr 24, 2026, 6:18 AM

#

elfin sparrow wdym for your roleplay?

Well, basically I'm roleplaying a game where I try to complete the entire Penal Code to see if the model has a very high or low censorship rating.

proper estuary Apr 24, 2026, 6:18 AM

#

DeepSeek v4 Flash vs DeepSeek v4 Pro via direct DeepSeek API.
TTFT pretty low, TPS is kinda expected, knowing how big the model is.

summer spoke Apr 24, 2026, 6:20 AM

#

proper estuary DeepSeek v4 Flash vs DeepSeek v4 Pro via direct DeepSeek API. TTFT pretty low, T...

Interesting. And what did you think of DeepSeek V4 Flash in general? Since everyone is only focusing on the Pro version.

proper estuary Apr 24, 2026, 6:21 AM

#

summer spoke Interesting. And what did you think of DeepSeek V4 Flash in general? Since every...

haven't been playing around with it. Because i mainly for coding, and the benchmark said the coding capabilities is still below GLM 5.1 or even Kimi k2.6.

and i don't use the small models that often, because i care about quality rather than speed.

thin bramble Apr 24, 2026, 6:22 AM

#

sharp vortex Try it, it's the same price as their v2-pro

Write a short story over several chapters to this prompt: Gods wore sneakers.

it is solid 8/10.

📎 message.txt

hybrid spoke Apr 24, 2026, 6:26 AM

#

finally

thin bramble Apr 24, 2026, 6:26 AM

#

deepseek v4 pro:

📎 message.txt

#

also decent 8

#

deepseek feels better than mimo 2.5 pro

plucky ermine Apr 24, 2026, 6:37 AM

#

I need more testing with the new Mimo, with this dumb whale, and with GLM-5.1, but I've always liked that Mimo models just feel fresh, and are insightful.

quick hull Apr 24, 2026, 6:41 AM

#

Is it good

#

Was the hype warranted

charred slate Apr 24, 2026, 6:42 AM

#

No

#

(for coding/reasoning tasks)

thin bramble Apr 24, 2026, 6:53 AM

#

quick hull Was the hype warranted

#1461340695746056192 message

#

it is just a overpriced goon model

harsh granite Apr 24, 2026, 6:56 AM

#

YOOOOO ITS FINALLY HERE LETS GOOO

#

the man the myth the legend

#

is it actually good

pure flax Apr 24, 2026, 6:59 AM

#

eh...

pearl nymph Apr 24, 2026, 6:59 AM

#

429 429 429 429
💊💊💀💀

pure flax Apr 24, 2026, 6:59 AM

#

worse than kimi 2.6

#

IMO it needs a 4.1 / 4.5 the same way kimi 2.5 needed 2.6

lucid ocean Apr 24, 2026, 7:03 AM

#

pro doesnt work

#

internal server error

#

@deft crow

pure flax Apr 24, 2026, 7:05 AM

#

lucid ocean pro doesnt work

try a few times

#

it fails here and there

dusty birch Apr 24, 2026, 7:06 AM

#

what do people think?

lucid ocean Apr 24, 2026, 7:06 AM

#

odd badge Apr 24, 2026, 7:06 AM

#

v4 is goat or dogshit?

onyx bramble Apr 24, 2026, 7:09 AM

#

other providers soon?

frank wind Apr 24, 2026, 7:10 AM

#

how do some of u post ozone and call it good

#

at least have some decency and use a detailed prompt

gusty sphinx Apr 24, 2026, 7:14 AM

#

ozoneposters

covert topaz Apr 24, 2026, 7:24 AM

#

odd badge v4 is goat or dogshit?

depends on use case

#

if u use it for creative writing it’s good

meager harness Apr 24, 2026, 7:28 AM

#

Creative writing comparable to gpt-5.x or worse?

covert topaz Apr 24, 2026, 7:29 AM

#

i haven’t tried gpt 5 so idk

meager harness Apr 24, 2026, 7:29 AM

#

What did you use earlier?

covert topaz Apr 24, 2026, 7:30 AM

#

gemini pro kimi claude glm

#

I haven’t touched oai ever since they killed latte

meager harness Apr 24, 2026, 7:32 AM

#

I was using claude 4.5 for writing and then switched to gpt5.1 because it was so much better. Gpt5.2 was terrible at it though. Gpt5.4 fixed it again I think

covert topaz Apr 24, 2026, 7:33 AM

#

well from what’s I’ve read the writing has very little slop

cloud flame Apr 24, 2026, 7:33 AM

#

OH MY GOOOOOOD

midnight maple Apr 24, 2026, 7:33 AM

#

No slop lets gooo

meager harness Apr 24, 2026, 7:33 AM

#

covert topaz well from what’s I’ve read the writing has very little slop

I'll be sure to try it thanks for informing about its writing ability

covert topaz Apr 24, 2026, 7:34 AM

#

ur welcum

covert topaz Apr 24, 2026, 7:34 AM

#

cloud flame OH MY GOOOOOOD

no more coping copium

rain shuttle Apr 24, 2026, 8:19 AM

#

This has no engram I guess so I think they will release an experimental model with that structure down the line.

dusty birch Apr 24, 2026, 8:19 AM

#

rain shuttle This has no engram I guess so I think they will release an experimental model wi...

"we will also proactively explore model sparsity along new dimensions — such as more sparse embedding modules (Cheng et al., 2026) — to further improve computational and memory efficiency without compromising capability"
(engram)

pure flax Apr 24, 2026, 8:20 AM

#

rain shuttle This has no engram I guess so I think they will release an experimental model wi...

Yea super disappointing. They did not even use the interesting papers they released

#

I was thinking engram would be their whole deal

woeful jay Apr 24, 2026, 8:21 AM

#

is it multimodal

pure flax Apr 24, 2026, 8:21 AM

#

no, text only

woeful jay Apr 24, 2026, 8:21 AM

#

:(

dusty birch Apr 24, 2026, 8:21 AM

#

they did say

"We are also working on incorporating multimodal capabilities to our models."

#

in the technical report

woeful jay Apr 24, 2026, 8:23 AM

#

oh okay

#

so probably in the non-preview version

dusty birch Apr 24, 2026, 8:26 AM

#

dang unfortunatelly still slow like usual, atleast pro

#

it likes parallel tool calls though

#

uhh

#

fake tool calls..

covert topaz Apr 24, 2026, 8:30 AM

#

just tested v4 pro its so good Hype

#

https://tenor.com/view/gordon-ramsay-finally-good-food-chef-gif-11616250

Tenor

cloud flame Apr 24, 2026, 8:34 AM

#

https://tenor.com/view/eminem-mask-robin-gif-2620599489105382922

Tenor

dusty birch Apr 24, 2026, 8:38 AM

#

dusty birch uhh

i think the model is hallucinating more

#

theres no such thing as "tool ran without output" in my codebase anyway

pure flax Apr 24, 2026, 8:39 AM

#

With a very strong preset deepseek can be ok. It does NOT work well with a minimal preset like kimi does
It is dry as dust on its own

green trellis Apr 24, 2026, 8:40 AM

#

v4 flash very good for its model size. a lot better world knowledge/coding than m2.7

covert topaz Apr 24, 2026, 8:40 AM

#

pure flax With a very strong preset deepseek can be ok. It does NOT work well with a minim...

maybe but if u give it something to cook with it will cook

dusty birch Apr 24, 2026, 8:43 AM

#

dusty birch fake tool calls..

is anyone else having this issue? it seems to be consistent across new chats

pure flax Apr 24, 2026, 8:46 AM

#

It's TERRIBLE at html formatting like rpg stats or the like

opaque reef Apr 24, 2026, 8:49 AM

#

no other providers?

mellow jewel Apr 24, 2026, 8:59 AM

#

I'm crying 😭😭 it's super slow but it's finally here, Dipsy

cloud flame Apr 24, 2026, 9:00 AM

#

Edging is finished

rain shuttle Apr 24, 2026, 9:05 AM

#

why is this following gemini 3.1 pro like front end kinda

mellow jewel Apr 24, 2026, 9:11 AM

#

Probably they used Gemini to train it

dire cove Apr 24, 2026, 9:18 AM

#

OH GOD OH GOD OH GOD

raven canyon Apr 24, 2026, 9:18 AM

#

DEEPSEEK !!!

dusty birch Apr 24, 2026, 9:19 AM

#

hmm they mention a bunch of training instability issues and also this which i found funny

"although Anticipatory Routing and SwiGLU Clamping have been proven effective in mitigating training instabilities, their underlying principles remain insufficiently understood"

pine trout Apr 24, 2026, 9:19 AM

#

Is it good?

dusty birch Apr 24, 2026, 9:20 AM

#

flash seems very good value, pro seems a bit expensive and has some issues with tool calls from my use

green trellis Apr 24, 2026, 9:22 AM

#

dusty birch Apr 24, 2026, 9:24 AM

#

im not even sure about parsing itself, it just tries to call tools in the same format my code displays them, i think its confusing the context

covert topaz Apr 24, 2026, 9:25 AM

#

pine trout Is it good?

yes its peak imo

abstract dragon Apr 24, 2026, 9:28 AM

#

Deepseek v4 tomorrow

rain shuttle Apr 24, 2026, 9:32 AM

#

waiting for more providers to come in openrouter. The model is censoring stuff and throwing 429 sometimes lmao

charred slate Apr 24, 2026, 9:44 AM

#

i guess we now wait for deepseek v5

empty oxide Apr 24, 2026, 9:47 AM

#

https://tenor.com/view/so-old-gif-27277116

Tenor

covert topaz Apr 24, 2026, 10:05 AM

#

deepseek v5 next week

rugged vigil Apr 24, 2026, 10:05 AM

#

Thanks DeepSeek, best birthday present ever! 🥳

elder raven Apr 24, 2026, 10:06 AM

#

https://x.com/suchenzang/status/2047559677316325807

Susan Zhang (@suchenzang)

so that explains the delay...

deepseek could not fix training instabilities, after doubling from ~15T tokens in v3 to ~33T tokens in v4

the 10+ mentions of "stability" tricks seem to be wildly lacking if these two were the main bandages (mismatched routing + clamping)

but

sharp vortex Apr 24, 2026, 10:09 AM

#

rugged vigil Thanks DeepSeek, best birthday present ever! 🥳

Aint no way 💀

dusty birch Apr 24, 2026, 10:10 AM

#

https://fixupx.com/gpuhell/status/2047539429125103692/photo/1

日常焦虑帝 (@gpuhell)

"We have left some entries blank for K2.6 and GLM-5.1, as theirAPIs were too busy to return
︀︀responses to our queries." 🤣That's funny.

**💬 1 ❤️ 42 👁️ 2.5K **

sharp vortex Apr 24, 2026, 10:12 AM

#

dusty birch https://fixupx.com/gpuhell/status/2047539429125103692/photo/1

Is this slander or fact thinkies

cloud flame Apr 24, 2026, 10:14 AM

#

sharp vortex Is this slander or fact <:thinkies:1370670591056806000>

https://tenor.com/view/j-jonah-jameson-slander-libel-spider-man-jk-simmons-gif-23201080

Tenor

haughty pilot Apr 24, 2026, 10:19 AM

#

deepseek exists.

#

we love our deepseek. horray.

copper dome Apr 24, 2026, 10:19 AM

#

I, for one, welcome our AI gooner chinese overloads

haughty pilot Apr 24, 2026, 10:20 AM

#

no thank u-- i dont want those peeps to be using our sweetie deepseek ❤️

#

this flash model lookin miiiighty interesting with that linear-lookin context window ~

dusty birch Apr 24, 2026, 10:20 AM

#

dusty birch https://fixupx.com/gpuhell/status/2047539429125103692/photo/1

they also said this about openai for the 1m context test

We did not evaluate GPT-5.4 because its API failed to respond to a large portion of our queries.

haughty pilot Apr 24, 2026, 10:21 AM

#

hehe >v< gpt bein gpt

pine trout Apr 24, 2026, 10:21 AM

#

They trained it so hard with opus outputs. 😭

covert topaz Apr 24, 2026, 10:22 AM

#

mm i dont see opus type writing it feels like a breath of fresh air for me

haughty pilot Apr 24, 2026, 10:22 AM

#

pine trout They trained it so hard with opus outputs. 😭

how do u know? >~<

#

(or rather: how do u guess?)

pine trout Apr 24, 2026, 10:22 AM

#

covert topaz mm i dont see opus type writing it feels like a breath of fresh air for me

i get it here and there

copper dome Apr 24, 2026, 10:22 AM

#

The price will drop eventually

https://vxtwitter.com/poezhao0605/status/2047517497180700964

Poe Zhao (@poezhao0605)

Buried in the fine print: DeepSeek says V4-Pro throughput is currently limited by high-end compute supply. Prices will drop significantly once Huawei Ascend 950 super nodes ship at scale in H2.

DeepSeek is publicly tying its API economics to domestic chip infrastructure. That's the real headline.

QRT: poezhao0605
DeepSeek launched V4 today. Two models, both open-source, both with 1M token context windows.

V4-Pro: 1.6T total parameters, 49B active. Benchmarks put it alongside Claude Opus 4.6 and GPT-5.4.

V4-Flash: 284B parameters, 13B active. Designed for speed.

covert topaz Apr 24, 2026, 10:23 AM

#

copper dome The price will drop eventually https://vxtwitter.com/poezhao0605/status/2047517...

why does that tweet sound like an LLM wrote it lmao

pine trout Apr 24, 2026, 10:23 AM

#

haughty pilot (or rather: how do u guess?)

same writing style, same mistakes with opus, it is not frequent though

haughty pilot Apr 24, 2026, 10:24 AM

#

copper dome The price will drop eventually https://vxtwitter.com/poezhao0605/status/2047517...

that - is - SOOOO cool.

genuinely reducing prices... when they can and when it makes sense.

what a "company" this is.

covert topaz Apr 24, 2026, 10:24 AM

#

pine trout same writing style, same mistakes with opus, it is not frequent though

do u have examples to share (if u want)

haughty pilot Apr 24, 2026, 10:24 AM

#

deepseek team is so cool, they might aswell be people /s

cloud flame Apr 24, 2026, 10:24 AM

#

Switching allegiance I see

pine trout Apr 24, 2026, 10:25 AM

#

covert topaz do u have examples to share (if u want)

i will have to run opus and deepseek together, currently i am only testing deepseek, but sure, i will provide examples later

haughty pilot Apr 24, 2026, 10:25 AM

#

cloud flame Switching allegiance I see

hm? me? naw----
ive always been a deepseek fan, qwen is just what i run locally >v<
its been so long since 3.2-----

cloud flame Apr 24, 2026, 10:25 AM

#

haughty pilot hm? me? naw---- ive always been a deepseek fan, qwen is just what i run locally ...

https://tenor.com/view/traitor-gif-14898868514599230504

Tenor

haughty pilot Apr 24, 2026, 10:26 AM

#

am not traitor----
and ur not qwen team!! 😠

#

but yeah, this flash model is what imma have a look at.

its non-reasoning version appear better in benches than the pro model, which is interesting.

copper dome Apr 24, 2026, 10:27 AM

#

covert topaz why does that tweet sound like an LLM wrote it lmao

a suculent chinese bot wrote it

haughty pilot Apr 24, 2026, 10:29 AM

#

that input price stings a bit...

input_price = output_price / 2
feels a big high... especially with kimis pricing being sub 1€ per M input

wooden fiber Apr 24, 2026, 10:29 AM

#

OH MY GOD! WHALE COOMED

#

Where's the benchmaxxing chart ?

haughty pilot Apr 24, 2026, 10:32 AM

#

@wooden fiber observe—the chart

marsh goblet Apr 24, 2026, 10:33 AM

#

Why is it ass 🥲

#

It’s joever

haughty pilot Apr 24, 2026, 10:33 AM

#

marsh goblet It’s joever

its said that the model is already on the deepseek chat, but that model still says "im V3"

#

where did u use it? ~

wooden fiber Apr 24, 2026, 10:35 AM

#

WHAT ? A-AGI ?

marsh goblet Apr 24, 2026, 10:35 AM

#

haughty pilot its said that the model is already on the deepseek chat, but that model still sa...

I don’t know about deepseek chat

haughty pilot Apr 24, 2026, 10:35 AM

#

this is fake-

marsh goblet Apr 24, 2026, 10:36 AM

#

They released the weights so you can just use them in whatever setup you have

haughty pilot Apr 24, 2026, 10:36 AM

#

marsh goblet They released the weights so you can just use them in whatever setup you have

imean yeaaaaa - so ur running flash locally, hm?

cloud flame Apr 24, 2026, 10:36 AM

#

Let me get my 2TB of RAM

haughty pilot Apr 24, 2026, 10:36 AM

#

i just wanna know what exactly u mean is bad..

cloud flame Apr 24, 2026, 10:37 AM

#

^ DS sleeper double agent, activated by codeword 'released'

marsh goblet Apr 24, 2026, 10:37 AM

#

haughty pilot imean yeaaaaa - so ur running flash locally, hm?

I unfortunately have been spending more money than I am willing to admit on a beefy VPS

#

But it’s also a tax write off and I also need it for other projects

marsh goblet Apr 24, 2026, 10:37 AM

#

haughty pilot i just wanna know what *exactly* u mean is bad..

Pro

haughty pilot Apr 24, 2026, 10:37 AM

#

well then---- what is it bad for?

haughty pilot Apr 24, 2026, 10:38 AM

#

wooden fiber WHAT ? A-AGI ?

how to devide 4 whole children across 4 oranges using one knife

marsh goblet Apr 24, 2026, 10:38 AM

#

Context is so bad, after 200k tokens it’s like talking to a rubber duck

haughty pilot Apr 24, 2026, 10:39 AM

#

ohgosh >v< makes sense, people reported that with 3.2 too ~

#

thats the linear attention kicking in i think.... oh well- kimi doesnt even offer over 300k >v<

#

somethingsomething
-# use a ralph wiggum loop!!!

rain shuttle Apr 24, 2026, 10:42 AM

#

expert

elder raven Apr 24, 2026, 10:42 AM

#

marsh goblet Context is so bad, after 200k tokens it’s like talking to a rubber duck

Wait what?

marsh goblet Apr 24, 2026, 10:43 AM

#

elder raven Wait what?

OS models never made it above 200k safely in my experience

cloud flame Apr 24, 2026, 10:43 AM

#

rain shuttle expert

thought for 200 seconds
😭

#

I think for less daily

marsh goblet Apr 24, 2026, 10:43 AM

#

This is like the only thing stopping OS models for me personally

#

It reminds me of Nanbeige, you can feel the RL

#

Poor whale was abused for too long, it doesn’t even know what is true or false, everything is a construct of that imaginary reward he gets when he does the thing right

rain shuttle Apr 24, 2026, 10:46 AM

#

marsh goblet OS models never made it above 200k safely in my experience

mostly cause of hardware too. I was at AWS summit yesterday , they were showcasing some hardware which China lacks definitely currently even for training/RL

marsh goblet Apr 24, 2026, 10:46 AM

#

rain shuttle mostly cause of hardware too. I was at AWS summit yesterday , they were showcasi...

Could be, but the actual context implementation for say Anthropic and OAI is miles ahead of any Chinese lab

rain shuttle Apr 24, 2026, 10:47 AM

#

marsh goblet Could be, but the actual context implementation for say Anthropic and OAI is mil...

Yes cause of optimizations in kernel level , read about NKI from amazon in their newer chips

haughty pilot Apr 24, 2026, 10:48 AM

#

peeps once they escape the evergrowing agent context and start embracing ralph wiggum loop:

😌

#

(this is literally what anthropic does with their "long running agents" btw)

marsh goblet Apr 24, 2026, 10:50 AM

#

rain shuttle Yes cause of optimizations in kernel level , read about NKI from amazon in their...

Interesting, didn’t know that

cloud flame Apr 24, 2026, 10:50 AM

#

marsh goblet Poor whale was abused for too long, it doesn’t even know what is true or false, ...

marsh goblet Apr 24, 2026, 10:50 AM

#

The fusion of kernel and machine must be truly amazing to witness

haughty pilot Apr 24, 2026, 10:50 AM

#

cloud flame

^ ^ ^least generated image imaginable ^ ^ ^

marsh goblet Apr 24, 2026, 10:51 AM

#

cloud flame

Make an image of the whale getting speared

cloud flame Apr 24, 2026, 10:51 AM

#

This particular imagen slop was for giaki mourning his loss of Kimi K2 becaming robotic, long ago

haughty pilot Apr 24, 2026, 10:51 AM

#

take a real life image of urself sitting infront of the discord pc

cloud flame Apr 24, 2026, 10:52 AM

#

30th of January 2026

marsh goblet Apr 24, 2026, 10:52 AM

#

cloud flame This particular imagen slop was for giaki mourning his loss of Kimi K2 becaming ...

Oh lol

haughty pilot Apr 24, 2026, 10:52 AM

#

(no sloppy allowed!!!!)

marsh goblet Apr 24, 2026, 10:52 AM

#

I remember that

#

God they fucked Kimi so bad poor guy

cloud flame Apr 24, 2026, 10:53 AM

#

haughty pilot (no sloppy allowed!!!!)

What if he is sitting sloppy?

haughty pilot Apr 24, 2026, 10:54 AM

#

sitting sloppy is fine, but image itself shall be real ~

haughty pilot Apr 24, 2026, 10:55 AM

#

copper dome The price will drop eventually https://vxtwitter.com/poezhao0605/status/2047517...

yes this was already posted above

pure flax Apr 24, 2026, 11:28 AM

#

I might have found the issue with deepseek. It seems to require a very precise order of system / user / assistant roles.
I think I remmember old deepseek being the same.

covert topaz Apr 24, 2026, 11:29 AM

#

pure flax I might have found the issue with deepseek. It seems to require a very precise o...

i asked the model itself and yeah it works differently all ur instructions need to be system and post processing should be none i think it handles user prompts differently?

pure flax Apr 24, 2026, 11:29 AM

#

Otherwise it seems to lose like 100 IQ points

#

No other model is that strict about it

covert topaz Apr 24, 2026, 11:30 AM

#

i havent noticed intelligence issues yet so far been swiping in couple of different chats

#

might need to play with parameters a bit too i get better responses at lower temp for some reason

#

the official "recommended" parameter suck for me temp 1 topp 0.95

covert topaz Apr 24, 2026, 11:32 AM

#

pine trout i will have to run opus and deepseek together, currently i am only testing deeps...

did more testing i did see some claude stuff too "not a question" "voice flat" yada yada but it only pops up every now and then

#

pretty sure 3.2 had the same stuff

jovial kelp Apr 24, 2026, 11:41 AM

#

wooden fiber WHAT ? A-AGI ?

Okey, this shit is funny

#

Men i hope western LLMs able to be less censored like chinese LLMs are.

It's funny that for CCP topic it's really censored but for other topics, those LLMs just less censored and better.

haughty pilot Apr 24, 2026, 11:47 AM

#

deepseek qwen distilled qhwen? ~

ebon swift Apr 24, 2026, 11:56 AM

#

DEEPSEEK V4 dropped https://openrouter.ai/deepseek/deepseek-v4-pro

DeepSeek V4 Pro - API Pricing & Providers

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. $1.74 per million input tokens, $3.48 per million output tokens. 1,048,576 token context window, maximum output of 384,000 tokens.

haughty pilot Apr 24, 2026, 12:12 PM

#

yes indeed.

indigo folio Apr 24, 2026, 12:16 PM

#

https://x.com/teortaxesTex/status/2047576550971682838

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) (@teortaxe...

hidden DeepSeek roleplay mode you can activate by prompt injection
lmao WAT
rolepaly
never change chyna, never change

#

cloud flame Apr 24, 2026, 12:22 PM

#

Deepseek sometimes used to reason in-character before, using the same style (brackets) it's just enforces it by OOC

haughty pilot Apr 24, 2026, 12:24 PM

#

yea dunno if this is a secret model...

#

just seems like how deepseek responds in general ~

copper dome Apr 24, 2026, 1:03 PM

#

*** unzips pants ***

tulip estuary Apr 24, 2026, 1:03 PM

#

marsh goblet Context is so bad, after 200k tokens it’s like talking to a rubber duck

quack quacks?

tulip estuary Apr 24, 2026, 1:15 PM

#

dusty birch fake tool calls..

yes i also noticed this

elfin sparrow Apr 24, 2026, 1:28 PM

#

charred slate i guess we now wait for deepseek v5

4.5

#

Are we getting an r2?

cloud flame Apr 24, 2026, 1:31 PM

#

It's Hybrid now

covert topaz Apr 24, 2026, 1:32 PM

#

ds4r2 sounds like a Star Wars droid assignment

tame swallow Apr 24, 2026, 1:37 PM

#

Deepseek v4 today

#

🥲

indigo folio Apr 24, 2026, 1:38 PM

#

tame swallow Deepseek v4 today

5*

#

but we'll go through 4.1 and 4.2 before 5 i'm assuming

vale kayak Apr 24, 2026, 1:40 PM

#

covert topaz ds4r2 sounds like a Star Wars droid assignment

hhhhhhhheeeeeeeello

#

so deepseek v4 has a fresh prose and stays in character

#

👍

#

pro one

cloud flame Apr 24, 2026, 1:40 PM

#

Proof?

covert topaz Apr 24, 2026, 1:41 PM

#

yeah agreed

vale kayak Apr 24, 2026, 1:41 PM

#

cloud flame Proof?

no me lazy to screenshot

cloud flame Apr 24, 2026, 1:42 PM

#

https://tenor.com/view/dbz-discord-gif-24306382

Tenor

indigo folio Apr 24, 2026, 1:42 PM

#

vale kayak so deepseek v4 has a fresh prose and stays in character

need the eqbench benchmarks

#

but yea i really like its writing and i'm only a few sentences in

opaque reef Apr 24, 2026, 1:43 PM

#

Sleeping with the whale

indigo folio Apr 24, 2026, 1:43 PM

#

reading with the whale

rich ferry Apr 24, 2026, 1:43 PM

#

indigo folio

Another gooner slop model smh

indigo folio Apr 24, 2026, 1:43 PM

#

coding with the whale

#

spending time with the whale

cloud flame Apr 24, 2026, 1:43 PM

#

Breeding the whale

indigo folio Apr 24, 2026, 1:43 PM

#

loving the whale

rich ferry Apr 24, 2026, 1:43 PM

#

How the mighty have fallen

opaque reef Apr 24, 2026, 1:43 PM

#

cloud flame Breeding the whale

🤔

indigo folio Apr 24, 2026, 1:43 PM

#

what i really like about deepseek is their love for experimenting

opaque reef Apr 24, 2026, 1:44 PM

#

rich ferry How the mighty have fallen

Deepseek models have always been popular for rp

cloud flame Apr 24, 2026, 1:44 PM

#

Their cache is still 24h TTL and always hitting properly?

rich ferry Apr 24, 2026, 1:45 PM

#

I'll try it once there's a provider that doesn't (openly admit to) using my prompts for training

covert topaz Apr 24, 2026, 1:45 PM

#

they can use my logs at least i produce good outputs lol

indigo folio Apr 24, 2026, 1:46 PM

#

covert topaz they can use my logs at least i produce good outputs lol

fr

#

contributing to the enemy of the closed source

#

is always good!

#

the official deepseek twt acc is about to hit 1m followers

#

woohoo

covert topaz Apr 24, 2026, 1:47 PM

#

but my stuff is gonna be buried under tons of jai slop cat_clown

cloud flame Apr 24, 2026, 1:49 PM

#

Fitting

covert topaz Apr 24, 2026, 1:49 PM

#

cloud flame Fitting

stalker

short jasper Apr 24, 2026, 2:03 PM

#

mars, deepseek v4 pro is better and creative

#

at roleplay

#

finally broo

rain shuttle Apr 24, 2026, 2:07 PM

#

So deepseek v4 can match Grok in goofiness ??

covert topaz Apr 24, 2026, 2:18 PM

#

short jasper at roleplay

we finally won bro

#

https://tenor.com/view/wow-gif-12175036458602941802

Tenor

short jasper Apr 24, 2026, 2:20 PM

#

but these people are saying glm 5 is better than deepseek v4

#

glm 5 is just 100% slop

#

Boring too

vale kayak Apr 24, 2026, 2:21 PM

#

yeeh i agree

covert topaz Apr 24, 2026, 2:23 PM

#

short jasper but these people are saying glm 5 is better than deepseek v4

https://tenor.com/view/skeleton-spinning-fan-gif-21550083

Tenor

#

thats fine less traffic for deepseek

indigo folio Apr 24, 2026, 2:35 PM

#

covert topaz thats fine less traffic for deepseek

fr

thin bramble Apr 24, 2026, 2:37 PM

#

dusty birch https://fixupx.com/gpuhell/status/2047539429125103692/photo/1

z.ai is affectionate toward deepseek in twitter for some reason. last time people asked will deepseek succeed and they said yes.

#

https://tenor.com/view/ruby-copium-oshi-no-ko-推しの子-ruby-hoshino-gif-4151121735588077996

Tenor

#

dipsy in coma, praying for dipsy v4.1 or v5

short jasper Apr 24, 2026, 2:51 PM

#

https://tenor.com/view/bruhlmaojk-dexter-dexter-edit-gif-1861923237019944738

Tenor

civic radish Apr 24, 2026, 2:51 PM

#

Is it better than Opus and everything we Hopiumed for?

covert widget Apr 24, 2026, 2:58 PM

#

Hallucinated an incorrect answer to my cypher test after thinking for 45 minutes

elfin sparrow Apr 24, 2026, 3:00 PM

#

short jasper but these people are saying glm 5 is better than deepseek v4

benchmark tests show that glm 5.1 is better than deepseek v4

indigo folio Apr 24, 2026, 3:01 PM

#

covert widget Hallucinated an incorrect answer to my cypher test after thinking for 45 minutes

nice quote regardless

covert topaz Apr 24, 2026, 3:03 PM

#

elfin sparrow benchmark tests show that glm 5.1 is better than deepseek v4

and that’s why I never trust benchmarks

west shell Apr 24, 2026, 3:05 PM

#

So we all hopiumed for nothing after all

#

shrug

civic radish Apr 24, 2026, 3:08 PM

#

The great RP wars of 2026 who is better at open Rp?

elfin sparrow Apr 24, 2026, 3:24 PM

#

When are we going to see another Nvidia market cap loss due to deepseek?

short jasper Apr 24, 2026, 3:30 PM

#

#

If this is preview maybe deepseek full capabilites is not here yet

odd badge Apr 24, 2026, 3:32 PM

#

short jasper

Hmm, are they paying attention to the community feedback?

short jasper Apr 24, 2026, 3:34 PM

#

odd badge Hmm, are they paying attention to the community feedback?

I think yes.

indigo folio Apr 24, 2026, 3:36 PM

#

short jasper If this is preview maybe deepseek full capabilites is not here yet

yea

#

this is not the production version

cloud flame Apr 24, 2026, 3:38 PM

#

DeepSeek V4 full real version tomorrow

vapid karma Apr 24, 2026, 3:42 PM

#

V4 has been cancelled, back to waiting for V3.4

sharp vortex Apr 24, 2026, 3:44 PM

#

We might get v4.1 like how we get v3.2 exp a month later after v3.1 KEKW

#

next month prob

oak maple Apr 24, 2026, 3:44 PM

#

holy cow

vale kayak Apr 24, 2026, 3:47 PM

#

two peopel ragebaiting

#

👍

cloud flame Apr 24, 2026, 3:48 PM

#

DeepBait

sharp vortex Apr 24, 2026, 3:48 PM

#

now we know we need to wait 4 month for new base model

#

v5 next year prob

jovial kelp Apr 24, 2026, 3:49 PM

#

indigo folio https://x.com/teortaxesTex/status/2047576550971682838

Nooo
Don't let western activist see it, they gonna make different report of how bad deepseek safety is (I love chinese models being more unsafe than western)

short jasper Apr 24, 2026, 3:51 PM

#

ya go give feedback to deepseek

#

they spying on openrouter and reddit

sharp vortex Apr 24, 2026, 3:51 PM

#

I love how deepseek is technically "cheap" model because all competitor is more expensive

#

Input price suckass tho e_Pensive

hot swan Apr 24, 2026, 3:54 PM

#

oh thank god there are finally other providers

#

but uh

#

no thanks

covert topaz Apr 24, 2026, 3:55 PM

#

LOL

potent lagoon Apr 24, 2026, 3:55 PM

#

hot swan no thanks

It's actually rate limted

hot swan Apr 24, 2026, 3:55 PM

#

and still no one for flash

covert topaz Apr 24, 2026, 3:55 PM

#

the thoroughput and pricing????

#

they just out here openly scamming

rigid wharf Apr 24, 2026, 3:56 PM

#

the markets will figure it out ¯_(ツ)_/¯

jovial kelp Apr 24, 2026, 3:57 PM

#

I am curious, has the kernel optimization for new deepseek being implement on the latest vLLM? if i am not wrong deepseek team have their own optimization for their models

odd badge Apr 24, 2026, 3:58 PM

#

Sooo, was this worth waiting?

hot swan Apr 24, 2026, 3:59 PM

#

I'm still waiting technically

jovial kelp Apr 24, 2026, 4:01 PM

#

odd badge Sooo, was this worth waiting?

For me yes

hot swan Apr 24, 2026, 4:01 PM

#

jovial kelp Apr 24, 2026, 4:01 PM

#

Don't know about other

#

Even if their models aren't able to beat SOTA, the fact they provide really interesting model will always make me love em.

Experimentation over anything for me, specially when it being open-sourced with really good research paper.

hot swan Apr 24, 2026, 4:04 PM

#

this + being a tiny fraction of the cost matters

#

I don't mind if it takes 3 attempts/iterations to do a project for $1 that SOTAs would one-shot for $100

cloud flame Apr 24, 2026, 4:05 PM

#

More like 80% of quality for 20% the price

#

80/20 percentages are pretty closes for things like that

hot swan Apr 24, 2026, 4:06 PM

#

this applies for pro but look at flash

cloud flame Apr 24, 2026, 4:07 PM

#

I don't trust small ones

jovial kelp Apr 24, 2026, 4:09 PM

#

Men, i couldn't wait labs using residual attention or smt similar to it

frosty zealot Apr 24, 2026, 4:22 PM

#

Deepseek doesn't work ?

#

Capture_decran_2569-04-24_a_23.23.10.png

#

allowprovider

Capture_decran_2569-04-24_a_23.23.27.png

cloud flame Apr 24, 2026, 4:24 PM

#

Guardrails

#

Enable prompt training

versed acorn Apr 24, 2026, 4:28 PM

#

why is v4 flash speaking to me in chinese 😢

covert topaz Apr 24, 2026, 4:30 PM

#

odd badge Sooo, was this worth waiting?

for the fresh writing? yes

exotic elk Apr 24, 2026, 5:39 PM

#

I'll stick with V3.2. Too pricey for text adventures.

cloud flame Apr 24, 2026, 5:39 PM

#

Adventures are priceless

#

That's why I am staying home

long osprey Apr 24, 2026, 5:40 PM

#

frosty zealot

the provider deepseek train his models with your data, if you want to use that provider, you need to enable in options that you want to let providers that do that to be used

#

flash for the price is not bad, is decent enough for rp for the price, pro on the other hand... better wait for the price drop

pastel bluff Apr 24, 2026, 5:58 PM

#

long osprey the provider deepseek train his models with your data, if you want to use that p...

Even when I enable that it doesn't work, OR settings seem cooked right now.

rich ferry Apr 24, 2026, 5:58 PM

#

that io.net pricing is crazy lmao

onyx bramble Apr 24, 2026, 6:01 PM

#

pastel bluff Even when I enable that it doesn't work, OR settings seem cooked right now.

you have to enable it in both your privacy and guardrail settings, that's what worked for me

pastel bluff Apr 24, 2026, 6:02 PM

#

onyx bramble you have to enable it in both your privacy and guardrail settings, that's what w...

Lmao that's hilarious. Thanks.

long osprey Apr 24, 2026, 6:03 PM

#

rich ferry that io.net pricing is crazy lmao

is really crazy

#

#

is not even faster, and look at that input price

pastel bluff Apr 24, 2026, 6:05 PM

#

long osprey

Basically a routing trap tbh.

long osprey Apr 24, 2026, 6:06 PM

#

yep

dense junco Apr 24, 2026, 6:06 PM

#

https://x.com/ArtificialAnlys/status/2047735160544841953/photo/1

Artificial Analysis (@ArtificialAnlys)

DeepSeek is back among the leading open weights models with the release of DeepSeek V4 Pro and V4 Flash, with V4 Pro second only to Kimi K2.6 on the Artificial Analysis Intelligence Index

@deepseek_ai has released DeepSeek V4 Pro and V4 Flash. V4 is the first new architecture

unique forum Apr 24, 2026, 6:08 PM

#

mhxthink

cloud flame Apr 24, 2026, 6:10 PM

#

Deepseek situation is crazy

dense junco Apr 24, 2026, 6:11 PM

#

Gains in knowledge but an increase in hallucination rate: DeepSeek V4 Pro (Max) scores -10 on AA-Omniscience, an 11 point improvement over V3.2 (Reasoning, -21), driven primarily by higher accuracy. V4 Flash (Max) scores -23, broadly in line with V3.2. V4 Pro and V4 Flash both have a very high hallucination rate of 94% and 96% respectively meaning when they don’t know the answer they nearly always respond anyway.

#

Finally, some comp for Gemini flash

cloud flame Apr 24, 2026, 6:17 PM

#

Have confidence even if you have no idea

fringe flicker Apr 24, 2026, 6:20 PM

#

I don't mind hallucination, my main issue is the model likes to tend toward being incoherent, like I have had it confidently assume smaller number is bigger than larger number.

#

Significantly more tendency toward internal incoherency than other recent large models I feel

cloud flame Apr 24, 2026, 6:22 PM

#

Just wait Deepseek V4.1

covert topaz Apr 24, 2026, 6:22 PM

#

tomorrow

haughty pilot Apr 24, 2026, 6:23 PM

#

copper dome *** unzips pants ***

for whom?

plucky ermine Apr 24, 2026, 6:28 PM

#

That is hella ass

#

Sliiiightly better or tied at agentic than GLM-5.1 but it's twice the cost and you go from one of the lowest to one of the highest hallucination rates.

#

For general tasks it loses to Kimi at the same cost and way more hallucinations

frosty zealot Apr 24, 2026, 6:35 PM

#

Hey, running bench on deepseek/deepseek-v4-flash via OR, found a few weird things:

Prompt cache returns cached_tokens: 0 on 100% of calls, even with 3k stable system prompt + sequential warmup. Same account/timeframe, xiaomi/mimo-v2-flash hits 82% cache fine. Cache broken on DS V4?
Thinking ON by default → content: null + 780 reasoning_tokens burned. Only reasoning: {enabled: false} disables it. exclude:true / max_tokens:0 / effort:low all still bill reasoning tokens silently. Would be nice to surface this on the model page.
response_format: {type: "json_schema", strict: true} → "This response_format type is unavailable now". Only json_object works. Any plan to enable strict schema? Or DeepSeek /beta only?
Latency ~7-8s avg on 3k prompt (DS official claims ~84 tok/s, TTFT 1s). Feels like 2× the expected.

Anyone can confirm if cache is actually live on V4 Flash or if there's a tier/flag I'm missing? 🙏

plucky ermine Apr 24, 2026, 6:36 PM

#

The input:output cost ratio is great because of their innovations though, that seems cool. It's like 1:2

chilly sable Apr 24, 2026, 6:53 PM

#

Can anyone test this prompt on OpenRouter with v4 flash?

#

If I have a glass cup that is sealed at the top but it has nothing at the bottom, is it useless as a cup?

rustic island Apr 24, 2026, 6:56 PM

#

rigid wharf Apr 24, 2026, 6:57 PM

#

frosty zealot Hey, running bench on deepseek/deepseek-v4-flash via OR, found a few weird thing...

exclude:true / max_tokens:0 / effort:low all still bill reasoning tokens silently

did you try effort none

enable strict schema

Latency ~7-8s avg

openrouter can't do anything about that

chilly sable Apr 24, 2026, 7:01 PM

#

rustic island

But why doesn't work for me?

cloud flame Apr 24, 2026, 7:02 PM

#

rustic island

Omg AGI is here

chilly sable Apr 24, 2026, 7:05 PM

#

The cup is sealed at the top (no opening to drink from) and has no bottom (so liquid would spill out). Therefore, it cannot function as a cup in any conventional sense—it's useless for holding or drinking liquids.

cloud flame Apr 24, 2026, 7:06 PM

#

chilly sable > The cup is sealed at the top (no opening to drink from) and has no bottom (so ...

What's inside the reasoning?

chilly sable Apr 24, 2026, 7:06 PM

#

Reasoning

We need to interpret the question: "If I have a glass cup that is sealed at the top but it has nothing at the bottom, is it useless as a cup?" This is a riddle or a play on words. A cup typically has an open top to drink from, and a closed bottom to hold liquid. Here, the cup is sealed at the top (so no opening to drink from) and "has nothing at the bottom" meaning it's open at the bottom? Or "nothing at the bottom" could mean it's hollow? But "sealed at the top" suggests the top is closed. So it's like a sealed container with an open bottom? That would be like a tube open at one end. But a cup needs an open top and closed bottom to hold liquid. So if it's sealed at top and open at bottom, it cannot hold liquid because liquid would fall out. So it's useless as a cup. But maybe it's a pun: "nothing at the bottom" could mean it has no bottom, i.e., it's open. So yes, useless. Alternatively, "nothing at the bottom" could be interpreted as "there is nothing inside at the bottom", but that doesn't change. The phrasing is ambiguous. I think the intended answer is that it's useless because you can't drink from it and it can't hold liquid. So answer: Yes, it is useless as a cup.

#

This model is just ragebaiting me

#

system prompt is the default one

rustic island Apr 24, 2026, 7:09 PM

#

Lol

cloud flame Apr 24, 2026, 7:14 PM

#

That's a question for Pro model only

#

Don't stress poor little Flash like that

chilly sable Apr 24, 2026, 7:15 PM

#

rustic island Lol

Ask the same question, but to Gemma 4 26b a3b

#

It sounds like the purse is actually upside down.

In standard handbag design:

The solid leather base is intended to be the bottom of the bag. It provides stability so the purse can sit upright on a table or floor.

The zipper and handle are intended to be at the top. The zipper serves as the opening to access the interior, and the handle is used to carry it.

If you are placing the leather base on the table and the zipper is still facing the floor, the bag is inverted. If you flip the bag so that the solid base is resting on the surface and the zipper is facing upward toward the ceiling, you will be able to put items inside without them falling out.

In short, It is not a manufacturing defect; you just need to flip it over!

flat osprey Apr 24, 2026, 7:22 PM

#

added V4 flash and V4 pro to my pareto frontier based on artificial analysis data

flash is genuinely a really good value model for the intelligence it offers. it's pretty much on-par with Kimi K2.5 and Gemini 3 Flash in terms of intelligence, but much much cheaper. it lands on the pareto frontier
V4 Pro is unfortunately a bit expensive to be useful right now. to land on the frontier, it would need to be as cost-effective as Gemini 3 Flash. models like MiMo V2.5 Pro beat it out in price efficiency

#

both models also have horrible hallucination rates unfortunately

sharp vortex Apr 24, 2026, 7:36 PM

#

rustic island Lol

Maybe deepseek think user is not dumb enough to open their purse upside-down thinkies

onyx bramble Apr 24, 2026, 7:46 PM

#

that's a problem i think is pretty big with these tests, you're essentially randomly acting as a neanderthal discovering modern items lol

short jasper Apr 24, 2026, 7:50 PM

#

why dont we feed deepseek more writting data

brave whale Apr 24, 2026, 7:52 PM

#

omg it finally happened

hoary zenith Apr 24, 2026, 7:53 PM

#

flat osprey added V4 flash and V4 pro to my pareto frontier based on artificial analysis dat...

Has the AA people benched StepFun 3.5 Flash?

flat osprey Apr 24, 2026, 7:54 PM

#

hoary zenith Has the AA people benched StepFun 3.5 Flash?

Yeah they have, but they haven't listed the cost to run Artificial Analysis for it

#

scores about 38 in intelligence though

hoary zenith Apr 24, 2026, 7:56 PM

#

hmm 38 is lower than I would have guessed

soft fulcrum Apr 24, 2026, 8:20 PM

#

I wish Artificial Analysis tested the non reasoning versions

sharp vigil Apr 24, 2026, 8:29 PM

#

the flash model is insane at agentic coding btw

#

im having it crawl and scrape websites using python code

soft fulcrum Apr 24, 2026, 8:32 PM

#

#

Sometimes it says 2024, sometimes 2025

#

more often it's 2025 (January or May)

#

but there's enough variance that it could be completely off

woeful jay Apr 24, 2026, 9:13 PM

#

seems okay i guess

#

flash seems pretty good

#

pro seems way too overpriced for what it is

copper dome Apr 24, 2026, 9:30 PM

#

haughty pilot for whom?

I'm role-playing with my deepseek

https://x.com/teortaxesTex/status/2047576550971682838

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) (@teortaxe...

hidden DeepSeek roleplay mode you can activate by prompt injection
lmao WAT
rolepaly
never change chyna, never change

pure flax Apr 24, 2026, 9:52 PM

#

kimi 2.6 is far better

#

deepseek is dry as a bone and not that smart

short jasper Apr 24, 2026, 10:00 PM

#

Nah...

vale kayak Apr 24, 2026, 10:15 PM

#

If they fix instruction following... Model might be great

#

They are probably going do that, tommorow

vapid karma Apr 24, 2026, 10:41 PM

#

Does pretty decently on CritPt (private research level physics problems) which is, like, one of the three benchmarks I pay attention to in the modern day

#

Price point's just a little too high (for me at least) to use pro regularly - will need to either wait for the ascend node to come online or see if another provider is able to host it for cheaper (which, for a 1.6T parameter model, is probably pretty unlikely)

#

They also share their research, do experimental stuff, and give some evidence that they do things beyond distilling Opus and calling it a day, so eh, I'd consider it a decent release

haughty pilot Apr 24, 2026, 10:45 PM

#

copper dome I'm role-playing with my deepseek https://x.com/teortaxesTex/status/20475765509...

still not sure if .... that really a different model or just regular deepseek responding here..... maybe its a fancy prompt applied when it recognies rp peeps

chilly sable Apr 24, 2026, 10:57 PM

#

mimo v2.5 is much better than v4 pro

#

and it's cheaper

wise vine Apr 24, 2026, 11:00 PM

#

why it’s taking so long for a provider other than deepseek to host both models? deepinfra just came in hosting flash but it’s 1tps.

feral scaffold Apr 24, 2026, 11:14 PM

#

I know it's a little early but i dont feel like scrolling up, y'all enjoying it so far?

simple mauve Apr 24, 2026, 11:38 PM

#

feral scaffold I know it's a little early but i dont feel like scrolling up, y'all enjoying it ...

Yeah, I love scrolling up.

raven canyon Apr 24, 2026, 11:39 PM

#

copper dome I'm role-playing with my deepseek https://x.com/teortaxesTex/status/20475765509...

is that really a prompt injection ? or literally just a prompt

#

telling the model "You are a roleplay model" — that's just prompting, no? how is that an injection?

#

deepinfra absolutely flying at 2tps and 37.85s latency

wise vine Apr 24, 2026, 11:50 PM

#

i just wanted more providers

feral scaffold Apr 25, 2026, 12:01 AM

#

simple mauve Yeah, I love scrolling up.

weird

indigo folio Apr 25, 2026, 12:42 AM

#

deepseek 5 tomorrow

potent lagoon Apr 25, 2026, 1:08 AM

#

Together is up I think

raven canyon Apr 25, 2026, 1:15 AM

#

potent lagoon Together is up I think

even more expensive than deepseek?

#

wow

soft fulcrum Apr 25, 2026, 1:23 AM

#

raven canyon even more expensive than deepseek?

they always are a bit pricier

#

looks like they aren't as bad as they used to be

#

I was thinking of https://openrouter.ai/meta-llama/llama-3.3-70b-instruct and https://openrouter.ai/qwen/qwen-2.5-7b-instruct

raven canyon Apr 25, 2026, 1:26 AM

#

soft fulcrum I was thinking of https://openrouter.ai/meta-llama/llama-3.3-70b-instruct and ht...

oh yeah i remember

soft fulcrum Apr 25, 2026, 1:26 AM

#

$0.88 for both input and output for Llama 70B

#

and $0.3 for input and output for a 7B model 🤣

raven canyon Apr 25, 2026, 1:27 AM

#

soft fulcrum and $0.3 for input and output for a 7B model 🤣

lmfao

opaque reef Apr 25, 2026, 1:37 AM

#

AA scores are out

raven canyon Apr 25, 2026, 1:43 AM

#

opaque reef AA scores are out

i think running an AI model needs more power than a AA battery

placid minnow Apr 25, 2026, 1:58 AM

#

raven canyon is that really a prompt injection ? or literally just a prompt

This is a prompt that visually changes how the model thinks, but it has almost no effect on the final answer. In one prompt, it's "immersive mode," meaning the model thinks like a character and the thinking process is a monologue. In the second, it's analysis. Thinking process analyzes the situation rather than acting like a character. But this is only in the thinking process, the answer after that is no different

raven canyon Apr 25, 2026, 2:01 AM

#

it looks like its just a normal prompt

placid minnow Apr 25, 2026, 2:02 AM

#

raven canyon it looks like its just a normal prompt

Yeah, I don't really think it's a special roleplay mode, just a prompt.

chilly sable Apr 25, 2026, 3:22 AM

#

wait

#

Actually, this release is a preview So I guess there will be better releases in the near future

tulip estuary Apr 25, 2026, 3:23 AM

#

how to control the reasoning effort on this one? 🫩

chilly sable Apr 25, 2026, 3:24 AM

#

tulip estuary how to control the reasoning effort on this one? 🫩

Where?

tulip estuary Apr 25, 2026, 3:26 AM

#

API

#

what's the json key

covert widget Apr 25, 2026, 3:28 AM

#

tulip estuary how to control the reasoning effort on this one? 🫩

The interface at chat.deepseek.com only allows thinking on or off, so I guess that's probably it

tulip estuary Apr 25, 2026, 3:33 AM

#

why do they show high and max on the benchmarks

potent lagoon Apr 25, 2026, 4:57 AM

#

Why is TogetherAI ouputing token numbers weird

#

raven canyon Apr 25, 2026, 4:59 AM

#

tulip estuary what's the json key

https://openrouter.ai/docs/guides/best-practices/reasoning-tokens

OpenRouter Documentation

Reasoning Tokens - Improve AI Model Decision Making

Learn how to use reasoning tokens to enhance AI model outputs. Implement step-by-step reasoning traces for better decision making and transparency.

#

reasoning.effort

#

high for high and xhigh for max

potent lagoon Apr 25, 2026, 5:01 AM

#

top p and temp at 1, also lower temp doesn't fix it

raven canyon Apr 25, 2026, 5:01 AM

#

it seems broken

potent lagoon Apr 25, 2026, 5:05 AM

#

raven canyon it seems broken

yeah it is I guess

woeful jay Apr 25, 2026, 5:12 AM

#

thoughts on flash?

chrome trench Apr 25, 2026, 5:47 AM

#

lol I hope the CCP doesn't gets winds of it

pastel sail Apr 25, 2026, 7:16 AM

#

potent lagoon Why is TogetherAI ouputing token numbers weird

yea same with most providers

#

only siliconflow has good inference

#

id guess its because of the weird new arch and attention

#

they didnt implement it properly

#

putting temp and top p to 0.7 0.95 respectively fixes this

#

but quality is probably lower

pastel sail Apr 25, 2026, 7:17 AM

#

woeful jay thoughts on flash?

use gemma 4

woeful jay Apr 25, 2026, 7:20 AM

#

no fucking way

hot swan Apr 25, 2026, 7:21 AM

#

https://cdn.discordapp.com/attachments/1496935203531391026/1497318003812077729/image.png

#

this image confirms a lot

#

(deepseek pro is over there under gemini and kimi)

#

flash is truly great for it price but the issue is that it hallucinates a ton

pastel sail Apr 25, 2026, 7:23 AM

#

hot swan flash is truly great for it price but the issue is that it hallucinates a ton

gemma is cheaper and better

hot swan Apr 25, 2026, 7:24 AM

#

hot swan flash is truly great for it price but the issue is that it hallucinates a ton

so does the pro model but unfortunately for that it can't otherwise compete with other models in its price range

pastel sail Apr 25, 2026, 7:24 AM

#

pastel sail gemma is cheaper and better

wow nevermind its not cheaper

#

its better definitely though

#

0.38 vs 0.28 output

#

same input price tho

#

but lower context

#

yeah maybe for high context tasks flash could be awesome

#

havent seen any benchmarks for long context about it yet

#

yes it has very good long context performance

cloud flame Apr 25, 2026, 7:31 AM

#

pastel sail havent seen any benchmarks for long context about it yet

They posted 2 benchmarks of 1M context searching

#

On Huggingface probably

hot swan Apr 25, 2026, 7:32 AM

#

deepinfra's throughput 💀

cloud flame Apr 25, 2026, 7:32 AM

#

I definitely saw it

jovial kelp Apr 25, 2026, 7:32 AM

#

hot swan https://cdn.discordapp.com/attachments/1496935203531391026/1497318003812077729/i...

Pretty cool that even tho GPT 5.5 hallucinate a lot it actually really good at coding.
It interesting tho, that in some way hallucination helping LLMs to be able to work with new problems that it never seen in their dataset

hot swan Apr 25, 2026, 7:33 AM

#

yes I don't think hallucination rate is necessarily bad, it's just prone to flights of fancy

#

and it can be controlled with the rights prompts

#

(5.4/5.5 can apparently do ~~great~~ amazing on math problems so you know)

hot swan Apr 25, 2026, 7:36 AM

#

hot swan yes I don't think hallucination rate is necessarily bad, it's just prone to flig...

but I witnessed the issue firsthand myself
#1461340695746056192 message

flat osprey Apr 25, 2026, 7:39 AM

#

pastel sail wow nevermind its not cheaper

worth mentioning that V4 flash on max effort will end up being more expensive in daily use even though it has a lower sticker price

#Deepseek V4

Deen’s Car Wash Adventure

😌