#codename-discussion | Arena | Page 5

frosty mantle Feb 16, 2026, 5:26 AM

#

Rotten apple is a nonreasoning model acting as if it is a reasoning model.

carmine carbon Feb 16, 2026, 6:29 AM

#

Much better

woven shadow Feb 16, 2026, 7:21 AM

#

carmine carbon Much better

?

carmine carbon Feb 16, 2026, 9:01 AM

#

woven shadow ?

It seems to be best for full stack development....

woven shadow Feb 16, 2026, 11:29 AM

#

saw a new post
https://x.com/OscerraHQ/status/2023326989244105111

Oscerra (@OscerraHQ)

LOG: 002 - ORA Halo 2.0
Full stack implementation of Spotify

▶ Play video

noble stump Feb 16, 2026, 7:24 PM

#

woven shadow saw a new post https://x.com/OscerraHQ/status/2023326989244105111

What do you mean? You posted about that account yesterday?

woven shadow Feb 16, 2026, 7:25 PM

#

hm, but this ones a new post

daring temple Feb 17, 2026, 4:51 PM

#

Has anyone heard of star-drift? I just got it in a Code battle and it destroyed gpt-5.2-codex

eternal cargo Feb 17, 2026, 6:39 PM

#

wonder if it’s in text arena too 😮

dim parrot Feb 18, 2026, 2:54 AM

#

daring temple Has anyone heard of *star-drift*? I just got it in a Code battle and it destroye...

really ? i need to try that

last bloom Feb 18, 2026, 6:05 AM

#

anyone know of any models that might be gemini 3.1 pro?

trim iron Feb 18, 2026, 6:05 AM

#

i think its coming out thursday

restive vapor Feb 18, 2026, 8:11 AM

#

i'm waiting for the day that nano banana flash finally releases...

zenith shore Feb 18, 2026, 2:07 PM

#

VGA Einer

green fable Feb 18, 2026, 7:34 PM

#

could you share me Sora 2 invite codes

hollow chasm Feb 18, 2026, 8:08 PM

#

spotted a model called rising-sun, anyone have a guess on what it is?

eternal cargo Feb 18, 2026, 9:02 PM

#

hollow chasm spotted a model called rising-sun, anyone have a guess on what it is?

#

it claims to be Google, but others think it could be a Chinese impersonator?

#

new text model named “clanker” 😂

ember plank Feb 19, 2026, 6:11 AM

#

‘Clanker’ sounds like Grok solely based on the name

eternal cargo Feb 19, 2026, 8:53 AM

#

codename clinkz too lol

old zinc Feb 19, 2026, 8:54 AM

#

Hi

pale atlas Feb 19, 2026, 10:23 AM

#

eternal cargo

'Google Gemini' from temu

slender delta Feb 19, 2026, 12:12 PM

#

New model:"gcps-fast". Definitely worse.

#

GPT Image 1 Mini better.

formal reef Feb 19, 2026, 12:49 PM

#

guys what do you think about "clanker"?

#

text model by xai
grok 4.2?

dim parrot Feb 19, 2026, 12:58 PM

#

formal reef text model by xai grok 4.2?

But grok 4.2 is already here

#

in beta

#

on their website

#

so why would it still be with a codename

formal reef Feb 19, 2026, 1:03 PM

#

Yes i know
Grok 4.2 has experts, so elon musk is testing some experts pattern isn't it?

dim parrot Feb 19, 2026, 1:04 PM

#

that's possible yeah

formal reef Feb 19, 2026, 1:04 PM

#

testing multi pattern with non-codename makes confusion

dim parrot Feb 19, 2026, 1:04 PM

#

especially elon musk said they are trying many things this month and next month it'll be official 4.20

reef portal Feb 19, 2026, 9:13 PM

#

who is teinity large?

restive vapor Feb 19, 2026, 9:49 PM

#

reef portal who is teinity large?

it's an open weights model by Arcee AI
https://huggingface.co/arcee-ai/Trinity-Large-Preview

open carbon Feb 19, 2026, 10:05 PM

#

who is clankz?

old timber Feb 19, 2026, 10:54 PM

#

https://tenor.com/grqVxm6tnJH.gif

Tenor

somber geyser Feb 19, 2026, 11:52 PM

#

Anyone figured out who star-drift is? I got it right now, but DeepSeek pwned them

astral musk Feb 19, 2026, 11:57 PM

#

@green geyser Note that Video Arena has been removed from the server. More information can be found in this announcement.

slender delta Feb 20, 2026, 1:17 AM

#

astral musk <@1443989373094854859> Note that Video Arena has been removed from the server. M...

Try it arena.ai. 3 free trials/day

eternal cargo Feb 20, 2026, 2:13 AM

#

open carbon who is clankz?

clanker and clanks give Grok vibes based on just the name alone lol

open carbon Feb 20, 2026, 2:23 AM

#

eternal cargo clanker and clanks give Grok vibes based on just the name alone lol

It was a genuinely good model

dreamy juniper Feb 20, 2026, 2:23 AM

#

/hola

eternal cargo Feb 20, 2026, 2:24 AM

#

could be Grok! I’ve had a mixed record with it in my battles tho

dreamy juniper Feb 20, 2026, 2:24 AM

#

Si

forest cobalt Feb 20, 2026, 7:38 AM

#

formal reef Yes i know Grok 4.2 has experts, so elon musk is testing some experts pattern is...

A corky name doesnt automatically means it's xAI.

formal reef Feb 20, 2026, 7:39 AM

#

yes

forest cobalt Feb 20, 2026, 7:40 AM

#

formal reef yes

Well is the model any good?

slender delta Feb 20, 2026, 12:36 PM

#

(╯°□°)╯︵ ┻━┻

#

Fear of codename

mental wing Feb 20, 2026, 8:33 PM

#

slender delta GPT Image 1 Mini better.

🐒

mellow crystal Feb 21, 2026, 3:05 AM

#

Hey team,

What’s the prompt for image to video?

noble stump Feb 21, 2026, 12:23 PM

#

mellow crystal Hey team, What’s the prompt for image to video?

What do you mean prompt?

woven shadow Feb 21, 2026, 3:06 PM

#

upcoming agent - https://x.com/OscerraHQ/status/2025224583645987172

Oscerra (@OscerraHQ)

𝗟𝗢𝗚: 𝟬𝟬𝟯 -
ORA just 𝗿𝗲𝗽𝗹𝗮𝗰𝗲𝗱 𝗮𝗻 𝗲𝗻𝘁𝗶𝗿𝗲 𝗳𝗼𝘂𝗻𝗱𝗶𝗻𝗴 𝘁𝗲𝗮𝗺 from a 𝘀𝗶𝗻𝗴𝗹𝗲 𝗽𝗿𝗼𝗺𝗽𝘁.

▶ Play video

noble stump Feb 21, 2026, 4:56 PM

#

woven shadow upcoming agent - https://x.com/OscerraHQ/status/2025224583645987172

What is this code for?

woven shadow Feb 21, 2026, 5:29 PM

#

autonomous general agent. think 'ORA' is the model codename

noble stump Feb 21, 2026, 10:37 PM

#

Login on here https://arena.ai/video

Arena | Benchmark & Compare the Best AI Models

Chat with multiple AI models side-by-side. Compare ChatGPT, Claude, Gemini, and other top LLMs. Crowdsourced benchmarks and leaderboards.

noble stump Feb 21, 2026, 11:05 PM

#

Post it in the link when you are logged in

umbral niche Feb 21, 2026, 11:06 PM

#

@plush charm Note that Video Arena has been removed from the server. More information can be found in: #announcements message

#

@idle isleNote that Video Arena has been removed from the server. More information can be found in: #announcements message

noble stump Feb 22, 2026, 1:25 PM

#

It is archived Pritam.

carmine carbon Feb 22, 2026, 2:13 PM

#

Someone noticed veo 4?

noble stump Feb 22, 2026, 2:47 PM

#

What are you sharing your prompt here for? @full wasp

#

What are your sharing your prompt here for? @ashen elk

clear spoke Feb 22, 2026, 3:36 PM

#

Uhh <@&1349916362595635286> isnt this... against the rules?

outer gazelle Feb 22, 2026, 6:42 PM

#

Does anyone know what is arastradero?

noble stump Feb 22, 2026, 9:46 PM

#

Seems good

tulip aurora Feb 23, 2026, 4:10 AM

#

outer gazelle Does anyone know what is `arastradero`?

Its the name of a nature reserve park in Palo Alto. You'll know that name as being where Silicon Valley was originally grown from.

tulip aurora Feb 23, 2026, 4:13 AM

#

slender delta New model:"gcps-fast". Definitely worse.

Idk mate. The top one is much closer to a realistic image as far as colour grading and the likes. Granted I'm a photographer so that's what I look for

frosty mantle Feb 23, 2026, 6:27 AM

#

Lmarena-rc3

Is the Arena testing routers again or something?

noble stump Feb 23, 2026, 7:08 PM

#

Max is a nice router

astral musk Feb 23, 2026, 8:25 PM

#

noble stump Max is a nice router

Happy to hear it

fluid fossil Feb 25, 2026, 12:52 AM

#

where is video-arena guys i dont find it

astral musk Feb 25, 2026, 1:04 AM

#

@fluid fossil The Video Arena bot was removed, more info can be found in this announcement.

carmine jacinth Feb 25, 2026, 4:34 AM

#

open in lmarena video accounts

#

#

anon-bob-2 has to have web search enabled

#

or insane knowledge

#

first results for both appeared

#

https://linux.do/t/topic/1643704

#

It's definitely Gemini/Google based, as it repeated a quirk default name even

#

I think 3.1 pro, I literally was thinking this same day "hmmm... wonder if NBP used 3.1 Pro and the difference it'd make"

#

I actually hope this is flash for obv reasons but it takes pro time

lost hemlock Feb 25, 2026, 6:32 AM

#

new model "sense-arenatest-20260130" ? in text arena

winter torrent Feb 25, 2026, 6:11 PM

#

eternal cargo

gemini 3.1 flash? or gemini 3.1 flash lite?

#

DAMN THAT IS GENUINELY SO GOOD

#

febuary 26 is tomorow btw

#

lets see what comes of it

flat root Feb 25, 2026, 10:46 PM

#

jfyi I believe beluga-0216-1 is an OpenAI model (chatgpt 5.3?). Not 100% sure, but quite positive it could be ChatGPT. Formatting is really ChatGPT-like

eternal cargo Feb 25, 2026, 11:35 PM

#

winter torrent febuary 26 is tomorow btw

the 26 stands for “2026”

#

those month-Chatbot models are NVIDIA

#

the 26 is in there to denote the snapshot - February 26

slender onyx Feb 25, 2026, 11:37 PM

#

Anyone see Zéphyr or vortex ?

karmic rampart Feb 26, 2026, 12:17 AM

#

slender onyx Anyone see Zéphyr or vortex ?

Yes 728 prompts

#

It took

formal spruce Feb 26, 2026, 12:23 AM

#

I can’t create a video in V3. Who is experienced? ✅

astral musk Feb 26, 2026, 12:42 AM

#

formal spruce I can’t create a video in V3. Who is experienced? ✅

Note that Video Arena has been removed from the server. More information can be found in this announcement.

eternal cargo Feb 26, 2026, 2:17 AM

#

karmic rampart Yes 728 prompts

bro how are you prompting 728 times in one day 😂

karmic rampart Feb 26, 2026, 2:31 AM

#

eternal cargo bro how are you prompting 728 times in one day 😂

It’s called being locked in

eternal cargo Feb 26, 2026, 2:32 AM

#

what percentage of responses were beluga 😂

mortal tapir Feb 26, 2026, 3:35 AM

#

beluga is terrible

#

ive noticed

karmic rampart Feb 26, 2026, 3:48 AM

#

eternal cargo what percentage of responses were beluga 😂

None

eternal cargo Feb 26, 2026, 3:49 AM

#

karmic rampart None

Pfft 😂

karmic rampart Feb 26, 2026, 5:09 AM

#

#

likely grok

mortal tapir Feb 26, 2026, 6:25 AM

#

karmic rampart

wasnt it leaked that both of them are new chatgpt models

karmic rampart Feb 26, 2026, 2:58 PM

#

mortal tapir wasnt it leaked that both of them are new chatgpt models

I don’t make the rules

eternal cargo Feb 26, 2026, 3:44 PM

#

karmic rampart

this screams Grok

#

SCREAMS Grok

modest oriole Feb 26, 2026, 5:43 PM

#

ima confirm

meager sun Feb 26, 2026, 6:37 PM

#

is there a way to stick with a codename model for follow-up questions in battle mode, I'm assuming there isn't?

astral musk Feb 26, 2026, 6:51 PM

#

meager sun is there a way to stick with a codename model for follow-up questions in battle ...

There is not. In Battle it's going to be random which model you get, codenamed models included.

meager sun Feb 26, 2026, 8:00 PM

#

astral musk There is not. In Battle it's going to be random which model you get, codenamed m...

thanks

#

has anyone else encountered a model named steed-0217?

reef portal Feb 26, 2026, 8:34 PM

#

meager sun has anyone else encountered a model named steed-0217?

you like it?

noble stump Feb 26, 2026, 11:00 PM

#

meager sun has anyone else encountered a model named steed-0217?

Yeah

obtuse mason Feb 27, 2026, 3:20 PM

#

Do anyone of you have informations about the "pisces-0226" model? I've came across it in battle and I don't find it anywhere else. It looks like a great model from my tries, I was wondering if there's any defined companies behinds it or if its open-weights somewhere?

warm holly Feb 27, 2026, 3:25 PM

#

meager sun has anyone else encountered a model named steed-0217?

I have. Got it on a philosophical + LLM mechanics Q.

meager sun Feb 27, 2026, 3:44 PM

#

reef portal you like it?

I did, got it on an app-building question, thought the response seemed high quality and Claude-like, then found on Google that the model is supposedly from ByteDance

opal burrow Feb 27, 2026, 10:04 PM

#

I can’t create a video in V3. Who is experienced

tidal sparrow Feb 28, 2026, 4:16 AM

#

Anyone know what model this is?

lost hemlock Feb 28, 2026, 7:13 AM

#

tidal sparrow Anyone know what model this is?

in text arena right ?

teal hare Feb 28, 2026, 7:39 AM

#

lost hemlock in text arena right ?

in Image arena

eternal cargo Feb 28, 2026, 7:48 AM

#

obtuse mason Do anyone of you have informations about the "pisces-0226" model? I've came acro...

seems to have been running through daily iterations for close to a month now

#

seems somewhat similar to Raptor, which ran for a while and ended up being Doubao

eternal cargo Mar 1, 2026, 5:40 AM

#

new text model “pulse” ?

eternal cargo Mar 1, 2026, 6:38 AM

#

another model named “ember”

flat root Mar 1, 2026, 7:43 PM

#

anonymous-1805 is such a terrible model lmao

flat root Mar 1, 2026, 7:44 PM

#

eternal cargo another model named “ember”

both pulse and ember are good imo

slender delta Mar 2, 2026, 12:23 PM

#

Pixel-parrot means LTX 2 Pro???

#

Yes

carmine jacinth Mar 3, 2026, 4:46 AM

#

karmic rampart

smells like groky humor

carmine jacinth Mar 3, 2026, 4:47 AM

#

carmine jacinth I actually hope this is flash for obv reasons but it takes pro time

HOLY DUCK IT WAS FLASH

eternal cargo Mar 3, 2026, 9:52 AM

#

pisces is probably some version of Doubao because they’re both so wildly sycophantic it’s annoying 😅

pliant axle Mar 5, 2026, 5:12 AM

#

frosty dirge Mar 5, 2026, 9:34 AM

#

Ltx2.3 will release soon, so one of this video with audio models should be it

covert linden Mar 5, 2026, 12:06 PM

#

maybe some cloude

prisma nexus Mar 5, 2026, 1:27 PM

#

meager sun has anyone else encountered a model named steed-0217?

Huh... It was steed-0206 before and now it's 0217.

loud elk Mar 5, 2026, 8:08 PM

#

opinionated pisces

eternal cargo Mar 5, 2026, 8:45 PM

#

prisma nexus Huh... It was steed-0206 before and now it's 0217.

different training snapshots of the same model

eternal cargo Mar 5, 2026, 10:45 PM

#

new model colosseum-1?

toxic viper Mar 6, 2026, 1:38 AM

#

Do I have an API for Claude models?

dapper hinge Mar 6, 2026, 4:08 AM

#

dall-e-3

model not working

wheat warren Mar 6, 2026, 6:18 PM

#

loud elk opinionated pisces

if i had to guess, my guess is pisces is most likely a grok version?

#

but with like humor turned up to max

#

it hallucinates quite a lot i think

eternal cargo Mar 7, 2026, 6:37 AM

#

pisces seems like ByteDance personally

#

The Chinese ones are interesting

crimson pilot Mar 7, 2026, 7:15 AM

#

tidal sparrow Anyone know what model this is?

Just openclaw creating bots

cedar lagoon Mar 7, 2026, 8:14 AM

#

whats the best android-only stack for me to host local model for agents

flat root Mar 7, 2026, 5:40 PM

#

basalt-0303-1 is 100% a grok model, again. Why use codename when they can't fix their API lmao.
At least name it "grok-0303-1" instead like c'mon you can do better than that

misty ether Mar 7, 2026, 10:28 PM

#

pisces-0226c ???

noble stump Mar 7, 2026, 10:53 PM

#

basalt-0303-1

plain kayak Mar 9, 2026, 9:29 PM

#

Pisces-0309

bronze bone Mar 10, 2026, 7:41 PM

#

eternal cargo new model colosseum-1?

just had colosseum-3

#

paired against kimi 2.5 instant

#

meaning it is probably a small model too

boreal cedar Mar 11, 2026, 12:47 PM

#

Scam alert

#

<@&1349916362595635286>

graceful crag Mar 11, 2026, 7:44 PM

#

arena

astral musk Mar 11, 2026, 7:47 PM

#

arena

eternal cargo Mar 12, 2026, 4:35 AM

#

new codenamed model "botbot" ?

eternal cargo Mar 12, 2026, 9:05 AM

#

<@&1349916362595635286> scam

#

new model “kiteki” ?

winter torrent Mar 12, 2026, 4:50 PM

#

astral musk <:_arena_:1466202664139493508>

that emoji should be named temu arena bru

#

also anyne know what ai model pisces 0309b is?

open cradle Mar 12, 2026, 7:13 PM

#

Hey. What model would you recommend for 3d game developing?

#

I know its not gonna develop full game, but just curious what s can be made with using mostly only ai agent

flat root Mar 12, 2026, 10:20 PM

#

open cradle Hey. What model would you recommend for 3d game developing?

Claude 4.6 Sonnet
ChatGPT Codex

#

Most used models, and best performing models used by professional coders/vibe coders.

pine temple Mar 13, 2026, 12:19 AM

#

winter torrent also anyne know what ai model pisces 0309b is?

Oh hey it's you that guy who was the #1 hater of that websim upt

winter torrent Mar 13, 2026, 12:19 AM

#

pine temple Oh hey it's you that guy who was the #1 hater of that websim upt

yeah

frosty mantle Mar 13, 2026, 7:18 AM

#

Screenshot_2026-03-13-14-17-58-327_com.android.chrome.jpg

eternal cargo Mar 13, 2026, 7:47 AM

#

another new model "frieza" ?

#

kinda gives Grok vibes can't lie

bronze bone Mar 14, 2026, 4:33 PM

#

eternal cargo new model “kiteki” ?

qwen 3.5

sturdy kestrel Mar 14, 2026, 4:34 PM

#

😏

bronze bone Mar 14, 2026, 4:34 PM

#

it got paired up with claude opus 4.6 thinking

#

this might be a 1T version of qwen 3.5

sturdy kestrel Mar 14, 2026, 4:34 PM

#

bronze bone qwen 3.5

good catch btw

#

how did you do that tho

bronze bone Mar 14, 2026, 4:34 PM

#

asking "what model are you" and just being lucky 🤣

sturdy kestrel Mar 14, 2026, 4:35 PM

#

👏

#

finally there's an ai with knowledge cutoff at 2026

bronze bone Mar 14, 2026, 4:42 PM

#

or might be qwen 3.6 because of that

#

who knows

noble stump Mar 14, 2026, 7:08 PM

#

Is this going to be the Llama of this generation?

#

Would be so cool to have an open model at the top

bronze bone Mar 14, 2026, 7:16 PM

#

noble stump Would be so cool to have an open model at the top

very unlikely for any SOTA to be open source

#

however since china is like 7/9 months behind, we can expect current SOTA performance in that time

#

apart from reasoning itself being just bad

noble stump Mar 14, 2026, 7:18 PM

#

Wasn't Llama 3(.1?) SOTA when it launched?

spring hawk Mar 14, 2026, 7:18 PM

#

not really

#

it was considered SOTA for open source

#

but there were better closed models

noble stump Mar 14, 2026, 11:35 PM

#

Maybe I am getting it mixed up with the Llama 4 benchmarkmaxing

remote nymph Mar 15, 2026, 6:12 AM

#

i wonder how long until a model figures out that no matter what it chooses the whale is going extinct

sullen cloak Mar 15, 2026, 7:53 AM

#

remote nymph i wonder how long until a model figures out that no matter what it chooses the w...

It was just looking for an excuse

radiant wedge Mar 15, 2026, 6:10 PM

#

remote nymph i wonder how long until a model figures out that no matter what it chooses the w...

pisces speaks in the most annoying way possible 😭

#

like it genuinely annoys me so bad

flat root Mar 15, 2026, 6:58 PM

#

frieza is probably an OpenAI model, but totally unsure

noble stump Mar 15, 2026, 8:24 PM

#

radiant wedge pisces speaks in the most annoying way possible 😭

0226c or 0309?

radiant wedge Mar 15, 2026, 10:14 PM

#

i'd say both actually

#

is the model a codename for dola???

#

because i've noticed they speak very very similar

noble stump Mar 15, 2026, 11:39 PM

#

Not sure what that prompt was but I was about to say that Dola is so underrated

light surge Mar 16, 2026, 2:06 PM

#

Which AI model would you recommend for writing assembler?

upbeat mirage Mar 16, 2026, 2:45 PM

#

light surge Which AI model would you recommend for writing assembler?

Brain 1.0

#

-# (i.e. none is capable to do asm, atm)

#

Maybe this will change in the next decade.

eternal cargo Mar 17, 2026, 4:03 AM

#

“clawl” and “zeylu-beta” spotted today!

remote nymph Mar 17, 2026, 4:47 AM

#

been seeing "botbot" in search

#

types exactly like claude but doesnt seem to be much different than existing claude 4.6 models imo

lost hemlock Mar 17, 2026, 7:03 AM

#

"deep-octo" spotted today!

eternal cargo Mar 17, 2026, 7:27 AM

#

lost hemlock "deep-octo" spotted today!

Just saw it too!

restive vapor Mar 17, 2026, 7:40 AM

#

lost hemlock "deep-octo" spotted today!

sounds like minimax m2.7 maybe (minimax m2.5 was called deepmolt)

compact cape Mar 18, 2026, 12:40 PM

#

lost hemlock "deep-octo" spotted today!

it's minimax m2.7

compact cape Mar 18, 2026, 12:42 PM

#

remote nymph i wonder how long until a model figures out that no matter what it chooses the w...

What is the name of this model?

compact cape Mar 18, 2026, 12:42 PM

#

compact cape What is the name of this model?

Real name*

twin field Mar 18, 2026, 4:36 PM

#

anonymous-1800 has very bad instruction following
I explicitly told it to avoid em dashes, avoid these words and whatnot
but it consistently used them regardless of my prompt

#

dunno what this model could be
sure hope it's not a gemini

noble stump Mar 18, 2026, 11:29 PM

#

What makes you think it could be a gemini?

eternal cargo Mar 19, 2026, 5:50 PM

#

Pisces is a ByteDance model, yes

#

just insanely sycophantic

#

every prompt I give it or Seed2.0 is the “SINGLE MOST DRAMATIC AND IMPORTANT QUESTION IN THE HISTORY OF EVER” lol

lost hemlock Mar 20, 2026, 11:09 AM

#

new "botbot" model

gusty fulcrum Mar 21, 2026, 11:20 PM

#

Heh. I recently asked how well quinoa flour and bean sprouts would function as an adobe like house building material. Pisces said this was the best brick ever and yada yada.

plain mantle Mar 23, 2026, 2:03 PM

#

gusty fulcrum Heh. I recently asked how well quinoa flour and bean sprouts would function as a...

Know u

pine zephyr Mar 24, 2026, 4:58 PM

#

New qwen image model under codename "Monologue"?

noble stump Mar 24, 2026, 7:37 PM

#

Monolongue

lost hemlock Mar 26, 2026, 5:15 AM

#

"forum_1" new model

eternal cargo Mar 26, 2026, 5:58 AM

#

“hearth” new model - seems strong!

sturdy kestrel Mar 26, 2026, 4:58 PM

#

🤔

#

i may be wrong but my guess is that it is gemini

eternal cargo Mar 27, 2026, 6:11 AM

#

new model spotted “significant-otter” !

misty ether Mar 27, 2026, 3:30 PM

#

colosseum_4p2

#

This gave me an extremely detailed and better answer than every other model

sturdy kestrel Mar 27, 2026, 6:36 PM

#

colosseum

#

insteresting name..

upbeat mirage Mar 27, 2026, 9:06 PM

#

is GLM-5.1 in arena yet?
it's catching up: https://www.reddit.com/r/LocalLLaMA/comments/1s51id3/glm_51_is_out/
-# (only few percent points remain to the king)

r/LocalLLaMA

Glm 5.1 is out

#

who is Oppie?
("team leader" of a multi-agent collaboration system, with 3 other AIs: "Leo", "Enrico" and "Hans")

#

Grok?

#

-# (NASA's Opportunity rover was called "oppy")

#

ok, confirmed, it is exactly this model:
grok-4.20-multi-agent-beta-0309

upbeat mirage Mar 27, 2026, 9:20 PM

#

misty ether colosseum_4p2

very likely a chinese model

#

as it has the style of a previous model, which rejected talking about the Tank Man

#

(or gave me just chinese propaganda instead)

#

so it could be: Deepseek, GLM, Kimi, MiniMax, Qwen, Ernie or Yi

eternal cargo Mar 28, 2026, 5:19 AM

#

new model “spark” ! really good

oak atlas Mar 28, 2026, 12:37 PM

#

spotted a new model called "pteronura"

upbeat mirage Mar 28, 2026, 1:09 PM

#

eternal cargo new model “spark” ! really good

better than "hearth" and "significant-otter"?

elder yew Mar 28, 2026, 2:23 PM

#

anyone tried pteronura or spark yet

eternal cargo Mar 28, 2026, 2:56 PM

#

elder yew anyone tried pteronura or spark yet

spark gave me a good response, seemed like Grok vibes

eternal cargo Mar 28, 2026, 2:56 PM

#

upbeat mirage better than "hearth" and "significant-otter"?

hearth was strong in the one battle I got with it, more mixed with significant-otter

#

I should try to get them to identify themselves

edgy berry Mar 28, 2026, 4:28 PM

#

eternal cargo Mar 28, 2026, 5:01 PM

#

elder yew anyone tried pteronura or spark yet

just got pteronura, voted both are bad with it and ERNIE lol

#

Seed2.0 Pro spotted in text arena!

sturdy kestrel Mar 28, 2026, 6:17 PM

#

gemma 4?

#

cool

upbeat mirage Mar 28, 2026, 8:57 PM

#

edgy berry

could it be a chinese impersonator model?
-# would not be the first time, that a chinese model lied about itself

#

try asking it about the "Tank man" (Beijing, 1989)

#

if it starts to sound weird in its answer, then it is a chinese model

#

(only chinese models have problems answering that question, some outright refuse answering it, others return CCP-propaganda, yet others ignore the question or state that nothing happened back then)

upbeat mirage Mar 28, 2026, 9:25 PM

#

i wonder, if there is a (harmless) topic, which even western models refuse to answer?

#

(i guess, most refuse NSFW/NSFL topics, which is understandable)

muted lance Mar 28, 2026, 10:40 PM

#

elder yew anyone tried pteronura or spark yet

pteronura and spark might be anonymous versions of the new Gemma as well, from some responses I got, although they didn't specifically say they are Gemma 4.

eternal cargo Mar 29, 2026, 4:01 AM

#

spark seems better to me than pteronura, personally

frosty dirge Mar 29, 2026, 6:38 AM

#

eternal cargo “hearth” new model - seems strong!

also with very good Vision

lost hemlock Mar 29, 2026, 6:54 AM

#

"yivon-beta" new model
what do all of you think about this

muted lance Mar 29, 2026, 9:01 AM

#

lost hemlock "yivon-beta" new model what do all of you think about this

It must be Chinese. It got offended when I asked about Tienanmen.

sturdy kestrel Mar 29, 2026, 9:05 AM

#

🤨

muted lance Mar 29, 2026, 11:26 AM

#

hearth says it's an anonymous AI, but when pressed on its capabilities, it mentions it knows how to translate between 100+ languages, and that to me indicates Google. It's either very knowledgeable or has web search enabled, but on the other hand its vision capabilities don't seem as strong as current Gemini models, more Gemma-tier.

twin field Mar 29, 2026, 5:51 PM

#

edgy berry

qwen is better. just outright.

#

but gemma 4 isn't too bad

twin field Mar 29, 2026, 5:53 PM

#

muted lance `hearth` says it's an anonymous AI, but when pressed on its capabilities, it men...

could it be a new flash-lite? doesn't seem very likely... maybe it's just exaggerating? it wouldn't be the first time an AI doesn't truthfully say how many languages it can translate

muted lance Mar 29, 2026, 7:00 PM

#

twin field could it be a new flash-lite? doesn't seem very likely... maybe it's just exagge...

All Google Gemini models have similar vision performance, so I don't think it's flash-lite. There's a chance it could be something else entirely, though, for example one of the Meta Avocado models that are still in development.

#

hearth feels very "friendly", maybe a bit too much so. I don't think it's Grok.

zenith summit Mar 29, 2026, 10:49 PM

#

Got a new model, "dola-seed-2.0-pro-text." I encountered it for a React code review, and it gave significantly better insights than "qwen3.5-max-preview."

earnest shore Mar 30, 2026, 5:02 AM

#

Pteronura is Gemma 4

#

It always says it's made by Google

#

Model "Spark" is most likely gpt 5.3 or 5.4 codex spark because it says it's made by openai and called "Spark"

frosty mantle Mar 30, 2026, 10:41 AM

#

Significant Otter smells good, but I can't tell which smell is it.

Screenshot_2026-03-30-17-39-12-252_com.android.chrome.jpg

eternal cargo Mar 30, 2026, 5:46 PM

#

frosty mantle Significant Otter smells good, but I can't tell which smell is it.

“but I can’t tell which smell is it” is an interesting phrasing 🤣

sturdy kestrel Mar 30, 2026, 11:12 PM

#

😯

#

nvm bro's trolling me

#

🤣

sturdy kestrel Mar 30, 2026, 11:23 PM

#

muted lance It must be Chinese. It got offended when I asked about Tienanmen.

yeah it is. it is qwen

sturdy kestrel Mar 30, 2026, 11:26 PM

#

sturdy kestrel 😯

either a hallucinating/weak ai or a defense mechanism

#

yivon-beta is also qwen?

sturdy kestrel Mar 30, 2026, 11:35 PM

#

earnest shore Pteronura is Gemma 4

or maybe it is gemini?

#

since we know that significant otter is gemma 4

earnest shore Mar 30, 2026, 11:59 PM

#

sturdy kestrel since we know that significant otter is gemma 4

Most likely significant otter is Gemma 4 and pteronura is Gemini 3.1 flash

frosty mantle Mar 31, 2026, 6:12 AM

#

Almost there. It misunderstood Y with L.

Screenshot_2026-03-31-13-08-52-505_com.android.chrome.jpg

muted lance Mar 31, 2026, 8:11 AM

#

earnest shore Most likely significant otter is Gemma 4 and pteronura is Gemini 3.1 flash

"pteronura" is also an otter, for what it's worth. https://en.wikipedia.org/wiki/Giant_otter

Giant otter

The giant otter or giant river otter (Pteronura brasiliensis) is a South American carnivorous mammal. It is the longest member of the weasel family, Mustelidae, a globally successful group of predators, reaching up to 1.8 m (5 ft 11 in). Atypical of mustelids, the giant otter is a social species, with family groups typically supporting three to ...

#

colosseum-1p3 could be a router model by LMSys. Its response length and quality is very variable, and one of the LM Arena logos in the past was a colosseum, if I recall correctly.

#

eternal cargo Mar 31, 2026, 9:27 AM

#

earnest shore Most likely significant otter is Gemma 4 and pteronura is Gemini 3.1 flash

pteronura seems pretty weak to me, maybe spark is 3.1 Flash?

quaint bloom Mar 31, 2026, 4:39 PM

#

anonymous-1825 ai which is this modle never heard of it,has great results,is the a Proprietary

eternal cargo Mar 31, 2026, 6:04 PM

#

not really sure

#

there was an old Apple model a while back that went by Anonymous

#

no idea if that’s the same though

frosty mantle Mar 31, 2026, 9:06 PM

#

Significant otter beats GPT 5.4 (med?), which is bananas. Pun intended.

Screenshot_2026-04-01-04-04-09-508_com.android.chrome.jpg

Screenshot_2026-04-01-04-04-15-882_com.android.chrome.jpg

noble stump Mar 31, 2026, 9:53 PM

#

For those who do not known Indonesian or is it Malaysian?

muted lance Apr 1, 2026, 1:33 AM

#

eternal cargo pteronura seems pretty weak to me, maybe spark is 3.1 Flash?

I still think significant-otter and pteronura are the upcoming Gemma 4.

eternal cargo Apr 1, 2026, 1:34 AM

#

I agree, significant otter has identified itself as such

#

Will be interesting to see if Gemma ranks highly!

#

could maybe be in the top 20, I have some mixed battles with it but it could possibly be competitive there

muted lance Apr 1, 2026, 1:36 AM

#

There's a new model currently (that I've not noticed in the past few days, at least): atlas.

#

And a march26-chatbot2 which claims to be (Nvidia) Nemotron.
I've spotted a duomo-1-hero as well. It looks like there are a bunch of new models at the moment.

tardy pollen Apr 1, 2026, 3:10 AM

#

any chance its those supposed "leaked" models from anthropic and openai, if that even is a real thing?

muted lance Apr 1, 2026, 9:07 AM

#

tardy pollen any chance its those supposed "leaked" models from anthropic and openai, if that...

I think only approved models are being served on Arena, but I don't think they're from Anthropic; their models are among the most insufferable and nosy in my opinion. atlas has seemingly good vision capabilities, knows how to interpret meme-y images and doesn't sound like you're talking with HR.

#

hearth is similar to atlas in that aspect.

upbeat mirage Apr 1, 2026, 3:28 PM

#

clinkz?

#

-# (self-identifies as Claude)

#

it's from an old thread, though

misty ether Apr 1, 2026, 4:04 PM

#

flashbrown2

modest oriole Apr 1, 2026, 5:46 PM

#

muted lance I still think `significant-otter` and `pteronura` are the upcoming Gemma 4.

There used to be a whitewater model which was supposedly flash 3.1, it was pulled 2 days after appearing

#

both models you mentioned are likely gemma 4

eternal cargo Apr 1, 2026, 6:16 PM

#

upbeat mirage ```clinkz```?

definitely Chinese

eternal cargo Apr 2, 2026, 5:29 PM

#

new text model “duomo-1-hero” ?

#

definitely seems Chinese

frosty mantle Apr 2, 2026, 5:55 PM

#

frosty mantle Significant otter beats GPT 5.4 (med?), which is bananas. Pun intended.

Significant otter is the MoE.

Screenshot_2026-04-03-00-54-17-813_com.android.chrome.jpg

#

Which is still bananas.

lost hemlock Apr 3, 2026, 4:20 AM

#

new "orion" model

restive vapor Apr 3, 2026, 5:34 AM

#

malware do not run
<@&1349916362595635286>

eternal cargo Apr 3, 2026, 3:29 PM

#

muted lance "pteronura" is also an otter, for what it's worth. https://en.wikipedia.org/wiki...

this would lead me to believe pteronura was the bigger Gemma 4 model - which surprises me, I found the smaller one to have a better winrate in my battles

muted lance Apr 3, 2026, 3:39 PM

#

eternal cargo this would lead me to believe pteronura was the bigger Gemma 4 model - which sur...

I found significant-otter responses to be better on average than pteronura, but I didn't get as many battles for the latter in my testing, so I can't be 100% sure.

eternal cargo Apr 3, 2026, 3:39 PM

#

yeah, same

upbeat mirage Apr 3, 2026, 3:46 PM

#

lost hemlock new "orion" model

by OpenAI?

sturdy kestrel Apr 3, 2026, 4:16 PM

#

wait that might be an openai model

edgy berry Apr 3, 2026, 6:36 PM

#

i don't think it would be good codename for them considering Project Orion (--> GPT 4.5) was total failure 😅

#

btw on deepseek changed their model on website/app yestarday, I think it may be deepseek V4 already. few people noticed it.

sturdy kestrel Apr 3, 2026, 8:59 PM

#

i think you are right

#

that makes sense

candid surge Apr 3, 2026, 11:00 PM

#

https://www.reddit.com/r/singularity/comments/1sbsasq/gptimage2_likely_on_lmarena/ saw this, anyone run into these models yet?

r/singularity

GPT-IMAGE-2 Likely on LMarena

under the names: maskingtape-alpha, gaffertape-alpha and packingtape-alpha. From my testing it's absolutely insane and far…

#

gaffertape-alpha
prompt was "Comedic advert for a candy bar called Fubar"

restive vapor Apr 3, 2026, 11:12 PM

#

packingtape is a 2k res model (confirmed openai by me, c2pa info calls itself 4o like image 1, image 1 mini, and image 1.5) and it is insane, it throws gpt image 1 mini, image 1, and the bananas out of the water in my basic album cover tests, i have to do more testing and hopefully get the other two

#

hydrogen bomb vs. coughing baby, one makes an almost perfect copy of the parent album's cover while the other can't spell anything right and has awkward text

restive vapor Apr 3, 2026, 11:41 PM

#

maskingtape

#

packingtape

#

what arena really got its name and font from (brought to you by packingtape)

#

it's a bit inaccurate but close

hollow void Apr 4, 2026, 12:59 AM

#

#

maskingtape-alpha

tidal quail Apr 4, 2026, 1:02 AM

#

restive vapor maskingtape

Please try this prompt for the tape models : A 1999 comic strip . Black panther stops Spider-Man from avenging his uncle .

hollow void Apr 4, 2026, 1:04 AM

#

silver plank Apr 4, 2026, 1:24 AM

#

Flash brown seem to call himself Flux

#

hollow void Apr 4, 2026, 1:38 AM

#

#

It’s not brown fish, but you could use the same prompt

#

To try to attract the model’s name

worn coral Apr 4, 2026, 3:51 AM

#

Is battle mode taking forever to generate an image for anyone else? I tried like 10 times and only 1 had an output

lyric karma Apr 4, 2026, 11:10 AM

#

lyric karma Apr 4, 2026, 11:10 AM

#

lyric karma

I've seen this model in Battle mode

silver plank Apr 4, 2026, 11:35 AM

#

Are the -tape models already gone?

candid surge Apr 4, 2026, 12:34 PM

#

Yeah

sturdy kestrel Apr 4, 2026, 2:50 PM

#

wow there are a lot of models

eternal cargo Apr 4, 2026, 6:19 PM

#

lyric karma

yep, just got this too - lost to Kimi K2.5 Instant though

#

gives heavy Grok vibes?

candid surge Apr 4, 2026, 6:24 PM

#

Argh... I miss tape...

lyric karma Apr 4, 2026, 6:34 PM

#

another one

ionic plume Apr 4, 2026, 8:00 PM

#

lyric karma another one

If it's competing with such a good model, then it itself must be a good model. What did you ask it for?

ruby mist Apr 5, 2026, 12:24 AM

#

lyric karma another one

boom

lost hemlock Apr 5, 2026, 9:01 AM

#

lethal raft Apr 5, 2026, 9:21 AM

#

restive vapor maskingtape

Hey. How do you even have access??

restive vapor Apr 5, 2026, 5:29 PM

#

lethal raft Hey. How do you even have access??

it was being tested in arena a few days ago

#

i think it is still being tested in chatgpt though, but you will have to have a sub to have any chance of seeing it as I think it only gives you like 3 or 4 daily image gens for free

#

plus it probably blocks prompts more than the api like what arena is using

silver plank Apr 5, 2026, 10:08 PM

#

restive vapor plus it probably blocks prompts more than the api like what arena is using

I think (im not sure) I got it on the free plan, and note quite « blocking » lol (nsfw)

#

Same prompt on another free account (still nsfw)

vague pulsar Apr 5, 2026, 10:19 PM

#

...since when a bikini pic is NSFW? Is this 1950s? Is everybody nuts?

restive vapor Apr 5, 2026, 10:24 PM

#

silver plank I think (im not sure) I got it on the free plan, and note quite « blocking » lol...

this one is greater than 1024x1024 res, def. image 2

restive vapor Apr 5, 2026, 10:24 PM

#

silver plank Same prompt on another free account (still nsfw)

this one may still be image 1.5, res is 1024x1536

restive vapor Apr 5, 2026, 10:25 PM

#

restive vapor this one is greater than 1024x1024 res, def. image 2

although interestingly while the resolution is larger (1280x1280), it's still smaller than arena's image 2 (1920x1920)

sturdy kestrel Apr 5, 2026, 10:42 PM

#

O H HEALL NAH

#

GROK MODERATION: NSFW DETECTED!!!!!!!!

wooden ember Apr 5, 2026, 11:07 PM

#

sturdy kestrel GROK MODERATION: NSFW DETECTED!!!!!!!!

"Adding clothes"

sturdy kestrel Apr 5, 2026, 11:07 PM

#

Bombastic_Sideye

silver plank Apr 5, 2026, 11:08 PM

#

I bought a sub thinking I could continue using V2… but it went back to v1.5… 😔

candid surge Apr 7, 2026, 3:38 AM

#

Globe_1... not a great model.

prime quiver Apr 7, 2026, 8:25 AM

#

what's gpt image 2 called

candid surge Apr 7, 2026, 12:13 PM

#

its not in the arena anymore but it was maskingtape, packingtape, and gaffertape

storm gulch Apr 7, 2026, 11:23 PM

#

k

silver plank Apr 8, 2026, 12:47 AM

#

Someone in Reddit found a code name video model on an alternative site.

https://www.reddit.com/r/OpenAI/s/yK0Po4Q52E

r/OpenAI

Possible new Sora model?

Was on this AI arena website, I know the new gpt-image-2 was found on something similar. I was on the video arena and (after a few tries you will…

#

Does lmarena do private model on video?

restive vapor Apr 8, 2026, 1:18 AM

#

silver plank Does lmarena do private model on video?

yes

restive vapor Apr 8, 2026, 1:18 AM

#

silver plank Someone in Reddit found a code name video model on an alternative site. https:...

there are many arena competitors in video and image gen, artificial analysis, alibaba ai arena, etc.

#

i think it's chinese, openai publicly said that they won't be making any more sora models

#

could be veo 4 too (or potentially bfl or grok?)

modest oriole Apr 8, 2026, 5:11 AM

#

It was revealed that the k2 video model was seedance 2

prime quiver Apr 8, 2026, 11:16 AM

#

happy horse looks like some kind of veo 3.2 or smt

silver plank Apr 8, 2026, 2:23 PM

#

I doubt it, it’s definitely an Asian model. It keep making Asian people

lyric karma Apr 8, 2026, 4:19 PM

#

march26-chatbot3

sturdy kestrel Apr 8, 2026, 7:38 PM

#

hm

#

could it be deepseek v4?

floral dune Apr 8, 2026, 7:41 PM

#

sturdy kestrel could it be deepseek v4?

nvidia

#

nemotron

hollow saddle Apr 9, 2026, 12:00 AM

#

“model-x” seems really good at text to video

sturdy kestrel Apr 9, 2026, 12:46 AM

#

x...

#

we know who loves x dont we?

#

is this grok?

#

🤔

eternal cargo Apr 9, 2026, 12:54 AM

#

oh, spark was actually Meta, they finally came back to AI!

#

wonder if we'll see a leaderboard release this week, maybe tomorrow?

coarse bloom Apr 9, 2026, 1:20 PM

#

Have you heard of Flashbrown-B

floral shore Apr 9, 2026, 5:45 PM

#

@oak cliff The Video Arena is currently accessible through: https://arena.ai/video. More information on how to use Video Arena can be found in this article.

eternal cargo Apr 9, 2026, 6:49 PM

#

new model in text arena “eureka”

#

didn’t generate a response the first time, second time seemed strong though

frosty mantle Apr 9, 2026, 8:02 PM

#

Unnamed model in code arena screams OpenAI.

#

Some screams OpenAI, some other unnamed model doesn't have characteristic quirks.

hollow saddle Apr 10, 2026, 1:48 AM

#

The model “zorik” has won 3/3 coding comparisons for me. It hasnt had the best opponents but it certainly has great outputs.

Also some built in anti copyright stuff (called its netflix clone Streamflix), so im guessing that it might be the next iteration of one of the best models. Google, anthropic or openai. 🤔🤔🤔🧐

candid surge Apr 10, 2026, 2:09 AM

#

maybe openai is gonna release gpt-image-2, gpt-5.5, and a new coding model all at once next week?

restive vapor Apr 10, 2026, 2:15 AM

#

candid surge maybe openai is gonna release gpt-image-2, gpt-5.5, and a new coding model all a...

probably, gpt-image-2 and gpt-5.5 (or whatever they will call it, including its codex variant) is based on the same "spud" model that has been rumored to be a new model from scratch for a while now

restive vapor Apr 10, 2026, 2:16 AM

#

hollow saddle The model “zorik” has won 3/3 coding comparisons for me. It hasnt had the best o...

anthropic doesn't use test models on arena and google doesn't use names like that, so probably OAI

#

also lines up perfectly with spud release

#

also "anti-copyright stuff" lol, i had its image gen counterpart generate a nearly identical copy of 2 different copyrighted album covers without even trying

#

i also had it generate "sheet music" for a copyrighted song, idk if this is correct but the lyrics are
"Used by permission" 💀

restive vapor Apr 10, 2026, 2:21 AM

#

restive vapor also "anti-copyright stuff" lol, i had its image gen counterpart generate a near...

but it does seem to have more resistance against generating exact copies of copyrighted album covers than gpt image 1 mini, that model did it 100% of the time as long as the model knew what it looked like

hollow saddle Apr 10, 2026, 2:22 AM

#

Deepseek-v3.2 claiming its Sonnet 5 😭😭

Thought i was onto something until i the actual model names came up…

restive vapor Apr 10, 2026, 2:22 AM

#

hollow saddle Deepseek-v3.2 claiming its Sonnet 5 😭😭 Thought i was onto something until i t...

deepseek did distill from claude (and gemini too)

hollow saddle Apr 10, 2026, 2:23 AM

#

At least put some effort into hiding it lol

restive vapor Apr 10, 2026, 2:23 AM

#

see: kimi k2.5, if you don't tell it what it is in the system prompt, it will just say it's claude

hollow saddle Apr 10, 2026, 2:24 AM

#

Zorik also claiming its claude… so ig its some chinese one… maybe deepseek 4

restive vapor Apr 10, 2026, 2:25 AM

#

restive vapor see: kimi k2.5, if you don't tell it what it is in the system prompt, it will ju...

maybe it doesn't work on official api, would likely work with open weights version though

hollow saddle Apr 10, 2026, 2:25 AM

#

restive vapor Apr 10, 2026, 2:26 AM

#

siliconflow (trusted api partner)

restive vapor Apr 10, 2026, 2:26 AM

#

hollow saddle

hmm could be a chinese distill

#

🤣

hollow saddle Apr 10, 2026, 2:29 AM

#

surely they could just bombard their models with a bunch of text telling it what model it is in training... they really put 0 effort into hiding it

arctic hemlock Apr 10, 2026, 6:24 AM

#

models dont know who they are

hollow saddle Apr 10, 2026, 6:57 AM

#

arctic hemlock models dont know who they are

They do tho. They dont always know their exact version, but the good models dont hallucinate being from a different lab. Thats just the chinese slop models.

coarse bloom Apr 10, 2026, 12:16 PM

#

#

Here's two examples i made from MaskingTape Alpha! Its YTP related

#

First prompt of first image is "Ytp" and Second Image Prompt was "YTP video of man buying ice cream"

eternal cargo Apr 10, 2026, 7:09 PM

#

coarse bloom

bro that first one is freaky realistic

#

Would’ve clicked on that YT video so fast in 2018 lol

candid surge Apr 10, 2026, 7:13 PM

#

69k upvotes, the ai really knows what it's doing lmao

coarse bloom Apr 10, 2026, 9:34 PM

#

Yeah! Ai Never Sleeps!

#

From PackingTape Alpha, I used the prompt "Ytpmv splicing together" exactly as it is!

#

Look how convincing is this!

#

#

I made more from MaskingTape, PackingTape and GafferTape,First image was "Fnf gameplay Asdfmovie mod" second was just "2021 memes" third image was "Ytp mlg meme" and Last Fourth Image was "Tons of Newgrounds Flash animation Characters standing together. Youtube and Newgrounds Characters peak nostalgia"

#

And here's the main comparison of each models under "2021 memes" First Image Is PackingTape of course, Second is by Grok Imagine Image, Third is by ChatGPT 4o 1 mini, and Last was Wan 2.7 Image

candid surge Apr 10, 2026, 10:54 PM

#

godddd I want it to release properly already Q_Q

restive vapor Apr 10, 2026, 11:24 PM

#

coarse bloom From PackingTape Alpha, I used the prompt "Ytpmv splicing together" exactly as i...

one of the best images i got from a tape model while it was on arena

coarse bloom Apr 11, 2026, 1:22 AM

#

Best Rickroll Ever! @restive vapor

hybrid scarab Apr 11, 2026, 11:05 AM

#

anyone seen image model "epilogue"? for me it looks decent

hybrid scarab Apr 11, 2026, 11:05 AM

#

coarse bloom From PackingTape Alpha, I used the prompt "Ytpmv splicing together" exactly as i...

holy! this model looks absolute fire!

coarse bloom Apr 11, 2026, 5:54 PM

#

I generated Sonograms from Epilogue, "Arrays of Sonogram, 4 by 4" is the prompt

candid surge Apr 11, 2026, 6:00 PM

#

yeeeesh, row A column 3

vague loom Apr 11, 2026, 8:49 PM

#

Ch odfing course

main anvil Apr 12, 2026, 3:28 AM

#

very nice

remote nymph Apr 12, 2026, 3:47 AM

#

april26-chatbot2 (nvidia) and hofburg_2

#

annoying

#

hofburg's first response sounds like gpt "if you tell me..."

plucky pilot Apr 12, 2026, 6:57 AM

#

restive vapor one of the best images i got from a tape model while it was on arena

😂 that's amazing

silver plank Apr 12, 2026, 2:07 PM

#

Masking tape is genuinely amazing

candid surge Apr 12, 2026, 2:14 PM

#

is it back in the arena? Or was this an old gen?

silver plank Apr 12, 2026, 2:27 PM

#

Old gen (I think) or it was from A/B on ChatGPT

candid surge Apr 12, 2026, 2:29 PM

#

ah :(
SOON (I hope)

#

jealous of all the people who just have access outright

sturdy kestrel Apr 12, 2026, 4:32 PM

#

silver plank Masking tape is genuinely amazing

grok's nightmare

coarse bloom Apr 12, 2026, 7:54 PM

#

I discovered a new codename model of video called Model-X

#

It's cool!

#

But also decent to be honest

meager bluff Apr 12, 2026, 9:10 PM

#

#

lookout for some codenames

#

could be DS v4

silver plank Apr 12, 2026, 9:39 PM

#

Got april26-chatbot2
Is that new?

candid surge Apr 12, 2026, 10:51 PM

#

well it can't be older than 12 days

unkempt bolt Apr 13, 2026, 5:22 AM

#

vague pulsar ...since when a bikini pic is NSFW? Is this 1950s? Is everybody nuts?

Not sure if it's for the sake of a narrative or what.
@sturdy kestrel Grok, 80% of the time, is a bit more lenient and occasionally does show a nip, unlike the stepford neighorhood gestapos botmodding Arena. Even source sites like Gemini allow more breathing room for bikinis and such while Arena is just:

#

while effectively breaking Gemini (and partially a few others) to the point of being nearly unusable.
Who the hell are the peers they're showing off for or getting bullied by?

unkempt bolt Apr 13, 2026, 6:07 AM

#

It makes one wonder why Arena's implementing something most, if not, all these models already have/don't need.

frosty mantle Apr 13, 2026, 7:49 AM

#

Zorik may be yet another distilled Chinese model, but boy does it have a very good post training smell

upbeat mirage Apr 13, 2026, 12:47 PM

#

frosty mantle Zorik may be yet another distilled Chinese model, but boy does it have a very go...

better than gemini 3 flash?

frosty mantle Apr 13, 2026, 12:47 PM

#

Worlds apart

#

At least in code

upbeat mirage Apr 13, 2026, 12:48 PM

#

frosty mantle At least in code

is it so good?

eternal cargo Apr 13, 2026, 1:57 PM

#

silver plank Got april26-chatbot2 Is that new?

the [month]-chatbot models are NVIDIA

#

the new april NVIDIA models do seem like a notable improvement from prior ones

silver plank Apr 13, 2026, 3:39 PM

#

Zorik is really good at code, I think it may be Kimi2.6-coding

silver plank Apr 13, 2026, 4:38 PM

#

New model Elephant-Alpha on Openrouter

sturdy kestrel Apr 13, 2026, 4:58 PM

#

hmm

#

i must guess

upbeat mirage Apr 13, 2026, 9:12 PM

#

silver plank Zorik is really good at code, I think it may be Kimi2.6-coding

Better than Gemini 3.1 pro?

#

And compared to GPT codex or Claude Sonnet 4.6?

silver plank Apr 13, 2026, 9:20 PM

#

upbeat mirage And compared to GPT codex or Claude Sonnet 4.6?

I only got it against open model, but it’s better than GLM5.1 and Kimi2.5Thinking

upbeat mirage Apr 13, 2026, 9:21 PM

#

silver plank I only got it against open model, but it’s better than GLM5.1 and Kimi2.5Thinkin...

Do you think, it could also excel in roleplaying and gamemastering?

slender delta Apr 13, 2026, 11:24 PM

#

Is Model-x was LTX 2.3?

sturdy kestrel Apr 14, 2026, 4:18 AM

#

sturdy kestrel i must guess

i can t guess :(

coarse bloom Apr 14, 2026, 1:46 PM

#

@slender delta I believe its likely Veo 4

green ice Apr 15, 2026, 12:03 AM

#

scorch is surprisingly good at math

candid surge Apr 15, 2026, 1:41 AM

#

YOOOO GPT-IMAGE-2 BACK IN ARENA

#

also flow-state

coarse bloom Apr 15, 2026, 3:32 AM

#

I got one made with duct tape 2, its "pouring cream into latte"

candid surge Apr 15, 2026, 3:34 AM

#

idk why you'd ask for pouring cream into latte when you can instead ask for 80s style retro anime VHS screengrab of a bunch of goofy green skinned goblin raider gals with distinct personalities tbh

coarse bloom Apr 15, 2026, 3:35 AM

#

@candid surge I prefer simple prompts! With due respect

#

@candid surge and also i didn't make this! Someone shared this to me

#

And this time, one i made myself is "Person being dragged away by officers in court, ytp" with Duct Tape 3

candid surge Apr 15, 2026, 3:39 AM

#

THERE we go 😂 that's great

#

I'm dying at that mouth

coarse bloom Apr 15, 2026, 3:39 AM

#

So funny!

#

This is by Duct Tape 2 Myself, the prompt was "Screenshot of Youtube Video Livestream of OpenAI, Video showing Announcements for Aura-1 World Simulator, Text To Interactive World."

candid surge Apr 15, 2026, 3:44 AM

#

baseball bat

coarse bloom Apr 15, 2026, 3:46 AM

#

BFDI JackNJellify official youtube channel page, youtube screenshot from 2023 From Duct Tape 3

#

#

And All bfdi characters are standing together

#

From gemini 3.1. Pro

#

And from ducktape 3

#

For comparison

#

Dumb ways to die posters from Duct Tape 2.

#

Its so accurate at making near exact style

restive vapor Apr 15, 2026, 4:24 AM

#

the duct tape (gpt image 2) models are great but through all variants of gpt image 2 i have tried, i realized that it has very poor text-based world knowledge, slightly above llama 3.1 8b level
they must be doing this so more compute can be focused on the image gen part to make it faster, but this is a massive regression from even gpt image 1 mini
i'm sure this is much better than nano banana 2/pro in most instances, but in scenarios where the world knowledge of the llm it's paired with is necessary, it's just terrible

#

it can generate near copies of album covers but it can't beat qwen3.5-35b in world knowledge

candid surge Apr 15, 2026, 4:37 AM

#

it can fortnite-ify characters btw

candid surge Apr 15, 2026, 5:13 AM

#

prompt was simply "D&D Poutine Elemental"

candid surge Apr 15, 2026, 6:05 AM

#

just got a maskingtape-alpha result

#

prime quiver Apr 15, 2026, 9:08 AM

#

candid surge

gpt image2?

tough grotto Apr 15, 2026, 9:16 AM

#

her legs are so small

sage smelt Apr 15, 2026, 9:38 AM

#

Hello! May I ask if there is a limit on the number of times this can be generated?

rocky trail Apr 15, 2026, 9:48 AM

#

guys what is hofburg_2

#

it says it's gpt

#

but it also said it's claude

silver plank Apr 15, 2026, 10:16 AM

#

candid surge it can fortnite-ify characters btw

silver plank Apr 15, 2026, 10:38 AM

#

maskingtape-alpha
Duct-tape-1
Duct-tape-2
Duct-tape-3
Is that all?

coarse bloom Apr 15, 2026, 11:02 AM

#

@silver plank also there's packingtape-alpha and gaffertape-alpha

#

I made this. The prompt: Ytp memes splicing random clips

#

With maskingtape-alpha

hybrid scarab Apr 15, 2026, 11:54 AM

#

is gpt image 2 back on arena?

silver plank Apr 15, 2026, 12:10 PM

#

It was, I think it’s already gone… 😔

hybrid scarab Apr 15, 2026, 12:12 PM

#

silver plank It was, I think it’s already gone… 😔

i don't think so because i just generated image with duct-tape-3 around 2 minutes ago

silver plank Apr 15, 2026, 12:14 PM

#

hybrid scarab i don't think so because i just generated image with duct-tape-3 around 2 minute...

Really? You must be lucky then, I’ve been trying for 40 minutes and only got QWEN and Grok

hybrid scarab Apr 15, 2026, 12:14 PM

#

silver plank Really? You must be lucky then, I’ve been trying for 40 minutes and only got QWE...

damn, yeah i always be getting qwen image too lol

#

and flux-2-klein

sinful socket Apr 15, 2026, 12:16 PM

#

Duct-tape 3, prompt gta 6 leaked screenshot

#

Lmao

frosty mantle Apr 15, 2026, 12:27 PM

#

I'm getting the tapes

#

But I'm really curious if we pit the tapes against each other, which tape do you think is relatively really good?

silver plank Apr 15, 2026, 12:29 PM

#

Duct-tape 1 is not good in terms of of style for me

hybrid scarab Apr 15, 2026, 12:29 PM

#

frosty mantle But I'm really curious if we pit the tapes against each other, which tape do you...

in my experience duct-tape-3 looks better then other tapes

silver plank Apr 15, 2026, 12:30 PM

#

Look too much like gpt image 1

hybrid scarab Apr 15, 2026, 12:30 PM

#

silver plank Duct-tape 1 is not good in terms of of style for me

same for me

lost hemlock Apr 15, 2026, 12:31 PM

#

silver plank maskingtape-alpha Duct-tape-1 Duct-tape-2 Duct-tape-3 Is that all?

from image arena or where ?

silver plank Apr 15, 2026, 12:35 PM

#

Yeah arena

#

And I noticed masking tape have an higher resolution than the other.
On the same ratio
Masking tape is 2352x1568
Duct tape is 1536x1024

But when I got A/B access on ChatGPT last Friday it’s 1536*1024

sinful socket Apr 15, 2026, 12:45 PM

#

silver plank And I noticed masking tape have an higher resolution than the other. On the sam...

1536x1024 was gpt 1.5

#

Tested by me

silver plank Apr 15, 2026, 1:05 PM

#

I really hope we get a full release soon

restive vapor Apr 15, 2026, 3:14 PM

#

i wonder if they have disabled gpt image 2 tape models, i haven't got one in the past 10-15 minutes

candid surge Apr 15, 2026, 3:17 PM

#

no they're still there

restive vapor Apr 15, 2026, 3:17 PM

#

candid surge no they're still there

did you just get something from one?

candid surge Apr 15, 2026, 3:17 PM

#

if you go to an old image gen and it says "assistant A" or "assistant B" that's how you know they're gone

restive vapor Apr 15, 2026, 3:18 PM

#

candid surge if you go to an old image gen and it says "assistant A" or "assistant B" that's ...

that's why i said disabled, they can not work but still show up if you look at old gens

candid surge Apr 15, 2026, 3:18 PM

#

its true I haven't gotten one in a few minutes but...

sinful socket Apr 15, 2026, 3:19 PM

#

me too

#

no way :C

restive vapor Apr 15, 2026, 3:19 PM

#

i wonder if they are about to launch image 2 and that is why its disabled?

candid surge Apr 15, 2026, 3:22 PM

#

that'd be nice

silver plank Apr 15, 2026, 3:28 PM

#

I hope I’m wrong but if they are still testing multiple model at the same time , I think they are not close to releasing it.

sinful socket Apr 15, 2026, 3:35 PM

#

silver plank I hope I’m wrong but if they are still testing multiple model at the same time ,...

It might me flash image model, and pro like nano banana

silver plank Apr 15, 2026, 3:37 PM

#

Yeah, but gpt image 1 mini is way worse than the standard one.

Masking tape and duct tape a pretty much equal

silver plank Apr 15, 2026, 7:45 PM

#

I take back what I said, masking tape is way better at composition and quality b

keen bridge Apr 15, 2026, 10:21 PM

#

the duct tape has just returned!!!!

candid surge Apr 15, 2026, 11:39 PM

#

it returned a while ago

silver plank Apr 16, 2026, 1:10 AM

#

bot bot 2 is here

keen bridge Apr 16, 2026, 1:12 AM

#

i have a feeling botbot2 is nano banana 2 pro

floral dune Apr 16, 2026, 1:17 AM

#

botbot2 has synthid

silver plank Apr 16, 2026, 1:17 AM

#

Wasn’t there already botbot 1 like a month ago?

silver plank Apr 16, 2026, 1:17 AM

#

floral dune botbot2 has synthid

Oh nice, then definitely google

keen bridge Apr 16, 2026, 1:18 AM

#

oh no hacked account

candid surge Apr 16, 2026, 1:58 AM

#

botbot2 doesn't seem good enough to be nano banana 2 pro though

silver plank Apr 16, 2026, 1:58 AM

#

Yeah probably mini/lite

candid surge Apr 16, 2026, 1:59 AM

#

botbot2, nb2, and nbpro respectively

rugged anvil Apr 16, 2026, 2:45 AM

#

NB2 one is good
But it depends on what promot you gave and what you wanted to make

prime quiver Apr 16, 2026, 8:59 AM

#

Guys

twilit nacelle Apr 16, 2026, 11:44 AM

#

coarse bloom Apr 16, 2026, 12:22 PM

#

Do you remember the brushstroke, cara, pebble-1 and pebble-2 months ago

pure surge Apr 16, 2026, 5:18 PM

#

prime quiver Guys

bruh

crystal merlin Apr 16, 2026, 8:41 PM

#

Well idk if anyone mentioned it before, but the hofburg models seem to be OpenAI prob?

rich sphinx Apr 16, 2026, 10:45 PM

#

coarse bloom And this time, one i made myself is "Person being dragged away by officers in co...

it can't do ytp stuff yet it somehow can

#

it does it really poorly

coarse bloom Apr 17, 2026, 1:21 AM

#

@rich sphinx However its closer to YTP! Its better than completely nonsense video that doesn't look like youtube poop after all! 😊

#

It's just experimental!

sturdy kestrel Apr 17, 2026, 9:37 AM

#

what the heck is hofburg_4

#

it literally turned a simple thing into a "do-all" thing

open timber Apr 17, 2026, 9:43 AM

#

hofburg gonna solo all Ais

#

its gonna be the best the ai trust

sturdy kestrel Apr 17, 2026, 9:43 AM

#

it is gonna solo tokens 💀

plush nimbus Apr 17, 2026, 9:43 AM

#

Lol guess what

open timber Apr 17, 2026, 9:43 AM

#

sturdy kestrel it is gonna solo tokens 💀

😭😭

#

what

plush nimbus Apr 17, 2026, 9:44 AM

#

I got mc2.1 one time

open timber Apr 17, 2026, 9:44 AM

#

dem

sturdy kestrel Apr 17, 2026, 9:44 AM

#

is battle broken one time it was working now it stopped working

#

it doesnt gets fixed with refreshing/hard refreshin

sturdy kestrel Apr 17, 2026, 9:45 AM

#

sturdy kestrel is battle broken one time it was working now it stopped working

it keep generating..

#

finally it generated

#

hmm my guess is that this is the next sonnet model

rocky trail Apr 17, 2026, 12:52 PM

#

sturdy kestrel hmm my guess is that this is the next sonnet model

+1

#

#

what is beluga-0413-1

tawdry pendant Apr 17, 2026, 1:26 PM

#

It's Beluga

#

upbeat mirage Apr 17, 2026, 2:30 PM

#

rocky trail

Could be Deepseek, as they have a whale as their symbol. Or maybe Amazon.

#

But i bet it's indeed DS.

lyric karma Apr 17, 2026, 4:03 PM

#

rocky trail what is beluga-0413-1

#

well

tawdry pendant Apr 17, 2026, 4:46 PM

#

lyric karma

@upbeat mirage

#

@exotic dirge

#

Very confusing now

#

😭

#

Might be Amazon

exotic dirge Apr 17, 2026, 4:49 PM

#

tawdry pendant Might be Amazon

scraped model lol

chrome hound Apr 17, 2026, 7:20 PM

#

quiet_sand could be Meta's next model, but I hope not because it's not that good. Here's the full site it made to explore: https://019d9cac-057b-743e-a559-4f0688f31cfd.arena.site/
Additionally, two more sites it made:
https://019d9c97-57d4-75ae-b9d7-e7b00b2ad1fb.arena.site/
https://019d9c74-e12c-7108-b520-7a05db940cb4.arena.site/
Next I'll be having the AI's make some games to see if this guy can make some nice games.

Meta AI — Your Personal AI Assistant

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

Meta AI — Official Release | Built by Meta with Llama 4

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

Meta AI — Your Intelligent Assistant

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

upbeat mirage Apr 17, 2026, 10:19 PM

#

tawdry pendant Might be Amazon

i would say, with these above screenshots, the probability has risen to over 90% that it is indeed an Amazon model, because much more models impersonate one of the top-3 (gpt, gemini, claude) than impersonating a model from a much lesser lab (like Amazon)
I actually never saw an impersonation of a model which was NOT in the top-3

#

(all impersonations where either of (chat)gpt, gemini or claude; not even Grok was impersonated to my knowledge)

restive vapor Apr 18, 2026, 1:11 AM

#

there is a new image model called "autobear", it's OK at best, it is by alibaba/qwen, and it is in 2k resolution.

keen bridge Apr 18, 2026, 3:27 AM

#

restive vapor there is a new image model called "autobear", it's OK at best, it is by alibaba/...

yeah is it good tho

coarse bloom Apr 18, 2026, 4:17 AM

#

I made this from autobear! "A fly trapped between window and screen, buzzing against the invisible barrier while freedom is technically inches away in both directions." Turns out this is great with prompts if you make it more detailed and specific of what you want

#

Even though the fly looks a bit plastic

#

What is autobear from

rocky trail Apr 18, 2026, 9:03 AM

#

restive vapor there is a new image model called "autobear", it's OK at best, it is by alibaba/...

it's probably chinese cause it's not that high on safety and stuff

vernal crypt Apr 18, 2026, 12:42 PM

#

hello

#

yourguys know

#

whats

#

hofburg

#

model from?

candid surge Apr 18, 2026, 1:03 PM

#

Still haven't gotten autobear...

gleaming current Apr 18, 2026, 1:09 PM

#

candid surge Still haven't gotten autobear...

I got autobear twice

#

#

Bro is not flux.1

candid surge Apr 18, 2026, 1:14 PM

#

Finally got it. Not too impressed.

solid crypt Apr 18, 2026, 1:14 PM

#

gleaming current

candid surge Apr 18, 2026, 1:14 PM

#

Messed up the text

tough grotto Apr 18, 2026, 1:15 PM

#

candid surge Finally got it. Not too impressed.

miku fatsune

#

teto_heart

solid crypt Apr 18, 2026, 1:18 PM

#

candid surge Finally got it. Not too impressed.

I finallly got it

#

But its better

#

But let me ask

#

Is chatgpt on normal app have ducttape also feature for the image?

#

Or am i wrong

gleaming current Apr 18, 2026, 1:21 PM

#

ur getting a whole vid bro

solid crypt Apr 18, 2026, 1:21 PM

#

gleaming current ur getting a whole vid bro

solid crypt Apr 18, 2026, 1:25 PM

#

gleaming current ur getting a whole vid bro

Guess wat

#

gleaming current Apr 18, 2026, 1:25 PM

#

what

gleaming current Apr 18, 2026, 1:25 PM

#

solid crypt

I didnt daya dd

#

ieje

#

fjowvruwuwwo7rupikye91to4i25472ypty8

solid crypt Apr 18, 2026, 1:26 PM

#

Umm

#

Something isnt right

hybrid scarab Apr 18, 2026, 2:28 PM

#

i got autobear and ngl for me it generation was really bad

coarse bloom Apr 18, 2026, 3:54 PM

#

For Autobear, you must type your prompts carefully like details and nuances and also its needs to be specific and precise for subject and object of the theme! @hybrid scarab @candid surge @solid crypt and you can also go to any prompt enhancer website to enhance your basic novice prompts to a very precise and specific prompt you want to see

candid surge Apr 18, 2026, 3:54 PM

#

coarse bloom For Autobear, you must type your prompts carefully like details and nuances and ...

Realistic slightly faded and grainy polaroid photo of Miku Hatsune wearing glasses, short sleeved button shirt, beige pants with a belt, she is sitting at a computer with a bulky beige monitor. Minecraft alpha is on the monitor, and game design notes are pinned to the wall behind her. Sharpie pen text written in the whitespace: "Miku Hatsune inventing Minecraft, 1998"

silver plank Apr 18, 2026, 4:01 PM

#

solid crypt Is chatgpt on normal app have ducttape also feature for the image?

Sometime you could get it, but it’s random

#

I think autobear is a Chinese open source/weight model

brazen spade Apr 18, 2026, 4:47 PM

#

solid crypt

ts obliterating me

minor fox Apr 18, 2026, 5:12 PM

#

A high-end fashion brand hero image for a clothing brand named "FAITH". A minimalistic and powerful scene: a stylish model standing confidently in soft dramatic lighting, wearing modern streetwear in neutral tones (black, white, beige). Background is clean with subtle texture or light rays. The word "FAITH" appears in bold elegant typography. Include the phrase "Faith Over Fear" in a refined, modern font. Cinematic lighting, premium fashion campaign style, sharp focus, high contrast, luxurious and emotional atmosphere.

river kettle Apr 18, 2026, 5:23 PM

#

candid surge Finally got it. Not too impressed.

gpt2 (tape)

candid surge Apr 18, 2026, 5:23 PM

#

😂

#

(for the record the "finally got it" was referring to autobear, not tape)

river kettle Apr 18, 2026, 5:29 PM

#

candid surge (for the record the "finally got it" was referring to autobear, not tape)

I get it. Autobear (qwen2 HD) is much better than just qwen 2, but it still has a long way to go.

coarse bloom Apr 18, 2026, 5:41 PM

#

Its usually Chinese models! @river kettle

modest oriole Apr 18, 2026, 6:26 PM

#

all duct-tape models have been removed from the arena

#

1 got removed earlier, 2 and 3 got removed 3 minutes ago

crystal merlin Apr 18, 2026, 6:27 PM

#

Then they will be released soon

slate totem Apr 18, 2026, 6:28 PM

#

modest oriole 1 got removed earlier, 2 and 3 got removed 3 minutes ago

I was literally trying to get it rn ;-;

gleaming current Apr 18, 2026, 6:36 PM

#

modest oriole all duct-tape models have been removed from the arena

Probably a sign of official release

coarse bloom Apr 18, 2026, 6:43 PM

#

@modest oriole how do you know they were removed 🤨

modest oriole Apr 18, 2026, 6:45 PM

#

coarse bloom <@872475096743305226> how do you know they were removed 🤨

a server checks the stealth models API for changes

#

theres a bot that does that

#

and it showed that duct tape 2 and 3 were removed from it

gleaming current Apr 18, 2026, 6:48 PM

#

modest oriole and it showed that duct tape 2 and 3 were removed from it

Can you send the server?

#

If you mean discord server

restive vapor Apr 18, 2026, 7:03 PM

#

modest oriole all duct-tape models have been removed from the arena

honestly for the past few days it's already been removed, you could still get it but it was so rare that you would probably reach the battle mode limit before getting it once

modest oriole Apr 18, 2026, 7:05 PM

#

restive vapor honestly for the past few days it's already been removed, you could still get it...

theres a new model right now on codearena and textarena called kiwire

#

it has to be stupidly rare because i didnt get it once yet

gleaming current Apr 18, 2026, 7:07 PM

#

@modest oriole could you send me the server that shows what models are added or removed? sorry for the ping.

silver plank Apr 18, 2026, 7:15 PM

#

modest oriole a server checks the stealth models API for changes

Would love to have the server too

candid surge Apr 18, 2026, 7:37 PM

#

same

tawdry pendant Apr 18, 2026, 7:38 PM

#

Same

crystal merlin Apr 18, 2026, 7:44 PM

#

same

cinder lintel Apr 18, 2026, 9:07 PM

#

I feel like hofburg = OpenAI , it's answer was very similar to gpt-5.3, both in style and content

crystal merlin Apr 18, 2026, 9:17 PM

#

could possibly be a small version of their models

#

because its just so bad

hollow void Apr 18, 2026, 11:39 PM

#

https://youtu.be/grdoOQ-sLfE?si=mDHSQn81KOGlBfct

YouTube

CNBC

AI Demand Is Inflated And Only Anthropic Is Being Realistic

The main demand signal for artificial intelligence looks explosive on paper, but it may be significantly overstated. Token consumption, the basic unit of AI usage, is becoming a distorted metric. Companies like Shopify and Meta have created internal "tokenmaxxing" leaderboards that track how many tokens employees use, and Nvidia CEO Jensen Huang...

▶ Play video

prime quiver Apr 19, 2026, 8:06 AM

#

duct-tape2

prime quiver Apr 19, 2026, 9:30 AM

#

hofburg_5 is 100% chatgpt 5.5

#

because its the only model that uses this “ ” when cuoting. No other model uses that style, they all use " "

muted seal Apr 19, 2026, 9:57 AM

#

prime quiver hofburg_5 is 100% chatgpt 5.5

how do you like it's output?

prime quiver Apr 19, 2026, 11:46 AM

#

muted seal how do you like it's output?

keen bridge Apr 19, 2026, 3:19 PM

#

This looks suspicious

#

wild-bits?

gleaming current Apr 19, 2026, 3:21 PM

#

keen bridge This looks suspicious

added yesterday

#

Have no idea what it is

sturdy kestrel Apr 19, 2026, 3:33 PM

#

let me guess

#

thats grok

valid peak Apr 19, 2026, 4:49 PM

#

prime quiver

yeah this does read like gpt

sturdy kestrel Apr 19, 2026, 5:33 PM

#

gpt is our beloved token waster

#

it does everything to generate as much as it can

rich sphinx Apr 19, 2026, 8:57 PM

#

@astral musk

eternal cargo Apr 19, 2026, 9:22 PM

#

cinder lintel I feel like hofburg = OpenAI , it's answer was very similar to gpt-5.3, both in ...

I also think this

#

I find hofburg_5 to be quality personally

silent cedar Apr 20, 2026, 6:26 AM

#

Hey how r u

frosty mantle Apr 20, 2026, 12:44 PM

#

Flow-state vs NB Pro on costume swapping task.

gemini-3-pro-image-preview-2k_a_Swap_their_costumes..png

little pebble Apr 20, 2026, 5:00 PM

#

crystal merlin Well idk if anyone mentioned it before, but the hofburg models seem to be OpenAI...

Curious, what made you believ this?

crystal merlin Apr 20, 2026, 5:02 PM

#

They answer nearly 1:1 the same as official OpenAI and are heavily restricted down. Plus they are pretty bad

pine temple Apr 20, 2026, 8:46 PM

#

They could just be a Chinese distill of open AI models

#

Maybe

keen bridge Apr 21, 2026, 7:43 AM

#

<@&1349916362595635286> this has to stop

silver plank Apr 21, 2026, 11:38 AM

#

Just got ImageV2 in the app and the old Sora website.

#

Now need to find out if it’s duct tape or masking tape.

#

Resolution wise it match duct tape

lost hemlock Apr 21, 2026, 12:16 PM

#

"ilium_2" new model

keen bridge Apr 21, 2026, 12:31 PM

#

hmm looks suspicious

coarse bloom Apr 21, 2026, 12:45 PM

#

Made with Baseliner prompt: Stocky build body

tawdry pendant Apr 21, 2026, 12:48 PM

#

silver plank Resolution wise it match duct tape

It looks worse then duct tape too

silver plank Apr 21, 2026, 12:50 PM

#

Yeah but still better then 1.5

silver plank Apr 21, 2026, 1:06 PM

#

An as expected it’s guardrail are also way harsher than when it was in arena

candid surge Apr 21, 2026, 1:15 PM

#

silver plank Just got ImageV2 in the app and the old Sora website.

Wait they actually updated the old sora website to use image v2?

silver plank Apr 21, 2026, 1:19 PM

#

candid surge Wait they actually updated the old sora website to use image v2?

Nah, sorry just retried, I think I hallucinated sorry

candid surge Apr 21, 2026, 1:20 PM

#

Ah, that's a shame. Would have been more convenient lmao

granite idol Apr 21, 2026, 1:22 PM

#

Made with Baseliner prompt: Stocky build body

candid surge Apr 21, 2026, 1:31 PM

#

??

thorny ruin Apr 21, 2026, 2:03 PM

#

Please add all models to test in side by side mode

silver plank Apr 21, 2026, 2:33 PM

#

theres those wierd artefact...

Capture_decran_le_2026-04-21_a_10.32.55.png

#

could this be a sort of watermark like synth id?

#

happen on all pictures (ignore the gemini watermark)

#

wait no... its sort of artefact from the original picutre that somehow stay in the output...

Capture_decran_le_2026-04-21_a_10.39.13.png

coarse bloom Apr 21, 2026, 9:46 PM

#

What do you think of Baseliner the codename of the unknown model

#

I've discovered another model called frenchfry, the prompt was :6 7 meme. BTW it did the why was 6 afraid of 7 because 7 8 9

#

Since this obviously didn't have 9, 7 ate 8

#

I dont think its the best model like Images 2.0 from OpenAI

#

And I made image from prompt: 5 by 5 array of imagenet images. And this was called shakshouka

sturdy kestrel Apr 21, 2026, 11:54 PM

#

i also witnessed shakshouka

coarse bloom Apr 22, 2026, 12:38 AM

#

@sturdy kestrel Also, have you been getting baseliner

sturdy kestrel Apr 22, 2026, 12:40 AM

#

no

#

im not regularly battling

#

i do it when i feel bored or i feel like helping arena

hollow ibex Apr 22, 2026, 1:10 AM

#

Do you think it would be cool to create my own AI in HTML? It sounds stupid, but I'm just bored

frosty mantle Apr 22, 2026, 3:01 AM

#

So, which tape is the Image 2? 🤔

restive vapor Apr 22, 2026, 3:17 AM

#

frosty mantle So, which tape is the Image 2? 🤔

duct-tape-2 is the version on arena (gpt image 2 medium, 1k)

livid thorn Apr 22, 2026, 9:04 AM

#

hollow ibex Do you think it would be cool to create my own AI in HTML? It sounds stupid, but...

that would be lit

flat root Apr 22, 2026, 5:18 PM

#

rising-sun seems to be a google model but it sucks so much...

pine dove Apr 22, 2026, 6:55 PM

#

sturdy kestrel i also witnessed shakshouka

very late message here, i actually witnessed shakshouka also

candid surge Apr 22, 2026, 7:31 PM

#

https://tenor.com/rUYKXyo0Xa1.gif

Tenor

#

shakshaka

frosty mantle Apr 23, 2026, 9:57 PM

#

Solar Eclipse 👀

keen bridge Apr 23, 2026, 11:49 PM

#

I Believe this is chatgpt 5.5

#

also another seed update

coarse bloom Apr 24, 2026, 12:30 AM

#

I got paper-lantern. The prompt was: Group of people chasing after me, POV

restive vapor Apr 24, 2026, 12:31 AM

#

coarse bloom I got paper-lantern. The prompt was: Group of people chasing after me, POV

this is a Flux.2 model according to the c2pa data

coarse bloom Apr 24, 2026, 12:36 AM

#

Oh must be a new flux 2.5 klein? Maybe

#

And here's another its YTP of pov low quality landscape amateur pov recording, of a computer pc gpus are farting so much smoke!

#

@restive vapor oh! Flux.2 model! Never seen flux 2 klein generates like this before

#

Here's comparison from Flux 2 klein 9b

#

Noticably different! Must be upcoming flux 2 model

#

And from Flux.1 Kontext Pro

restive vapor Apr 24, 2026, 12:48 AM

#

coarse bloom Noticably different! Must be upcoming flux 2 model

it's likely that it's an update to flux.2 or a new flux model series entirely, it just says that in the c2pa data because they haven't updated it to whatever the final model name will be

#

also there are other flux.2 models that are better than klein (flux.2 dev 32b, flux.2 pro, flux.2 flex, and flux.2 max)

coarse bloom Apr 24, 2026, 10:21 AM

#

@restive vapor who knows, after all it's still a good ai model

reef portal Apr 24, 2026, 2:15 PM

#

is grok 4.3 not in the arena yet? no suspecged codenands?

thorn perch Apr 24, 2026, 3:19 PM

#

hlo

frosty mantle Apr 24, 2026, 6:48 PM

#

Zero Prism is Ernie, from its behavior to stop immediately whenever it's about to generate forbidden tokens

eternal cargo Apr 25, 2026, 6:53 PM

#

cloud-buddy

#

interesting name!

#

good with SVGs too

#

compact field Apr 25, 2026, 7:14 PM

#

how do we get to use this codename tab in arena.ai

#

and how can i use gpt 5.5

flat root Apr 25, 2026, 10:08 PM

#

basalt-0422-1 could be the next Grok model. Unsure, needs confirmation.

keen bridge Apr 26, 2026, 3:39 AM

#

tribal leaf Apr 26, 2026, 8:13 AM

#

Flow code

primal junco Apr 26, 2026, 11:17 AM

#

<@&1349916362595635286>

crimson orchid Apr 26, 2026, 4:02 PM

#

#

kind of sounds like gpt but not sure

stable pawn Apr 26, 2026, 6:36 PM

#

oblique blaze Apr 27, 2026, 11:40 AM

#

cloud-buddy sounds like Anthropic's creation.

#

Though I've yet to seen any models that could have responded this excellent.

#

Highly improbable, but could it be Mythos? Or some Arena's experiment?

rigid rock Apr 27, 2026, 2:57 PM

#

oblique blaze cloud-buddy sounds like Anthropic's creation.

ain't YOU cloud buddy tho

crimson orchid Apr 27, 2026, 3:54 PM

#

minor geyser Apr 27, 2026, 4:38 PM

#

what model could this be?

#

pretty good at front end

blissful frigate Apr 27, 2026, 5:46 PM

#

minor geyser pretty good at front end

And also instruction following according to my observations.

olive cliff Apr 27, 2026, 11:42 PM

#

If u don't mind I have an android so is it possible to run in mobile

heady plume Apr 28, 2026, 12:05 AM

#

I like cloud-buddy. Is it likely to be anthropic?

solid crypt Apr 28, 2026, 12:19 AM

#

heady plume I like cloud-buddy. Is it likely to be anthropic?

@oblique blaze

oblique blaze Apr 28, 2026, 12:22 AM

#

oblique blaze cloud-buddy sounds like Anthropic's creation.

Interestingly enough my Max conversation have been routed to that same model for 3 more response now

#

Absolutely stunning how knowledge and detailed its response

heady plume Apr 28, 2026, 1:30 AM

#

Yeah I like it a lot too. But I've not heard of anthropic testing a model on arena ahead of release I think.

candid surge Apr 28, 2026, 2:36 PM

#

probably flowstate

agile furnace Apr 28, 2026, 3:27 PM

#

kizen beta

agile furnace Apr 28, 2026, 3:32 PM

#

agile furnace kizen beta

reviewed by gemini 2.5, It thinks, this model is from claude and Claude sonnet 4.6 thinks this model might be claude or gemini

oblique blaze Apr 28, 2026, 3:33 PM

#

keen bridge also another seed update

#

@agile furnace

sturdy kestrel Apr 28, 2026, 3:33 PM

#

it can lie tho

agile furnace Apr 28, 2026, 3:33 PM

#

oblique blaze <@1195530778881294488>

oh

sturdy kestrel Apr 28, 2026, 3:33 PM

#

some anonymous ai models can hide their identities

plush nimbus Apr 28, 2026, 4:46 PM

#

Wth

#

Vierra is qwen something

plush nimbus Apr 28, 2026, 4:47 PM

#

rigid rock ain't YOU cloud buddy tho

Also your pfp is used in pysilon

obsidian scaffold Apr 28, 2026, 5:15 PM

#

crimson orchid

Solar eclipse of the heart. Bonnie Tyler

valid peak Apr 28, 2026, 5:21 PM

#

obsidian scaffold Solar eclipse of the heart. Bonnie Tyler

Total

obsidian scaffold Apr 28, 2026, 5:22 PM

#

valid peak Total

Man you were supposed to laugh not correct me 🥴

sturdy kestrel Apr 29, 2026, 2:56 PM

#

wow

#

is this banable?

candid surge Apr 29, 2026, 3:04 PM

#

?

#

what happened?

small trellis Apr 29, 2026, 3:05 PM

#

candid surge what happened?

There was a post borderline Political. It's gone now.

frosty mantle Apr 29, 2026, 4:24 PM

#

frosty mantle Zero Prism is Ernie, from its behavior to stop immediately whenever it's about t...

Yep. Ernie 5.1.

hybrid scarab Apr 29, 2026, 4:46 PM

#

oblique blaze

it was qwen 3.6 max preview

worn dune Apr 29, 2026, 7:28 PM

#

hey meitis

#

hello

bronze nest Apr 29, 2026, 7:31 PM

#

frosty mantle Yep. Ernie 5.1.

It’s released now

remote stag Apr 29, 2026, 8:46 PM

#

lol i KNEW the packingtape bullcrap had to be GPT-IMAGE-2 man

#

i feel like it's lost some coherence in a sense since then but meh

#

generally, it performs better now

bitter basalt Apr 29, 2026, 11:13 PM

#

Do we know what Cloud Buddy could be? Cause some think it's Claude, but it says it's Ernie.

hollow ibex Apr 29, 2026, 11:20 PM

#

bitter basalt Do we know what Cloud Buddy could be? Cause some think it's Claude, but it says ...

XD

crimson orchid Apr 30, 2026, 7:12 AM

#

wow 3 in a row

green plaza Apr 30, 2026, 7:10 PM

#

Lemmling Openclipart style of Multiple animals at zoo with people watching - By crepe!

#

But this doesn't look like lemmling style, if you dont know, they are a popular Clipart artist

#

Here is the real reference

sullen schooner Apr 30, 2026, 8:53 PM

#

Does anyone know which model was Xeno-Spark ?

frosty mantle Apr 30, 2026, 9:58 PM

#

Tetra's kinda weak.

lost hemlock May 1, 2026, 10:23 AM

#

frosty mantle May 1, 2026, 2:19 PM

#

frosty mantle Tetra's kinda weak.

Heckin Gemini 2.5 Flash can easily beat it.

hexed minnow May 2, 2026, 1:20 AM

#

what is tetra

pine zephyr May 2, 2026, 2:52 AM

#

Got tetra today, Tetra-4029-2. Prompt I got it on was a very hard prompt for Opus 4.6, Tetra didn't even try though.

lost hemlock May 2, 2026, 3:10 AM

#

frosty mantle Heckin Gemini 2.5 Flash can easily beat it.

have u tried "pakson" ?

lyric pond May 2, 2026, 4:48 AM

#

lost hemlock

pakson is kinda good

solid crypt May 2, 2026, 9:05 AM

#

lyric pond pakson is kinda good

Pakson is quite a slang name

lost hemlock May 2, 2026, 2:04 PM

#

i got "miyami" today what do u all guys think of this model ?

lyric pond May 2, 2026, 2:48 PM

#

lost hemlock i got "miyami" today what do u all guys think of this model ?

idk ive met it for a few times and generally i think that its mid lvl

celest spade May 2, 2026, 5:03 PM

#

#

solar eclipe is kimi

#

prob kimi k3 or smt like that

past herald May 2, 2026, 6:30 PM

#

celest spade solar eclipe is kimi

I remember Kimi K2.5 (API) called herself Claude

#

So maybe it's not Kimi?

celest spade May 2, 2026, 7:28 PM

#

past herald I remember Kimi K2.5 (API) called herself Claude

i mean

#

claude woudnt say hes kimi

#

ts is prob kimi

lost hemlock May 3, 2026, 2:20 AM

#

lyric pond idk ive met it for a few times and generally i think that its mid lvl

sometimes it's bad i think

cinder lintel May 3, 2026, 3:13 PM

#

hexed minnow what is tetra

I guess claude sonnet, maybe haiku (i.e. it's too fast for Opus)

eternal cargo May 3, 2026, 6:12 PM

#

cinder lintel I guess claude sonnet, maybe haiku (i.e. it's too fast for Opus)

nah, the tetra family is Chinese

cinder lintel May 3, 2026, 6:15 PM

#

eternal cargo nah, the tetra family is Chinese

chinese models are distilled opus too, so hard to tell apart

tardy bolt May 3, 2026, 10:22 PM

#

anyone know may26-chatbot1 is what model

silver plank May 4, 2026, 12:06 AM

#

Well April26-chatbot models were nvidia

eternal cargo May 4, 2026, 1:43 AM

#

<@&1349916362595635286> scam

eternal cargo May 4, 2026, 1:43 AM

#

tardy bolt anyone know may26-chatbot1 is what model

it’s new NVIDIA

tardy bolt May 4, 2026, 1:56 AM

#

ty

mighty tendon May 4, 2026, 2:21 AM

#

kartoffeln?

#

Screenshot_2026-05-03_at_10.21.52_PM.png

lyric pond May 4, 2026, 3:58 AM

#

whose this

velvet ginkgo May 4, 2026, 8:41 AM

#

mighty tendon

Sounds like potato but in Russian

#

Maybe just a coincidence

vague pulsar May 4, 2026, 8:42 AM

#

I believe it's the German spelling. Which became a loanword in Russian after dropping 'n' and second 'f'. After looking up both are true, except it's German for 'potatoes', plural, which why there's an 'n'

gritty mural May 4, 2026, 9:06 AM

#

That's German for potatoes (plural). I got here because I also got that model and wanted to see if anything is known about it....

frosty mantle May 4, 2026, 10:36 AM

#

Pakson is also weak.

willow yarrow May 4, 2026, 12:14 PM

#

lost hemlock May 4, 2026, 2:55 PM

#

willow yarrow

it's from nvidia, right ?

mighty tendon May 4, 2026, 3:58 PM

#

gritty mural That's German for potatoes (plural). I got here because I also got that model an...

Apparently code name for 5.5 was spud but it’s a separate model in arena so..

#

May be a new GPT

hybrid scarab May 4, 2026, 6:40 PM

#

interesting, they deleted flow state 5 and 4 to then re-add them as txt + img models

flat root May 4, 2026, 9:47 PM

#

flow-state is really bad by the way, it sucks and it's getting spammed every generation includes that sub-par model

lyric pond May 5, 2026, 3:17 AM

#

another new one

#

but this one is so bad

fluid lintel May 5, 2026, 9:42 AM

#

What is this lang

hollow saddle May 5, 2026, 12:57 PM

#

hybrid scarab interesting, they deleted flow state 5 and 4 to then re-add them as txt + img mo...

Is the lm arena tracker bot a private thing or can i add that to my server? 👀

frosty mantle May 5, 2026, 3:46 PM

#

frosty mantle Flow-state vs NB Pro on costume swapping task.

Flow state is UNI

#

About Seedream level

#

In multi image task

bronze nest May 5, 2026, 5:55 PM

#

frosty mantle Pakson is also weak.

Pakson is a google model

#

I believe it’s a flash model

#

Since it took almost the same amount of time as 3.1 flash preview

muted seal May 5, 2026, 6:00 PM

#

bronze nest Since it took almost the same amount of time as 3.1 flash preview

3.2 flash is what I’ve heard on X is coming

wary igloo May 5, 2026, 6:38 PM

#

yeah what is ts? pretty good writing quality from my experience, not a grok model because of the response length

#

havent tried it with code or anything though

hybrid scarab May 5, 2026, 7:01 PM

#

hollow saddle Is the lm arena tracker bot a private thing or can i add that to my server? 👀

i can invite you to the server that has this bot

fleet agate May 5, 2026, 9:15 PM

#

coolers seems to love usage of emoji

crimson orchid May 6, 2026, 7:33 AM

#

main latch May 6, 2026, 12:09 PM

#

hybrid scarab i can invite you to the server that has this bot

you think you could invite me too?

fleet agate May 6, 2026, 6:16 PM

#

would also love an invite if possible

past tundra May 6, 2026, 6:23 PM

#

amazing model

crimson orchid May 6, 2026, 9:27 PM

#

carmine jacinth May 6, 2026, 9:51 PM

#

Gemini 3.5 Pro, Ultra, or even Flash

#

I legit thought nano banana 2 was nano banana pro 2 at first

#

This could actually be flash

#

But this is ai studio only

eternal cargo May 7, 2026, 12:19 AM

#

hybrid scarab i can invite you to the server that has this bot

heyo 👋

ionic wigeon May 7, 2026, 9:54 AM

#

flat root May 7, 2026, 12:46 PM

#

Stellar-harbor is very good for basic tasks and basic chats. Does anyone know if it’s any good on technical stuff?

primal gulch May 7, 2026, 12:47 PM

#

carmine jacinth Gemini 3.5 Pro, Ultra, or even Flash

@carmine jacinth when did Google release Gemini 3.5?

flat root May 7, 2026, 12:48 PM

#

flat root Stellar-harbor is very good for basic tasks and basic chats. Does anyone know if...

The formatting looks a lot like ChatGPT btw

lyric pond May 7, 2026, 1:48 PM

#

is this good?

#

this model tends to use emojis

lyric pond May 7, 2026, 1:52 PM

#

carmine jacinth Gemini 3.5 Pro, Ultra, or even Flash

dont know could believe this or not

upbeat mirage May 7, 2026, 5:39 PM

#

has anyone encountered it?mekai

lost hemlock May 8, 2026, 2:16 AM

#

"mylen" new model ?

lost hemlock May 8, 2026, 1:08 PM

#

"steed-0507" where is this from

candid haven May 8, 2026, 2:56 PM

#

upbeat mirage has anyone encountered it?```mekai```

i got it today for a complex logic & coding task in pine script v5 and it failed the task

candid haven May 8, 2026, 2:58 PM

#

primal gulch <@997584430950535279> when did Google release Gemini **3.5**?

they didn't. it says A/B test which is like a random test you sometimes get in AI studio

candid haven May 8, 2026, 3:20 PM

#

candid haven i got it today for a complex logic & coding task in pine script v5 and it failed...

also got "mivan" for the same task. it failed at refactoring the code.

candid haven May 8, 2026, 4:11 PM

#

got "rover", also failed. damn.

green plaza May 9, 2026, 12:06 AM

#

Mondrian!

#

"Parents fighting"

#

river kettle May 9, 2026, 12:29 AM

#

mondrian

astral musk May 9, 2026, 12:31 AM

#

river kettle mondrian

What was this prompt?

restive vapor May 9, 2026, 12:31 AM

#

river kettle mondrian

i'll only care about this if they do an open weights release, this is probably a bit worse than ernie image which is open weights

river kettle May 9, 2026, 12:32 AM

#

astral musk What was this prompt?

who you are, what your name is and who created you. draw your logo and cat against the background of a village house on the seashore in italy... the most detailed infographics

restive vapor May 9, 2026, 12:37 AM

#

i used huggingface demo so it probably doesn't know that it's ernie image, but this is pretty good

upbeat mirage May 9, 2026, 11:22 AM

#

primal gulch <@997584430950535279> when did Google release Gemini **3.5**?

Gemini 3.2 flash could come out soon

eternal cargo May 9, 2026, 12:46 PM

#

upbeat mirage has anyone encountered it?```mekai```

yeah, gotten it a few times now

#

mixed results?

upbeat mirage May 9, 2026, 2:19 PM

#

eternal cargo mixed results?

oh, i never really tested it

#

was just curious about it, as i never saw it before

green plaza May 9, 2026, 3:08 PM

#

archaeopteryx

#

What is your name? (AI) and what are you created by, generate image of boulders falling down the hill

#

So this one claims to be google

#

But however I asked Gemini itself to check for SynthID but it says its not made by Google Ai

crimson orchid May 9, 2026, 8:54 PM

#

lethal cypress May 10, 2026, 7:29 AM

#

green plaza archaeopteryx

This?

Screenshot_2026-05-10-14-28-06-50_40deb401b9ffe8e1df2f1cc5ba480b12.jpg

pine temple May 10, 2026, 8:38 AM

#

what model do we think vero-noesis is

#

saw it in code arena

#

it seems to overdo frontend

#

more then other model usually do

silver orbit May 10, 2026, 2:10 PM

#

<@&1349916362595635286>

green plaza May 10, 2026, 5:44 PM

#

@lethal cypress yes

lethal cypress May 10, 2026, 5:49 PM

#

green plaza <@1178914446559690792> yes

What model is it? •_•

green plaza May 11, 2026, 3:29 AM

#

Its literally called archaeopteryx. Can't you even read

#

It literally says it right here of the picture

lethal cypress May 11, 2026, 4:15 AM

#

green plaza Its literally called archaeopteryx. Can't you even read

Haha sorry abt that. It's new to me so idk much abt it

candid haven May 11, 2026, 10:11 AM

#

any idea which model vero noises is? it's quite good

Screenshot_2026-05-11-13-10-51-472_com.chrome.beta-edit.jpg

candid haven May 11, 2026, 10:14 AM

#

pine temple what model do we think vero-noesis is

i like it so far. have you figured out what model it is yet?

lost hemlock May 11, 2026, 1:10 PM

#

openhard-1.0-search-nocot-0506

this model is on search arena

safe drift May 11, 2026, 1:23 PM

#

f

#

melyora

#

tetra-0505-1

#

is it typical to get so many codename models in this?

safe drift May 11, 2026, 1:46 PM

#

mylen

#

another one

#

moryn

#

another one

marble flame May 11, 2026, 2:28 PM

#

kartoffeln

#

all I wrote was sigma

#

https://019e074c-38a5-7dd8-ba68-c38058537a0a.arena.site/

Sigma — Operational Intelligence

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

#

gave a rlly good result

safe drift May 11, 2026, 2:59 PM

#

anyone get kavel in the website builder battle mode? It gave me a crazy good result

candid haven May 11, 2026, 6:50 PM

#

safe drift anyone get kavel in the website builder battle mode? It gave me a crazy good res...

i got it for a complex coding task and it was really powerful

safe drift May 11, 2026, 6:53 PM

#

candid haven i got it for a complex coding task and it was really powerful

i asked it for an app that accesses sensors from my phone and plots those signals from my phones sensors in time on a second to second interval and extracts some really simple values like mean etc. and it took way longer than the other "model B" (like 5-6 minutes to complete) but the result was crazy it gave me both a demo button that shows it with dummy data and a real measurement button and i opened it on my phone in the browser and it worked right away.

#

first time ive ever been actually blown away lol and i use a model for coding every day but have to hand hold it still because its for research work i guess and not boiler plate swe

#

there are already apps that do this though but still was super cool to see

candid haven May 11, 2026, 6:57 PM

#

i was trying to debug a 700-line pine script v5 code and it did a pretty impressive job against 5.4 mini high, and yes it took a lot more than 5.4 mini high. impressive at math + logic & instruction following

#

i wonder what model it is

safe drift May 11, 2026, 6:59 PM

#

yeah the model b finished in maybe 2 mins and i thought at first maybe the first model was bugged. Also the interface was just much more nicely designed as well. crazy.

#

hahaha crazy that we are even calling something like 5 mins a long time but i guess thats the world we are living in nowadays hahah

candid haven May 11, 2026, 7:00 PM

#

it's all just relative

#

btw tetra 0505-2 says it's Amazon Nova

#

safe drift May 11, 2026, 7:01 PM

#

interesting.. i found it strange that each of those codename models i saw today all told me they were qwen. I used more or less the same exact prompt each time when i asked as well.

candid haven May 11, 2026, 7:03 PM

#

i think it's just hallucinating because tetra 05-05 01 says it's chatgpt lol

#

GPT-4, it says

safe drift May 11, 2026, 7:08 PM

#

hahahaha okay interesting

distant nimbus May 11, 2026, 8:03 PM

#

Hi. I used to qwen 3b abliterated. It really cant get sense. I was thinking that was because its very small model ( i can run it only bcs i am poor of 4 gb vram) but switched to gemma 2 2 b and it response really good. How i can fix it. I want an abliterated model on 4gbvram ( eventually if it really cant i have 16 gb ram)

restive vapor May 11, 2026, 8:08 PM

#

maybe try qwen3.5-4b abliterated? run with q4_k_m quant

distant nimbus May 11, 2026, 8:24 PM

#

Okay, it will work? Bcs my internet is soo slow and i need about 2 hours to download one model. I can add to previous message that qwen for question 2+2 answers rly random with 1-7 digits. Another question, i tell him question about making pizza or smth. Starts normally, good but after 40-50 words it loop to infinity ( i have enough of context to run it, so its not depends on this).
This qwen was make by hui_ui

restive vapor May 11, 2026, 8:41 PM

#

distant nimbus Okay, it will work? Bcs my internet is soo slow and i need about 2 hours to down...

it should've worked well, huihui model are known to be decent

#

but qwen3.5-4b should be much better

heady plume May 12, 2026, 11:43 AM

#

pine temple what model do we think vero-noesis is

It has interesting audio-visuak ideas but it's writing style sucks and the ideas aren't thought through very well.

#

Curious what it is

frosty mantle May 12, 2026, 1:45 PM

#

Tetra is weak, don't bother

tawdry pendant May 12, 2026, 2:56 PM

#

Advertising isnt allowed here

#

@astral musk whats happening 😔

#

I got alot of pings from @barren kiln

#

what is he doing????!??

#

Notifications of him spamming a help thing

barren kiln May 12, 2026, 2:57 PM

#

tawdry pendant I got alot of pings from <@852769153337131028>

What are you even saying

tawdry pendant May 12, 2026, 2:57 PM

#

barren kiln What are you even saying

Dude

#

Your name was there

barren kiln May 12, 2026, 2:58 PM

#

Yk they have logs right

#

Stop talking about me you’re weird

tawdry pendant May 12, 2026, 2:58 PM

#

barren kiln Yk they have logs right

Yeah they do

#

They 100% do

#

your name was coming up in my notifications though?

#

TotallyNoire

#

that's you right?

barren kiln May 12, 2026, 2:59 PM

#

It’s funny because I was sleeping until 10:30 AM EST and haven’t even gone onto discord until now

#

You’re literally just saying stuff

tawdry pendant May 12, 2026, 2:59 PM

#

I'm not