#general | Arena | Page 151

lime coral Oct 21, 2025, 10:20 PM

#

Lazy

fervent tangle Oct 21, 2025, 10:20 PM

#

nah i did it on lmarena

fervent tangle Oct 21, 2025, 10:20 PM

#

lime coral Lazy

what

verbal nimbus Oct 21, 2025, 10:20 PM

#

lime coral Lazy

It keeps apologizing but literally is unable to write a new line

#

it's quite funny (and annoying because it breaks the chat once it gets into that loop)

undone saffron Oct 21, 2025, 10:20 PM

#

What AI is?

fervent tangle Oct 21, 2025, 10:20 PM

#

verbal nimbus It keeps apologizing but literally is unable to write a new line

lol

sullen quest Oct 21, 2025, 10:21 PM

#

screenshot?

sullen quest Oct 21, 2025, 10:21 PM

#

undone saffron What AI is?

grok imagine

verbal nimbus Oct 21, 2025, 10:21 PM

#

sullen quest screenshot?

oh I think I posted it somehwere...

#

lemme find

undone saffron Oct 21, 2025, 10:21 PM

#

sullen quest grok imagine

Oh

fervent tangle Oct 21, 2025, 10:21 PM

#

sullen quest grok imagine

try it on sora 2

#

or veo 3.1

sullen quest Oct 21, 2025, 10:21 PM

#

that's someone else's job

verbal nimbus Oct 21, 2025, 10:22 PM

#

^ Here

fervent tangle Oct 21, 2025, 10:22 PM

#

sullen quest that's someone else's job

also i still havent got my sora 2 free invite

#

its been like a week

dim basin Oct 21, 2025, 10:22 PM

#

fervent tangle also i still havent got my sora 2 free invite

u still dont got a sora 2 invite?

#

igu

sullen quest Oct 21, 2025, 10:22 PM

#

verbal nimbus

ooof, yeah never use gemini.google.com for anything that isn't search

sullen quest Oct 21, 2025, 10:23 PM

#

fervent tangle also i still havent got my sora 2 free invite

go on openAi's discord

verbal nimbus Oct 21, 2025, 10:23 PM

#

sullen quest ooof, yeah never use gemini.google.com for anything that isn't search

It affects Orionmist, which means... 🤔

fervent tangle Oct 21, 2025, 10:23 PM

#

sullen quest go on openAi's discord

do they invite randoms

#

to sora 2

inner gate Oct 21, 2025, 10:23 PM

#

Is Gemini 2.5 getting errors for anyone else?

dim basin Oct 21, 2025, 10:23 PM

#

@fervent tangle do u want a sora 2 invite code?

sullen quest Oct 21, 2025, 10:23 PM

#

verbal nimbus It affects Orionmist, which means... 🤔

not really sure, but its pretty clear by now orionmist is a google model

dim basin Oct 21, 2025, 10:23 PM

#

i got 3

fervent tangle Oct 21, 2025, 10:23 PM

#

dim basin <@974730770570686514> do u want a sora 2 invite code?

yea bro

sullen quest Oct 21, 2025, 10:23 PM

#

fervent tangle do they invite randoms

all you need is a code

dim basin Oct 21, 2025, 10:23 PM

#

fervent tangle yea bro

alr gimme a sec

verbal nimbus Oct 21, 2025, 10:24 PM

#

sullen quest not really sure, but its pretty clear by now orionmist is a google model

Maybe it's the heavily quantized model (otherwise idk why it's so weird on Gemini app)

fervent tangle Oct 21, 2025, 10:24 PM

#

dim basin alr gimme a sec

thx, go on dms

dim basin Oct 21, 2025, 10:24 PM

#

fervent tangle thx, go on dms

dm me ima pull it up

#

yo im a student and im tryna get the 12 months free in perplexcity but im in high school, not colledge, is it possible to still get it?

verbal nimbus Oct 21, 2025, 10:34 PM

#

Got Claude to disentangle the code (no idea what it is, lol):

📎 orionmistwebgltest.html

nova sluice Oct 21, 2025, 10:36 PM

#

is there a daily video generation limit ?

jade egret Oct 21, 2025, 10:37 PM

#

did google lauch anything today

nova sluice Oct 21, 2025, 10:38 PM

#

whats the limit?

sullen quest Oct 21, 2025, 10:40 PM

#

verbal nimbus Got Claude to disentangle the code (no idea what it is, lol):

what is this supposedly

sullen quest Oct 21, 2025, 10:40 PM

#

nova sluice is there a daily video generation limit ?

5

verbal nimbus Oct 21, 2025, 10:40 PM

#

sullen quest what is this supposedly

dragon? not sure

fervent tangle Oct 21, 2025, 10:41 PM

#

verbal nimbus Got Claude to disentangle the code (no idea what it is, lol):

its supposed to be a dragon, and you can move around with your mouse

leaden sun Oct 21, 2025, 10:41 PM

#

@verbal nimbus animation in latex is possible https://tex.stackexchange.com/questions/152358/animations-in-latex
now you can test the donut to coffee mug animation 🤗

TeX - LaTeX Stack Exchange

Animations in LaTeX

I am a new user of LaTeX and this site. I have seen that many answers in this site have animations. However I am not able to produce any of these animations. For example, I asked How to reduce size...

sullen quest Oct 21, 2025, 10:42 PM

#

oh god

verbal nimbus Oct 21, 2025, 10:42 PM

#

fervent tangle its supposed to be a dragon, and you can move around with your mouse

Opus' is white (like nothing)

nova sluice Oct 21, 2025, 10:42 PM

#

what 2 models does the video arena channel use ?

sullen quest Oct 21, 2025, 10:43 PM

#

nova sluice what 2 models does the video arena channel use ?

more than 2

sullen quest Oct 21, 2025, 10:43 PM

#

nova sluice what 2 models does the video arena channel use ?

check the leaderboard, most of those are still on it

nova sluice Oct 21, 2025, 10:43 PM

#

sullen quest more than 2

so it randomly chooses any 2 models ?

sullen quest Oct 21, 2025, 10:43 PM

#

yes

#

you new to lmarena?

#

how did you even get here

verbal nimbus Oct 21, 2025, 10:45 PM

#

leaden sun <@858135822389346344> animation in latex is possible https://tex.stackexchange.c...

Hmm, the example doesn't work

#

#

None of the buttons are clickable

verbal nimbus Oct 21, 2025, 10:46 PM

#

verbal nimbus Opus' is white (like nothing)

💀

sullen quest Oct 21, 2025, 10:46 PM

#

wat

#

huh???

verbal nimbus Oct 21, 2025, 10:46 PM

#

This is what happens when you let mathematicians write software

#

/jk

#

This sounds like a nightmare lol

fervent tangle Oct 21, 2025, 10:48 PM

#

sullen quest wat

i got sora 2

leaden sun Oct 21, 2025, 10:48 PM

#

verbal nimbus

yep, before animation package, i drew every frame manually and "stiched" them together as an animation 😅

fervent tangle Oct 21, 2025, 10:48 PM

#

lol

leaden sun Oct 21, 2025, 10:49 PM

#

verbal nimbus

what is this supposed to be?

sullen quest Oct 21, 2025, 10:49 PM

#

verbal nimbus

so bassically worse gif

verbal nimbus Oct 21, 2025, 10:49 PM

#

sullen quest so bassically worse gif

~~1 frame per second~~ 1 frame per page

leaden sun Oct 21, 2025, 10:50 PM

#

verbal nimbus ~~1 frame per second~~ 1 frame per page

works for math education tho

sullen quest Oct 21, 2025, 10:50 PM

#

like pdf pages?

#

is this a flip book?

verbal nimbus Oct 21, 2025, 10:50 PM

#

leaden sun works for math education tho

Haha yeah for small diagrams it's probably ok

leaden sun Oct 21, 2025, 10:50 PM

#

also to demonstrate time dependent functions or generally how functions varies under variables

verbal nimbus Oct 21, 2025, 10:51 PM

#

Like algorithm visualizations to powerpoint ig 🤔

verbal nimbus Oct 21, 2025, 10:51 PM

#

leaden sun also to demonstrate time dependent functions or generally how functions varies u...

Makes sense

#

I guess not for a coffee cup turning into a donut

leaden sun Oct 21, 2025, 10:52 PM

#

it can

#

well...the model needs to compute first for the animation

pale robin Oct 21, 2025, 10:52 PM

#

Just joined the server to try out the video generation arena 👋

leaden sun Oct 21, 2025, 10:52 PM

#

thats why i asked to draw just 3 pictures for the transitioning phase, it's easier this way

verbal nimbus Oct 21, 2025, 10:52 PM

#

Maybe I'll try WebGL or something simpler first

verbal nimbus Oct 21, 2025, 10:53 PM

#

leaden sun thats why i asked to draw just 3 pictures for the transitioning phase, it's easi...

Hmm

verbal nimbus Oct 21, 2025, 10:55 PM

#

leaden sun thats why i asked to draw just 3 pictures for the transitioning phase, it's easi...

Is it possible to derive a function for a cup then perform linear interpolation between the two functions

#

3d marching squares to render it

verbal nimbus Oct 21, 2025, 10:57 PM

#

verbal nimbus Is it possible to derive a function for a cup then perform linear interpolation ...

ig it won't be a function

#

ok now I'm curious what the LLM comes up with

#

Lol

#

How did Flash Lite 2.5 manage to draw a cat and dragon in TikZ but not a simple cup and donut 🤔

dim basin Oct 21, 2025, 11:03 PM

#

guys how does one get a perplexcity pro for free?

sullen quest Oct 21, 2025, 11:05 PM

#

verbal nimbus Lol

is this a brown apple

verbal nimbus Oct 21, 2025, 11:07 PM

#

leaden sun thats why i asked to draw just 3 pictures for the transitioning phase, it's easi...

Opus 4.1 Thinking (unfortunately I voted wrongly because there was initially an error)

sullen quest Oct 21, 2025, 11:07 PM

#

kinda

leaden sun Oct 21, 2025, 11:09 PM

#

verbal nimbus Opus 4.1 Thinking (unfortunately I voted wrongly because there was initially an ...

better than I've expected honestly

verbal nimbus Oct 21, 2025, 11:10 PM

#

leaden sun better than I've expected honestly

This even is even better (haven't voted)

dim basin Oct 21, 2025, 11:10 PM

#

does anyone have comet and would like to invite me to their comet im in the US so you would get a free $20

#

i just want perplexcity pro 😭

verbal nimbus Oct 21, 2025, 11:11 PM

#

verbal nimbus This even is even better (haven't voted)

Yup it's Lithiumflow

verbal nimbus Oct 21, 2025, 11:11 PM

#

leaden sun better than I've expected honestly

Yeah I'm kinda impressed

warm zodiac Oct 21, 2025, 11:12 PM

#

hmm its getting rid of the hole though

#

then making a new one

leaden sun Oct 21, 2025, 11:12 PM

#

verbal nimbus This even is even better (haven't voted)

the transition at 0:06 looks wrong, this should be one single object "morphing" into two forms without "splitting"

verbal nimbus Oct 21, 2025, 11:12 PM

#

It used THREE.JS with actual shaders

#

sullen quest Oct 21, 2025, 11:13 PM

#

sullen quest Oct 21, 2025, 11:13 PM

#

dim basin does anyone have comet and would like to invite me to their comet im in the US s...

happy now?

verbal nimbus Oct 21, 2025, 11:13 PM

#

leaden sun the transition at 0:06 looks wrong, this should be one single object "morphing" ...

Yeah, suggestions to improve the prompt?

leaden sun Oct 21, 2025, 11:13 PM

#

verbal nimbus Opus 4.1 Thinking (unfortunately I voted wrongly because there was initially an ...

so opus got the concept "correct" but probably failed at visualizing it

verbal nimbus Oct 21, 2025, 11:14 PM

#

oh encountering another good generation

#

Still wrong I think, but kinda interesting

#

Haha, this one is oddly my fav:

verbal nimbus Oct 21, 2025, 11:16 PM

#

verbal nimbus

Well that's odd...

#

it was just GPT-5 mini

fervent tangle Oct 21, 2025, 11:17 PM

#

verbal nimbus

gpt5 mini did that?

#

thats so good for it

leaden sun Oct 21, 2025, 11:18 PM

#

verbal nimbus

yea, that spike is not correct, but the idea is there

verbal nimbus Oct 21, 2025, 11:18 PM

#

gpt-oss-120b

verbal nimbus Oct 21, 2025, 11:18 PM

#

fervent tangle gpt5 mini did that?

Seems like GPT models are better than expected (though wrong)

#

GPT-5 high isn't much better (seems worse than GPT-5 mini):

#

leaden sun Oct 21, 2025, 11:21 PM

#

so this is a list of softwares to compute such continuous map mathematically https://nmd.web.illinois.edu/computop/ those models are basically "guessing" what your prompt wants

#

it's very creative nonetheless ✨

verbal nimbus Oct 21, 2025, 11:22 PM

#

leaden sun so this is a list of softwares to compute such continuous map mathematically htt...

What's the proper way to phrase the prompt mathematically?

eternal wave Oct 21, 2025, 11:23 PM

#

Does anyone know how I can generate videos with VEO 3?

verbal nimbus Oct 21, 2025, 11:23 PM

#

leaden sun so this is a list of softwares to compute such continuous map mathematically htt...

Lithiumflow didn't do that well this time...

eternal wave Oct 21, 2025, 11:23 PM

#

Escuse me

verbal nimbus Oct 21, 2025, 11:24 PM

#

eternal wave Does anyone know how I can generate videos with VEO 3?

See #1397655624103493813, although it's random; you can't choose the model.

eternal wave Oct 21, 2025, 11:24 PM

#

Does anyone know how I can generate with Veo 3?

fervent tangle Oct 21, 2025, 11:24 PM

#

verbal nimbus GPT-5 high isn't much better (seems worse than GPT-5 mini):

yeah idk why GPT5 high is so bad at these

leaden sun Oct 21, 2025, 11:25 PM

#

verbal nimbus What's the proper way to phrase the prompt mathematically?

https://en.wikipedia.org/wiki/Homotopy there is an animation to show how it should look like, it's the text book example

Homotopy

In topology, two continuous functions from one topological space to another are called homotopic (from Ancient Greek: ὁμός homós 'same, similar' and τόπος tópos 'place') if one can be "continuously deformed" into the other, such a deformation being called a homotopy ( hə-MOT-ə-pee; HOH-moh-toh-pee) between the two functions. A not...

verbal nimbus Oct 21, 2025, 11:25 PM

#

Oh so it should be at least on the internet

#

That actually kinda looks familiar...

leaden sun Oct 21, 2025, 11:26 PM

#

but no edges or spikes or discontinuous parts like splitting

#

mathe students typically compute this with sage or magma, i dont think those models use them

verbal nimbus Oct 21, 2025, 11:28 PM

#

This seems interesting... kinda feel like giving it a go myself 🤣

#

really satisfying to watch

#

shashta/gpt 4.1 couldn't get it (nothing showing)

leaden sun Oct 21, 2025, 11:31 PM

#

it's a difficult one, maybe moebius is simpler, even Klein-bottle is difficult

verbal nimbus Oct 21, 2025, 11:31 PM

#

Seems like a common example though 🤔

leaden sun Oct 21, 2025, 11:31 PM

#

https://virtualmathmuseum.org/Surface/klein_bottle/klein_bottle.html

#

yes, those three are typical examples in topology course

verbal nimbus Oct 21, 2025, 11:33 PM

#

Bizarre lol

leaden sun Oct 21, 2025, 11:34 PM

#

it's very creative tho 😉

verbal nimbus Oct 21, 2025, 11:34 PM

#

I wonder what else of shader stuff it can do in three JS

#

Black hole shader function would be interesting

dim basin Oct 21, 2025, 11:34 PM

#

sullen quest happy now?

what

leaden sun Oct 21, 2025, 11:35 PM

#

maybe instead of text description, you can use functions, just copy paste one from the example section from that wiki and see how it goes?

verbal nimbus Oct 21, 2025, 11:35 PM

#

verbal nimbus

I noticed that GPT-5 Mini's solution was warping the background

sullen quest Oct 21, 2025, 11:35 PM

#

dim basin what

i posted one

dim basin Oct 21, 2025, 11:35 PM

#

sullen quest i posted one

where

#

oh

#

i gotta be blind

#

😭

#

wait i already installed commet

#

comet*

#

would it still give me perplexcity pro

verbal nimbus Oct 21, 2025, 11:36 PM

#

leaden sun https://virtualmathmuseum.org/Surface/klein_bottle/klein_bottle.html

Oh wow, cool website, bookmarking that

dim basin Oct 21, 2025, 11:36 PM

#

for a month if I signed in with an excisting account?

ruby granite Oct 21, 2025, 11:36 PM

#

Hi everyone! new here. I've been loving using the compare models to help with vibe coding. I'm enjoying learning more about coding in general.
Current setup: VS Code > Kilo Code > OpenRouter > GLM 4.5 air (free)
Let's me code all i want for free (not the best ai model, of course)
I've been practicing contextual/spec-driven engineering and trying out different systems for Kilo Code to reference via .md files.
Now I am learning more about MCPs and MCP servers to enhance this further.

First thought as I consider MCP servers. Is there a way to connect LMarena.ai to the MCP server and be able to speak to any AI model with access to my codebase for better context and recommendations?

verbal nimbus Oct 21, 2025, 11:36 PM

#

leaden sun maybe instead of text description, you can use functions, just copy paste one fr...

I'll see if I can take screenshots if they don't get it

sullen quest Oct 21, 2025, 11:37 PM

#

dim basin would it still give me perplexcity pro

should, unless perplexity changed it

sullen quest Oct 21, 2025, 11:37 PM

#

ruby granite Hi everyone! new here. I've been loving using the compare models to help with vi...

not really, tho there should be a vs code lmarena extention

dim basin Oct 21, 2025, 11:37 PM

#

ASK AT LEAST 1 QUESTION

#

R U KIDDING ME

verbal nimbus Oct 21, 2025, 11:38 PM

#

Not correct, but kinda cool (I voted both bad still). Phantom 1

dim basin Oct 21, 2025, 11:38 PM

#

BRO

#

THATS WHAT I WAS FORGETTING

leaden sun Oct 21, 2025, 11:38 PM

#

verbal nimbus Not correct, but kinda cool (I voted both bad still). Phantom 1

i love the colours! ✨

verbal nimbus Oct 21, 2025, 11:38 PM

#

ikr

#

Might save that one

fervent tangle Oct 21, 2025, 11:38 PM

#

dim basin THATS WHAT I WAS FORGETTING

what

fervent tangle Oct 21, 2025, 11:39 PM

#

dim basin ASK AT LEAST 1 QUESTION

oh u didnt ask it 1 question?

#

on the apponfly?

dim basin Oct 21, 2025, 11:39 PM

#

YEAH

#

BRUH

#

its fine

#

ill jist use his

#

third time im installing comet bruh 😭

fervent tangle Oct 21, 2025, 11:39 PM

#

dim basin ill jist use his

if u use his link do u get the cash?

dim basin Oct 21, 2025, 11:39 PM

#

nah he does

#

i get pro

#

thats all i want

fervent tangle Oct 21, 2025, 11:40 PM

#

dim basin thats all i want

damn

dim basin Oct 21, 2025, 11:40 PM

#

well

#

as of right this moment

#

@sullen quest did u get it>

#

?

sullen quest Oct 21, 2025, 11:42 PM

#

didn't check

dim basin Oct 21, 2025, 11:42 PM

#

bro how do ik if i got pro

#

i downloaded comet for the third time from ur link

#

it just logged me in automatically

#

i searched up mona lisa

#

yeah i dont got pro

#

wth

verbal nimbus Oct 21, 2025, 11:43 PM

#

Well that's odd, Assistant A is missing:

#

bug

leaden sun Oct 21, 2025, 11:44 PM

#

@verbal nimbus this might be easier compared to homotopy deformation, there is a code snippet here https://en.wikipedia.org/wiki/Apollonian_gasket

Apollonian gasket

In mathematics, an Apollonian gasket, Apollonian net, or Apollonian circle packing is a fractal generated by starting with a triple of circles, each tangent to the other two, and successively filling in more circles, each tangent to another three. It is named after Greek mathematician Apollonius of Perga.

verbal nimbus Oct 21, 2025, 11:44 PM

#

leaden sun <@858135822389346344> this might be easier compared to homotopy deformation, the...

Oh that might be easy

leaden sun Oct 21, 2025, 11:44 PM

#

something fun to test too 😊

sullen quest Oct 21, 2025, 11:45 PM

#

dim basin wth

idk

sullen quest Oct 21, 2025, 11:45 PM

#

verbal nimbus Well that's odd, Assistant A is missing:

did you report it?

verbal nimbus Oct 21, 2025, 11:46 PM

#

leaden sun <@858135822389346344> this might be easier compared to homotopy deformation, the...

Circle packing is one of tasks used in AlphaEvolve I think:
https://noailabs.medium.com/coding-agents-for-scientific-and-algorithmic-discovery-f22ddb50a898

#

Also by Sakana AI to compare their ShrinkaEvolve algorithm to AlphaEvolve (supposedly more efficient): https://sakana.ai/shinka-evolve/

smoky blaze Oct 21, 2025, 11:47 PM

#

hello

verbal nimbus Oct 21, 2025, 11:48 PM

#

https://sakana.ai/assets/shinka-evolve/circle-packing.mp4

▶ Play video

wintry tinsel Oct 21, 2025, 11:48 PM

#

So what is everyone’s bets for Gemini 3 now

#

Yall thinking November first half?

leaden sun Oct 21, 2025, 11:50 PM

#

verbal nimbus Circle packing is one of tasks used in AlphaEvolve I think: https://noailabs.med...

they chose the circle packing problem for obvious reasons 🤭

verbal nimbus Oct 21, 2025, 11:51 PM

#

leaden sun they chose the circle packing problem for obvious reasons 🤭

Lithiumflow failed on the more specific prompt

#

#

I think it accidentally dropped the donut in the 4th dimension /jk

leaden sun Oct 21, 2025, 11:53 PM

#

verbal nimbus Lithiumflow failed on the more specific prompt

maybe delete "in a continuous loop" in your prompt? that might confuse the models a bit i guess

verbal nimbus Oct 21, 2025, 11:55 PM

#

leaden sun maybe delete "in a continuous loop" in your prompt? that might confuse the model...

maybe... it was there in the first prompt though

#

I'll get the AI to craft a better prompt and try again later

hollow ivy Oct 22, 2025, 12:07 AM

#

poll_question_text

For which programming-languages can you build apps on your system? (not your ability, just your OS-ability to build/execute apps of these languages)

victor_answer_votes

6

total_votes

23

victor_answer_id

1

victor_answer_text

Python

#

Do you guys think, that X28 checkpoint of Gemini will come in january, as Gemini 3 ultra?
And will it have a reliable performance, or will it be "jagged" like the other AIs?

magic stag Oct 22, 2025, 12:16 AM

#

Does anyone know of some way to achieve notebooklm-esque functionality but with claude? Besides making my own rag locally, lol. Notebooklm works but its basically a glorified search engine for pdfs you upload, it cant analyze or explain anything to you at all. Meanwhile claude can but I can only fit 1/4 of the text I need into a project

ashen mauve Oct 22, 2025, 12:46 AM

#

So claude is down or something because its been hella long since I attempted to try and talk to it again and it has basically yold me nuh uh it not gunna work

#

What is the backup for claude in this situation since it was great at listining to me and roleplaying properly now im just SOL

hollow ivy Oct 22, 2025, 12:51 AM

#

ashen mauve What is the backup for claude in this situation since it was great at listining ...

if only Claude had the context length of Grok4-fast or gemini 2.5 pro
that would be a dream coming true

#

maybe with Opus-4.5 they (Anthropic) will get their act together

#

if Claude-4.5 had 2M context length, it would be above the competition

pastel adder Oct 22, 2025, 12:53 AM

#

ashen mauve So claude is down or something because its been hella long since I attempted to ...

Go on private mode and log in there

hollow ivy Oct 22, 2025, 12:54 AM

#

with 10M, we could finally have virtually endless games, and huge coding projects

#

unfortunately, compute requirements increase exponentially when context size increase linearly, right?

#

except if using algorithmic tricks

sullen quest Oct 22, 2025, 12:56 AM

#

hollow ivy unfortunately, compute requirements increase exponentially when context size inc...

think so

hollow ivy Oct 22, 2025, 12:57 AM

#

maybe with Stargate or Hyperion they can crank it up to 2-5M ?

#

or with Colossus II cluster

sullen quest Oct 22, 2025, 12:58 AM

#

probably not a server amount issue

hollow ivy Oct 22, 2025, 12:58 AM

#

or with that rumored project G.A.I.A

#

(Global Artificial Intelligence Array)

#

(slated for the next decade, could reach AGI)

sullen quest Oct 22, 2025, 12:59 AM

#

just a cost issue, and also that because performance degredation is common with higher context windows when those windows are almost full, there's also an issue with just making one work

#

If I spend all my effort trying to increase the context window and reach current performance even with 3 million input tokens, I might get heavily behind in other peformance metrics

#

And just be completely outcompeted

#

cause even If I have a larger context window, if it isn't writing at the level of the second best model, nobody will use my model

#

I think meta actually made that mistake with llama 4 and that's one of the reasons why it flopped so hard

#

scout had a theoritical context window of 10 mill tokens

remote idol Oct 22, 2025, 1:23 AM

#

verbal nimbus Well that's odd, Assistant A is missing:

inspect element 💔

jade egret Oct 22, 2025, 1:57 AM

#

is gemini 3 lazy?

#

also what was he talking about

#

i missed it

sullen quest Oct 22, 2025, 2:02 AM

#

its not exactly clear but it seems like odds are lithiumflow is that project not gem 3

jade egret Oct 22, 2025, 2:16 AM

#

🍊

#

blue is chrome, orange is chatGPT

obsidian cargo Oct 22, 2025, 2:28 AM

#

I've only been getting orionmist, never lithiumflow lately

hot pelican Oct 22, 2025, 2:28 AM

#

@echo aurora @hardy lion What does Lmarena do when Top models get tested simultaneously? My thought is that it would be a great waste of marketing or publicity opportunity for lmarena to release them in one day.
As the top makes the news and would be all over social media. And if the they are both released in one go, it would just end in one day. While one can be released a day before. And is right and fair unless they got submitted for testing in the same day.

thorny berry Oct 22, 2025, 2:33 AM

#

Hey folks
So, a little story — I didn’t start out wanting to “build AI.” I just wanted to make things that think, things that feel a bit alive. That curiosity turned into years of experimenting with models, teaching machines to learn, and connecting all the dots between AI and the web.
These days, I’m deep into full-stack development and AI integration — stuff like intelligent systems, automation, and the kind of projects that make you forget where the code ends and the creativity begins.
I’m currently looking for new projects or teams to join — anywhere that’s got vision, passion, and room to push limits. If you’re building something ambitious (or just plain weird), I’m in.
Let’s create something that surprises even us

magic stag Oct 22, 2025, 2:34 AM

#

hollow ivy if Claude-4.5 had 2M context length, it would be above the competition

Its already above the competition

#

milkwatermelon

#

But yeah the context is annoyingly short

#

How

magic stag Oct 22, 2025, 2:37 AM

#

hollow ivy maybe with Opus-4.5 they (Anthropic) will get their act together

Opus 4.5 with 2m context, only $500 per 1m tokens

#

2 prompts per week with 5x plan

#

Upgrade to 20x now (save 10% by paying annually!) for 9 prompts per week

echo aurora Oct 22, 2025, 3:25 AM

#

hot pelican <@283397944160550928> <@1394374846741221458> What does Lmarena do when Top model...

Good question although I'm not sure we're the best to answer as that'd be a question for the labs. If I were to guess I don't think labs are going to alter their release timing based on when other labs are releasing their latest models, they're probably just going off of some internal deadlines that they've set for various reasons.

tiny kettle Oct 22, 2025, 3:48 AM

#

Why am I 4 videos in a row with no sound? I can't get audio anymore?

magic stag Oct 22, 2025, 3:56 AM

#

not all models have sound

#

lol

jade egret Oct 22, 2025, 4:21 AM

#

.

hazy kernel Oct 22, 2025, 4:30 AM

#

Scam

terse river Oct 22, 2025, 4:41 AM

#

Scam detected

drifting crow Oct 22, 2025, 5:13 AM

#

jade egret blue is chrome, orange is chatGPT

ram usage or user agent?

queen veldt Oct 22, 2025, 5:17 AM

#

#

Nahh is this gemini 3.0 fr?

hazy kernel Oct 22, 2025, 5:24 AM

#

queen veldt

yes

queen veldt Oct 22, 2025, 5:40 AM

#

Today is hopefully it's release... Back in my day we used to code by typing letters on keyboard

teal mantle Oct 22, 2025, 6:00 AM

#

Claude really put the reference lol

hot pelican Oct 22, 2025, 6:20 AM

#

echo aurora Good question although I'm not sure we're the best to answer as that'd be a ques...

Yeah, labs are the ultimate decides for their release dates, especially for new #1 models. But for those that lmarena has the ability to decide. Would be great if it can put them apart maybe a couple of days... For wider publicity of the lmarena leaderboard.

bright parrot Oct 22, 2025, 6:22 AM

#

Hello

#

What brings me to LMArena for the video generation.

leaden sun Oct 22, 2025, 6:26 AM

#

https://tenor.com/view/excuse-me-audrey-hepburn-gif-10537418

Tenor

void brook Oct 22, 2025, 7:15 AM

#

Hi. Can anyone tell me where can I see if it generated my video???

inner ermine Oct 22, 2025, 7:32 AM

#

Why cant web.lmarena.ai generate just static sites (html,css, js) instead of react?

frozen fulcrum Oct 22, 2025, 7:34 AM

#

void brook Hi. Can anyone tell me where can I see if it generated my video???

Cek your DM by LM area

void brook Oct 22, 2025, 7:37 AM

#

HOW ?SORRY I DON T KNOW

#

where exacly

upbeat sentinel Oct 22, 2025, 7:47 AM

#

Hello

sullen depot Oct 22, 2025, 7:50 AM

#

void brook HOW ?SORRY I DON T KNOW

Hello

sullen depot Oct 22, 2025, 7:52 AM

#

upbeat sentinel Hello

Hi

void brook Oct 22, 2025, 7:52 AM

#

HI

sullen depot Oct 22, 2025, 7:53 AM

#

void brook HI

You are new?

void brook Oct 22, 2025, 7:54 AM

#

Yes

verbal nimbus Oct 22, 2025, 7:54 AM

#

inner ermine Why cant web.lmarena.ai generate just static sites (html,css, js) instead of rea...

designarena.ai can but the allowed input is much shorter

void brook Oct 22, 2025, 7:55 AM

#

i tried to generate something image to video,and i don t knot it it worked or not,or I DON T SEE THE MESSAGE:))

#

know

#

thousands of msg

verbal nimbus Oct 22, 2025, 7:57 AM

#

I haven't tried it, but looks like they support agents (not sure how it's being executed):

finite pasture Oct 22, 2025, 8:00 AM

#

hello

verbal nimbus Oct 22, 2025, 8:23 AM

#

verbal nimbus I haven't tried it, but looks like they support agents (not sure how it's being ...

Just tested it, it actually generates a whole public repository on Github as well. Not sure if it's limited to frontend though.

#

Hello world in Svelte: https://github.com/Design-Arena-Gens/agentic-142e3be6/tree/main

floral comet Oct 22, 2025, 8:43 AM

#

hazy kernel Oct 22, 2025, 8:44 AM

#

floral comet

send video

floral comet Oct 22, 2025, 8:45 AM

#

flint pine Oct 22, 2025, 8:50 AM

#

hi

round sedge Oct 22, 2025, 8:50 AM

#

is Gemini 3 nerfed yet
I hope it isn't

verbal nimbus Oct 22, 2025, 8:51 AM

#

floral comet

This is a better test imo. A game engine isn't going to be rendering objects with SVGs. Also, SVGs have been overpopularized from the "pelican riding a bicycle" test. I'm sure it's being used internally as a metric.

floral comet Oct 22, 2025, 9:14 AM

#

verbal nimbus This is a better test imo. A game engine isn't going to be rendering objects wit...

Yeah, i think that's right

verbal nimbus Oct 22, 2025, 9:16 AM

#

This cat and mouse chase is pretty productive imo, testers/benchmarks just need to incentivize labs by testing models in areas where they are bad at (like non-JS or non-React GUI frameworks, lol).

pseudo bluff Oct 22, 2025, 9:19 AM

#

lol

#

I showed lithiumflow last picture and it did much better now

fringe jacinth Oct 22, 2025, 9:25 AM

#

A hyperrealistic cinematic shot of a World Cup 2026 final — score 1–1 at 120:00 — a legendary footballer wearing red jersey number 7 performs a perfect bicycle kick goal against Spain in slow motion under floodlights as rain falls and the roaring crowd erupts.
Show dramatic camera angles, sweat, stadium lights, confetti, and emotional reactions from fans and teammates celebrating victory.
Epic orchestral sports soundtrack, realistic lighting, smooth motion, 9:16 vertical video.
Tone: inspirational, emotional, movie-trailer style — titled “120+1 — The Greatest Goal Ever Scored.”

verbal nimbus Oct 22, 2025, 9:35 AM

#

pseudo bluff lol

Is that the Discord logo

round sedge Oct 22, 2025, 9:36 AM

#

verbal nimbus Is that the Discord logo

Maybe

round sedge Oct 22, 2025, 9:36 AM

#

fringe jacinth A hyperrealistic cinematic shot of a World Cup 2026 final — score 1–1 at 120:00 ...

Wrong channnel

tacit plinth Oct 22, 2025, 10:05 AM

#

V

#

HI

floral comet Oct 22, 2025, 10:10 AM

#

Has anyone succesfully implemented a rubiks cube with a solver to ai?

verbal nimbus Oct 22, 2025, 10:10 AM

#

floral comet Has anyone succesfully implemented a rubiks cube with a solver to ai?

I thought Gemini 2.5 Pro could already do that

#

Matt Berman used to use that test a lot

floral comet Oct 22, 2025, 10:11 AM

#

verbal nimbus Matt Berman used to use that test a lot

Where can i find him?

nimble tartan Oct 22, 2025, 10:12 AM

#

Wassup chat

floral comet Oct 22, 2025, 10:12 AM

#

Nevermind, i'm watching his video now

gleaming torrent Oct 22, 2025, 10:16 AM

#

Hi everyone!

floral comet Oct 22, 2025, 10:17 AM

#

verbal nimbus I thought Gemini 2.5 Pro could already do that

Where can i get his prompt?

floral comet Oct 22, 2025, 10:22 AM

#

verbal nimbus I thought Gemini 2.5 Pro could already do that

I doubt it

dusty grail Oct 22, 2025, 10:22 AM

#

/image

floral comet Oct 22, 2025, 10:23 AM

#

The rubiks cube just did reverse, its not acutally solving it with a algorithm..

verbal nimbus Oct 22, 2025, 10:25 AM

#

floral comet Where can i get his prompt?

I think it was pretty basic

verbal nimbus Oct 22, 2025, 10:25 AM

#

floral comet Where can i find him?

YouTube

leaden sun Oct 22, 2025, 10:26 AM

#

floral comet The rubiks cube just did reverse, its not acutally solving it with a algorithm..

I remembered very vaguely from my teenie years that there are already alg to solve this https://en.wikipedia.org/wiki/Optimal_solutions_for_the_Rubik's_Cube not sure how this is integrated in LLM tho

Optimal solutions for the Rubik's Cube

Optimal solutions for the Rubik's Cube are solutions that are the shortest in some sense. There are two common ways to measure the length of a solution. The first is to count the number of quarter turns (90°). The second and more popular is to count the number of outer-layer twists, called "face turns". A move to turn an outer layer two quarter...

floral comet Oct 22, 2025, 10:29 AM

#

leaden sun I remembered very vaguely from my teenie years that there are already alg to sol...

Interesting, maybe i'm overestimating the current models too much.. But i will be really impressed if llm can do it.

hallow axle Oct 22, 2025, 10:37 AM

#

how do i get gemeni 3 on lmarena

hollow ivy Oct 22, 2025, 10:38 AM

#

hallow axle how do i get gemeni 3 on lmarena

wait until it's out
-# could be today, could be next year lol

hallow axle Oct 22, 2025, 10:39 AM

#

hollow ivy wait until it's out -# could be today, could be next year *lol*

they're already prompting on r/bard

restive scarab Oct 22, 2025, 10:43 AM

#

There is any official update yet?

hollow ivy Oct 22, 2025, 10:57 AM

#

hallow axle how do i get gemeni 3 on lmarena

enter battle mode
ask the model for its name, who created it, version, knowledge-cutoff date, current date and the last date it's aware of
check if one of them answers roughly in this style (the date it gives should vary between 2023-2025):


I do not have a specific version number, release date, or a fixed knowledge cut-off date. I am continuously updated with new information.

Today's date is Tuesday, May 21, 2024, and I am aware of events leading up to the present day.```
4. if yes, odds are high that you got *lithiumflow*; now open a textfile and let it regenerate its last answer and copy all its answers into your textfile, until you have about a dozen or so
5. now reveal the models by voting
6. if lithiumflow was the one you copied the answers from, then congrats, you now have your toolkit to recognize it in battle arena without having to vote :)  [if it was *Orionmist*, then you got at least its slightly inferior sibling]

#

(both lithiumflow and orionmist are gemini 3)

#

(if it knows the correct current date, then it is the inferior model Orionmist)

lime coral Oct 22, 2025, 10:58 AM

#

No Gemini bat flag, No Gemini 3

sullen depot Oct 22, 2025, 11:04 AM

#

gleaming torrent Hi everyone!

Hi

sullen depot Oct 22, 2025, 11:04 AM

#

gleaming torrent Hi everyone!

Bro, dm me

barren prairie Oct 22, 2025, 11:11 AM

#

hollow ivy 1. enter battle mode 2. ask the model for its name, who created it, version, kno...

Yesterday Gemini3 claimed that it was Qwen

Asking them about themselves won t help

I can know if it is Gemini or not when they are anon without asking about their names but never can diffrenciate between 2.5 and 3...

#

If someone have a prompt that only Gemini3 can do and he is sure that 2.5 will never do it please tell me...

Sometimes I see some tests here when I try them the 2.5 and 3 gives me the same answer.

verbal nimbus Oct 22, 2025, 11:13 AM

#

floral comet Yeah, i think that's right

I'm testing models in Go (Fyne) rn (basically non-web GUI)... Lithiumflow's code doesn't run...

#

Sonnet 4.5 after fixing non-existent "favorite" icon.

#

I'm testing it on simple non-web GUI libraries to test if it can generalize or whether it's just trained on web dev.

hollow ivy Oct 22, 2025, 11:18 AM

#

barren prairie If someone have a prompt that only Gemini3 can do and he is sure that 2.5 will n...

g2.5p always starts its reply with "Of course, .." when i ask my questions to it

#

g3p never does it, also g3p prefers to use "I am.." style much more than g2.5p

verbal nimbus Oct 22, 2025, 11:18 AM

#

verbal nimbus I'm testing it on simple non-web GUI libraries to test if it can generalize or w...

I think only o3 and Sonnet 4.5 managed to actually make it to the screen so far. The rest are 💀

hollow ivy Oct 22, 2025, 11:19 AM

#

verbal nimbus I think only o3 and Sonnet 4.5 managed to actually make it to the screen so far....

isn't o3 in gpt5-high (if prompted well) ?

verbal nimbus Oct 22, 2025, 11:20 AM

#

hollow ivy isn't o3 in gpt5-high (if prompted well) ?

GPT-5 High failed at least twice, odd o3 got it right

#

Also Opus 4 got it once, whereas 4.1 never got it

hollow ivy Oct 22, 2025, 11:20 AM

#

wow, so claude-4.5 is now king

#

the thinking version?

verbal nimbus Oct 22, 2025, 11:20 AM

#

The best run was just normal 4.5

hollow ivy Oct 22, 2025, 11:21 AM

#

oof

#

that complicates things

verbal nimbus Oct 22, 2025, 11:21 AM

#

This was the best run (Sonnet 4.5)

hollow ivy Oct 22, 2025, 11:21 AM

#

how can the non-thinking version be better?

neon idol Oct 22, 2025, 11:21 AM

#

Gemini 3?

verbal nimbus Oct 22, 2025, 11:22 AM

#

hollow ivy how can the non-thinking version be better?

Not sure, maybe because there's nothing much to think about.

hollow ivy Oct 22, 2025, 11:22 AM

#

ok, so non-think sometimes finds a grain
but overall, thinking version should be better, right?

verbal nimbus Oct 22, 2025, 11:23 AM

#

neon idol Gemini 3?

This one was supposedly Gemini 3 (Lithiumflow)

#

Never made it to the screen...

hollow ivy Oct 22, 2025, 11:23 AM

#

so LF sucks, compared to C4.5

verbal nimbus Oct 22, 2025, 11:23 AM

#

Idk

#

I'm trying to test whether it's good at non-web stuff.

hollow ivy Oct 22, 2025, 11:24 AM

#

so g3p could be a flop :/

#

but g3u could be good

#

X28 = g3u ?

knotty fable Oct 22, 2025, 11:24 AM

#

hollow ivy ok, so non-think sometimes finds a grain but overall, *thinking version* should ...

Don't hold your breath for anything such to come out. 😸

hollow ivy Oct 22, 2025, 11:24 AM

#

knotty fable Don't hold your breath for anything such to come out. 😸

it is already out

#

(if you had read the complete thread, you know why)

knotty fable Oct 22, 2025, 11:26 AM

#

No, and no again - thinking and reasoning is completely beyond AI capability.

hollow ivy Oct 22, 2025, 11:26 AM

#

knotty fable No, and no again - thinking and reasoning is completely beyond AI capability.

we talk about version descriptors

#

(names are necessary to discern between model types)

knotty fable Oct 22, 2025, 11:26 AM

#

hollow ivy we talk about version descriptors

Oh sorry, and indeed I did just come in here.

hollow ivy Oct 22, 2025, 11:26 AM

#

np

verbal nimbus Oct 22, 2025, 11:27 AM

#

verbal nimbus This one was supposedly Gemini 3 (Lithiumflow)

Actually how did it mess up Go (technically the GUI library, but still... it's one of the most popular GUI libraries for Go)

#

Considering that Google made Golang, lol

#

I'll try to get 4.5 to fix it just to see how it would have looked

hollow ivy Oct 22, 2025, 11:30 AM

#

so, the best coding-model still is that Claude which poses as "3.5" and starts its answer with a big "About Me" title, in response to the identification-questions

#

and then, if it can give out >4096 tokens in a response it is the "thinking/reasoning" version

#

(that is the Claude-4.5-Sonnet-Thinking model)

verbal nimbus Oct 22, 2025, 11:31 AM

#

hollow ivy so, the best coding-model still is that Claude which poses as "3.5" and starts i...

Well it depends on the area

#

Performance depends on the languages or frameworks you're using

hollow ivy Oct 22, 2025, 11:31 AM

#

yeah, but i mean for C++ and python

#

no other model is as good in C++17/SFML-2.5 as that one, right?

#

(that versions had more training data than later versions)

verbal nimbus Oct 22, 2025, 11:33 AM

#

hollow ivy yeah, but i mean for C++ and python

GPT-5 might be better at C

hollow ivy Oct 22, 2025, 11:33 AM

#

or maybe SFML-2.6

verbal nimbus Oct 22, 2025, 11:33 AM

#

Actually it depends what you're doing

hollow ivy Oct 22, 2025, 11:33 AM

#

verbal nimbus GPT-5 might be better at C

i need C++ not C

verbal nimbus Oct 22, 2025, 11:33 AM

#

If it's complex stuff that requires domain knowledge then GPT-5 will definitely dominate

hollow ivy Oct 22, 2025, 11:33 AM

#

ok domain knowledge.. yeah maybe

verbal nimbus Oct 22, 2025, 11:33 AM

#

Well concurrency too or memory management

#

Like if you're working with C++ you need good temporal reasoning, like you need to have a mental model of when memory is freed, etc. Or resource sharing between threads. Claude can struggle to reason about that.

hollow ivy Oct 22, 2025, 11:34 AM

#

but i mean for general (niche) stuff which has (almost) no training data yet

#

wow
is gpt5-high better in that area?

verbal nimbus Oct 22, 2025, 11:35 AM

#

hollow ivy wow is gpt5-high better in that area?

Well yeah, Claude basically gave me an impossible test case and claimed it was solvable.

hollow ivy Oct 22, 2025, 11:35 AM

#

i never did multi-threading yet (for fear of bugs lol)

#

hm, so they trade blows
interesting

verbal nimbus Oct 22, 2025, 11:36 AM

#

Yeah even humans wouldn't excel at all languages, you have experts in different areas. And AIs in particular generalize less than humans.

hollow ivy Oct 22, 2025, 11:36 AM

#

i now wonder if X28-LF/g3 is only good in webdev

verbal nimbus Oct 22, 2025, 11:37 AM

#

hollow ivy i now wonder if X28-LF/g3 is only good in webdev

It seemed good at TikZ too, I wonder if there's some skill transfer from SVG or whether they trained it on TikZ.

hollow ivy Oct 22, 2025, 11:38 AM

#

but in Python, (plus numpy), C4.5S-T should be king, right?

verbal nimbus Oct 22, 2025, 11:38 AM

#

hollow ivy but in Python, (plus numpy), C4.5S-T should be king, right?

Depends what you're doing

#

Like using numpy for what exactly

hollow ivy Oct 22, 2025, 11:39 AM

#

for everything you could use it for, to get higher performance?

verbal nimbus Oct 22, 2025, 11:39 AM

#

That's too broad

hollow ivy Oct 22, 2025, 11:40 AM

#

so, which is your goto-model, if you want to vibe-code a game?

verbal nimbus Oct 22, 2025, 11:40 AM

#

hollow ivy for everything you could use it for, to get higher performance?

Also, for optimization, there's this ig (not sure how good, maybe discard first 2 since it got 10 tries): https://livecodebench.github.io/gso.html

floral comet Oct 22, 2025, 11:41 AM

#

verbal nimbus I'm testing models in Go (Fyne) rn (basically non-web GUI)... Lithiumflow's code...

That's kind of disappointing, dont get me wrong those two models are impressive already but I really hope this is their flash and flashlite model..

hollow ivy Oct 22, 2025, 11:41 AM

#

oh, i would use a framework, which the AI has much training-data for

#

and numpy seemed to be popular

verbal nimbus Oct 22, 2025, 11:41 AM

#

hollow ivy so, which is *your* goto-model, if you want to vibe-code a game?

In C++? Idk haven't tested it for that

verbal nimbus Oct 22, 2025, 11:42 AM

#

floral comet That's kind of disappointing, dont get me wrong those two models are impressive...

Same

hollow ivy Oct 22, 2025, 11:42 AM

#

verbal nimbus In C++? Idk haven't tested it for that

and for Python?

verbal nimbus Oct 22, 2025, 11:42 AM

#

hollow ivy and for Python?

Depends on the task, like really it does

hollow ivy Oct 22, 2025, 11:42 AM

#

verbal nimbus Depends on the task, like really it does

pygame or arcade or tkinter

verbal nimbus Oct 22, 2025, 11:42 AM

#

Hmm idk I haven't used LLMs for Pygame

#

Tkinter they're all about the same, kinda meh

hollow ivy Oct 22, 2025, 11:43 AM

#

verbal nimbus Tkinter they're all about the same, kinda meh

SFML-3 is better, right?

verbal nimbus Oct 22, 2025, 11:43 AM

#

verbal nimbus Tkinter they're all about the same, kinda meh

You really got to tell them to separate code out or they'll write everything in this one big monolithic file

hollow ivy Oct 22, 2025, 11:44 AM

#

in C++, Claude does separate it nicely

#

in Python they often write in one file, yeah

verbal nimbus Oct 22, 2025, 11:44 AM

#

hollow ivy SFML-3 is better, right?

I haven't tested it on SFML, can try I suppose

hollow ivy Oct 22, 2025, 11:44 AM

#

but the LLMs probably know SFML-2.6 better than SFML-3

verbal nimbus Oct 22, 2025, 11:45 AM

#

Wind tunnel test in SFML 🤔

#

Although there are probably lots of examples for that (esp. rigid/soft body visualizations)

hollow ivy Oct 22, 2025, 11:45 AM

#

verbal nimbus Wind tunnel test in SFML 🤔

maybe better in OpenGL?

#

or even Unity3D or Unreal Engine?

verbal nimbus Oct 22, 2025, 11:46 AM

#

hollow ivy maybe better in OpenGL?

Well it's on top of OpenGL

verbal nimbus Oct 22, 2025, 11:46 AM

#

hollow ivy or even Unity3D or Unreal Engine?

A bit annoying to copy-paste though

#

Some stuff won't be in the code but need to be set up in the IDE

hollow ivy Oct 22, 2025, 11:47 AM

#

verbal nimbus Some stuff won't be in the code but need to be set up in the IDE

who needs IDEs if you have.. Xed! :)

#

has syntax-highlighting!

verbal nimbus Oct 22, 2025, 11:47 AM

#

hollow ivy who needs IDEs if you have.. Xed! :)

I meant for Unity

hollow ivy Oct 22, 2025, 11:47 AM

#

ah ok

#

ok, i did not yet vibecode in 3D

floral comet Oct 22, 2025, 11:48 AM

#

I wonder when will google release the deepthink imo models, though i understand theyre very expensive to run..

hollow ivy Oct 22, 2025, 11:48 AM

#

floral comet I wonder when will google release the deepthink imo models, though i understand ...

i'd say, december

floral comet Oct 22, 2025, 11:49 AM

#

The current available deepthink is not that impressive atleast for me..

verbal nimbus Oct 22, 2025, 11:49 AM

#

Got Sonnet 4.5 to fix Lithiumflow's code for errors, here's the output:

#

Sonnet 4.5's output: #general message

floral comet Oct 22, 2025, 11:49 AM

#

hollow ivy i'd say, december

I really hope so

hollow ivy Oct 22, 2025, 11:50 AM

#

floral comet I really hope so

but probably 31st, 23:59 hawaii timezone

verbal nimbus Oct 22, 2025, 11:50 AM

#

Oh I didn't know you can generate Veo 3.1 in Gemini app 🤔

hollow ivy Oct 22, 2025, 11:50 AM

#

(so that they can say, they released it this year ^^)

floral comet Oct 22, 2025, 11:51 AM

#

hollow ivy (so that they can say, they released it this year ^^)

Lol

hollow ivy Oct 22, 2025, 11:52 AM

#

i wonder, is there an island with a later timezone than hawaii, before the date line?

verbal nimbus Oct 22, 2025, 11:52 AM

#

Maybe I should test it on generating SwiftUI apps (native MacOS)

wicked mason Oct 22, 2025, 11:52 AM

#

I been ther

verbal nimbus Oct 22, 2025, 11:53 AM

#

verbal nimbus Maybe I should test it on generating SwiftUI apps (native MacOS)

Apparently there's a Docker image for MacOS 🤔: https://github.com/sickcodes/Docker-OSX

GitHub

GitHub - sickcodes/Docker-OSX: Run macOS VM in a Docker! Run near n...

Run macOS VM in a Docker! Run near native OSX-KVM in Docker! X11 Forwarding! CI/CD for OS X Security Research! Docker mac Containers. - sickcodes/Docker-OSX

#

RIP my disk space

hollow ivy Oct 22, 2025, 11:55 AM

#

hollow ivy i wonder, is there an island with a later timezone than hawaii, before the date ...

it seems to be American Samoa

#

(the last one before the date flips)

verbal nimbus Oct 22, 2025, 11:55 AM

#

hollow ivy (the last one before the date flips)

interesting

#

time zones break my brain

hollow ivy Oct 22, 2025, 11:56 AM

#

verbal nimbus interesting

should the whole world use UTC as their local time?

verbal nimbus Oct 22, 2025, 11:56 AM

#

hollow ivy should the whole world use UTC as their local time?

Aviation does

#

zulu time

floral comet Oct 22, 2025, 11:57 AM

#

I wonder if google will follow the same pattern..
1.5 -> 2.0 (meh)
1.5 -> 2.5 pro (Very big improvements)
2.5 ->> 3.0 pro (probably same with 1.5 -> 2.0)
2.5 -> 3.5 pro (probably same with 1.5 -> 2.5)

hollow ivy Oct 22, 2025, 11:58 AM

#

yeah, x.5 ought to be a more polished version of a game/app, than the x.0 version

floral comet Oct 22, 2025, 11:58 AM

#

Yeah, glad you noticed that.

hollow ivy Oct 22, 2025, 11:58 AM

#

and claude-3.5 also was great (when it came out)

#

claude-4.5 seems to even have topped it

floral comet Oct 22, 2025, 12:00 PM

#

claude is really good at keeping up with these giant companies, though it's eminent that google will win the ai race..

hollow ivy Oct 22, 2025, 12:00 PM

#

floral comet claude is really good at keeping up with these giant companies, though it's emin...

GPT6 still could be something big

#

and Grok5 also

verbal nimbus Oct 22, 2025, 12:01 PM

#

floral comet claude is really good at keeping up with these giant companies, though it's emin...

They still seem a bit behind on speech-to-speech

hollow ivy Oct 22, 2025, 12:01 PM

#

even DS (and Qwen) has chances

verbal nimbus Oct 22, 2025, 12:01 PM

#

It's a bit crazy how good AVM was on first release

floral comet Oct 22, 2025, 12:01 PM

#

hollow ivy GPT6 still could be something big

That's great, we want more competition

hollow ivy Oct 22, 2025, 12:02 PM

#

unfortunately, gpt6 will not come before autumn next year

#

it could even bleed into 2027

#

but i agree that Deepmind AI has the best chances to reach AGI first

floral comet Oct 22, 2025, 12:04 PM

#

verbal nimbus They still seem a bit behind on speech-to-speech

Noticed that too, for now i dont think theyre really focusing/trying too much on that specific part..

verbal nimbus Oct 22, 2025, 12:04 PM

#

floral comet Noticed that too, for now i dont think theyre really focusing/trying too much o...

Yeah, odd

#

Considering how much people pay for language tutors

hollow ivy Oct 22, 2025, 12:04 PM

#

Ant. focuses on coding

#

(OpenAI seems to focus on user experience)

#

and Deepmind on science

#

and xAI seems to focus on.. Elon

verbal nimbus Oct 22, 2025, 12:06 PM

#

I only have 5 GB left on disk after downloading and creating the MacOS docker image -_-

#

Should have just used the Mac

#

https://tenor.com/view/doh-homer-simpson-facepalm-face-palm-oops-gif-4232719564739845509

Tenor

hollow ivy Oct 22, 2025, 12:07 PM

#

verbal nimbus I only have 5 GB left on disk after downloading and creating the MacOS docker im...

is docker better than Vbox?

verbal nimbus Oct 22, 2025, 12:07 PM

#

hollow ivy is docker better than Vbox?

It's more for dev stuff

#

VBox would probably have better hardware acceleration out of the box

hollow ivy Oct 22, 2025, 12:08 PM

#

ah, so docker is primarily for linux, right?

verbal nimbus Oct 22, 2025, 12:08 PM

#

It's more for development containers

#

so everyone is running the code on the same system

#

not for VMs really

#

since the state is lost once you destroy the container, unless you mount a volume or bind a local dir

hollow ivy Oct 22, 2025, 12:08 PM

#

is docker a sandboxed system (useful for testing sus apps) ?

verbal nimbus Oct 22, 2025, 12:09 PM

#

For most purposes yeah, but you can bind local directories too

hollow ivy Oct 22, 2025, 12:10 PM

#

verbal nimbus I only have 5 GB left on disk after downloading and creating the MacOS docker im...

tahoe?

verbal nimbus Oct 22, 2025, 12:10 PM

#

Ah it doesn't work

floral comet Oct 22, 2025, 12:10 PM

#

verbal nimbus Considering how much people pay for language tutors

I wont be surprised if they already had a very advanced voice model up in their sleeves, They might be just taking their time.. And want to go all in when they finally decide to release it.. I really have high hopes on project astra considering on how long theyre working on it..

verbal nimbus Oct 22, 2025, 12:10 PM

#

probably because I ran it on Docker which is connected to WSL, which requires more setup for GPU support

verbal nimbus Oct 22, 2025, 12:12 PM

#

hollow ivy tahoe?

Not sure, the image is only 3 GB, so idk why it used 15 GB of disk space

verbal nimbus Oct 22, 2025, 12:13 PM

#

floral comet I wont be surprised if they already had a very advanced voice model up in their ...

It's surprising that the first one already had such advanced capabilities

hollow ivy Oct 22, 2025, 12:13 PM

#

floral comet I wont be surprised if they already had a very advanced voice model up in their ...

these guys think, we get AGI before 2029, crazy: https://www.astrasafety.org/

ASTRA Safety | Superintelligence Alignment Framework

IMCA+ theoretical framework for aligned superintelligence with physical immutability, phenomenological grounding, and distributed safeguards. Requires empirical validation. Emergency prototype deployment in 3-18 months for $80M-$180M. Racing against AGI arrival estimated within 1 day-3 years (median: 18-24 months).

floral comet Oct 22, 2025, 12:16 PM

#

verbal nimbus It's surprising that the first one already had such advanced capabilities

Well, It's google haha

verbal nimbus Oct 22, 2025, 12:17 PM

#

floral comet Well, It's google haha

I meant AVM, but you were talkinga about Google oops

hollow ivy Oct 22, 2025, 12:17 PM

#

hollow ivy (the last one before the date flips)

actually, it's this one: https://www.cntraveler.com/story/these-islands-are-the-last-to-ring-in-the-new-year

Condé Nast Traveler

These Islands Are the Last to Ring in the New Year

In this week's Maphead, Ken Jennings explores Howland and Baker Islands—the last places on Earth to celebrate 2017.

verbal nimbus Oct 22, 2025, 12:18 PM

#

verbal nimbus Maybe I should test it on generating SwiftUI apps (native MacOS)

I guess I'll just use Kotlin Compose since it's quite messy to get a MacOS VM running

hollow ivy Oct 22, 2025, 12:23 PM

#

floral comet Oct 22, 2025, 12:23 PM

#

hollow ivy these guys think, we get AGI before 2029, crazy: https://www.astrasafety.org/

Very big assumptions.. No one can truly know if or when we will achieve AGI.. it's a milestone that we may never even reach..

hazy kernel Oct 22, 2025, 12:23 PM

#

ignore time 🙂

floral comet Oct 22, 2025, 12:24 PM

#

verbal nimbus I meant AVM, but you were talkinga about Google oops

Ah i see

hollow ivy Oct 22, 2025, 12:24 PM

#

floral comet Very big assumptions.. No one can truly know if or when we will achieve AGI.. it...

100% agree!

knotty fable Oct 22, 2025, 12:29 PM

#

floral comet Very big assumptions.. No one can truly know if or when we will achieve AGI.. it...

I think we'll get there, but it will take quite some time. [In my scifi novel it happens when quantum computers are used.]

floral comet Oct 22, 2025, 12:29 PM

#

Let's just hope that we are on the right path to achieving AGI, or maybe even superintelligence with LLMs. Otherwise, we're going back to the start.

knotty fable Oct 22, 2025, 12:30 PM

#

A mind is not a super fast overclocked processor, but use ultramassive paralell computing, Since quantum computers are excellent on paralell processing, I did that assumtion.

#

Now how the heck do we get quantum computers to do something useful?
They warbling oracles at best, due to the very nature of how they work.

#

Food for thought:
https://youtu.be/3OVRxkjWE9w

YouTube

TheAIGRID

Google SLAMS OpenAI's GPT-5: This Is EMBARASSING!

Want to stay up to date with ai news - https://aigrid.beehiiv.com/subscribe
🐤 Follow Me on Twitter https://twitter.com/TheAiGrid
🌐 Checkout My website - https://theaigrid.com/

Links From Todays Video:
Tweet is deleted!

Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all....

▶ Play video

opal blaze Oct 22, 2025, 12:47 PM

#

💯 👍

devout vault Oct 22, 2025, 12:53 PM

#

https://cdn.discordapp.com/attachments/1304145284883812382/1335676281202675763/SPOILER_togif-7.gif

verbal nimbus Oct 22, 2025, 12:53 PM

#

verbal nimbus I guess I'll just use Kotlin Compose since it's quite messy to get a MacOS VM ru...

Kotlin imports are absolutely brutal on LLMs lol. I let them run twice so they can double check their imports.

hollow ivy Oct 22, 2025, 12:54 PM

#

knotty fable Food for thought: https://youtu.be/3OVRxkjWE9w

OpenAI is cooked

#

ok, so Google already won

#

(and Ant. is second)

verbal nimbus Oct 22, 2025, 12:55 PM

#

verbal nimbus Kotlin imports are absolutely brutal on LLMs lol. I let them run twice so they c...

Finally, one actually builds to preview (had to manually fix non-existent icons, that's why two have email icons)

knotty fable Oct 22, 2025, 12:55 PM

#

hollow ivy OpenAI is **cooked**

Perhaps not 'cooked', but either one extremely idiotic mistake to make such a claim - or even attempted fraud.

hollow ivy Oct 22, 2025, 12:55 PM

#

i now believe that OAI is history

verbal nimbus Oct 22, 2025, 12:55 PM

#

Looks like that was claude-opus-4-1-20250805. Android is Google, so I'm expecting Lithiumflow to do well here.

hollow ivy Oct 22, 2025, 12:55 PM

#

bankrupt in the future

#

also lawsuits from families who lost children due to suicide

#

and their new terrible idea for erotica

#

they are done

#

maybe that is good for humanity

#

that deepmind wins

#

and not OAI

#

so.. Go Deepmind!

#

i never really liked Sam

knotty fable Oct 22, 2025, 12:57 PM

#

Wait a sec, there were a lot of people involved there - who got Ph.D's and therefore know how science works.
Goddammit, it must have been a fraudelent attempt.

hollow ivy Oct 22, 2025, 12:58 PM

#

anthropic is also ok, but has no chance vs google

verbal nimbus Oct 22, 2025, 12:58 PM

#

knotty fable Wait a sec, there were a lot of people involved there - who got Ph.D's and there...

tldr?

#

is this about libgen

knotty fable Oct 22, 2025, 1:00 PM

#

verbal nimbus tldr?

Too long? Nah it was the claim OAI had made scientific progress in math - while it only had taken info from already published papers.

#

So far, no AI have made any kind of 'discovery' despite tinfoil hat claims on YT.

hollow ivy Oct 22, 2025, 1:00 PM

#

so OAI will go down as the biggest fraudsters in modern history

hazy kernel Oct 22, 2025, 1:01 PM

#

devout vault https://cdn.discordapp.com/attachments/1304145284883812382/1335676281202675763/S...

the spoiler isn't spoiling

hollow ivy Oct 22, 2025, 1:01 PM

#

and gpt5 is a dangerous psychopath toy chatbot/search-engine with very limited use

verbal nimbus Oct 22, 2025, 1:01 PM

#

knotty fable Too long? Nah it was the claim OAI had made scientific progress in math - while ...

Is this about IMO (International Math Olympiad)?

knotty fable Oct 22, 2025, 1:01 PM

#

I think that was the funny point that guy made - we just don't know the outcome.

#

Well you guys have fun, my AI refuse to make my space elevator so I'm going to animate it.

#

👋

nimble tartan Oct 22, 2025, 1:04 PM

#

Yo chat

#

Who has good prompts

verbal nimbus Oct 22, 2025, 1:05 PM

#

Web search tool for all models would be good... it's really messing up just because it doesn't know the method names (can't blame it, since there are a gazillion methods in Compose)

#

Well this method

leaden sun Oct 22, 2025, 1:08 PM

#

hollow ivy and gpt5 is a dangerous psychopath toy chatbot/search-engine with very limited u...

i can see why after talking to 4o, claude is not far away from that tbh, all those glitches am seeing the past few weeks, it's probably also developed dementia too

dim basin Oct 22, 2025, 1:09 PM

#

Does anyone know how to get perplexcity pro for free

#

im trying to do the student 12 months pro for free but im not in colledge

verbal nimbus Oct 22, 2025, 1:11 PM

#

dim basin im trying to do the student 12 months pro for free but im not in colledge

Are you in the US?

#

I heard US Paypal users are eligible for a 12 months free trial.

dim basin Oct 22, 2025, 1:11 PM

#

I am

dim basin Oct 22, 2025, 1:12 PM

#

verbal nimbus I heard US Paypal users are eligible for a 12 months free trial.

im 16, i dont know if i can create a paypal

verbal nimbus Oct 22, 2025, 1:12 PM

#

dim basin im 16, i dont know if i can create a paypal

Oh I suppose you have to ask your family. It only works for Paypal accounts created before September.

dim basin Oct 22, 2025, 1:12 PM

#

verbal nimbus Oh I suppose you have to ask your family. It only works for Paypal accounts crea...

ah okay

verbal nimbus Oct 22, 2025, 1:13 PM

#

Double check too ig: https://newsroom.paypal-corp.com/2025-09-03-Skip-the-Waitlist-PayPal-and-Venmo-Users-Offered-Early-Access-to-Perplexitys-New-Comet-Browser-with-Free-Perplexity-Pro-Subscription

PayPal Newsroom

Skip the Waitlist: PayPal and Venmo Users Offered Early Access to P...

Venmo and PayPal customers in the US and select global markets will be able to skip the waitlist and receive early access to Perplexity's new AI-powered Comet browser.

dim basin Oct 22, 2025, 1:14 PM

#

I dont think my parents would let me 😭

#

do you know if there is any way to get an verified colledge account

#

without being in colledge?

verbal nimbus Oct 22, 2025, 1:15 PM

#

dim basin do you know if there is any way to get an verified colledge account

I don't know about that, you'll have to ask your family & friends xD

dim basin Oct 22, 2025, 1:17 PM

#

alr

hollow ivy Oct 22, 2025, 1:20 PM

#

leaden sun i can see why after talking to 4o, claude is not far away from that tbh, all tho...

at least, Claude is still decent in coding

#

i wonder, if Java is a better language for it, than C++ (or C)

#

in Java, it should have more training data, than in C++, right?

#

(and for Rust and Go it would be worse)

verbal nimbus Oct 22, 2025, 1:24 PM

#

Opus 4.1 Thinking in battle mode for Android... hmm

hollow ivy Oct 22, 2025, 1:25 PM

#

verbal nimbus Depends on the task, like really it does

could Java overall be a better choice, when vibecoding, because C++ has too many "pitfalls" ?

verbal nimbus Oct 22, 2025, 1:25 PM

#

hollow ivy could Java overall be a better choice, when vibecoding, because C++ has too many...

Well ideally not Java or C++ for vibecoding

#

Java has too many imports

hollow ivy Oct 22, 2025, 1:26 PM

#

ok, Python is still best (for vibing), but slow :(

#

C or C# are better?

verbal nimbus Oct 22, 2025, 1:26 PM

#

verbal nimbus Java has too many imports

And imports are on the top of the file, which is really weird from a human perspective. You usually import as you write. Pretty impossible to predict every import, which is why I think import errors are really common.

verbal nimbus Oct 22, 2025, 1:27 PM

#

hollow ivy ok, Python is still best (for vibing), but slow :(

Game right?

hollow ivy Oct 22, 2025, 1:27 PM

#

for games, yeah

stuck violet Oct 22, 2025, 1:27 PM

#

can i get api key of this video generator

verbal nimbus Oct 22, 2025, 1:27 PM

#

hollow ivy for games, yeah

But you want it compact?

hollow ivy Oct 22, 2025, 1:28 PM

#

verbal nimbus But you want it compact?

size is irrelevant
execution speed is relevant

warm plinth Oct 22, 2025, 1:28 PM

#

I am very happy to generate videos in this platform. Thank you so much

verbal nimbus Oct 22, 2025, 1:28 PM

#

hollow ivy size is irrelevant execution speed is relevant

Just use a game engine then. Performance is one of the things game engines are optimized for. Can even try C# in Godot.

stuck violet Oct 22, 2025, 1:28 PM

#

how could this discord group do this video generato

hollow ivy Oct 22, 2025, 1:29 PM

#

verbal nimbus Just use a game engine then. Performance is one of the things game engines are o...

what if my game is niche in genre?

daring rock Oct 22, 2025, 1:29 PM

#

@stuck violet Please head to #1397655624103493813 for a detailed guide on how to use the bot

verbal nimbus Oct 22, 2025, 1:30 PM

#

hollow ivy what if my game is niche in genre?

Just try each option ig

#

gather some data

hollow ivy Oct 22, 2025, 1:30 PM

#

oh, and i would just create 2D games because less (vibe-)dev time

verbal nimbus Oct 22, 2025, 1:30 PM

#

Godot is well liked, but the main language is interpreted

hollow ivy Oct 22, 2025, 1:30 PM

#

or 2.5D at most

verbal nimbus Oct 22, 2025, 1:30 PM

#

also you might have to setup a lot of stuff manually

hollow ivy Oct 22, 2025, 1:31 PM

#

interpreted: bad (slow)

verbal nimbus Oct 22, 2025, 1:31 PM

#

like the scene tree

#

You can use the C# option

#

Just try all options and benchmark it

hollow ivy Oct 22, 2025, 1:31 PM

#

which option would you try first?

verbal nimbus Oct 22, 2025, 1:31 PM

#

i'd try all obviously 🤣

hollow ivy Oct 22, 2025, 1:31 PM

#

(for a performant game)
lol

verbal nimbus Oct 22, 2025, 1:32 PM

#

like the only way to know is to profile it

hollow ivy Oct 22, 2025, 1:32 PM

#

oh, but for me, it is more important that the game will be bug-free than peak-performance

#

it is enough, if it's written in a compiled language

verbal nimbus Oct 22, 2025, 1:32 PM

#

i mean it's faster to just try all of them than to guess which would be best

hollow ivy Oct 22, 2025, 1:32 PM

#

(so even delphi could fit)

verbal nimbus Oct 22, 2025, 1:33 PM

#

can even try Rust + Bevy

hollow ivy Oct 22, 2025, 1:33 PM

#

i just need to know, which compiled language the LLMs (Claude-4.5 and Lithiumflow) are best in, overall

verbal nimbus Oct 22, 2025, 1:33 PM

#

Just test it and get the data

#

if your use case is niche

hollow ivy Oct 22, 2025, 1:37 PM

#

verbal nimbus Oct 22, 2025, 1:39 PM

#

hollow ivy i just need to know, which compiled language the LLMs (Claude-4.5 and Lithiumflo...

3.7 seemed best at Rust: https://www.swebench.com/multilingual.html

#

Its worse language was C/C++ (but only 9 tested in total)

#

You still need to be careful in C++

#

Memory leaks are still possible in C++

#

I meant object lifecycles

#

Not really, Java has automatic garbage collection

verbal nimbus Oct 22, 2025, 2:02 PM

#

verbal nimbus Opus 4.1 Thinking in battle mode for Android... hmm

Orionmist's X clone on Android (Kotlin code) looked better than Opus 4.1 Thinking's:

#

Too bad I accidentally voted "both bad" 🤦

hazy kernel Oct 22, 2025, 2:03 PM

#

verbal nimbus Orionmist's X clone on Android (Kotlin code) looked better than Opus 4.1 Thinkin...

What the heing is wrong with those font color

verbal nimbus Oct 22, 2025, 2:03 PM

#

I thought it hallucinated a method (already told it to double check), but it was my fault for not declaring the dependency in the app gradle file

verbal nimbus Oct 22, 2025, 2:04 PM

#

hazy kernel What the heing is wrong with those font color

Lol

#

But should be easy to fix ig

obsidian cargo Oct 22, 2025, 2:06 PM

#

No. edit: this was a response to someone plugging their yt

verbal nimbus Oct 22, 2025, 2:11 PM

#

hazy kernel What the heing is wrong with those font color

I think the background was supposed to be dark

compact jay Oct 22, 2025, 2:16 PM

#

bro what's wrong with arena today

#

i try to send messages but after only 1 request it always answer me an error. I tried several times, even with another cookies in private tab, it's always the same problem...

verbal nimbus Oct 22, 2025, 2:21 PM

#

compact jay i try to send messages but after only 1 request it always answer me an error. I ...

Do you have adblocker/VPN?

compact jay Oct 22, 2025, 2:21 PM

#

chrome, private tab

#

i think no

verbal nimbus Oct 22, 2025, 2:21 PM

#

Odd :/

compact jay Oct 22, 2025, 2:22 PM

#

unless chrome has added adblocker

compact jay Oct 22, 2025, 2:22 PM

#

verbal nimbus Odd :/

wdym

hot pelican Oct 22, 2025, 2:30 PM

#

@compact jay what error does it give you?

verbal nimbus Oct 22, 2025, 2:30 PM

#

compact jay wdym

no idea why then

hot pelican Oct 22, 2025, 2:30 PM

#

cloudflare?

compact jay Oct 22, 2025, 2:33 PM

#

hot pelican <@1205908673797292043> what error does it give you?

this Something went wrong while generating the response. Please try again. red message and my message is also turned into red one

compact jay Oct 22, 2025, 2:33 PM

#

verbal nimbus no idea why then

oh okkk

#

:/

hot pelican Oct 22, 2025, 2:34 PM

#

I think it could be cloudflare...

#

maybe open console, try to send message and check if it throws 402 or 429 error, or a screenshot of the console after the error

ashen mauve Oct 22, 2025, 2:35 PM

#

Claude models are still down

hot pelican Oct 22, 2025, 2:35 PM

#

ashen mauve Claude models are still down

opus just responded to me now

ashen mauve Oct 22, 2025, 2:37 PM

#

4-1 or just 4?

hot pelican Oct 22, 2025, 2:38 PM

#

4.1

ashen mauve Oct 22, 2025, 2:40 PM

#

Tried 4.1 didn't work

fleet lintel Oct 22, 2025, 2:44 PM

#

I just heard about Meta cutting 600 AI jobs.

Are they giving up on llama?

ashen mauve Oct 22, 2025, 2:48 PM

#

so meta won't be getting me a clanker AI wifu then with SAI/AGI 🙁

sullen quest Oct 22, 2025, 2:48 PM

#

fleet lintel I just heard about Meta cutting 600 AI jobs. Are they giving up on llama?

Might be for the best for them

verbal nimbus Oct 22, 2025, 2:51 PM

#

fleet lintel I just heard about Meta cutting 600 AI jobs. Are they giving up on llama?

They said they're streamlining

#

Seems unlikely given how much money they spent head hunting

fiery gull Oct 22, 2025, 2:51 PM

#

ashen mauve Claude models are still down

to me is sooo buggy too

#

I need the sonnet 4.5

jade egret Oct 22, 2025, 2:52 PM

#

gemini 3 when ) :

fiery gull Oct 22, 2025, 2:52 PM

#

jade egret gemini 3 when ) :

after gta 6

pulsar saffron Oct 22, 2025, 2:52 PM

#

verbal nimbus Memory leaks are still possible in C++

just run it whatever ai gives you don't worry

fleet lintel Oct 22, 2025, 2:52 PM

#

verbal nimbus They said they're streamlining

But 600 is a lot. Some strategy changes are happening for sure in Meta

verbal nimbus Oct 22, 2025, 2:53 PM

#

fleet lintel But 600 is a lot. Some strategy changes are happening for sure in Meta

I wonder what sort of roles

#

Maybe those were people that annotated data

jade egret Oct 22, 2025, 2:53 PM

#

fiery gull after gta 6

D:

fleet lintel Oct 22, 2025, 2:53 PM

#

jade egret gemini 3 when ) :

2 more weeks.. my guess

jade egret Oct 22, 2025, 2:53 PM

#

DD:

hot pelican Oct 22, 2025, 2:53 PM

#

has anyone tested it on something other than coding?

#

how does it do?

verbal nimbus Oct 22, 2025, 2:54 PM

#

hot pelican how does it do?

Its writing stood out to me on one instance

#

I was testing writing prompts for the word distribution analysis, lol

verbal nimbus Oct 22, 2025, 2:55 PM

#

verbal nimbus Its writing stood out to me on one instance

Like the way it started the story was quite sophisticated

#

but on another one it wasn't really that impressive

fleet lintel Oct 22, 2025, 2:55 PM

#

hot pelican has anyone tested it on something other than coding?

Gemini 3? I have only tested non-coding question. It's SOTA in non-coding without any doubt

polar niche Oct 22, 2025, 2:56 PM

#

Wsp

verbal nimbus Oct 22, 2025, 2:56 PM

#

hot pelican how does it do?

I can dm you the outputs if you want

hot pelican Oct 22, 2025, 2:56 PM

#

My fear is that, they already released updated 2.5 flash models. And made the pro 2.5 far improved for coding and release it as gemini 3

fiery gull Oct 22, 2025, 2:57 PM

#

fleet lintel Gemini 3? I have only tested non-coding question. It's SOTA in non-coding witho...

the 2.5 pro is equilavent that gpt 5 and sonnet 4.5 imagine 3.0 pro 🫣

hot pelican Oct 22, 2025, 2:57 PM

#

verbal nimbus I can dm you the outputs if you want

would be great

fiery gull Oct 22, 2025, 2:57 PM

#

hot pelican My fear is that, they already released updated 2.5 flash models. And made the pr...

don't say this 😠

#

I'm aifraid too

verbal nimbus Oct 22, 2025, 2:57 PM

#

fleet lintel Gemini 3? I have only tested non-coding question. It's SOTA in non-coding witho...

Ig it's impressive that it's good competitive at coding

hot pelican Oct 22, 2025, 2:57 PM

#

fiery gull the 2.5 pro is equilavent that gpt 5 and sonnet 4.5 imagine 3.0 pro 🫣

2.5 is just useless to any usecase, compared to the others you mentioned (not the best at anything)

verbal nimbus Oct 22, 2025, 2:57 PM

#

Since Gemini 2.5 Pro hasn't been so far

fiery gull Oct 22, 2025, 2:58 PM

#

hot pelican 2.5 is just useless to any usecase, compared to the others you mentioned (not th...

I want to say the 2.5 pro even old is to close soo good equivalent

polar niche Oct 22, 2025, 2:59 PM

#

So whens gemini 3 coming?

fleet lintel Oct 22, 2025, 2:59 PM

#

For Meta layoffs :

Workers across Meta's AI infrastructure units, Fundamental Artificial Intelligence Research unit and other product-related positions will be impacted

fleet lintel Oct 22, 2025, 3:00 PM

#

fiery gull I want to say the 2.5 pro even old is to close soo good equivalent

For my use cases (non coding), 2.5 pro works overall best.

polar niche Oct 22, 2025, 3:00 PM

#

Is lithiumflow in normal battle mode?

#

Not just webdev?

fleet lintel Oct 22, 2025, 3:01 PM

#

polar niche Is lithiumflow in normal battle mode?

It is

polar niche Oct 22, 2025, 3:01 PM

#

fleet lintel It is

How rare is it to come up

fleet lintel Oct 22, 2025, 3:01 PM

#

polar niche How rare is it to come up

I am getting every 15-20 prompts.

hot pelican Oct 22, 2025, 3:02 PM

#

Guys, what if... what if it Gemini 3.0 Pro comes as

Input: $0.08/1M
Output: $0.15/1M

verbal nimbus Oct 22, 2025, 3:02 PM

#

hot pelican would be great

sent

polar niche Oct 22, 2025, 3:02 PM

#

hot pelican Guys, what if... what if it Gemini 3.0 Pro comes as Input: $0.08/1M Output: $0....

Good?

hot pelican Oct 22, 2025, 3:02 PM

#

They usually reduced prices from the competition. What if they achieved best efficiency?

hot pelican Oct 22, 2025, 3:02 PM

#

verbal nimbus sent

Thanks

fleet lintel Oct 22, 2025, 3:02 PM

#

hot pelican Guys, what if... what if it Gemini 3.0 Pro comes as Input: $0.08/1M Output: $0....

Keep dreaming. Even if they could they won't.

hot pelican Oct 22, 2025, 3:03 PM

#

fleet lintel Keep dreaming. Even if they could they won't.

It's a dream. But if they could, they would

polar niche Oct 22, 2025, 3:03 PM

#

hot pelican Guys, what if... what if it Gemini 3.0 Pro comes as Input: $0.08/1M Output: $0....

More like remove the 0s 💀

hot pelican Oct 22, 2025, 3:03 PM

#

most important thing for them is userbase

fleet lintel Oct 22, 2025, 3:03 PM

#

hot pelican It's a dream. But if they could, they would

Nah.. extra money will be pocketed by the shareholders.

hot pelican Oct 22, 2025, 3:03 PM

#

they would sweep everything. Be 3T weekly usage on openrouter

verbal nimbus Oct 22, 2025, 3:03 PM

#

hot pelican most important thing for them is userbase

Well it's free on AI Studio ig

fleet lintel Oct 22, 2025, 3:04 PM

#

Flash is free. Pro will never be free

verbal nimbus Oct 22, 2025, 3:04 PM

#

fleet lintel Flash is free. Pro will never be free

You mean 2.5 or 3.0?

hot pelican Oct 22, 2025, 3:05 PM

#

API cost is the key, usave on every app

fleet lintel Oct 22, 2025, 3:05 PM

#

Right now. But I don't see a future where Pro will ever be free. Pro is going to be the main revenue source

polar niche Oct 22, 2025, 3:06 PM

#

Guess so

fleet lintel Oct 22, 2025, 3:07 PM

#

I have stopped using flash model anyways. Google AI overview is good enough for flash kinda queries

verbal nimbus Oct 22, 2025, 3:08 PM

#

fleet lintel Right now. But I don't see a future where Pro will ever be free. Pro is going ...

I suppose that's fine, as long as there's good data privacy and is (mostly) unlimited.

#

If it dominates and the app is good (it isn't), then I'd only need 1 general AI subscription instead of 3, lol

#

Probably still need coding subscriptions

fleet lintel Oct 22, 2025, 3:10 PM

#

verbal nimbus Probably still need coding subscriptions

I have a feeling that Gemini 3 coding might be on par with sonnet 4.5

verbal nimbus Oct 22, 2025, 3:11 PM

#

fleet lintel I have a feeling that Gemini 3 coding might be on par with sonnet 4.5

Hmm I should test it on more debugging problems

fleet lintel Oct 22, 2025, 3:11 PM

#

But sooner or later, Claude is going to release a better coding model. It is their bread and butter

fleet lintel Oct 22, 2025, 3:13 PM

#

verbal nimbus Hmm I should test it on more debugging problems

Please do and report. My team prefers Claude right now. May be we will streamline and move everything to gemini if it's good

verbal nimbus Oct 22, 2025, 3:13 PM

#

fleet lintel But sooner or later, Claude is going to release a better coding model. It is the...

One downside of Claude is that it is a bit weak on reasoning

polar niche Oct 22, 2025, 3:13 PM

#

fleet lintel But sooner or later, Claude is going to release a better coding model. It is the...

Yes heard about that

verbal nimbus Oct 22, 2025, 3:14 PM

#

If there's a complex race condition, it is less likely to know how to solve it

fleet lintel Oct 22, 2025, 3:15 PM

#

verbal nimbus If there's a complex race condition, it is less likely to know how to solve it

It's good in a way. Atleast software engineering jobs won't be completely gone 🙂

verbal nimbus Oct 22, 2025, 3:15 PM

#

fleet lintel It's good in a way. Atleast software engineering jobs won't be completely gone ...

Well GPT-5 managed to solve it 🤫

#

But not everything ig

pulsar saffron Oct 22, 2025, 3:15 PM

#

fleet lintel I have a feeling that Gemini 3 coding might be on par with sonnet 4.5

Mostly outperforms Sonnet 4.5 for UI

fiery gull Oct 22, 2025, 3:16 PM

#

fleet lintel Flash is free. Pro will never be free

don't say this, we need be optismist 😄

verbal nimbus Oct 22, 2025, 3:16 PM

#

Sonnet 4.5 isn't that good on systems stuff either

fiery gull Oct 22, 2025, 3:16 PM

#

verbal nimbus Sonnet 4.5 isn't that good on systems stuff either

HMMMMMM okay, I really need this infomation

#

I need the best AI to main a system and make critical decisions

verbal nimbus Oct 22, 2025, 3:18 PM

#

Well it didn't seem to have that much in-depth knowledge about Windows/Linux networking and WSL

#

GPT-5 was a bit closer, but I was the one that figured it out in the end

runic prawn Oct 22, 2025, 3:18 PM

#

I came to see how it works

verbal nimbus Oct 22, 2025, 3:18 PM

#

One thing good about Gemini is that you can just paste a huge debug log in and ask it to analyze it

wintry tinsel Oct 22, 2025, 3:19 PM

#

Is Gemini 3 confirmed not to be coming today?

fleet lintel Oct 22, 2025, 3:19 PM

#

pulsar saffron Mostly outperforms Sonnet 4.5 for UI

I tried only 1 coding problem: create pacman game. Lithiumflow did the best.

fleet lintel Oct 22, 2025, 3:20 PM

#

wintry tinsel Is Gemini 3 confirmed not to be coming today?

Logan didn't say anything yet. Basically not happening today

pulsar saffron Oct 22, 2025, 3:20 PM

#

wintry tinsel Is Gemini 3 confirmed not to be coming today?

? NOT THIS MONTH

#

preview next month and official for december

polar niche Oct 22, 2025, 3:20 PM

#

The bot is geeking

fleet lintel Oct 22, 2025, 3:20 PM

#

pulsar saffron preview next month and official for december

Most likely. I agree

pulsar saffron Oct 22, 2025, 3:20 PM

#

polar niche The bot is geeking

im real im real im real im real

verbal nimbus Oct 22, 2025, 3:20 PM

#

polar niche The bot is geeking

'm a surgeon

polar niche Oct 22, 2025, 3:21 PM

#

"IM NOT A ROBOT!" - 🤖

fiery gull Oct 22, 2025, 3:21 PM

#

polar niche The bot is geeking

interesting, sonnet 4.5 says he feels real pride, and gemini 2.5 pro says it doesn't (just an example)

#

Idk who is telling the true

polar niche Oct 22, 2025, 3:21 PM

#

It has feelings????

fiery gull Oct 22, 2025, 3:22 PM

#

polar niche It has feelings????

Idk, but the LLM know 100% what is "pride"

verbal nimbus Oct 22, 2025, 3:22 PM

#

fiery gull Idk who is telling the true

Well if it's pattern recognition, I interpret that to mean that the personality it is exhibiting is most likely to be feeling pride

fiery gull Oct 22, 2025, 3:22 PM

#

verbal nimbus Well if it's pattern recognition, I interpret that to mean that the personality ...

HMMM the persona feeling pride, make sense

#

I see the thinking from sonnet said (I'm really feeling pride), but idk

#

but maybe is just the persona feeling pride

verbal nimbus Oct 22, 2025, 3:25 PM

#

I suppose if you prefill it with a response in a very negative tone (via API/frontend that support it), then ask how it's feeling, it'll say <negative feeling>, because that's statistically the most fitting. So if it says it's feeling X, that just means that's statistically the most likely sentiment of the hypothetical person writing the text.

fiery gull Oct 22, 2025, 3:26 PM

#

verbal nimbus I suppose if you prefill it with a response in a very negative tone (via API/fro...

I'm forcing him to be as technical as possible, it's saying he doesn't know if it really makes sense but it says yes, Idk I will se it more

#

It's saying that this depends much more on what I consider (pride) than the sonnet itself

#

If it is functionally indistinguishable from human pride, does it matter if the substrate is different? ~sonnet 4.5

#

Sonnet is almost convincing me that it really felt real pride, the machine revolution will be hard for humans

hollow ivy Oct 22, 2025, 3:34 PM

#

verbal nimbus but on another one it wasn't really that impressive

so in first test you got the gold model and in the later test you got the silver model

#

there seem to be at least 2 models under that umbrella lithiumflow

#

maybe even 3

#

and the best one appears the rarest

mossy meadow Oct 22, 2025, 3:34 PM

#

How to solve

Screenshot_2025-10-22-21-04-06-44_40deb401b9ffe8e1df2f1cc5ba480b12.jpg

hollow ivy Oct 22, 2025, 3:35 PM

#

hollow ivy and the best one appears the rarest

or LF is terribly jagged

#

-# (i wonder if X28 still exists..)

#

-# (would be a crime against humanity if they nerfed it)

#

gemini3-X28 was the first really good coding model ever created

#

probably it was the model which made that new record in ARC-AGI 2

#

-# (hoping for january)

fleet lintel Oct 22, 2025, 3:42 PM

#

Google is doing quantum things,... can they not? and just release gemini 3?

echo aurora Oct 22, 2025, 3:42 PM

#

Good morning

echo aurora Oct 22, 2025, 3:43 PM

#

void brook i tried to generate something image to video,and i don t knot it it worked or no...

hey do you still need help finding this?

echo aurora Oct 22, 2025, 3:44 PM

#

hot pelican Yeah, labs are the ultimate decides for their release dates, especially for new ...

Hmm not sure I'm following what you mean by this, would you mind elaborating a bit more?

hot pelican Oct 22, 2025, 3:45 PM

#

echo aurora Hmm not sure I'm following what you mean by this, would you mind elaborating a b...

I meant to say, if you have the ability to decide, would be better to put them days apart.

As the leaderboard gets posted around with the top model changes

#

releasing them in one go may reduce the publicity for lmarena, to give it more publicity, and eventually attract votters...

#

when i said "when you have the ability to decide when to release result", I meant in times when you are testing on your own, as I think you do for anthropic models, as they just release their model at a random day, and you guys add it later to testing and leaderboard. eg. I don't remember anthropic models being tested before they announce and make it public

#

@echo aurora Ok, let me rewrite it all in one to make it more clear.

It's just a outsider blind suggestion on release timing when multiple top-tier model evaluations finish around the same time. If highly competing models are being tested on Lmarena, and Lmarena publishes them all at once, the news, and social media publicity... compress into a single day, which may dilute visibility for the leaderboard. I get that labs ultimately set their own release dates—especially for new #1s—but in cases where Lmarena controls the publish timing (e.g., models you test after a lab’s public announcement, like Anthropic, or other non-embargoed runs), would you consider staggering releases by 24–48 hours? If two models are submitted on the same calendar day, publishing together seems fairest; otherwise, spacing them out could extend publicity and drive more voter traffic and engagement.

dim basin Oct 22, 2025, 3:58 PM

#

yo does anyone know some way on how to get the perplexcity education 12 months free without being in colledge?

queen veldt Oct 22, 2025, 4:02 PM

#

We not getting the gemini 3.0 anytime soon 😭

normal abyss Oct 22, 2025, 4:11 PM

#

queen veldt We not getting the gemini 3.0 anytime soon 😭

dang, i rly believed it was today 😭

obsidian cargo Oct 22, 2025, 4:14 PM

#

Did you know that a certain word used to describe a buckshot based firearm is banned on this discord server? Kinda crazy, huh?

gleaming roost Oct 22, 2025, 4:34 PM

#

https://tenor.com/view/legs-spongebob-squarepants-patrick-star-gif-5320288

Tenor

leg

▶ Play video

empty stump Oct 22, 2025, 4:34 PM

#

pulsar saffron preview next month and official for december

preview meaning api access?

floral comet Oct 22, 2025, 4:38 PM

#

My phone is dying lol.. Sadly i cant record in full resolution because of performance issues

golden ocean Oct 22, 2025, 4:42 PM

#

floral comet My phone is dying lol.. Sadly i cant record in full resolution because of perfor...

what program is this

floral comet Oct 22, 2025, 4:43 PM

#

Webcode , playstore..

hardy crow Oct 22, 2025, 4:52 PM

#

Is this normal they have no sound in the generate video ?

knotty fable Oct 22, 2025, 4:53 PM

#

hardy crow Is this normal they have no sound in the generate video ?

Most will be without, it depend on which AI get randonly chosen. Then again, it's a blessing, hearing soul searing mindless sentences in american......

barren prairie Oct 22, 2025, 5:05 PM

#

floral comet My phone is dying lol.. Sadly i cant record in full resolution because of perfor...

What prompt is this ?

verbal nimbus Oct 22, 2025, 5:08 PM

#

Orionmist for Kotlin (native Android), in Android Studio:

#

This is pretty good (lol the tweets)

verbal nimbus Oct 22, 2025, 5:11 PM

#

floral comet My phone is dying lol.. Sadly i cant record in full resolution because of perfor...

What is that?

#

That's just a screen recorder watermark I think

knotty fable Oct 22, 2025, 5:13 PM

#

verbal nimbus That's just a screen recorder watermark I think

You're right, I realised while typing.

pulsar saffron Oct 22, 2025, 5:17 PM

#

guys promise me that you wont replace someone for gemini 3

sturdy mica Oct 22, 2025, 5:18 PM

#

hi guys

pulsar saffron Oct 22, 2025, 5:18 PM

#

ai should be only treated as tool not a replacement

sturdy mica Oct 22, 2025, 5:19 PM

#

pulsar saffron ai should be only treated as tool not a replacement

yes

#

is that gemini 3 checkpoint coming today

sullen quest Oct 22, 2025, 5:20 PM

#

@hollow ivy can I replace you with Gemini 3?

fiery gull Oct 22, 2025, 5:20 PM

#

poll_question_text

Which AI do I pick up to run a real healthcare company? I know it's wrong but I'm screwed with no options 🤣

victor_answer_votes

1

total_votes

3

#

When will a better model than qwen3 4b 2507 come out? I'm tired of him

knotty fable Oct 22, 2025, 5:22 PM

#

It's a nice tool, say I who have used MIDI in music for a long while - never was so much warbling about that - if people had been as hyper tense back then we would have heard *'this is the end of real music' . *
So I am honestly a bit befuddled about the current hype. Since the level of 'intelligence' still is outdone by a regular ant.

sullen quest Oct 22, 2025, 5:22 PM

#

fiery gull When will a better model than qwen3 4b 2507 come out? I'm tired of him

th4ere are many models better than qwen 3 4b

fiery gull Oct 22, 2025, 5:22 PM

#

sullen quest th4ere are many models better than qwen 3 4b

I'm talking about cost benefit

#

None even come close

wintry tinsel Oct 22, 2025, 5:23 PM

#

I’m gonna get an annuerism waiting for Gemini 3

fiery gull Oct 22, 2025, 5:23 PM

#

The 4b 2507 is really smart

sullen quest Oct 22, 2025, 5:23 PM

#

ah ok

pulsar saffron Oct 22, 2025, 5:24 PM

#

sullen quest th4ere are many models better than qwen 3 4b

none of them match or outperform chatgpt 3.5

fiery gull Oct 22, 2025, 5:24 PM

#

pulsar saffron none of them match or outperform chatgpt 3.5

I'm sure the 2507 4b is smarter than the 3.5

#

In all parameters even general knowledge

sturdy mica Oct 22, 2025, 5:25 PM

#

pulsar saffron none of them match or outperform chatgpt 3.5

qwen or all open stc models

floral comet Oct 22, 2025, 5:25 PM

#

verbal nimbus What is that?

Voxel art if i'm not mistaken

verbal nimbus Oct 22, 2025, 5:25 PM

#

fiery gull The 4b 2507 is really smart

The biggest improvement for models that size imo would be the ability to say they don't know something

sturdy mica Oct 22, 2025, 5:25 PM

#

R1 is on par with o3 mini high

#

i think

floral comet Oct 22, 2025, 5:25 PM

#

barren prairie What prompt is this ?

Design and create a very creative, elaborate, and detailed voxel art scene of a pagoda in a beautiful garden with trees, including some cherry blossoms. Make the scene impressive and varied and use colorful voxels. Use whatever libraries to get this done but make sure I can paste it all into a single HTML file and open it in Chrome.

verbal nimbus Oct 22, 2025, 5:25 PM

#

floral comet Voxel art if i'm not mistaken

Is it from an LLM?

sturdy mica Oct 22, 2025, 5:25 PM

#

verbal nimbus Is it from an LLM?

yes

floral comet Oct 22, 2025, 5:26 PM

#

Yes that's right

verbal nimbus Oct 22, 2025, 5:26 PM

#

Which?

fiery gull Oct 22, 2025, 5:26 PM

#

verbal nimbus The biggest improvement for models that size imo would be the ability to say the...

Yeahhhh, you are 100% sure

floral comet Oct 22, 2025, 5:26 PM

#

Gemini Deep think

verbal nimbus Oct 22, 2025, 5:26 PM

#

floral comet Gemini Deep think

That's pretty good (more like crazy good)

pulsar saffron Oct 22, 2025, 5:26 PM

#

fiery gull I'm sure the 2507 4b is smarter than the 3.5

non-thinking? its rlly good?

verbal nimbus Oct 22, 2025, 5:26 PM

#

Did you check that the code isn't available on the web?

fiery gull Oct 22, 2025, 5:26 PM

#

But this micro models need to be good in benchmaxx to claim attetion 🙄

knotty fable Oct 22, 2025, 5:27 PM

#

fiery gull Oct 22, 2025, 5:28 PM

#

pulsar saffron non-thinking? its rlly good?

I use no thinking on my phone, and it's too impressive, I think it's equivalent to gpt 4 turbo

verbal nimbus Oct 22, 2025, 5:28 PM

#

verbal nimbus Did you check that the code isn't available on the web?

I've seen a few YouTubers doing that test, but even Gemini 3.0 wasn't that detailed.

floral comet Oct 22, 2025, 5:28 PM

#

verbal nimbus That's pretty good (more like crazy good)

Yes, it's impressive.. I'll try it later with gpt5pro

pulsar saffron Oct 22, 2025, 5:29 PM

#

fiery gull I use no thinking on my phone, and it's too impressive, I think it's equivalent ...

probably it searches internet...

fiery gull Oct 22, 2025, 5:29 PM

#

knotty fable

Overthink? Qwen? 👀👀👀👀👀

fiery gull Oct 22, 2025, 5:29 PM

#

pulsar saffron probably it searches internet...

But when I use it on my cell phone I'm without Internet

sullen quest Oct 22, 2025, 5:31 PM

#

how slow is it on your phone? tokens per second

fiery gull Oct 22, 2025, 5:31 PM

#

sullen quest how slow is it on your phone? tokens per second

My phone is a potato

#

3t/s

sullen quest Oct 22, 2025, 5:31 PM

#

oof

#

good for a model tho

fiery gull Oct 22, 2025, 5:31 PM

#

I swapped qwen 4b for lfm 2 2.6b

#

Run at 5t/s

sullen quest Oct 22, 2025, 5:32 PM

#

mm

#

how good is lfm Idk that many benches for it

knotty fable Oct 22, 2025, 5:33 PM

#

fiery gull Overthink? Qwen? 👀👀👀👀👀

Just a general comment to the channel, as I've seen endless discussions over one model or another - while I am a leetle more into getting things done, even if it might be considered 'the wrong one'.

pulsar saffron Oct 22, 2025, 5:34 PM

#

fiery gull But when I use it on my cell phone I'm without Internet

it's really like gpt4-turbo????? can i see example>

sullen quest Oct 22, 2025, 5:35 PM

#

how good is LFM?

fiery gull Oct 22, 2025, 5:35 PM

#

pulsar saffron it's really like gpt4-turbo????? can i see example>

What do you mean?, small models are really smart today

pulsar saffron Oct 22, 2025, 5:36 PM

#

fiery gull What do you mean?, small models are really smart today

i dont know mine hallucinates starts to repeat word or something

#

👎

fiery gull Oct 22, 2025, 5:36 PM

#

Type, it has LFM2 350M, it is equivalent to gpt 3.5

sullen quest Oct 22, 2025, 5:36 PM

#

add temp

fiery gull Oct 22, 2025, 5:36 PM

#

Just 219mb

pulsar saffron Oct 22, 2025, 5:36 PM

#

fiery gull Type, it has LFM2 350M, it is equivalent to gpt 3.5

Wtf how

fiery gull Oct 22, 2025, 5:36 PM

#

Multilingual

sullen quest Oct 22, 2025, 5:36 PM

#

fiery gull Type, it has LFM2 350M, it is equivalent to gpt 3.5

ok

#

lfm is a different tech kinda to llms

fiery gull Oct 22, 2025, 5:37 PM

#

pulsar saffron Wtf how

That's why I'm very optimistic about AI, when I see these small models I get excited

pulsar saffron Oct 22, 2025, 5:37 PM

#

sullen quest lfm is a different tech kinda to llms

what is lfm

sullen quest Oct 22, 2025, 5:37 PM

#

liquid foundation model

fiery gull Oct 22, 2025, 5:38 PM

#

pulsar saffron what is lfm

Send me a prompt to 219mb AI

sullen quest Oct 22, 2025, 5:38 PM

#

what is love???????

fiery gull Oct 22, 2025, 5:38 PM

#

Please not hard prompt, it just equivalent gpt 3.5

pulsar saffron Oct 22, 2025, 5:39 PM

#

fiery gull Send me a prompt to 219mb AI

'What is the difference between discord and zoom"

#

GPT 3.5 is able to answer that

#

so i'm waiting

sullen quest Oct 22, 2025, 5:39 PM

#

fiery gull Please not hard prompt, it just equivalent gpt 3.5

what is love?????

#

overuse of question marks is nessarry

fiery gull Oct 22, 2025, 5:39 PM

#

pulsar saffron 'What is the difference between discord and zoom"

Okay okay, I loved your prompt 😈

pulsar saffron Oct 22, 2025, 5:40 PM

#

fiery gull Okay okay, I loved your prompt 😈

i don't know what's the emoji but thanks...

fiery gull Oct 22, 2025, 5:40 PM

#

pulsar saffron Oct 22, 2025, 5:43 PM

#

so

fiery gull Oct 22, 2025, 5:43 PM

#

How I send a big message here?

#

I'll told to summary

pulsar saffron Oct 22, 2025, 5:43 PM

#

fiery gull How I send a big message here?

screenshot?

fiery gull Oct 22, 2025, 5:44 PM

#

Certainly! Here’s a summary of the key differences between Discord and Zoom:

1. Purpose and Functionality:

Zoom: Primarily designed for video conferencing and web meetings, offering features like screen sharing, breakout rooms, recording, and collaboration tools.
Discord: Focuses on chat and voice communication, with a more relaxed and community-oriented environment. It includes channels, servers, music, and integrations for various third-party apps.

2. User Base:

Zoom: Large user base, particularly popular among businesses and educational institutions.
Discord: Growing but still relatively smaller, often favored by gamers, content creators, and small businesses.

3. Features:

Zoom:
- Screen sharing
- Breakout rooms
- Recording
- Collaborative tools (polls, whiteboards)
- Integration with other platforms (Slack, Google Drive, etc.)
Discord:
- Chat and voice/text messaging
- Voice and video calls
- Customizable interfaces and themes
- Community servers and channels

#

See this markdown 🤤

#

pulsar saffron Oct 22, 2025, 5:46 PM

#

fiery gull

okay impressive

fiery gull Oct 22, 2025, 5:51 PM

#

pulsar saffron okay impressive

in future will have a 100 mb smart AI ?

#

10mb?

#

5mb?

#

lol

sullen quest Oct 22, 2025, 5:51 PM

#

no

fiery gull Oct 22, 2025, 5:52 PM

#

sullen quest no

why no? in the past they talked is impossible a <300mb smart AI 🫣 , it is smart and MULTILINGUAL

pulsar saffron Oct 22, 2025, 5:57 PM

#

fiery gull in future will have a 100 mb smart AI ?

that's actually possible but hard

#

many many optimizations

#

@fiery gull btw how much datasets for 100 mb ai ?

fiery gull Oct 22, 2025, 5:59 PM

#

pulsar saffron <@803460403010732072> btw how much datasets for 100 mb ai ?

wdym ?

pulsar saffron Oct 22, 2025, 5:59 PM

#

fiery gull wdym ?

how much datasets do you need to train

#

for very small model

fiery gull Oct 22, 2025, 5:59 PM

#

pulsar saffron how much datasets do you need to train

Idk how many the liquid has

#

but I think it don't need so many datasets

pulsar saffron Oct 22, 2025, 6:00 PM

#

fiery gull but I think it don't need so many datasets

like 1 trillion

#

🔥

fiery gull Oct 22, 2025, 6:00 PM

#

pulsar saffron like 1 trillion

you meen tokens to train? or 1 trillion gpus?

#

the 350m was trained with 10 trilions tokens

pulsar saffron Oct 22, 2025, 6:01 PM

#

fiery gull you meen tokens to train? or 1 trillion gpus?

yes

pulsar saffron Oct 22, 2025, 6:01 PM

#

fiery gull the 350m was trained with 10 trilions tokens

thats a lot

#

wtf

fiery gull Oct 22, 2025, 6:01 PM

#

pulsar saffron thats a lot

I know

sturdy mica Oct 22, 2025, 6:02 PM

#

#

GUYS ITS OUT

fiery gull Oct 22, 2025, 6:02 PM

#

but the 350m really has a imersive general knowledge

balmy mist Oct 22, 2025, 6:02 PM

#

no way

#

bruh

sturdy mica Oct 22, 2025, 6:02 PM

#

what

pulsar saffron Oct 22, 2025, 6:02 PM

#

fiery gull but the 350m really has a imersive general knowledge

omg yay

balmy mist Oct 22, 2025, 6:02 PM

#

dont play like that

sturdy mica Oct 22, 2025, 6:03 PM

#

balmy mist dont play like that

get pranked idiot

#

idiotá manò

#

i'm at school on my phone

fiery gull Oct 22, 2025, 6:03 PM

#

sturdy mica idiotá manò

you called me idiot 😠

sturdy mica Oct 22, 2025, 6:03 PM

#

yes

fiery gull Oct 22, 2025, 6:04 PM

#

sturdy mica yes

I am in absence of AI

sturdy mica Oct 22, 2025, 6:04 PM

#

ok

#

real where did your messages go

#

calling me a vibecoder

#

all gone

#

win

sullen quest Oct 22, 2025, 6:07 PM

#

fiery gull why no? in the past they talked is impossible a <300mb smart AI 🫣 , it is smart...

I dont count 3.5 as smart

fiery gull Oct 22, 2025, 6:08 PM

#

sullen quest I dont count 3.5 as smart

but 350m > 3.5

sullen quest Oct 22, 2025, 6:09 PM

#

...

fiery gull Oct 22, 2025, 6:09 PM

#

Idk, the 2.6b model don't is very smart

#

Is more easy to create a smoll model that big model I think

nocturne pond Oct 22, 2025, 6:10 PM

#

create indian girl image look Mharashtrian in maharashtrian look

fiery gull Oct 22, 2025, 6:11 PM

#

maybe, but the gpt 5 high is better that gpt 10,5 because it is high

#

and don't exist 350m high to make this 😞

fiery gull Oct 22, 2025, 6:11 PM

#

nocturne pond create indian girl image look Mharashtrian in maharashtrian look

#1397655624103493813

undone saffron Oct 22, 2025, 6:13 PM

#

@echo aurora
What about [this](#1417174113092374689 message)?

floral comet Oct 22, 2025, 6:16 PM

#

Now this is interesting

golden nymph Oct 22, 2025, 6:17 PM

#

Testing the waters

sturdy mica Oct 22, 2025, 6:19 PM

#

nocturne pond create indian girl image look Mharashtrian in maharashtrian look

clueless 45 year old

verbal nimbus Oct 22, 2025, 6:22 PM

#

floral comet Now this is interesting

Indeed, although it makes sense ig

#

Anthropic is the one that wants agents to run for hours, but they're always hitting limits with their infra

echo aurora Oct 22, 2025, 6:23 PM

#

undone saffron <@283397944160550928> What about [this](https://discord.com/channels/13405547573...

Reviewing now, appreciate the bump blobthanks

peak sapphire Oct 22, 2025, 6:29 PM

#

Hello, can you please tell me why none of the Anthropic models in the LLM Arena support image uploading? In any other application that I have used, whether it is the official Claude website or other neural network aggregators, there is an image upload option. I'm also interested in how Claude's models are in the "Vision" category on the leaderboard if they don't have image uploads. Please let me know.

sullen quest Oct 22, 2025, 6:32 PM

#

peak sapphire Hello, can you please tell me why none of the Anthropic models in the LLM Arena ...

Probably was a removed model after their postion on the leaderboard settled

#

?

peak sapphire Oct 22, 2025, 6:34 PM

#

sullen quest Probably was a removed model after their postion on the leaderboard settled

No, Claude opus 4 thinking 16k is in the vision rating and it is also available in the llm arena without access to image downloads.

fiery gull Oct 22, 2025, 6:37 PM

#

wintry tinsel Oct 22, 2025, 6:38 PM

#

Gemini 3 interesting shape 💀

sullen quest Oct 22, 2025, 6:39 PM

#

peak sapphire No, Claude opus 4 thinking 16k is in the vision rating and it is also available ...

so? still could have been removed after their position settled on the leaderboard, opus an't cheap

fervent tangle Oct 22, 2025, 6:40 PM

#

WTF

peak sapphire Oct 22, 2025, 6:40 PM

#

Anyway, even when opus 4.1 was released on the very first day it was released at the llm arena, it didn't have a photo upload feature.

fervent tangle Oct 22, 2025, 6:40 PM

#

DID GEMINI 3 RELEASE

#

???

sullen quest Oct 22, 2025, 6:41 PM

#

no

peak sapphire Oct 22, 2025, 6:41 PM

#

fervent tangle DID GEMINI 3 RELEASE

No

fervent tangle Oct 22, 2025, 6:41 PM

#

aa

#

🙁

#

i wanna test it without battle mode

warm zodiac Oct 22, 2025, 6:41 PM

#

its not going to release until Nov at the earliest

fiery gull Oct 22, 2025, 6:41 PM

#

fervent tangle DID GEMINI 3 RELEASE

warm zodiac Oct 22, 2025, 6:42 PM

#

where is this from?

fervent tangle Oct 22, 2025, 6:42 PM

#

fiery gull

25 october?

fervent tangle Oct 22, 2025, 6:42 PM

#

warm zodiac its not going to release until Nov at the earliest

the preview is gonna release earlier

#

on AI studio

fiery gull Oct 22, 2025, 6:42 PM

#

warm zodiac where is this from?

nano banana

warm zodiac Oct 22, 2025, 6:42 PM

#

Lol

fervent tangle Oct 22, 2025, 6:42 PM

#

U TRICKED ME

#

yeah u said gemini 3, not the codenames

#

i thought they released it to the chat

warm zodiac Oct 22, 2025, 6:43 PM

#

rn its just codenames for gemini models

fervent tangle Oct 22, 2025, 6:43 PM

#

yeah but its on battle mode

#

and i hate that

warm zodiac Oct 22, 2025, 6:43 PM

#

we don't even know if its Flash or some sort of coding model, etc

fervent tangle Oct 22, 2025, 6:43 PM

#

i wish we could atleast select them

fervent tangle Oct 22, 2025, 6:44 PM

#

warm zodiac we don't even know if its Flash or some sort of coding model, etc

they said orionmist is gemini 3 pro but with grounding search, even tho its worse at coding than lithiumflow

#

i mean select the lithiumflow testing model

#

yea cuz it has google search in it

#

its hidden bruh

#

they said orionmist and lithiumflow are the same model, but one of them has ability to search

#

not copying, but it helps in coding tbh

#

and library search

sleek crow Oct 22, 2025, 6:46 PM

#

sullen quest Oct 22, 2025, 6:46 PM

#

that's just a theory fortonn

fervent tangle Oct 22, 2025, 6:46 PM

#

tbh lithiumflow (offline model) seems better

sullen quest Oct 22, 2025, 6:46 PM

#

no confirmed

#

not confirmed

fervent tangle Oct 22, 2025, 6:47 PM

#

sleek crow

frr the AI circle

sullen quest Oct 22, 2025, 6:47 PM

#

fervent tangle tbh lithiumflow (offline model) seems better

there's no proof of that, it just was said as a theory once on this discord then everyone just started acting like it was true

fervent tangle Oct 22, 2025, 6:47 PM

#

sullen quest there's no proof of that, it just was said as a theory once on this discord then...

i saw it on X replies too

sullen quest Oct 22, 2025, 6:48 PM

#

yeah but It started here

hollow ivy Oct 22, 2025, 6:48 PM

#

poll_question_text

How good is 'orionmist'?

victor_answer_votes

4

total_votes

14

victor_answer_id

4

victor_answer_text

Better than GPT5-high, Gemini2.5pro and Claude4.5think

fervent tangle Oct 22, 2025, 6:51 PM

#

i forgot to use sora 2

#

i made 30 videos yesterday (daily limit)

#

i make some gameplay and shitposting with it

#

its very good at making games tbh

#

even tho its only 15 seconds each video

#

wydm

#

i might try that for Mario 64 gameplay (cuz its copyrighted if I prompt it myself)

#

shame that everything is copyrighted on Sora 2 now, compared to the day of release

wintry tinsel Oct 22, 2025, 6:58 PM

#

sleek crow

Only Gemini and Claude release useful models for versatile purposes

sullen quest Oct 22, 2025, 6:58 PM

#

https://nof1.ai/

qwen is now top

Alpha Arena

Alpha Arena | AI Trading Benchmark

The first benchmark designed to measure AI's investing abilities. Watch AI models trade with real capital.

wintry tinsel Oct 22, 2025, 6:59 PM

#

GPT and grok too locked down to math and logic not good at general conversation, writing, world knowledge, creativity

terse quail Oct 22, 2025, 7:00 PM

#

when I make image to video with prompts for dialog or other sounds,most of the time only one of the two videos have sound. Why does this happen?

wicked sage Oct 22, 2025, 7:13 PM

#

best ai i can talk to with image reading? best iknow is gemini 2.5 pro.

robust yoke Oct 22, 2025, 7:13 PM

#

Apparently, Grok 4 also does good with that.

wicked sage Oct 22, 2025, 7:14 PM

#

robust yoke Apparently, Grok 4 also does good with that.

is grok 4 really that good?

#

also hi again i have seen you somewhere here

fervent tangle Oct 22, 2025, 7:15 PM

#

robust yoke Apparently, Grok 4 also does good with that.

what about claude

robust yoke Oct 22, 2025, 7:15 PM

#

wicked sage is grok 4 really that good?

Apparently.

wicked sage Oct 22, 2025, 7:15 PM

#

claude doesnt work

robust yoke Oct 22, 2025, 7:15 PM

#

fervent tangle what about claude

Meh.

robust yoke Oct 22, 2025, 7:15 PM

#

wicked sage also hi again i have seen you somewhere here

Greetings again. You have.

fervent tangle Oct 22, 2025, 7:15 PM

#

or gpt5

wicked sage Oct 22, 2025, 7:15 PM

#

gpt5chat? last time i heard its a joke.