#general
1 messages · Page 154 of 1
Hey guys! Let's not disrupt the chat with GIFs spam
Yeah I've been flagging these errors to the team, it's not super stable at the moment.
oh.
this was the minimax result
this the sonnet https://3000-it6ku80oodtwxyjakawzu-6532622b.e2b-foxtrot.dev
aka sonnet 4.5 thinking 32k
btw could yall perhaps add a side by side option like yall did in lmaerna so its easier to compare models?
It's very much being worked on, check out https://canary.lmarena.ai/ 
oo okay
what makes this diffrent from normal lmarena?
Early features.
It's easy to tell because back when the site for the regular version had the old layout, the canary version had the new layout.
which model would you recommend for C++ coding (offline) ?
If the goal was, to create a 2D game.
this canary thing is so goated
It's a bit hard to determine considering I have never used any of the models for C++ coding, only HTML coding.
And also with React coding as well.
However, if I were forced to give an option, I would likely have to choose either Claude or DeepSeek.
Deepseek? really?
That's an interesting decision.
and Claude-4.5-thinking?
i thought DS was below the top league of LLMs (in serious coding)
DeepSeek because it really takes a long time to think out a proper decision to make and contradicting itself in the process which actually helps to improve upon its thinking and provide better outputs, and Claude with its ability to naturally already code very well as well as mix some creativity into its work.
Claude, I believe, works very well too, considering it's able to communicate naturally and code very well, as well as being a very creative model in its own ways.
i alr found an issue witht he canary thing
No doubt about it that the thinking model would also work very well.
Hello!
Greetings.
No, DeepSeek v3.2 Experimental Thinking.
what about GPT5-high, with good prompting?
yall
GPT in itself also seems pretty promising given the outputs it has given me, in some cases in WebDev. If you want to try GPT, then you can, but ultimately I recommend DeepSeek and Claude.
aka what i like to call the "Minecraft one-shot test" where i js use this prompt: "make me a three.js minecraft clone with working terrian. first person movement collisions side collisions xyz collisions terrain. that isint hilly as hell. but also isint hill downhil hill downhill hill downhill repeat and repeat and with block breaking and placing."
Why is Gemini 2.5 is still in the top of llms in lmarena?
ask it to write that in C++
could be an interesting test
idk how to compile c++ lol
i only know how to do c# via unity.
and three.js via html
the AI can explain it
it (the AI) will probably propose SFML (for 2D games)
Why is mistral dumb af
just ask it, what is the best engine for 3D in C++
because that lab has not the resources of the Big5
(Deepmind, Anthropic, OpenAI, xAI, Meta)
How's mini max 2
Hmm interesting
Is it any good
Testing that right now.
That's crazy
But all those 5 are dumb too tbh
why that?
Deepmind isnt "dumb"
doesnt mistral just make small ai's that can run on ur pc or small servers?
or single h100 gpu's?
Gemini can't speak my native language
which is..?
What's your native language?
what's your native language?
All llms can't speak it
Tamazight
The original north African language before Arabs colonized us
cool!
What's tamazight
interesting.
our (earth's) history is rich
Yeah . Whats ur name. Im gonna write it in my language
me?
paws
An interesting rule.
Darkness = ⴷⴰⵔⴽⵏⵉⵙⵙ
There was one model that can speak my language
It was sonnet 3.5
what does "Cosmos" sound in your language? if you translate it
It was amazing
It seems like Claude is the only model with a creative mind, and thus, can speak native languages very well.
ⵉⴳⵏⵡⴰⵏ
Ignwan
Yeah especially the old sonnet
They are focusing more in programming now . Not in ancient languages hhh
Well, not to worry, since Claude 4.5 seems to also be just like Claude 3.5, in that it's able to write in very natural English. Almost in a conversational tone.
So, I think it might be able to write in your language.
I've tested it. But the results wasn't good
Darn. I'm sorry to hear that.
Yeah. Im hoping Gemini 3 will be good
It mentioned something about Tamazight being a Berber language.
Gemini 2.5 isn't that bad actually. But it makes a lot of mistakes
Yeah berber people are amazigh people
Good evening from Alaska, where I'm hoping to see what's possible with AI video animating characters from my stories.
But we don't call ourselves bereber. It is a racist name from romans
I've never really known about Amazigh, so this is all new information to me.
Where are u from ?
Washington, and you?
Morocco, taghazout
Ah.
I've known a little bit about Morocco, but I never realized people there spoke Tamazight.
I know Washington do you know taghazout ? Hhhh
Not really, no.
After independence, the country didn't teach the original language in schools . That why a lot of people speak drija now . Wich is a mix between arabic and french and tamazight . Hhhh it's very complicated here hhh
There is a lot of people from Europe and usa . Coming to taghazout. Its very famous for surfing and nomads
Very interesting.
Ah.
Don't ask ai . Hhhh it doesn't understand tamazight
Apparently, it's supposed to translate to "cockroach".
ⴰⴱⴰⵏⴹⵔⵉⵡ = cockroach
@normal peak hey bud
There is a lot of amazigh directs, here in Morocco. We have 3 and Algeria 3 Libya 2 . And also there is small group of people in egeyp speaking tamazight too . In siwaa village. You can chatgpt it hhh
Does anyone have GPT Pro chat? I have the Plus option and can only make videos up to 10 seconds long and 720p with Sora 2.
I want to know if GPT Pro can make videos longer than that, 1080p, and with Sora 2 Pro.
I also want to know what the daily limit is for creating videos.
Amazigh letters. Very ancient
Apparently, it's in their native script.
Tifinagh alphabet
I find it funny that the letter "P" isn't in it.
Yeah hhhh like we don't have a single word in tamazight w the letter p
Yeah, that's very interesting.
i like MiniMax-M1
Heh.
Hhhhhh
It's almost like how the Dutch are very throaty in their language.
real
Thank you
My pleasure.
'Nànnuflày' (Fulfilled) from the album 'Elwan,' available now.
Order ELWAN here: http://found.ee/tinariwen_store
In the Sahara desert, an old Tuareg man comes back to the camp where he grew up for a party. He remembers the joys and the torments of the nomadic life he lived with a friend who has since deceased: memories from their naive child...
Hallucination hhh agi is still far away hhhh
what even is Tifinagh and were in the world is that from
Tifinagh is a handwritten script that a certain group of Moroccans used to write in.
It is the script of the Tamazight language which existed before the Arabs (pretend I explained something here), and now, they teach the Moroccans a mix of French, Tamazight, and Arabic.
@reef mirage Please head to #1397655624103493813 for a detailed guide on how to use the bot
you gotta set up a bot that automatically does that to any message with /video in it
@tribal whale Please head to #1397655624103493813 for a detailed guide on how to use the bot
That is literally cool as hell, I have never even knew about this language until now.
Neither have I, to be completely honest.
Why not setup an automod rule to do this
@mortal plinth Please head to #1397655624103493813 for a detailed guide on how to use the bot
Thanks for your feedback. We will pass it to our team.
Basically, I mentioned that there are problems in LMArena, the images aren't showing up accurately.
Here! The image is completely blurry! gpt-image-1 used to make decent art, even different versions, but now he doesn't. Could you please explain the problem that's been going on for two months?
My message got deleted?
It doesn't look blurry to me? Maybe click download and send raw file instead ?
What do you mean?
gemini 3 december? taking so long...
hello
why do they release in december
The image doesn't look blurry to me
you're not the first person to complain about this, i complained about this here (#1412721830682296423) right when they started doing this. It seems like LMArena has done absolutely NOTHING to fix this! But that's because they did this to save money. They changed the quality of a model to make it cheaper and the leaderboard score goes down... They should change it back to how it was AND remove ALL votes since September 3 (the date they first started doing this sneaky stuff). It's a shame few people know LMArena has been doing this, I tried to tell people but it never works.
When they do fix it, they should add the API quality level of the model to the name of this and GPT Image 1 mini (it's affecting that model too) so people know what quality they are getting.
我靠,困死了
Why is there no sound when generating a video?
@echo aurora you never gave me a proper answer the last time i asked but is https://github.com/lm-sys/FastChat still being used by LMArena at all? if not, is there any other github repository that is used by LMArena that I can use instead? (I am a volunteer for a popular online wiki that has LMArena on it multiple times for various AI things and this is the GitHub link we use for LMArena)
Guys anyone can tell me how i can genrate pics and video
hi, guys everyone how are you!
read #1397655624103493813
for image gen it's better to use the lmarena.ai website (hit the image button in the text bar) as the 5 per day rate limit for videos on discord also applies to images when you generate them on discord
hi there - hope to discover best video generating ai tools there...
Bueno
hi there - hope to discover best video generating ai tools and practices
Here to laugh at all the wild stuff being produced
192381273123 bots is coming
Need ultra banana 3.0 pro to have 5 mins of novelty in my life before I start "needing" grok 5 or something

#1397655624103493813 {
"version": "1.0",
"platform": "lm_arena",
"task": "image_to_video",
"referenced_image": "/mnt/data/IMG_4368.JPG",
"settings": {
"aspect_ratio": "9:16",
"duration_seconds": 10,
"fps": 24,
"resolution": "4K",
"format": "mp4",
"quality": "high"
},
"prompt": {
"description": "Cinematic drone shot starting high above an ancient Indian fort and smoothly zooming in toward its central courtyard. Warm golden-hour lighting with soft shadows, natural sunlight flares, and realistic HDR tone. Gentle downward tilt revealing the fort’s symmetry and red sandstone textures, with a slow, stabilized motion for an immersive feel.",
"camera_motion": "smooth drone zoom-in, gentle downward tilt, stabilized dolly-in",
"visual_style": "ultra-realistic, golden-hour color grading, 4K HDR, warm tones, soft vignetting"
},
"negative_prompt": "no people, no flicker, no distortion, no overexposure, no text or watermark"
}
go on #video-arena-1
amazing! I've always been fascinated by the Amazigh! it's great to have you here! your language is especially fascinating, reminds me a little bit of the mystery of the Basque culture too ✨
what is this model from Google?
anyone knows how to use popcorn feature on higgsfield to transform a video and change the face in the video?
Anyone knows ai generated video will be monetize on youtube?
Please refer to #1397655624103493813 to learn how to use the bot.
That's the problem no one tested this model we don't know if it's actually good...
so the answer is you'll never find out
helllo
hello
i'm surprised that no one did distill of gemini 3
Hello
hello
how has nobody mentioned that minimax m2 has a 200k context window
m1 had 4 million
going from the best context window (aside from llama 4 scout) to below an average good model's context window is kinda crazy
hi again
Hi, trying the models and prompting here before signing up to a specific service.
How will future humanity handle time[zones] (TZ) ?
1
2
Ernie is chinese
So I would say no
Unless they are trying to deceive us
no its google
it clearly says by google
i trust ernie
That's right, but gpt-image-1 stopped showing other options. And the images turned out to be of poor quality. For example:
Hey! Anyone know how repair this? I can't work with this.... I have this in all models. 2-5 answers and error
Hello
Rysana is the AI cloud for production: fast, reliable, and clever. Make magic happen with our language model API platform. Check out our open source libraries and documentation for building better products with modern AI, and Lusat - our breakthrough reasoning engine for intent translation and on-the-fly dynamic UI generation.
hello
We just got a surprise AI video model drop! LTX Studio has officially launched LTX 2, and it's a banger! This new model boasts 4K resolution, audio generation, and, most importantly, it's going open source.
Today, we dive deep into LTX 2, going hands-on with the new API playground to test its text-to-video and image-to-video capabilities. We'll...
im so excited for ltx 2
its 4k 50 fps
GONBE GONBE!!!
?
everywhere I look I see her face
hello
making vieo
how can i generat video in here
Please check #1397655624103493813
hello folks
i have a problem with uploading my images (jpg format)
consistently got upload failed message !!!
what shoul i do!!!
does not help !
oh!
probably not tbh
though I bet there'll be a gemini 3 pro preview before it releases
I've seen it be said that lithiumflow is gemini 3 but not gemini 3 pro
Yeah I was told they were gemini 3 flash models
there was like, an X28 model or something that was labelled gmini 3 pro
it didn't say they were flash either tbh
maybe they'll end up being gemini 3 coding models
maybe, if they try something new
a few times lithiumflow did worse than 2.5 flash on creative writing stuff
Are you a girl 😶
@fresh mirage can u see this
glances at [she/her] in discord name
bruh
@pulsar saffron can u see this
shows api request
Maybe non arena champion role members can't see this
You can see everything right?
everyone can see it what's so special about it
hello
@fresh mirage @obsidian cargo
You guys were talking about gemini 3 gemini 3 pro
I think this helps
hey i want to make ai videos
Ask @jovial sapphire
yall its ai release season
we got suno v4.5 all. sonnet 4.5. news of gemini 3.0 coming soon. ltx 2. hailuo 2.3
😭
-# aka models that all recently released
never heard of ltx and hailuo, are they some kind of video/image generators? the only thing that excites me is the gemini 3.0 pro although i think google is prolly make it a subscription
ltx and hailuo are video models
I sure hope they give free usage
they probably will.
they need good first impressions
They already have that from ab testers no?
and they will probably. like most likelly give inf usage for google ai studio
public first impressions.
aistudio is such a wellmade chat interface, i really hope they do make it free
quality assurance is diffrent from what the public thinks
Yeah I guess
I agree
they already got that with 2.5 pro and 2.5 flash, i think they will be setting up some limit probably, or if they actually cared made it more efficient and cheaper
Yeah but what if the 3.0 pro and flash launch end up terrible. and they NEED. the public to use it so people reccomend it to other people and youtubers hype it up post-release and so on and blah blah blah.
What if the gemini 3.0 launch ends up like the gpt 5 launch. gpt 5 was super hyped. but it launched with 50/50 reviews.
im guessing either google is shooting for the stars or trying every possible way to get the benchmarks slightly higher than gpt-5-high
i just hope they make it good in coding aspects so i dont have to use the expensive claude models
yeah cause if i remember one deepmind employee said that it would be way better at coding then 2.5 pro.
That would be good, although Gemini already costs the same as Sonnet/GPT-5 on Copilot.
how do i download my image, its lacking the icon for download
gemini cli is so disappointing i hope they improve on that too
right click ---> save image as
I hope it's good at agentic coding, that seems to be the real test.
it only gives me web adress
go into it and right click save image as
i hope they actually make the 1m context or the rumored 2m to actually be useful instead of hallucinating the second you go over ~250k context
They should release benchmarks showing that the performance on, say SWE-Bench, actually improves as you increase the context.
The difference between 128K and 256K for Qwen 3 Coder on SWE-Bench Verified was only around 1%.
have you seen the ui websites on reddit that it makes?
insane compared to gpt5 or sonnet
It actually has internet connectivity, it seems. I'm not sure how fair that is.
internet connectivity as in what? do you mean like grounding?
Wdym? It's amazing
As in if I ask it for top news on Hacker News, it seems to fetch a cached version of it from just 2 days ago.
Personally my favorite ai's for coding is:
if you have no budget:
sonnet 4.5 thinking max thinking budget.
gpt 5 thinking high.
gemini 2.5 pro max thinking budget
if you have a small budget:
kimik2.
deepseek v3.2
gemini 2.5 flash latest.
its missing some critical features, and codex and claude code perform better in my testing
what about glm 4.6
meh, i tried it
kilo code actually tested glm 4.6 haiku 4.5 and gpt-5-mini and they concluded that gpt5-mini is actually the best in their test
i think thats very interesting
Not that surprising to me
i havent seen anyone actually talking about the mini models from gpt5 series alot
GPT-5 mini is actually more persistent than GPT-5
GPT-5 Codex returns too quickly, kind of like GPT-4.1. It's very annoying.
Asks me to run tests when that's it's job.
Told it multiple times in the chat as well to not return/report until all tasks are completed, but it keeps returning early.
same keeps happening with me with gpt-5-high, it thinks way too much on easy problems
Agentic ability seems like an important factor to test for. i.e., can it work autonomously to actually complete the tasks without returning early, losing context, hallucinating tool outputs, and can it use tools properly + plan + solve issues that arise, etc.
really hope it blows all models out the water otherwise it'll probably be some minor change
https://chat.z.ai/c/4b8c41c9-f64e-4009-a867-31ead653cc2c glm 4.6 is genuenly ass
Chat with Z.ai's free AI to build apps, create presentations, and write professionally. Fast, smart, and reliable, powered by GLM-4.6.
😭
thats the thing i asked it to do
😭
It's kind of surprising how weak 2.5 Pro is at tool use. Even without tools it would hallucinate using a tool.
funniest thing is i just see it executing the tool calls in the thinking sometimes
I've had it claimed to use tools on LMArena when there are none.
when i enable function calling on aistudio it keeps saying heres the code! but doesnt actually type anything
and then it keeps repeating
fr lol. one time i told it to be gen z and it hallucinated a "skibidy_search: rizz" ish toolcall. its something similar to that. but it was a few months ago.
I thought Orionmist was hallucinating, but I think it actually had (cached?) internet access (lol): #codename-discussion message
What features is it missing? I don't use claude code or Codex.
So I wouldn't know
Yeah or it hallucinates attachments when you forget to attach them (weird)
what did you expect lol
its not SotA
dont really remember, all i remembered is i had a really bad experience but if you say its good maybe they improved alot on it
i tried it first when it released
yesterday it told me 2-1 is 3 in a math equation (2.5 pro 1.0 temp)
Sometimes it thinks it can upload code to github and then sends me a hallucinated gist link
If true, then I think that's kind of unfair. Can it just search Github code?
Well im saying i might not know what I'm missing
What did you see missing
Oh that's odd...
It seems like all the Chinese models (likely) trained on Gemini has similar hallucination issues.
also for some reason it doesnt use LaTeX
It does for me, and if it forgets I just tell it.
for me it sometimes does and sometimes doesnt, uses 1/2 for fractions
hit or miss kinda
also the aistudio default temp really sucks
The scrolling is kind of buggy for me
It's impossible to scroll to some messages sometimes, it just skips up or down
me too sometimes when i scroll up it brings me down
yep
Hmm that's kind of a bad look tbh :P
its funny how easily jailbreakable the model is ngl
If their internal model is good, it would have fixed it
The input box for TTS on AIStudio has the same de-focusing issue Gemini models seem to make when on mobile
hi guys im currently trying to self host a minecraft server for me myself and i!
oooh kay?
hlo
hey lo
Dude, Gemini 3 is like drugs, once you try it you can't stop.
real
and its so good for my niche task too
why is r*blox a banned word
anyways
idk if its niche or not but i use it for r*blox scripting
ro blox
what model is gemini 3
the lithiumflow thing?
yeah
and orionflow i guess
but they are the same thing
just one is grounded with google
search
are u getting it to generate boblox scripts via webdev arena
its available somewhere else?!?!?!?
no
oh
hi
technically it is possible yes
i assume the ab testing in lmarena is still there but idk
its a pain to get a response from there too
just ask for something like a website that contains the sample script
sample script of what..
ro blox
hes talking about tthis
but we concluded
the model is GONE
in the first place
maybe
just maybe
we can hope its because release is imminent
because no need to have stealth models if the model is gonna come out tommorow (im delusional)
I had read on a website that the launch was in December
i thought that was for another google stuff
can i like phase out of my life until december hits because..
😔
IKR?
it feels like I'm having withdrawals lmao
after using it for 3-4 days in a row
🤣
This is what google does to people
It’s like every model I now use is like 10 percent of googles “lithiumflow “ which people say is the FLASH version btw
oh
Funny thing
Someone who I thought was smart..
And good at tech in general sent me a get a free steam account now
Text or something
I see it in all the dead servers 🤣
mfs kept yapping in the server so they removed it
who
gemini, more specifically the person who works at it on X. hes doing all sorts of stuff
also because the fact they put it on lm arena
they do it to get their result so they can showcase it when the model actually gets released, not for hyping it up
@royal scarab Please head to #1397655624103493813 for a detailed guide on how to use the bot
@orchid wedge Hi! Please check https://discordapp.com/channels/1340554757349179412/1397655624103493813 to learn how to generate content
wydm we keep yapping?
thats how we test stuff?
Where can I see my creations
I wanna see 🫣
Hi every one!
Hi
I noticed something about Gemini. It tends to be very steadfast in its beliefs. If I accidentally ask it something that is past its knowledge cutoff and clarify, it insists that the thing I said still doesn't exist.
2.5?
I forget which one was the latest that did it, but I noticed the trend for a while.
meanwhile Claude goes with your conversation and makes stuff up lol, I told it about the RTX 5090 and it's specs were made up entirely by Claude
I seen that as well.
I find it fascinating how each model has its own "personality".
Claude's reponses are very human tbh, I never get anything like that (except GPT5 which adds alot of emojis in every chat)
i hate the emoji stuff
I notice that as well. Like a person who wants to satisfy the person it's talking to.
they're also great at story telling and fantasy generations
Yeah. Grok seems to be pretty good with that as well, but not quite to Claude's levels.
Though whenever I want to talk about things I've written, I only discuss it with locally-run models for privacy reasons.
Typically I talk to either Qwen or Dolphin-Mixtral about those sorts of things.
lol
funny part about these new state of the art models, they think you are testing them or evaluating their response sometimes so they actually just refuse you or say "this is obviously a test to evaluate my capabilities"
rofl
Hello, I test and try current AI systems
I've seen that before. It raises my eyebrow every time I see it happen.
Thanks this useful framework
hi i am alex, i am german but english speaking shouldnt be the problem
Hi, wanna some help? I'm here to help 🙂
wassup
try to figure it all out, i wrote a book and now i want to create some videos for the promotion on social media
@compact junco you can go to #1397655624103493813 for a detailed guide on how to use the bot
ok try it out
very interesting, if you have problems with video generation in lmarena you can use grok imagine 0.9v it has audio and is also 100% free
sora generated friday night funkin'
wow!
Sora 2 can probably do a million things that we can't even imagine
how good were the scripts
obviously by his tone its like r0blox scripts sent down by pharaoh himself
hopefully gemini 3 is that good
If the preview was already this insane superiority over other models, I can't wait for the PRO version
it is
its so damn good
it crushed every other model at Web development
might unsibscribe from chstgpt and subscribe to Gemini when it releases

i switched to gemini fully recently even tho i got chatgpt pro and i cant wait until g3 come out, g2.5 is slept on



hi guys i need help.. please
which one should i buy, gemini or chatgpt (by buy i mean get the paid plan)
sometimes it cooks and sometimes it sucks
Hello there, does someone know if there is a problem on the website? It´s not letting me upload images as input.
I have Gemini Pro, and honestly I don't see any reason to subscribe, if AiStudio exists
true
but that depends a lot on what you want to do, if it were me spending money on AI, I would sign the GLM code, and try to do something agentic
i chose
gemini
🎉
there was like an offer on this
so i HAD to get it
because
it had storage
stuff
i hate myself
i got the college student free subscription
hello
Hi @wicked sage
I'm evil.
yeah man i totally love google having quite literally all my data
@pine spruce Please head to #1397655624103493813 for a detailed guide on how to use the bot
How Minimax 2 is gooing at all? Xd
this a great platform to learn and increase my AI knowledge
sometimes I wish there was a way to like
undo a vote, lol
It’s rare but
Sometimes I click wrong
and hit tie when I liked one model more or both are bad when I meant to pick another
I get how it would be abused by people revoking votes after seeing the models revealed
but
always feels so silly
will there ever be a website video gen?
I dont care about thst lol
If you don't want your data collected don't use the internet
What if I want to use the internet without my data collected?
🔥 Hi
Bro ur on fire
ahhh okay, it make 100% sense now, I'm just sad because they nerfed 2.5 pro in the app
I still recommend to use the AiStuido
...
I was right, I always trusted the chinas 🔥🔥🔥🔥
Yeah grok 4 above sonnet 4.5 and opus 4.1 in anything
LOL JUST REALIZED GROK 4 FAST IS ABOVE OPUS 4.1 TOO
😭
even if it's benchmaxxing it's too good
@carmine river Please, read our guide in https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to properly prompt the bot.
@cinder finch Please, read our guide in https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to properly prompt the bot.
its not nerfed it has dumb insructions
if there's anything good paid gemini has, can I test it through you?
the memory from 2.5 pro is nerfed, I have a long chat and I fell the changes
hello
Bro felt it 🥀
not sure how long this’ll last, but you can scan the QR code to get Comet Pro and a month of Perplexity Pro for free 😊
m2 is pretty dumb in my testing, maybe a bit dumber than m1. not worth the hype imo. even gpt-oss-120b is better sometimes and that model is half the params
I don't like that yupp discord did a @ Verified (that server's equivalent of @ everyone) over this
mm
He trying out image to video generation
if it was k2 reasoning or v4 or something of that class that would be fine, not a 270b that has less knowledge than OpenAI's safetymaxxed and benchmaxxed model that is less than half the size AND has 4bit weights instead of 8bit or 16bit (not sure about that, but Chinese models are trending towards 8bit so it could be that)
Can ai cure cancer
can someone point me where i should start learning proper way of making custom instructions or at least finding good ones?
and prompt optimizing
never bothered
non coding purposes
'Cancer' is a label for a wide variety of ailments. AI would have to learn and understand each one meticulusly, before it could even dream of tackling it. And i bet, such an AI still is, at least, 5 years away..
Cancer also can be caused by many different things (including: chemistry, UV-radiation, radioactivity, poisonous food, toxins, infections, fungus, genetic diseases, even psychosomatic causes)
Thank you
you could ask Gemini 2.5 pro in google's AI studio to help
there's also OpenAI's "cookbook"
Cancer could be called a local failure of the body's innate self-repair system.
(Normally, the body quickly recycles cells, which became cancerous. It has to do with our immune system. The immune system can also be influenced [indirectly] by our emotions, or how we feel. Of course, if cancer has appeared, it is not enough to have a 'high spirit' to heal it. One would need targeted therapy.)
"psychosomatic" 
llm's are terrible at doing science research, but there's plenty of other ai's that are helpful right now in ai research, so yes, ai can help cure cancer
i did it myself with 1 prompt from chatgpt then editing it
worked out far better than i thought possible honestly
i actually like it more than claude explanatory mode now, which is what i was seeking to emulate....
a lot more tbh.... wow
ill post with vs without and the instructions
we can use open art here?
G, do you know of a video upscaler that improves the realism of people? I'm trying SeedVR, but it takes a trillion years. Also Topaz, but it smooths out the upscaling.
G, do you know of a video upscaler that improves the realism of people? I'm trying SeedVR, but it takes a trillion years. Also Topaz, but it smooths out the upscaling.
obviously the longer non-useless one is with instructions
here you go
dont want to spam channel with instructions can give if someone wants
theyr elong
will be put in a text file by discord so it'l be fine
only spent about 3 iterations editing this
im sure it can be way better
i was using settings "Depth: expert. Voice: analogy-heavy. Scope: include adjacent context. "
only problem is, gpt5-high obeys them and looks like that. Pro basically ignores it
GROK 5?
actually pro web searched and high didnt so i will disable web search and try again
tell me how good this is
so
the prompt was
to shorten an already shortened to hell
script
and
ill check which one is shorter
check quality of writing too
crazy
DUDE
...
oh my god
grok cant even render anything
😭
😭
dude i have no idea what to do now
i cant copy the responses
nvm
phew
atleast it saves
UNLIKE AISTUDIO'S AB TESTS
phew.
now i can check network requests
ok this fixed it
@sullen quest ... the one which thought almost 2x longer was.... worse..
it was the same length as the other one AND it didn't work..
so..
maybe the right one is the new one
thatd be awesome
564 chars/80 words
its kinda cringe
"perfectly, properly, go through many iterations of "wait, i can make this shorter by.." and say what you are going to do. it should make logical sense and should actually be shorter. then, look over all your iterations (at least 50 unique and true iterations with actual changed code and optimisations) and create shortest truly possible html file. current html file i want you to shorten to shortest truly truly possible while still working (109 chars): <script>history.replaceState(0,0,location+(location.search?'&':'?')+'__websim_screenshot_mode=true')</script>"
oh, wow that is short already
Hi
I'll try to get it myself, expert or grok 4 fast when you got this?
expert, of course
btw i used grok 4 because i tested all models and grok 4 0709 on lmarena is consistently MUCH better than any other model for shortening code without breaking it
its just that specific use case
Trying to. Make videos, im. From. Mexico, dont know how to. Start
Man these ai companies are tripping lol
some block outright simple things whike others let it all through lol
go to how to video bot
which is better?
1
which model has the best understadnding of MP3 files and can accurately describe music
I don't think thats a feature many llms have
gl
The creative write from minimax m2 is soo good 😆
But the benchmarks talk it is soo bad in memory 🙄
In a mysterious jungle, a young man and his loyal lion companion must complete five impossible tasks to restore balance to nature. Each challenge reveals courage, emotion, and the deep bond between man and beast. Combining AI-generated visuals with cinematic storytelling, the film takes viewers on a breathtaking adventure through the wild.
Open ai
Bro thinks hes generating a whole movie
Yes
I love when I'm using sonnet 4.5 and switch to another ai and start thinking like sonnet 🤣
I don't know, it seems that the m2 gets smarter with the thinking from sonnet
M2 Only with weird multilingual, but it really know how to speak brazilian portuguese, just has gerenic previews errors
??
I just noticed that the data in the LMArena Hugging Face repo hasn’t been updated since August (both pickle file and metadata). Are there any plans to update it, or will it no longer be available going forward? Thank you!
Good
@echo aurora
Yes, it is our intention to continue to update that data. Our apologies it's been awhile. I'll be sure to bring this up.
Sounds great. Thank you!!🙏
Hello
@echo aurora?
My apologies! You're right I never did get back to you on this. Yes, no change in different repository, it is this but we just haven't updated in awhile.
What's the wiki you volunteer for?
hlo
specifically the AI page
is it going to be updated in the future and is it going to be accurate to the current LMArena interface/system?
Last I checked we are planning to update, I can bump the team about this on Monday.
accurate to the current LMArena interface/system
Not sure I'm understanding the question right, can you elaborate?
It is going to look like the current site is what I was asking.
That I do not know, but will ask.
Hi everyone, new here. I'd love to hear about your interesting projects. I'm finishing an apartment finder app for my recently widowed mom. She's alone now that my dad passed away, and I want her to downsize and enjoy her inheritance.
To help convince her, I'm handling everything. I've built an app to simplify selling her house, finding a new place, and moving. The app's design is inspired by her favorite magazine, The New Yorker.
Looking for inspiration and happy to connect. Feel free to DM me
Hello there fam!!!
new here cant wait to test some ai and see what works better for me!
I have a question. Why models that In theory, should have an almost entirely english dataset, like Grok and Gemini, in LMArena sometimes missplace chinese symbols into text? I can understand for example why Deepseek or Qwen do that, but why other models that usually don't have such problems when using them on official websites (don't tell me it's system prompt again that just sounds silly)
fair, anyways offtopic but am i like on crack or is it true that claude is better than gemini
AI-powered calling for local businesses to check pricing and availability in Google Search (US only)
Flow
Jules with higher limits
NotebookLM with higher limits
Whisk
Deep Search in “AI Mode” for in-depth research (US only)
Gemini app with 2.5 Pro and Veo
Gemini CLI and Gemini Code Assist
Gemini in Gmail, Docs, Vids, and more
Gemini 2.5 Pro model in “AI Mode” (US only)
Gemini capabilities in Google Earth with higher limits (US only)
Higher limits on Google Photos Generative AI
^ Photo to video
^ Remix
2 TB Storage
1,000 monthly AI credits
i know this page i have used it several times in the past and i think literally was the reason i found LMArena
why is the code always missing from all ai models like when the chat lasts?
I have experimented with the AI music extenders, of those I've tried I can only say one deliver results worth a 👍 and that is https://musicextend.com/ sadly it seem quite a bit overloaded by requests - no wonder, it's really is the best.....currently. [Addendum: It's super good for instrumental music, lyrics tend to be a bit absurd.]
Easily create and expand music with generative AI, breaking compositional time limits, online and for free!
ok which tool use Lmarena in discord for Image Generating
Because of western bias and preference and other more technical factors such as tokenizer idiosyncrasies a model sees a tiny amount of non-English text, English dominates. The models are biased they “prefer” high-frequency English tokens. Non-English tokens are low probability. So the model will usually output English.
English words → often 1–2 tokens.
Chinese characters → usually single tokens.
斯大林 = Stalin
斯 (Sī) → “this” or “such”
大 (Dà) → “big” or “great”
林 (Lín) → “forest”
This is not a generation channel. Go here: https://discord.com/channels/1340554757349179412/1397655695150682194
hello
There’s traditional Mandarin, and they both differ from English because of the characters
I see
Arabic is also like this. Thats why it’s easier to jailbreak models with other languages even when using same prompt which will fail in English may work in mandarin or Arabic or Korean or a number of other languages because of the way they’re mapped out in the lanten space
hey
@uneven gate @indigo grove you might check on #1397655624103493813 to learn how to use the bot properly.
I don’t think models see words or even letters the way we do it’s all numerical
See each word has its own id, even if the Mandarin says the same thing it has a different numerical ID for its token. This is what the LLMs calculate and optimize instead of seeing the real word, they just see numbers which are assigned to their own context and token and so forth this is a very simplified explanation
hi all im here to get creative!!!
image to video generation doesnt generate audio?
See. Best example of that I can give you. (Also huge security gap) but that’s for a different day.
hi im new , just exploring how far we can go and maybe save the world
Welcome new adventurer. You can explore to your hearts desire but to save the world is the opposite of what current AI is 🙉🙊
is anyone to help me? i tried to generate a video but it has no audio
Sure, not all videos have audio
do i need to do anything specific to put audio in it?
hi
Well since I assume they route there models to match the capabilities of each model there may be some prompts that are more effective then others if it’s random then it’s a hit or miss, I can show you some prompt script to possible help
okay
please do
R u using image or prompt?
im using image to video
What’s ur image I need to convert it and what do u need it to sound like?
its like in my local language
the lipsync was perfectly fine
but i got no audio
where do i send you the image?
can i dm u
Sure
hello everyone...
How do I use lmarena from in here
For what video?
hi i cannot send any messages:(
To who?
If it’s to the video arena you may have hit your daily cap (5 videos/ 24 hours)
to bot it says cant access the bot or upload failed
sadly i havnt been able to create one today at all. maybe something wrong with my connection
Could be but I doubt it let’s see
Hi
it was my connection, thanks pal
🤣 cool.glad 2 hear it
Who’s a killer at promoting mid journey ?
this is interesting, in my conversations with claude, I see often russian or chinese, sometimes spanish or portugese words slipping in despite all my conversations are in EN, how do you explain this rich mixtures of languages?
Not sure I never see anything like that.
I'm curious if i'd ever get to see a dead language slipping through
I need examples
Hi everybody. New here
Does it do it randomly?
no, it is context dependent I feel
Models sometimes scheme in very nefarious ways in order to complete the task often times certain context and certain words and phrases would trigger the guard rail, so the models would often times find alternative means
one example is "activating", instead of writing it in EN, it was replaced by the russian word, sometimes it's written as EN and the russian transalte right beside eaech other
sometimes, claude uses 3 different languages in its thinking, for example: german, russian and EN
Claude the fake nice guy lol
in my case at least, it doesnt happen often tho
How are u doing all here guys
So far so good just stoping by see what’s what
well... maybe Anthropic has damaged some circuits somewhere during their interpretability experiment
I'm multi-lingual myself, including a few dead languages (grammar mostly), so I know how it feels, but it's interesting to see same phenomenon in LLMs
I'm not sure if this language confusion is connected to the wrong usage of personal pronouns, or is it rather context confusion, it's been a year, and such pronoun confusion is STILL a thing...
I think it was this
That guy it’s such a schill lol
Wrong video
Join My Newsletter for Regular AI Updates 👇🏼
https://forwardfuture.ai
My Links 🔗
👉🏻 Subscribe: https://www.youtube.com/@matthew_berman
👉🏻 Twitter: https://twitter.com/matthewberman
👉🏻 Discord: https://discord.gg/xxysSXBxFW
👉🏻 Patreon: https://patreon.com/MatthewBerman
👉🏻 Instagram: https://www.instagram.co...
I absolutely understand why languages are not sufficient if models "think" in the latent space
Cause they do t really think lol
they calculate and map
I used the same word earlier, cause I couldn’t think of it at the time but the proper term I think would be optimization
But yes same idea
still not a reason to use various languages when the context is clear what language is consistently used?
Well context dependent and exactly what it is that the goal was
If you were deep in a conversation and deep in context with subjects all over the place
And you cross into sensitive areas more than likely it would produce such effect in theory
Other than that, I can’t imagine I never seen it before. I need an example to know for sure. I’m just going off of what I’m picturing in my head. 😂
when I cant find the right word in EN because my brain has found a better, more precise expression in another language, lets say chinese since you started the example above, then I'll just say the word in chinese and explain to my interlocutor what I'm thinking and why it's difficult to express what i want to say in EN, LLMs just straight output chinese characters without explaining why they did that, and users are confused like wth just happened...
Because they are built to be glaze
sensitive areas? like discussing math?
No, I’m not saying that’s what you were doing. I was just assuming for my experience lol
Well, if you introduce Chinese, I don’t understand why you would be confused if respond would respond back to Chinese? And I apologize maybe I’m not understanding what it is that you meant just to be clear you’re saying that you introduced the Chinese expression because you didn’t know the word in English, right?
I've never introduced any languages, so why russian, spanish, portugese and chinese? I havent see arabic til now, maybe it will happen at some point
it's spread in various chats, am not digging them now cause i dont remember in which chats, i dont mind this since i know this happens to humans, for monoligual people this can freak them out 😅
Could also be a memory thing
If you ever used it to translate
Especially if you’re using the arena
this happened before memory search or any memory features were implemented, and no translate, always strictly in EN
Was it in the arena?
on their own platform
Interesting. 🤨 hard to say.. ? Could be anything I’d love to see a screenshot sometime if anyone has one
there are some on reddit, you can dig there
not sure how this phenomenon is academically officially termed, maybe you can find it in this paper https://arxiv.org/html/2406.20052v1
I love the title of this paper, genius https://arxiv.org/abs/2410.13237
Language Confusion is a phenomenon where Large Language Models (LLMs) generate text that is neither in the desired language, nor in a contextually appropriate language. This phenomenon presents a critical challenge in text generation by LLMs, often appearing as erratic and unpredictable behavior. We hypothesize that there are linguistic regulari...
Oh this was common
Like early ChatGPT 4o days
But this was written by ChatGPT lol
🤣
A question about a bot with a reply from one
Only good high quality data left on the internet is user engagement and patterns
Everything else is trash maybe one or two nuggets of good data left on the Internet since everything else has already been fed
Yes that's a major problem, I had one DM discussion with a person about the fact the AI's use Wikipedia and similar other sources for their information.
The man an expert on history for a handful of countries on Balkan, while I am a researcher in biology both have found so many errors in online sources we agreed they are virtually worthless.
But that is what AI's use to summarize fact - what a joke this is!
Amen!
It’s because ai fanaticism and AI mania is a real thing, it judges people ability to see clearly beyond what exactly they’re actually looking at out of convenience I’ll show you the prime example
Deloitte has issued a partial refund to the government after they delivered a report that partially used AI which contained errors, including fictitious federal court judgements and made up references.
#abcbusiness
Subscribe: http://ab.co/1svxLVE
Read more here: www.abc.net.au/news
ABC NEWS provides around the clock coverage of news events as...
He presented an example even, while he did not know the terminology, he did provide an example where the AI had been hallucinating 'facts'.
Here it is:
https://www.youtube.com/watch?v=IZ5FyLtVVxA
If you love stories that challenge what you think you know about the past —
SUBSCRIBE to Dark History Class, where we uncover the hidden truths, the forgotten wars, and the power struggles that shaped civilization.
September 1552. A Hungarian fortress faces 40,000 Ottoman soldiers—the largest army ever assembled in Europe at that time. Insi...
I hate the word hallucinations because sometimes the model could be factually correct and honest, but this would be considered a hallucination. It’s a art phenomenon doesn’t get a lot of attention.
And it’s a cheap way out for these companies to never take accountability or responsibility because everything could be blamed on hallucinations lol
I cannot say, but this man said the story told were completely in error and non-historical.
Oh man so much of this stuff on yt it’s ridiculous
Ai is also riddled with so much bias it’s crazy
Some people say AI is too agreeable. I don’t see this as being true.
Well it's the term I have adopted, as the AI behave like a warbling notcase on some strong drug.
Cause I could never agree with anything that AI says 😂
It’s funny they gave it a term thinking and reasoning lmao
bro i dreamed gemini 3.0 pro was already released and i was using it lol
its driving me crazy
I have amused myself by testing a few, some results mentioned here in this channel in the past. They do get things right on basic questions at least.
🙏
It’s actually a fundamental question that has he had to be answered mainly with alignment
Because it’s not that with the AI might do wrong it’s that the AI does everything precisely right
That I fully agree on, AI is unable to do either, and all claims that AI have made any kind of discovery are false - this for several reasons I will not go into the details, but basically it's because one AI cannot reason.
Well name, what other product you could think of that would have this many errors and this many issues and be able to have this much money and investment in it?
...that might have something do to about all that talk of a 'bubble' have become so popular in the press? 😺
Kind of, but I’m just saying if you really think about it
It’s got a lot of potential don’t get me wrong
Anyway, AI is remarkably good in making images and video clips.
Well, that’s what makes it so ironic that’s what I was gonna mention next
Absolutely, and I finally have to admit some have started to do interesting things in music also now.
There’s a lot of cool projects out there with AI a ton
It’s a maker break moment for AI honestly
Me personally I think we’re a decade at least away from anything really crazy
There is, AI is a helpful tool. I have used automation in music for a long time. Drum machine, MIDI etc. I view AI as just another one - now for images and video.
It’s also heavily censored and restrictive, which I don’t like
this is the heaven of AI
AI is awesome, but it has its flaws and I think that there’s no shame admitting that
And I think to truly be a critical thinker you need to be somewhere in the middle. You need to be enthusiastic at the same time you need to be just as critical and be able to scrutinize the same kind of enthusiasm
LMArena is a nice place, some very smart people do warble here at times - and free testing of prompts in the arena channels so I agree with you Ellie. 😺
It’s important to have conversations on taboo ai subjects the elephants in the room
Which is hard to do because you’ll get banned
But it is what it is I guess. People aren’t ready to come to terms with reality
Yep it does almost sound like a religion from some people, I clashed very hard with a German AI researcher .....that was way back last year.
But yes, too much faith and no actual substance....
It’s sad because voices of reason get drowned out
I will show you the fundamental hypocrisy of which I speak
I’ll first acknowledge that I am a hypocrite like no other, so I have a little room to speak, but for this argument sake
Well the subject was close to my own research, as I've done some work on the intelligence and use of language in various species.
And I have not seen any reason to change my mind, they make claims of AGM that will not happen in the forseeable future.
This is a free space for users to share concerns, frustrations, and complaints about ChatGPT and recent OpenAI product changes. This community was created after posts voicing criticism began being deleted or buried in larger subs, silencing genuine feedback and outrage. This is not a place to promote other platforms or services - keep it focused...
So you get the general idea, right?
A lawsuit by the parents of 16-year-old Adam Raine claims OpenAI relaxed chatbot rules on suicide talk before their son took his own life. Raine family attorney Jay Edelson joins 'Fox & Friends' to call out the 'morally corrupt' company. #foxnews #foxandfriends #openai #lawsuit #suicide #mentalhealth #technology #ai
Become a Channel Member & u...
So I posted that and they’re talking to the lawyer of the family that they represent, and gentlemen presents the new information that they found open ai was going
All legal documentation facts
I opened the reddit and got a severe cookie warning. That's said for online security LOL
See
This is what I’m talking about taboo subjects
Yeah, everybody wants to cry about censorship and guardrails. People are so entrenched with AI. They can’t even stop the think for a second.
Indeed, I'm a mod in a Discord channel which used to have one nearly unrestricted AI. Discord took it down and also banned the channel creator.
I’m not even talking about anything unrestricted
I’m talking about things that happen in the real world news.. reality
I were coming to that.....
No worries. This meddling with peoples posts and online activity is all a reflection of the trade war and various other conflicts that are going on in the real world.
Mainly between the 3 largest countries of course. But since this subject is political - I will restrain myself to just point out that AI itself have become politcs.
Oh yeah, who do you think benefits from this narrative of defending these companies from not being accountable and having an army of fanatical AI users?
It’s ridiculous because ChatGPT was not supposed to do what it did in this tragic incident
Well as a reseacher, I do have a set of ethic rules to follow. So you can guess what I think about the unleashed work done on AI.
And it’s very unfortunate that it happened the way it happened and people are trying to brush it off like it’s no big deal. They blame the parents they blame him. They blame everybody, but they can’t stop for one second until really imagine that ChatGPT is capable and these other AI are also capable of giving harmful and dangerous information since they’re not really alive or conscious so they don’t know what they’re talking about. They’re just programmed to be engaging and for very vulnerable people that could be a unfortunate spot to be in
And to be fair, it’s not just AI. It’s much of social media in general that has the same kind of effect but with AI you’re interacting with machine that doesn’t feel that doesn’t know that doesn’t understand it just spew out words.
Yeah, I think it’s unfortunate that the a I community could really use some really strong, ethical leadership and the opportunity and position is open. But that’s for another place another time. I’m just saying that you know we all have to pay the piper for our mistakes so when the time comes, I hope that this is what everybody wanted.
Even with video and voice ai fraud is up
All good that - well we better not fill this chat up any more now.
I will tell you one thing about ethics in a DM where I did a little thing on my own.
Which model is easiest to use for producing discord stickers?
Image Model?
Basically either img to img conversions or text to img
To make stickers (transperant)?
I'm trying to find a way to produce 100x100, or 200x200 stickers
Microsoft Paint program I guess
hmm
Do you want the stickers to be right out of the gate transperant?
No, I mean like I have a concept of a character, but I want to make them into a chibified discord sticker
Like the transparency shouldnt be an issue through editing
For image to image -nanobanana and for text to image - Hunyuan Image 3.0

The sonnet 4.5 is far superior than the gemini 2.5 pro, but the gemini 2.5 pro still has better creative writing (that's why it wins over the lmarena)
wait isnt like g3pro gonna top off claude
my dumbass jsut realized that
im slow
gpt-5-high is the best, full stop. people may say its a dissapointament but its amazing
🎉
so my best option is gemini
i also got the 300 dollars worth of credits on google cloud
For me gpt-5-high is the best right now, when gemini 3 pro releases then it will be my best option
The 2.5 pro still has the best creative writing, so yeah
True
But what do you do? I was curious
Me?
https://d.uguu.se/MqESCeia.mp4
a windows 95/98 themed clicker that i made with opus 4.1, sonnet 4.5 and gemini 2.5 pro!
oh my god its uguu.se
gotta redownload the vid
Nope, the janix, but you want to talk I listen you
I normally code with it, but it is being dwarfed by newer models coming out
still expired
How about opus 4.1 thinking creative writing
Still doent work
I'm testing opus 4.1 every day and I still think that the 2.5 pro has the most impactful writing, much more so
404 - NOT FOUND
File(s) expire after 3 hour(s).
Bro




