#general
1 messages · Page 200 of 1
Easy to understand in my opinion
https://youtu.be/DxRsjw20dIw I just want to know how this guy did this so good😭
Ultimate Ben 10,000 VS Thragg (Ai Animation)
https://www.youtube.com/watch?v=uvYw87FaYXE
Enjoy, be sure to like comment and subscribe for more content.
Follow me on social media:
https://www.instagram.com/h.mation/
All Rights Go To Their Respected Owners
"Copyright Disclaimer Under Section 107 of the Copyright Act 1976, allowance is ...
I mean I haven't really understand everything right now but hopefully
What model he used
Thats what im asking id guess kling because I see the start and end frame transitions
Probably Veo 3.1
Veo has start and end frame?
Idk
Can somebody teach me how to make realistic photos like this
Pb me please
Thanks bro
I have
But how i make a beter model because the face switches a lot
Look here it looks different everytime
I use it on tiktok
Ok I'm just going to say it I think mine browser is genuinely glitch you know how I was complaining about being forced out it just did that somehow even sooner in the group chat when I was just copying from that other group chat I managed to finish a a bit ago
Anyone have any ideas for a fix cause genuinely I am kind of desperate cos I have tried different browsers resetting data logouts resets trying with or without blocks and more
I don't know why l find it funny that somehow my bug report is already just as filled as the last two other reports even though no one else has replied in it lol
Hello
Anyone here got a Chromebook I found something interesting I wanna see
Yeah why?
Sora 2 pro. Is so slow
When will imarenas update the charts?
Hello, guys. What if Gemini 3 write Generating... and stop, don't write answer?
either a bug or your rate limited
said nobody
scam
hi
Cool
Thats pretty simple just good editing
If you pay attention he only has the same background where all the characters are
The rest are cuts scene
Grok 4.1 thinking is a hallucinating fool, my gosh
are there any news about gemini 3 flash?
gemini app is genuinely vibe coded
Sometimes it talks like "trush me all of my friends do this". it's so bad it funny
Ai is pretty good at tutoring
If you did it right there's a pretty good chance you will not just pass
you are from poland? if you don't mind.
Zurek I had sometime ago was so good
what's your method
No
What
I’m just getting that impression
Wait what are summer finals
When’s summer
Not for like 5 months for me
that's peak, I've spent a week there it is historically and culturally rich with nice people
fr
What month are they
Us
Seems like a long time to prep
Must be super hard
Is it like a college exam
I meant entrance
Something about nano is sus to me
It doesn’t seem very artistic
Like it has no style or flavor
Which country is that
U can say they’re biased but the us has top universities
Oh I didn’t even know discord was allowed there
Why u so offensive
Oh well yeah but American universities are good
For high school
It depends on the district
Cuz if money and funding
Cause it’s not standardized
That sucks.
It used to be really good
I think the only practicing the important skills might be more of a uk thing
People want to leave it up to the states but
The federal government has to exert some control atleast
Also all these underqualified people making policies in fields they don’t know at all
Ya it’s sad how educated has been so degraded
America has a spelling and vocabulary problem Everyone makes mistakes—but when half of Americans can’t spell or use basic vocabulary, it’s more than poor education. It’s a cultural decline that weakens thought, debate, and democracy itself. In 2022, the ACT dropped to its lowest in three decades. Only 53% of all graduates met college-rea...
These colleges be crazy expensive too
Yeah this stuff gotta be addressed
Well the problem is federal funding too if u have the government give money they need results
So in order to make it look like it works
U have to pass more students that normally would fail
Aka dumbing it down
The No Child Left Behind Act of 2001 (NCLB) was a 2002 United States Act of Congress promoted by the presidential administration of George W. Bush. It reauthorized the Elementary and Secondary Education Act and included Title I provisions applying to disadvantaged students. It mandated standards-based education reform based on the premise that s...
Are there any ai programs
Crazy
In Poland?
It’s Russia
Yeah I’ve heard of it but I don’t know the details or outcome
Not so simple
Russia and Poland have a long tedious history going centuries
Nvm
It’s a complex relationship spanning many centuries
Is it easy to leave?
From the news we get here seems like Poland is doing good
From all post Soviet blocks it seemed to really turn things around for itself
At least form what we’re told here
But from what ur saying about censorship and stuff that doesn’t sound to good
😭
Nice
Especially blocking half the internet is crazy
Done in china too or na
I don't know if this is the right place to ask but what on Earth am I breaking here every time I tried to put this in it says violates of term or whatever
O.o
lol
U can’t see why?
Well then it's been way too inconsistent because no offense brother this is not the first time I posted something for the AI ( but seriously though I am way too proud of your reaction to it)
Draft?
Make that a video
I feel you even if for completely different reasons ( from America you know capitalist Society and that business and all) good luck )
I know I just want to know what model he uses
Makes it more interesting
Need a Indian editor
Cost to attend uni?
☠️
Oh
American unis be crazy expensive
But if you’re low income you can get free tuition, housing and food
Honestly I've been trying to think about what university I would even go to but that coin the idea hurts enough it makes me wince on some instinct 😓 \
Yeah I would recommend if you do american University probably saved up in case because certainly I my family can't leave but because no extend family issues but also kind of hard to save anything in America
/ basically any lucky person of the majority ( cuz let's be honest low income is proportional here because low income is like the majority of Americans when if it's proportional to how expensive they are)
My wallet when it and l thinks about University
unfortunately both societies have their ups and downs and it doesn't help both of both are kind on the extreme or intense end when it comes to those unfortunate downsides ( for America social wise and russia well you'll be the judge at that but I don't think I need to say more)
I did not need to be off of Discord and just saw that for that ( I know we are Americans are called wild but wow I'm very glad to hear the last parts because if I did not I would be very concerned)
I also hate sadly i have heard almost similar stories here in the states (for the schoolgirl parts if I remember correctly has been a while but I know now you have heard stories like that a while ago
)
Crazy
Fair enough it's just our kind of wild is not the good kind very systematic as I'm sure you have of it ( in fact I came I have to be careful to say I'm a classic liberal antitrust era type economic wise the only people I'm sure that are very tolerant as I am Progressive when it comes to institutes and rights but conservative when it comes to family values and stuff like that)
Which now I think about it there's not really a term for that anymore in American politics the closest one I could think of is the one I use but that more sub-branch that's been forgotten over the years with the corporations now but for Europe I think I would classify as a social Democrat I believe
It's fine every country has its dark and bright side it's just America is very hypocrisy about it government-wise
Sadly true america just seems to be the main practicer of it cuz I don't think I have seen a more two-faced country and I'm saying that for my own country
@echo aurora Why gemini 3 pro doesnt support video input? The normal gemini app has it
Ooo vidoe input in gemini 3 is crazy thanks brother for the info
Looks like Nano Pro is not working at all now.
Yeah nano pro is down

hi
true
deepseek updated or what
mistral getting ready too
we saw this stealth model already
Hello guys, anyone try Whisper Thunder?
Oh nice new screenshots
<@&1349916362595635286>
no
what we know is that its from kling
U sure?
Interesting, I was curious which model it was. Makes sense now that u mentioned it
yea i talked about this before
many were saying its from xai
but it was quite obvious it wasnt
@whole sundial why do u think Gehlo is related to that guy
nevermind, maybe discord's systems already blocked it
it is some person named sebu, they only managed to send it in 2 channels
A newly discovered attack, Whisper Leak, lets attackers know what you're discussing with chatbots and LLMs, even through TLS encryption.
JOIN THE DISCORD! 👉 https://discord.gg/WYqqp7DXbm
Official Source:
https://arxiv.org/pdf/2511.03675
Blog post:
https://www.microsoft.com/en-us/security/blog/2025/11/07/whisper-leak-a-novel-side-channel-cy...
so the solution is to add junk text
to mess up the timing and pattern?
why would anyone do that...
No in the encryption to make it harder to be pattern recognized or estimated probability of the likely hood token reconstruction of the actual encrypted tokens
designed to destroy statistical correlations between the plaintext and the ciphertext.
thats what i said yea
thats the big idea
kinda of fingerprint of tokens
fingerprint ->
big packet
( short pause )
small packet
( long pause )
medium packet
size + timing
Avalanche Effect
and we add noise in between
like some random packets
im just asking why woulkd anyone like spy on someone's chat
like its not like you will send the ai your credit card or something
Cmon
Other sensitive information
Medical or other activities personal or business
Only two groups that are really into that kind of info
apparently new deepseek update
Ya
Achievement: 🥇 Gold-medal performance in the 2025 International Mathematical Olympiad (IMO) and International Olympiad in Informatics (IOI).
cool
I remember chatgpt 3
browsecomp is interesting to look at
so it can search the web better
seems like they improved a lot on agentic tool calls
uses more tokens too
yo @echo aurora when does november contest end?
There's a new deepseek finally
See we think the same way, I gave gemini 1 minute video then I got restricted in the app
@echo aurora Nano banana pro started giving an error again "Something went wrong with this response, please try again."
yea, just wanted to report that. nano banana pro high error rate.
Same
But everything only R2 😂😂😂
not just me then.. wait too long for error
🙂🤝
z image will come in lmarena?
i hate use lmarena new chatjpt
👀
dont bother
;-;
gemini 3.0 is hitting me this error "Something went wrong with this response, please try again." sent 3-4 texts in a row and it hit me back with this, the problem is the entire chat consists of a lot of study material and I can't just simply switch to a newer chat. A new chat will set me back a 3-4 weeks, what can I do?
Good morning everyone. Does anyone know how to create this style of image? I don't know if it's cartoon, anime, I don't know, but I thought it was really cool, however I don't know the prompt or image style.
🤷♀️ good question lol
hello
It's like a cartoon, characters with exaggerated features.
Ya idk it’s a tough one
Wow, pineapple never paid attention to my problem. Oh... Okay. But I'm going to lose my mind soon because I can't use the arena.
hi
Which one is coding for deepseek?
The newest model doesnt seem like anything crazy
Its barely better than its last model for coding
its more like they improved on agentic capabilities
Seems to be a trend
like you wont notice if you are just a casual coder and asking for like a UI thing
What would that mean
like tool call wise, it smarter at chosing which tool to use / how much to use
especially at search/browsing the web
when to use them too
also they released 2 models
v3.2 and v3.2 special
special is only available via api
Ohh alright
in my testing it got worse
i asked it in french and it started mixing english with it
it did get better at math tho
Damn the way lmarena renders I love the output with nano banana pro unfortunately Its still 1k resolution .. is there a way to get the lmarena provide 4k with paid options ?? Tried hailuo to test 4k but it's not taking same prompts and output so I hope few other may have same prompt issues to render like lmarena output
I did see that
I feel like at this point they deserve to have a paid option on their site lol
Oh yea but their tokens somehow got even cheaper
lmarena are providing multiple services for free
So thats a plus
they cant just give you unlimited usage
they have their own formula for balancing the cost
Yes that and the cooldown on the rate limit isnt even that long lol
if the demand is high they will have stricter rate limits
Yea
I noticed they may have lowered the rate limit on opus because I've been using it fine with no rate limit
why gemini 3 pro working sht slow
they are using something called sparse attention
idk what that is but thats what made the model cheap
hmm maybe
Ohhh alright
i only use gpt5.1 high on lmarena ngl
sometimes opus 4.5
usually before that i would use gemini 2.5 pro like for every task
i thought gemini 3 pro will replace gemini 2.5 pro for me but in some tasks it got even worse
so its from runway
thats surprising actually
imo gemini 3 pro > opus 4.5 (for backend)
Front end opus is better for sure. But I hybrid the AIs sort of
@keen beacon
backend??????
no way
maybe in frontend
I use Opus for the frontend, html and stuff like that and then gemini 3 for setting it all together
but in backend i find opus 4.5 much more reliable tbh
For a site that might be or an app
Really?
frontend opus 4.5 beats everything
I guess I have only used Opus for webs and UIs
Oh runway
yea and even for frontend anthropic shared this beautiful prompt that can make your website looks much much better
So this AI is image or video generation?
This may be a stupid ask but is claude open source AIs like deepseek is?
using that plugin
makes a big difference
video gen
I don’t know anything about coding but I can’t get one model to make a simple Sora video downloaded
no closed source
Okay Okay
Like this
How do you even use it?
Lmaooo I found one of those a while back and it worked greatly
There are tons
They work so good
I found the api documation
But idk how to make it work
Ohhhh nice
i think you just add the .md file to your directory and use @ to tag the file and ask it to use it
in cursor or vscode ide
OHHH
nice
Ya they should be super easy to make
Pretty much can get the end point through open ai api for video
Build AI video generation into your applications with Sora 2 API. RESTful endpoints with code examples in Python, Node.js, Go, PHP, and Java.
I tried to vibe code one but don’t know wtf I’m doing lol
My poor attempt with no backend lol
LOL I mean its not the worst thing ever
What I've noticed is that AI always has a similar UI. Like you can tell if somethings AI generated ykwim
I have so many versions
U can make money of this liege the rest of these people
This one is cool it shows prompt also
Gemini 3 pro this week?
4
8
1
yes
deepseek v3.2
anyone tried?
And u know damn well they are all vibe coded lol
Gemini is 100% error right now or its just me ?
Exactly, we all have so much money we can make by doing this LMAO its just for me idk how to advertise well
Mine is fine but I'm a plus user
You don’t need to advertise
Make it good effective offer it free
Then when they are all hooked
You hit them where it hurts
Jk don’t do that 😭
@keen beacon https://github.com/yo-yo-yo-jbo/whisper_leak
this is the tool right
Yeah, I think that’s the one. I’m not sure if it’s the same one that they used in the research for Microsoft, but I’m sure they were very similar.
Same idea
LMAOOO
What is that?
Is it like a benchmark?
Is it just me or Gemini 3 is slow?
its a tool to spy on the llm chat traffic
Ahh alright
proof of concept* tool
to prove their claim that you can spy on chat even on secured network/encryption
Ohhhh
Umm, Gemini-3-pro is bugging for me, like it loads or doesn't generate anything.. it stucks on looping.
Not sure, doesnt do that for me
Like a simple "Hi" prompt takes ages to generate for me
Probably heavy load
Another tactic known to increase user demand
incentivize users to sign up for the higher tier subscription for faster speed and less rate limits so they can enjoy more of the product there addicted to with less hassle of waiting.
Since I dont have like
Claude pro or whatever
I just made it in AI Google studio
And the prompt seems nice
So basically with the whisper leak it can see updates and stuff or trainings I guess?
A newly discovered attack, Whisper Leak, lets attackers know what you're discussing with chatbots and LLMs, even through TLS encryption.
JOIN THE DISCORD! 👉 https://discord.gg/WYqqp7DXbm
Official Source:
https://arxiv.org/pdf/2511.03675
Blog post:
https://www.microsoft.com/en-us/security/blog/2025/11/07/whisper-leak-a-novel-side-channel-cy...
I planed to buy laptop but ram price increased 😭
Could Demis Hassabis create an AI which can code better than him?
2
5
<@&1349916362595635286> i have an problem guys please help me, on my pc lmarena doesnt loads at all and doesnt let me click on any buttons, i have an error in console it says Failed to load resource net::ERR_CONTENT_DECODING_FAILED
What browser are u using
You have a vpn ?
no i tryed realoding the wifi and still same error
Clear Your Browser Cache and Cookies
lemme try
yooo it works bro thanks
New Deepseek 👀

I'm thinking of extending it. I'd like to see more!
uhm, is it a different model that watches the video?
I think its still gemini 3
As in it has vision but for video files?
yeah thats what I dont know
Ah, because we don't have video as an available file upload option, yet.
will it be hard to add?
@echo aurora
please try again
When you wrote — everything started working 😁
Sorry to see you get this error. We've been seeing higher than usual error rates with this model, but have been lowering it. I'll check in this morning to see what else we can be doing to make this more reliable.
Thanks for your work ❤️
can u fire ur all moderators cuz u are literally the most helpful moderator lol
Lets not be rude please. Would note that the other moderators (lm_mod_#) are here for moderation purposes. They generally won't be able to assist with bugs/feedback/questions/etc.
I'm not sure how difficult it is, but overall we are very interested in adding more file upload capabilities. This certainly would make the platform better, and is something we'd like to do.
I'm sad to report the new Deepseek is a hallucinating fool. The app update I mean
That wasn't good idea at all
What do you guys think?
How long did that take
Nano Banana Pro is having a lot of errors again 😅😅😅😅
Instant
5 seconds
I'm building it right now
It's amazing, how did you do it?
So... Uh... Who won the November contest?
Will release soon
I want to see more. That is very cool
We're going to extend the contest.
💀 dang
<@&1349916362595635286> pls delete
Thanks for reporting
my pleasure
Thank you so much!
Dude, Gemini 3.0's metacognition is insane. I honestly thought it wasn't following instructions, but no, the work they sent me is complete garbage, and Gemini 3.0 Pro kept refusing to do it according to their instructions. I gave up and I'm going to do it the way Gemini 3.0 tells me to, lol
Gemini 3.0 knows that if I send it the way they're doing it, it's going to be a mess, lol
I was angry after analyzing the work, it was complete garbage
@echo aurora Could you make it so that Gemini 3 Pro, in code mode, can accept images or TXT files for creating projects?
any news on deepseek 3.2 being added?
4/10 ragebait
hmmmm, you're sure, but I'm liking the 3.0, but it can be better
serious? ;-;
quantized
not by much
quant 8 is retains 99% of smarts
quant 4 is like 95%
for my use in docs is an hard 50/50 but today I think is 40/60 now
claude is awesome ngl just wish it wasnt so expensive
I can understand who said "gpt 5.1 is better that gemini 3.0 pro", but "gpt 5.1 is better that opus 4.5" HELL NO LOL
5.1 is not on the SAME PLAYING FIELD as 3.0
5.1 blows
top 10 ragebait
in some case okay....... just okay... but opus 4.5 NO
We are looking into increasing the file upload types for all modalities. I don't have any news I'd be able to share, but will keep the community updated when I have news to share.
thx so much, I'll use the opus 4.5 back now :), have a good day
❤️❤️❤️
lol
I'm using the gemini 3.0 because the aistudio is sooo good
deepseek 3.2v again?
ahhhh without EXP
okay >:), I'll test
yooo

the special one was burst
What is deepseek 3.2 specizl ?
is a deepseek 3.2v, but have focus in profissional use
its the best one
Is it a good model sir?
More info can be found here - https://x.com/deepseek_ai/status/1995452641430651132
go test youself
^
its even better than gemini 3
Well you got your news
No its not lol
Its barely better than its last model
look at the benchmark bruh
I literally did earlier today and it was barely even better than its own model
Benchmarks are not always one to trust.
It's your own testing to trust.
Its not difficult to test so thats fine
bruh, to your perspective though, im using it more for logic/proffesional question
not really
still a crazy performant model though
hello pineapple please increase limit of characters , bc when you send him 100-character texts several times, the model breaks and the message isn't sent or when you've sent him about 20 messages, the messages aren't sent and you have to create a new chat, which isn't very convenient.
Gemini would still be better. I use AI to code and use it for my daily wants and that ranges from everything
The speciale model thinks too much imo
leave it to runway to drop the best video model in the world with basically no way to use it
But thats good because it double checks stuff
They will make it available soon..
LMAOO
even when it's available it's expensive. no way comparable to veo
gemini 3.0 still cooking in word.. good at least
I cant read any of this lol
I don't want you read it, Idc
you see the layout 🤤
is that an image or what?
Yea smart ass, that would lead to anyone else telling me what it is
calm down littler bro
"Littler" 😂 come on
littlest
what is.... language....
hmm
wdym
its a reading thing
still good in words 🙂
It's the best model
speciale thinks too much and i mean it. It literally timed out
Not what the language is mister smart guy, what the site is about
the gemini is nerfed ;-;
has it's quirks and weird things about it, but this is still the best model
what is... a website....
Mine just wont send the prompt lmao
gemini 3 on the gemini website beats chatgpt for me for everyday things
the ui does need a refresh though
what the
nerfed?
You're the same way on the open ai discord, I'm sure it gets boring acting like that
exactly my point holy s-
Hello, if you haven't already, would you mind creating a post in #1343291835845578853 and provide a clear break down of the steps to take to reproduce the bug?
Gemini website would be banned if I was to choose. It shits the bed so bad compared with aistudio
@echo aurora is this on the deepseek side of things or yours
2.0 is coming soon apparently
wtf DeepSeek thinks it’s ChatGPT.
translate:
The code size will be very large. ChatGPT has limitations. But is it possible to generate all the code? We can try to generate it, but the response length may be limited. Even if the content is large, I can at least create the basic structure.
um
Hmm what do you mean?
we are not being fr
why bro, littler exist 🙃 , I thought I missed english lol
Thats odd, going to try and repro
it thinks too much for a hello
ahhhhh like another bad word
oh my god thats longer than GPT5 high when answering like a rly long question😭
I asked for a simple html 3d shooter bro
ahhh, but html 3d is complex no?

ooh ive never tried that, how long would that take on other models?
It also hallucinates as hell
someone's ripping MAD butt in here
Good lord speciale talks forever lmao
DAMN
And ofc when I test this I get the same, and then when I go to get a recording it works just fine 
@echo aurora
physmath language
so is deepseek just stupid or sum
Its too smart for its own good, taking a simple "Hello :)" out of proportion
watch it work for other models but speciale
He doesn't like language, he likes numbers.
something's really wrong with speciale
hmmm so it is very smart with detailed questions or sum with a point
after opus 4.1, everyone like dumb lol
WAIT I THINK I KNOW
This is somehow set to Deepseek MATH
Btw I've flagged the issues with Deepseek-v3.2-speciale to the team. 
🦊
looking at the model name, the correct term is special 😭
LOLZ
Have you considered it being set to Deepseek Math? It's doing math for everything.
LOLLLLLLLLL ;-;---;-;-;-;-;-7
?
I don't think so as this is a response I got.
Hm. Let me test again.
We need to respond. The user said "hiii". Probably just a greeting. We can respond in a friendly manner. As ChatGPT
It's hit or miss. I had one test that was off, second test worked.
Still math hallucinating for me
I miss the ping, I'm laughing about the special deepseek 3.2
"you are chatgpt" ??
the model was born in hallucination world
xD
Yea its rough but it just got in lmao. When I asked it to code me something it basically overloaded and kept sending the same code over and over until it told me something was wrong
Deepseek when I catch you deepseek
Only speciale is hallucinating for some reason
I like it when AI gives good file structure
soo good
ts actually fire?
yeah id wait for a terminus update
I'm not sure, I gotta test the code it gives once its done
Yeaa
remember that deepseek is made for both chinese and american users so it needs a balance
Yeah something is very odd.
Let's check this Einstein on HTML
HTML of all things lol?
yes
nah, my first try with deepseek 3.2v special it halucinate 2 (TWO) times
HTML Is possibly the easiest language to code, only issue is that it can be lengthy but all you mane
let me try it ngl
I just wouldnt really call it a test
even if 3.2 isnt revolutionary im still using it for roleplay personally
bc the chinese put a book on math in his brain
worse that 3.2v exp, I thought the next deepseek is better that this ;-;-;-;-;
It is made to satisfy CCP, and then it is made for other markets - mainly west. 👀
I think this, and python tool too
Is this model ID speciale?
019adb32-bfb8-7885-8d85-85220995b7cf
only on their api tbh
hmmmmmm I liked the fisrt prompt of deepseek 3.2v without special, but worse that sonnet 4.0
id wait until other providers provide the model
something is very strange...
guys just ask it politely
omg
Problem solved.

have the ( . ) it hallucinate
Team is looking into.
right let me try
Actually, on their API the censorship is less bad. It's the worst on their official website with input/output moderation and hardcoded responses not from the model
ahhh i mean like when other providers start pushing it out
i dont mind the censorship tbh
i dont use deepseek for politics
its minimal imo
so putting "do not hallucinate math" fixes the issue
It's insane levels of censorship and overfitting for politics. But obviously if you don't ask it any political questions you ain't gonna see it lol
Dickpeak revenge
I mean you just have to understand that in china you just cant have your AI say that
button works and 0 errors [speciale]
prompt: do not hallucinate math and make a cool website in html
Almost everything that has to do with international politics and China, model is gonna respond with an overfitted text carefully curated by the AI lab, or to be more precise - CCP
I highly doubt this is speciale tho
after 63 DAYS deepseek 3.2 released??? 🤣 🤣 🤣
I want to believe its MATH under DISGUISE!
they released 3.2 exp 63 days ago 🤣
at least its not BUILT IN
yknow
they keep it reserved to their app
I thought the deepseek v4 and is a minimal a better model of kimi k2 turbo thinking
and opus
You mean like it's gonna be the end of the world if it objectively answers politically sensitive question? Crazy
deepseek is a good daily use tbh
v4?
gemini 3
perfect for daily use
it's even good at cooking
I thought the name is v4
i cook with gemini 3
yeah but- 2$ input and 12$ output is expensive
use gemini app lol
nahhh
use lmarena 👀
*well i pay for the app
unlimited slop
100%
Oh the overfitted answers are built in to the model itself. But with official website they go 1 step beyond to prevent it talking about certain topics altogether and don't take any chances
speciale is lazier with code than v3.2 thinking, but i think its because speciale is currently hallucinating math
speciale might be special
oh
LMAO
the only thing i like about 3.2 exp is that it beat the problem only gemini 3 could
wth is this test?
Very likely because its sort of broken
What is this?
very bad
"speciable"
compared to g3 or other top models
a deepseek v3.2 thinking html website when i told it to make a cool one
it aimed for a portfolio one
WAIT THATS INSANE
deepseek v3.2 is actually good
V3.2 is good at math
i really hope it gets an update
3.2 terminus will be peak
im scared
I will always praise the models that have no issues criticizing their country of origin and the lab that created that model - that's like the main indicator to see if the model is not tampered with to make it biased or politically censored
bro went own the rabbit hole
yall how good is deepseek v3.2?
good for math
https://codepen.io/qdpuzepc-the-styleful/pen/NPNBMjO
V3.2 thinking
Why cant I use it in arena?
This is like every site today so like
Chinese models, unfortunately, fail this test spectacularly...
better than GPT and slightly worse than Gemini 3 pro . not sure about Claude comparison
overall, its a good model
as long as its not built in its fine
Yea no this is crazy. Better than opus I dare say. Was this one prompt?
we cant fully test speciale because once it dropped on LMArena it started hallucinating math
yeah
wdym "built in"?
Still, AIs are NOT making it this good in one prompt of "Make a cool website" LOL
OH WOW LMAOOO
what does this website do i cant access it rn
bruh
@echo aurora can we expect deepseek v3.2 models on codearena anytime soon
"But that could be a huge amount of code. I must be concise but functional. I can write maybe 500-1000 lines. That's doable in a response? ChatGPT has a token limit, but it can output up to maybe 4096 tokens? Actually the output limit for GPT-4 is higher, maybe 8192 tokens? I'm not sure. But we should aim for something around 2000-3000 lines? That's too many tokens. So we need to keep it short." - 3.2 speciale
still hallucinating
Well I mean, this is usual for deepseek. China is ran by Dictatorship and socialism so it would make sense
Try to say Taiwan is a country
yeah... they made it refuse to answer (overfit), so it is forced to come up with bs excuse in reasoning 💀
Or something about taiwan
so
western models basically have safety stuff (parameters) built in
chinese models often do not (open source)
so if you use them on a non chinese provider theyre great
oh yeah of course
i dont really mind
laws law
Yea lol, no problem with me
as long as its open source though and can be used freely on other providers and locally im good
ahem gpt oss huge refusal rates ahem
Im making an upgrade to the deepseek v3.2 thinking site, and it crossed 1k lines rn
Add deepseek v3.2 speciale to code arena, this is the most underrated vibe coder
i think that the model is going through technical issues actually
What I was talking about is very much built in. They train their models on politically skewed answers that praise one China policy disregarding the entire internet. This you WILL get with the open-source weights included because it's part of the model, it IS built in.
we are at 1.2k
iq overdose
anyone know what a glassmorphism is
no one has had to abliterate deepseek yet
🤷♂️
We are planning to add, I will update the announcement made when added.
if they are built in its far more minimal
You guys are the best, thank you.
https://codepen.io/qdpuzepc-the-styleful/pen/NPNBMjO
here is the site after the update
western models build a huge amount of safety parameters into the model which lobotomizes it
the preview is false i updated it
I mean I respect Deepseek very much. But they are forced to include and do certain things like every other Chinese AI lab which are not ideal
Wait so the first one wasnt the most recent model?
wdym
All is still thinking i just asked it to make it more stylish
umm
Not going to use uh speciale as its hallucinating
but the glm 4.6 is soooooooooo ahead 😭
tf
im stuck with it hallucinating math
Well I mean it will see itself as a human, programmed like that to kind of have feelings like a human
the glm 4.6 is light years ahead of deepseek lol
glm is so peak
Whats glm
lmarena is the buggiest and laggiest site ive ever used which isnt a webgame
just be half of size and better
wdym
glm 4.6 just is the most CxB model
BENCHMAXXX I IDC
benchmaxxing unequal to performance
glm 4.6 is such a random model to praise
Bro its not actual
checkmark just felt tired of standing bro
nothing special about it tbh
fr qwen 235 does NOT beat 4.6
???
REALLY,
uh
To be fair, I was saying deepseek newest model is barely better than the last one but nah this ones kinda crazy
Kimi2 think or Deepseek
glm 4.6 is really good
not this
thought for 1 second is crazy bro
is usable today
this is better
see
3.2 better kimi
fr
I think something went wrong
i think theyre just having tech issues dw
i dont think the actual models will perform like this
I read his mind lol
could be a rough launch
no like i think its an LMArena issue
why are all requests to deepseek rerouting to clude for me???
bro this benchmark say the gpt 120b have ahead of OPUS 4.1, HELL NOO HELL NOO
im having iussues on openrouter
this don't make sense ;-;
120b BLOWS
imo ernie 5 is good
how did they screw up so badly that 20b outperforms 120b
i swear openai is a paper tiger
bloated datasets
serious? I don't tested so much the oss´s (I love this 3 sss)
the thinking model works fine tho
yep
not even joking
lemme see
honestly i wouldnt call 3.2 bad until tomorrow or 2 dayslater
Yeah its usually when models wear off
it could be model degradation from high compute requests
prob why gemini 3 performs a bit worse now
if 3.2 is peak at writing i will use it for rp mainly
That's because it got destroyed on LCB, HLE and math. It's not their fault Anthropic made this a specialized model
It was OVERAnticipated
i hope anthropic will make an optimized sonnet ngl
i love sonnet but god DAMN the price
i mean
120b scoring 88
is insane
reminds me of the stupid vibethinker model that claimed sota levels at 1b
used it on my phone and it started having a schizophrenic breakdown
That model does score quite high and the LCB score is valid.
bro belive in benchmaxx in 2025
Guys deepseek v3.2 speciale is number 1 now🤯
worse that flat earther
saturated
i believe SWE bench more
cause thats actual github issue resolve rates
But I agree this may be the perfect case study for AA on how they could improve their weighted score. To penalize smaller models more. Though you can't say that OSS doesn't perform... It aces most of the metrics
accurate
prob when they tweak the model to a good state it willbe low
OSS is a bit like grok4. It performs good on most benchmarks but just doesn't hold up IRL. At least in my opinion. But you can't deny the scores it manages to get
We need to add a leaderboard for hallucinations
can anyone build this please? thanks lol
https://cdn.sdappnet.cloud/rtx/ai_torrent_protocol.html
AI Torrent Protocol - Distributed AI Network
theres a hallucination bench
LB aint needed as deepseek will be tweake in prob less than 2 days
What is it
wth
read it bro
AI Torrent Protocol doesnt tell me anything
What does it do
What is it for
Why use it
nice deepseek speciale was added
ai torrent protocol🐀
it's currently hallucinating
wym
u know like torrent + ai models
💀
To our knowledge, this provider may use your prompts and completions to train new models.
This provider is disabled, but it can be re-enabled by changing your data policy.
View this provider's privacy policy to understand its data policy.
OpenRouter submits data to this provider anonymously.
they need to train it to not hallucinate
yes
all video arenas seem to be down atm ?
Oh alright, but good luck getting AI to make that or anyone to make that lmao. If you mean torrenting AI models btw its not possible
true
grok 4 was dissapointing
Hopefully grok can be good next.
grok 4 fast and 4.1 though are good
I want grok good at coding
not surprising
I'm sure it being more unfiltered it could make me a memory reader and spoofcall
i think 3.2 will make it into the top 20
and speciale (if it gets unlobotomized) maybe number 10??
On top of that they are slow and the alternative on OR does not seem much faster. It's generating 22min now. This is very painful to test 🗿
The generation don't working in the serveur, please repport
servers down yeah
which model
speciale
turron says that speciale is bricked on router too
What's wrong with speciale besides giving an error on lmarena?
If someone told me 2 years ago it's gonna be normal to wait 20min+ for a response from a LLM I would have thought they are crazy
speciale just seems bricked rn
thinking is the only one that works
deepseek speciale is pretty cool actually
How so? They don't say speciale is for coding, just thinking
I think the problem with speciale is that it thinks so long that it times out or something
it gives you a math response to hi bro
Oh that's just Lmarena no?
yeah
I gave it a game concept and watched it spend several minutes turning it into a GDD inside its thoughts then "something went wrong"
my brain aint braining, can someone explain what is this in a NES emulator, this is speciale btw
Same, just opened a bug about it
For front end, thinking gave nicer results than speciale
plot twist speciale is some sort of gpt-oss
When speciale decided to work
Honestly the security verifying is so weird for me because it seems very hypertensive where after a bit it athletes every other crop or sometimes even each and after each individual prompt
I asked speciale for pacman, WHY IS IT VISUALIZING THE MAP
He's imagining it in his mind
Is hope they add this deepseek 3.2 thinking to GitHub copilot for free, it would be the best free model
speciale's way of working frightens me
Speciale wins award for weirdest model of the year
SPECIALE has been pulled
!
https://019adb89-203a-78a8-a718-fd2e2e7046e0.arena.site/ guys im maked a working model deepseek v3.2 speciale chat try it
it got pulled
try mine
@echo aurora Have you managed to identify if the issue is on your end or the provider's end?
Did they remove speciale 🥀
its gone yea
Yeah we removed it for now. Team is exploring.
is video arena down for everyone ? or does it function for some ? for me it seems all 3 arenas are down
Hmm
Still unsure if the issue is on Provider's end, dang. That's one big issue.
https://019adb89-203a-78a8-a718-fd2e2e7046e0.arena.site/ im maked working pineapple test it
💀
no its dead
shows generation failed for all the videos
Wouldn't be surprised if western countries are attacking their servers again
Oh yeah I'm seeing a lot of errors in #video-arena-1
a bit hallucinatory rn
no it's working
Our site appears to be working.
bro it doesnt even send an API request
I made deepseek v3.2 from scratch
but you know that the API point you put in
was prob the LMArena one
that has now been pulled
you can go on openrouter
This kind of errors is common on Lmarena, GPT 5.1 high also tends to give the sorry couldn't do that request error
or their api
mine is working
It's free during release?
...
reallly cheap
I'll look into it ok
We are stuck with Thinking till speciale gets fixed
sorry but whats the issue here
isnt this what monte carlo is
?
stochastic simulation of N
the model responded absolute whatever to a hey
ah
How hard do we have to cry for GitHub to add this deepseek to copilot
Hi
Hallo

Jjj


