#general
1 messages · Page 175 of 1
and then i created a tampermonkey script to intercept the checkpoint
and display it when it was on A/B
I'm not sure they don't assign the same properties to different models...
they assigned the same
in a/b
it wasnt random cuz
we all got same performance for same checkpoints
just go on google and type "x28 checkpoint gemini"
you'll see
i'm trying to get
an a/b
to show you
just got it
i'll show you how you can see the checkpoint
now
so first you wait until theyre done
you said you are a LLM researcher befoee.. and now you are a Security Resercher? So what? are you trying playing on us? OMG!
Perhaps, I mean plausible... I miss when the days when you could just direct chat on lmarena some of the biggest unreleased models
check my twitter
when did i say i was
an llm researcher
lmao????
link?
i said i can create a group with a stanford ML researcher
yeah crazy
so thats im correct!!
i'm a security researcher
i can create a group with someone that is an llm/ml researcher
LLM same as ML.. but LLM is bigger..
i didn't say i was
oh my mistake..
np !
so what you can do with your programs?
hahahah oh my god
the result is amazingh!
prompt: "
User
Create a neo brutalist web page, esthethic that looks like a modern museum., 1000 lines, multiple pages, effects etc."
one shot
@ocean vortex
looks like a site..
he might got to sleep..
yes it is a very beautiful one for a one shot prompt!!!
SOTA in coding
@ocean vortex
so when you submit A/B test you go here
this is a one shot?????!
WoW! then if it was one shot!!
i couldnt do that inone shot!!..
yes it is one shot
and then you can go to ```"{"temperature":1,"topP":0.95,"topK":64,"jl":[],"maxOutputTokens":65536,"safetySettings":[[null,null,7,5],[null,null,8,5],[null,null,9,5],[null,null,10,5]],"enableCodeExecution":false,"enableBrowseAsATool":false,"Fj":false,"responseModalities":[],"zf":false,"enableSearchAsATool":true,"googleSearch":[],"thinkingBudget":-1,"outputResolution":"1K","model":"52c1a3b8c57b46fb"}"
and at the end you have ""model":"52c1a3b8c57b46fb"
and this is the checkpoint
So where does it say "x28..."?
or the actual checkpoint this is
"model:"
so when it was x28
it used to say
"model:x28blablabla"
but now it's gone
you mean its now renamed to 52blablablabla?
no, it's a different checkpoint
but very good
it's gemini 3
oh okay..
this is gemini 3 result
no other models
can one shot
this @ocean vortex
that's why i'm saying it will be sota
how can i?
make urself an automating script
that seems encoded
or just obfuscated
bro
when you do another
a/b
it's not the same
and sometimes it comes back
and when it does
it's the same performance
that you had previously
with the same checkpoint
is your head full of air
i'm done with this debate
lookes like u typed very fast and got removed very fast..
cani connect to gemini 3.0?
You are not understanding what I'm saying. It is clearly obfuscated as that is not the real name. Not saying that you can't get the same one the 2nd time
hey guts does anyone know why i cant send photo to ai chat it wont let me send
please.. can i chat to you, from me to you?
how did u get a gemini 3 checkpoint on AI studio
Let him do his 'security research' with inspect element Network tab 🤓
lmaaooo!!!
@jovial sapphire yo bro how do u get A/B tests on AI studio
cuz i never got them
is it locked to some regions?
maybe
i wanna try Gemini 3 checkpoints on AI studio
maybe they're better than riftrunner
in coding
.
pls could u add an option to share chat?
u mean in lmarena?
bro
i know
it's not the real name
yh
captain obvious
thats why it's called checkpoints
and we call them by their codename (x28, ECPT)
try usa
how can i encounter the checkpoints
usa
ip
i can run prompts for you if you want
if you have very good prompts
does it require some prompting to get a checkpoint
its cool if from the site u could even download the chat to ur device.. beacuse i want that !
yh
hmm lemme search through my prompts
is your chat in lmarena is direct --chat?
yes
also are the new checkpoints better/on par with x28/x58
some of them are on par
but x28 was very good overall
i love to see non-nerfed outputs
then we must develope together a tool that gets the chat messages very easily..
from gemini 3
yes x28 was crazy
well if you have other coding prompts
don't hesitate
yeah i saw crazy demos
i'll run them now
youre the first one who knows what checkpoints here are lol
happy to see not everyone is a skeptic
how can i join the x28?
well, i've known checkpoints since lithiumflow
lol
like 3-4 weeks ago
yup, a/b testing first starts on
ai studio
then lmarena
'''Create an interactive 3D wormhole simulation using three.js, featuring a tunnel-like geometry with animated textures that give the illusion of depth and motion. Implement custom shaders or materials to simulate swirling energy, stars, or nebulae moving along the tunnel, and animate the camera to travel smoothly through the wormhole. Include controls to adjust parameters such as tunnel length, speed, color scheme, and distortion intensity in real time. Provide well-commented JavaScript code, an HTML setup snippet, and brief explanations of key three.js concepts used (geometry, materials, shaders, animation loop, and camera controls).'''
thats a good prompt
no problem
how many checkpoints are live rn on AI studio
and how much time does it take you to encounter one?
js curious
i've seen 3-4
some are very bad
sometimes a few mins
sometiomes 30m
shi wait i can upload
did it come out already
the code to codepen
nope not yet
but
i can show you one run i did earlier
hey guys does anyone know why i cant send photo to ai chat it wont let me send
prompt was to create neobrutalist website
thats a good prompting idea tbh
was the output any good
wait brb
i got something rn
codepen is down...
yes
here
one shot
"Create a neo brutalist web page, esthethic that looks like a modern museum., 1000 lines, multiple pages, effects etc."" prompt
it is one of the bests i have tried rn yes
im pretty sure, there are other gemini 3 models in there like flash and flash-lite
thats why some checkpoints suck at your attempts
this is so cool hahaha
yes
hmm, do u mind sharing it
wait paste the html file here
only 10KB, lets see
how about riftrunner
its a gemini 3 checkpoint too
on lmarena
lets see how it compares
how do u know
maybe riftrunner is a little worse/better
oh bruh
.
i thought u meant that the checkpoint is riftrunner
imma try the same prompt with g2.5
oh no nono
its far better imo
i'll try SVG
imma try it with riftrunner, maybe it produces something interesting or different
trying with classical g2.5 pro
super trippy
(gemini 3, not gemini 2.5)
LMAO
gemini 2.5 pro completely failed @gaunt spade
doesnt even work
you can still get gemini 3 through a/b on studio?
which one is that
wait
oh yea
i thought u tested another one
im yet to get riftrunner in lmarena
its quite difficult
u can, but i think you need to be in the US now
have u ever jail breaked gemini ? grok? , i have and worked with them.. , but the next day , it have been restricted.. even if i tried encoding the prompt.. looks like lmarena shares the chats with companies , and maybe have told them i had tried how to make something , if i told him as a test to the jailbreak prompt , he will say , "Sorry, I can't assist with that."
well riftrunner might be close, cuz its Gemini 3 too
havent u tried it yet?
its a little worse than lithiumflow
i guess
i found it very bad
i guess even gemini 2.5 pro , got updated everyday...
beacuse it got better by everyday...
yes
trying opus 4.1 on the wormhole thing
to see if it performs better than g3
so u mean if i tried to edit the thing in browser, i can connect with gemini 3.0?
it's nothing to edit
it's just a/b
just wait until it releases bro
it will release in a few days
no worries
can you do this one "Create a fully functional, identical and playable DOOM (1993) clone created in three.js <{5000 lines of code}>"
it crashes
after
3k lines average
how can i understand what does a/b means? u mean a dual ai of gemini 2.5-pro making them selfes a 3.0?
launching now
np
nonono
can you explain this guy
a/b isnt means a for gemini and b for other one? working together?
basically on ai studio
when you send more messages
you have a probability
that they will ask you to choose between two answers
for testing purposes
and one of them is gemini 3
i rememberd that..
launched your test buni
howw did u know??
a/b is a rare testing thing that comes up in Google AI Studio sometimes, where you can choose between two models by the one who responds better, choosing A or B
looks very bad imo @gaunt spade
this is Opus 4.1
for wormhole
looool
very cheap looking particles animation
looks overengineered
and doesnt even look like a wormhole
i have rememberd that happening alots of between a few long period of times..
doesnt look like a wormhole at all
gemini 3 one was so much better
howw did u know if it was really a 3.0???
and how can i connect to gemini-3.0-pro?
someone made that prompt in X28, i remember how crazy it looked. but it still looks good on the new Gemini 3 checkpoints
@jovial sapphire
because the demo looks very good
and its detailed with proper coding
and how can i connect to him?
you need to be in a specific region (the united states)
and run some prompts on AI studio until you get the A/B testing checkpoint
that means a better model will produce your output
and you get to choose between one of them
it takes a long time
you are same as gemini and ill tell you same as what im gonna tell him when he does that:
" Why Are You making the things more complex and trying hiding the true solution to me? you could easily told me use v.pn!! please dont do this again ever, as i have eyes, and can know , so u better dont hide, otherwise ill use gpt "
and how can i know which one?
so i can use both ? instead of seleceting one? and how?
lets go
it does so well
okay try again
didnt vote lol
nice!
i know how riftrunner does
and i dont vote
if it got it
so i get to chat with it
a little more
@jovial sapphire imma make the wormhole and then doom
looks like his feet is connected with the cycle.. 😆
okay!
waiting on doom
no a/b for now
bruh riftrunner FLOPPED SO HARD
i voted and it turned out its indeed riftrunner
wow i didnt expect it to fail at the wormhole
AI studio is the real gemini 3 pro
LMAOOOOOO!!
riftrunner is a nerfed gemini 3
sadly
if they release this one
i will be so so so disappointed
the code looks like from 2016!!! 🤣🤣🤣🤣
well well well, this checkpoint is the same on that appeared on canvas on gemini app 😭
the outputs were the exact same
i didnt laugh this hard on whole this day until nowww!!!!!
no canvas
was not as good
as ai studio one
yea
thats why
it was like riftrunner
im hella confused
thats my point
they released the worst checkpoint on canvas
thats my point 😭
yea....
i dont know what they will release
in the end i mean
maybe riftrunner is gemini 3 flash
with some thinking
cuz rift-runner sounds like a FAST model
for its codename
@hallow hemlock maybe u tried the Gemini-3.0-ultra-lite 🤣🤣🤣🤣
YEA
wow
i think theres a way to bypass the phone number thing
but maybe it got patched or something
yes
wait
actually no it s kind of good
enemies walk towards me and attack me
when i approach them
which type
the red eye thing?
this actually looks close to my riftrunner doom
pretty good
yup
i'll fix the one that has
one error
and see if its better
lets see
looks like there is just one monster and there is no good controls..
i think the one we got
wazs bad checkpoint
actually no it was the same
52c1a3b8c57b46fb
here check this
i think problem here was a very bad prompt, it was very short
"
User
can you do this one "Create a fully functional, identical and playable DOOM (1993) clone created in three.js, all in a single html file, 2000 lines of code."'"
still better than every other model tho
blegh
✦ The Team ✦
AdamX: https://twitter.com/AdamEHKS
Brooskee: https://twitter.com/brooskeeb
Ciara: https://twitter.com/bresnahammy?lang=en
Checkers: https://twitter.com/chexchess
Corin: https://www.instagram.com/corinkeen/?hl=en
Colleen: https://bsky.app/profile/solarcitrus.bsky.social
Claudia: https://bsky.app/profile/hiyfi.bsky.social...
i have....
but its not a web or for web.. yes it csn be a web but its a fornt end could be a web..
nevermind..
i need web
how about explaining monero how it works in web page for a teenager like me cannot stand how monero works ?
like i know bitcoin how it works.. but monero i cannot stand it ..
like i cannot stand how it works in hiding the amount of current of what the sender have and to the reciever and everything is hidden.. i cannot stand that..
everytime i asks ai about it such as gemini 2.5 pro , it cannot make me stand it..
yes , just tell him make it better and standable for a teenage guy such like me..
just dont host it ur self online.. just give here the .html and ill host it myself in the terminal..
"Your task is to create a single-file, highly interactive HTML educational experience designed for a teenager who understands Bitcoin but is confused by Monero's complex privacy features. Focus on using relatable, casual language, and high-contrast, brutalist styling to demystify how Monero hides the sender, receiver, and transaction amount. The entire page must feature accurate, step-by-step animations or interactive visual analogies that explain the core mechanics of Ring Signatures, Stealth Addresses, and Ring Confidential Transactions (RingCT). The final product should be a visually striking and functionally clear demonstration that makes the user finally "get" the system. A neo-brutalist style axed on Monero visual identity. Make a masterpiece, minimum 1000 lines."
this is the prompt
u know how did u make the prompt?
also do u know that i dont have gemini 3.0?
i use it
alright.. ill try gemini 2.5 pro..
i will try Gemini 3
and send you the html
alright..
if he did make a short one.. say to him that the user dont love the short answers and wants a longer ones..
and tell him he will read it..
also tell him the user know what the pgp is.. and what the signature meaning and the publickey means..
why u didnt give the html file or code?
cuz i cannot interact with it..
such as in the image , i cannot press buttons..
thats funny what he is saying at the end.. dont do something.. HahHaha
this one is a different checkpoint "ovdf5tin1bp9d2o3"
@regal zealot
one shot is crazy
i still laugh at the end of it..👀
hahah ayes
but you see, the result is amazing
gemini 3 is very good
yeah.. it could make it better such as like of what i imagine of how the sender or receiever when it wants to send , i thinking of maybe if the node of blockcha1.n it uses if the hidden hands knows it.. perhaps they might know him..
or may thought is wrong?
let it fix it.. so i can know more about moner0.. and also know why my only-view wallet needs alots of time just for syncing..
so u mean my thought is right?
ask gemini 2.5
as of what i see?
delete this NOOOWWWW!!!
you have been blocked..
alright after i did blcok , now im comfertable, that all i see is a "blocked message--Show"
alright.. so sorry..
i rememberd that i asked that question in the old past of time when the 2.5 was new..
but it didnt yet explained to me and maked me stands it its secure , so its not have that issue that i thought of..
how and why its bad?
i meant if u dont get good answers
maybe its because you dont ask the right questions
G3 out?
but i can access
via ai studio
if u have good prompts
i can try for you
can someone help me please i have an issue
huh? , u mean my question is bad or my language? , i mean when i did ask it, i did in my own language witha little english..
.
i cant import photo to ai chat why please help
Yeah let me prep them lol
oktysm
Which one?
claude 4-5
if u have some respect to my privacy, u better not have asked me here..
On which app lmarena?
yes
bro
its not private
who cares
fe.ds may cares..
@echo aurora Can u help with this?
u dont have my issue ?
even if i didnt do anything wrong.. they still logs me in their logs..
Could be an issue with the app, but send screenshot of error message to help them debug I’m not dev but I can guide u in right direction lol
Sry what's the TLDR? I was headed out the door, how can I help?
Wassup bro, how u been?
hello please help me i have an issue i cant import photo to ai claude sonet 4-5
Paste error too as an image if you can
ok
Can you fill out a post in #1343291835845578853 if you haven't already? I won't be able to answer now, but that's the best way to flag and technical issues.
Bro same
Thanks! Next time I’ll just forward them them lol
heere
But u got this tho, u using ai to help?
i did but noone answer me
@balmy mist its jpg
brah whats even that slang ?? "ADDH"??
How to generate image to image please
Try refreshing(ctrl + shift + r), I’m outside right now, but I could help when I get back home,
Ah yes, this model IIRC we had isssues with enabling vision with. At the moment it's not going to work with image upload sorry to say.
maybe "AD OF UHD?"
wich ai model has import photo
i did stil not wokrs
I’m the same way, I’m tryna start my family now, that la how far ai pushed me away from tech and I minored in it lol
wait actualy working on gpt but all model claudes has this issue
That’s weird, there could be an issue with Claude models in arena, dm I got something u can use
lol yeah 23
Finish college a while back
But kinda miss it, adulting hard bro😂
Man i would say it get better but … lol im thinking about changing careers
Im a software engineer rn, but I want to branch out so bad, i hate being at my desk at home all day, ik its a privilege but this making me crqzy lol
what grade u in
just be chill and enjoy some moments on different stuff
u dont have to always learn about AI
hello here
i need some help
after uploading my image for video animation and prompt what key to press for generation because enter isn't working?
U right, it’s just so hard not to cause it’s become everything now tbh
Hi @echo aurora Is there any update for the UI?
lmao chill why u suddenly mad
Hello, I want to learn, share ideas, and connect with others passionate about AI.
hello, want to learn prompts, generate videos and share ideas
Proof of your success?
<@&1349916362595635286>
its a scam bot
As I thought
$100k wow
What it was?
there was a spammer who said you can make 100k by doing whatever he said or some nonsense
scammers everywhere... i hate them
Does google login not work for anyone else
its still in every other channel
i cant even open lmarena cf under attack just spins forever no matter what browser i try
same
guyz du u beliv mi
Hey everyone! I've been sifting through a lot of posts here trying to find information about two battle arena models: Crystal and Quasarflux, and I've only found guesses that Crystal is LLama and Quasarflux is Grok. Is there any definitive information published anywhere, or is it always a secret? 👀
New models?
@quartz light are these models new
I saw Crystal a long time ago, about a few months ago, and Quasarflux, yes, it’s new.
That's chatgpt
@ocean vortex bro even your system prompt is not doing anything against the heavily nerfed gpt 5 pro
The custom instructions are not un nerfing it
The prompt you are trying it with nearly all current models are gonna struggle by chance depending on their tokenizer. As far as I'm aware nothing major has changed with their current tokenizer. So essentially the model can't get this right without being overfit. You could test the tokenizer and how it performs or to what extent reasoning can remedy it, but then you would need much more prompts like this than just 1 random word. Different models are gonna fail on different ones
@ocean vortex you said your prompt will make it think longer, is 4-6 minutes long? I tried many tasks which should take long stem and non stem
yeah 4-6min is not bad. I'm kinda fascinated by the 5.1-chat performance though currently. It is impressive how good it performs for how fast it is
But it's hard to say definitively if system prompt is really the only change their website to API
on API it is much more blunt
Reminds me of grok4 while being as fast as non-reasoning model, on API
Bro before it took 10-12 minutes for lower tasks than what i tested
And the quality is worse
And I'm not talking about 5.1 I'm talking about 5 pro
I mean how can gpt 5.1 heavy thinking take longer than 5 pro and give better answers
For pro there's no 5.1 Pro yet afaik
on API I only see 5.0 Pro
I don't have ChatGPT Pro. It's a waste of money as far as I'm concerned lol
I don't too
but hence can't comment on it much
I'm using business subscription free trial
Do you have any prompt that you ran earlier and tried now without any changes?
Cause it''s very hard to know without retrying the same exact thing
You may think it was 'easier task', but models do not exactly always work the way you may expect them to. They will end up thinking longer for tasks that you anticipated to take less.
And then the quality is next to impossible to compare if you are comparing different tasks
Yes
Many infact
yeah but i havent seen crystal
btw look
guys i have a question
my thinking injector for any model
it worked on grok's stealth model
Seems like there's a lot of the usual reddit shtposting there to be completely honest. I don't see a single prompt and output in there.
What is shitposting?
Yes
Comments not based on reality but rather unfounded emotions, in this context
on god?
say on god @hollow imp
because chat gpt 4o is like a human buddy in old days
Can you try one now, to compare it directly all things being equal? Also shorter reasoning alone does not mean it is worse. It is actually a big plus if the final answer turns out better.
THE FINAL ANSWER TURNS OUT SIGNIFICANTLY WORSE
On skibidi
Then show one example of it like I asked lol
🤯
say on god bro.
on skibidi is everything
if you say it, i believe...
I can't, that prompt has some personal stuff
On taro sakamoto
Cause that reddit post does not document a single one as far as I can see
imma search skibidi, wait
just a pointless thread for the most part
Dom please give some nudge phrase to make gpt 5 pro answers more quality
you said there are many. Pick ONE you can share. 🗿
lmao
What AI is better
13
21
1
Gemini
As good as 5.1 is, very excited for gem3
5.1 sucks at coding wym
Did I say I'm coding?
No idea how good it is at coding, but clearly it's the only use case...
gemini 3 is close
This week... I will be using it a lot
Will be curious how flash is compared to 5.1 too
Oai really neutered 5.1 by disabling legal assistance too. So gemini will be an even bigger deal there.. It was already better than many lawyers
What experience have you made with Claude-4.5-Sonnet-Thinking?
When does its performance begin to degrade?
4
5
1
after 90k tokens already
Yeah they told gpt not to do any legal stuff. It was already better than a lot of lawyers I'm familiar with in gpt5.0
So disappointing. I know people will be switching to gemini.
If you're a lawyer that's not double-checking the legal reasoning, or are using any case law, you deserve to get screwed
No, whats your main use case?
No, it's simply not better than 5
A few.
Translations, creative writing, legal, summarizing, transcriptions of audio. Probably more I'm forgetting
No it is better than 5...but incrementally. And then they disabled legal assistance
Too many idiot lawyers used gpt without using their brain, so oai disabled it... Disable the lawyers instead
Yeah, that is a issue
Gpt 5 is better than a average lazy lawyer, but he will be barely feed of context
Hi, I'm new here 😄
Yup. If you go through it many times it usually gets there (with better than avg reasoning).. Rarely on first couple attempts
for translations and transcriptions, AI is pretty much good enough (as long as you give it just enough context to know what it's about)
imma use deepseek ai bro
Gemini
GPT 5.1medium is around 72% (plus), and high is around 76% (pro)
is SWE verified not the best benchmark for coding?
anyone know why its not working?
🔥
It isn't. It's only part of the story and a coding benchmark that is being exploited the most currently
And is being exploited for this very reason. People like yourself thinking it is the best 👀
lmao.. isnt that a bubble..
Anthropic did it first, now Google and OpenAI doing the same. Focusing on SWE much more than they did before. Things like codeforces, LCB, Aider etc are secondary now
Kinda predictable if you ask me. They are always gonna focus on the things people are looking for. SWE is also easier to rig than having to worry about all the alternatives equally, in my opinion. Like with o3 they had to test a ton of different coding metrics. For 5.1 people are kinda happy just with improved SWE... 👀
i dont code, i dont know either way. just curious. Are there better benchmarks?
I also keep using deepseek xd, 3.2 is goated
Maybe, but now it will be benchmaxxed even at unwill
It's not a bad metric, but there isn't really 1 definitive coding benchmark that would stand in isolation of everything else. Variety is always important against contamination. So the score is only 'valid' if the scores are high in other related coding benchmarks as well
Not correct on what exactly?
It happens mostly because they try to adjust it "better"
yea
Just like, lmarena nerfs Gemini 2.5 pro somehow since it is weird to it be better than sonnet 4.5
the order varies a lot depending on what you choose to look at. For LCB just about all Claude models suck but this is still coding
It is only partially true, for example an model that have higher world knowledge (Gemini 2.5) can outperform sometimes other model like o4 mini, not because the quality of model is consistent, but because the sum of competences
You are way oversimplifying it. There are many models from each of them. And I'm still not sure what you didn't agree with in my initial message you quoted tbh
yes
4o is bad at coding, but he knows ball and perfoms well in more standardrized tests
I dont see a reason to not count human judged tests
And also it keeps being true in automatized or LLM judged tests
Curious, i wanna read that
How i said, i am being objective and not affirmed they are really good
How do you think current benchmarks were made if they aren't human tests? By aliens?
🗿
Yeah, an test jugded by like Sonnet will be better than jugded by humans, mostly because it is smarter than a average human istead consistency
This is the most ridiculous thing I have read this week, no offense
This is like asking some entity to launch investigation into itself
LLMs are terrible at self evaluation
Anyone is bad at self evaluation, if you swear something is good, you will judge it being good and do in that way
The problem here is either you don't understand AI at all, or you are having issues trying to communicate it.
The issue is not even AI, its about self judge
yeah he can't even grasp that, true lol
Can I only generate videos here in the server, or can it also be done on the website?
That's too much schizo for me to engage.... all this 'psych' world 🗿
Classic example of someone trying to apply something he studied to different incompatible fields. You are not ML engineer my man...
An "Self Judge Benchmark" that compares the level an AI check it self perfomance comparated to an actual realiable test would be an a interessing number
I trust Claude can jugde it self but not Gpt for example
How would you even make the AI check itself without making ir reference some external human data? That approach is flawed by it's very design
And you can't make it generate test questions either cause it's never gonna come up with something it can't answer correctly. It needs to know the answer for the question to be valid
Generally from what I saw, even making it come up with the hardest questions it can answer is problematic. It's gonna sway into making the questions it can answer very easily. Because it is creating them based on what it knows and understands. To push it out of it's comfort zone you need human written/curated problems or tasks
Yep, just here 👉 #1397655624103493813
Does anyone love gaming and read manhwa especially cultivation manhwa
idk what manhwa is
Korean comic
oh damn
If you watch solo leveling that's based on manhwa
tbh i dont watch anime
or korean comics
There is a game released yesterday
Thats a f2P game
If you love manhwa you should play that game
yo guys
what is familiar to chatgpt 4o in websites ai
like really really familiar
good for coding and stuff
anybody!/
this is python?
no its html
ah
rip gemini 3
Nahh, Gemini 3 will be better than that
I see Gpt 5.1 high is just like Gpt 5 ultra high since they adjusted the reasoning effort
Ok Claude models are literally unusable without Claude max
16k and 32k reasoning tokens are not enough
Gpt 5.1 extended thinking and gpt 5.1 high are way way ahead
no its NOT
not even close
have u tested the new Gemini 3 checkpoints on AI Studio, within the A/B testing feature
yeah, it's not even remotely closed.
not saying gpt 5.1 is bad but Gemini 3 is just another league
gpt 5.1 can compare to gemini 3 flash
lol
maybe flash lite
i wont be surprised that 3 flash == 5.1
i have bigger hope in 3 flash than gpt 5.1
i guess we'll get a much better idea of capabilities this week. im curious
LOL
I am just concerned that Google will raise price of Gemini 3 because it is soo much better
gemini 3 pro is nuts
I think only pro is launching this week
its voodoo bro
i think gemini 3 might demand a premium.. but we will see. if google wants to earn extra $ or want to be super competitive
and gpt 5.1 will allow u only 5 prompts on free plan, while you'll get much higher limits with Gemini 3 Pro
lol
that was always the case
Hi, please remember that english is the only language allowed in chat channels 🙂
Guys Sherlock alpha which ai is this?
nothing important
they already did with 2.5 pro for 6 months
on open router, someone said it's grok 4.2
its from xAI (Grok)
yeah it sucks anyways
Grok is the worst
ah then grok 4.2 like they said
Nah bro, the grok 4 fast in the release I thought it really good in my use
Your standards are very low
meta is the worst. Grok is fine but has some strenght at least
even the worst chinese model beats it
I found the grok 4 fast better than the Chinese models at launch, but as it launched the glm 4.6 then it is useless now
yeah glm 4.6 is a miracle at such low price
who uses grok bruh
for its coding abilities
just shut it down already
Why some people said the gpt 5.1 dont is better that 5? I fell a huge improve
Models like that are out of conversation
grok is decent but nothing exceptional as far as I've tested. At least softer guardrails than open ai
elon musk gonna hire some robots to your house
🤖
That's not decent nibba
you know what sucks with 5.1 tho
It has the same damn style
every time
yes
it lacks creativity i noticed its all the same thing
no variety
@stray aspen ask your yupp ai to get gpt 5 pro api
i have been wrong multiple times about Elon.. I wont make the same mistake by counting Grok out
unlike gemini 3
They are so rich
which gave me different style
About grok 5, do you think it will come really good?
i mean it's not completely trash
Atleast it doesn't hallucinate as much
and it likes putting a bunch of text so much
also u can control the temperature with gemini models, which means different results each time
is it 5.1 ? what is the prompt?
its gone lol
they removed it
this the prompt
Gemini 3 never will be released 😢
2 more days for gemini 3. atleast we dont have to wait too much anymore
😭
Grok 4.2 is worse than Gemini 2.5 Flash
18 nov.. mark my words
2 more days 🥶
Just december
Lol 🤣
Yesterday I tried deepsider.ai and for now it offers free Sora 2 (10s) ai video gen 👇
scam labs ai is better than grok 4.2 lol
Scam labs? Why? Because the "" mpu"" 🤣
Deepseek v3 crushes Grok 4.2
We need more chinese spies
gpt-3.5 turbo crushes elon musks clanker
Yupp ai is removed?
Qwen is better than grok atp
scamlabs is actually glm 4.6 with Claude writings
Yoo
Maybe every ai company uses chatgpt models to train their model but grok doesn't do so because of elon musk's grudge that's why they are so behind?
Serious? 🤔 Lol
isnt gpt 2 and gpt 3.5 dead
well they actually said "we pulled some weights from qwen 3 and glm 4.6"
Gpt 2 is open
oh its opensrc
what is scamlabs?
movementlabs
Movimentlabs
Even Qwen3 4B is better imo
😹
grok providing the worst closed source models ever
Even qwen 3 4b Q2
real
No joke, that's real
vibe thinker 1.5b > Grok atp
grok is not that bad... its not the best but its ok 🙂
I feel like gpt models are better at python than html
i didnt test it, is it good ?
Is soo massive the 4b 2507 with only 0.4% of qwen3 max even be a good model
for a small model its definitely worth it i'd say
I never tryed this new model, is multilingual?
good reasoning
that's the goat of the slm by far for me
Too
If you are going to use a gpt model, i recommend for smething like pygame or python, because it aint lazy on that one [unlike html]
It does
HEY GUYS
It even runs well on phone
WHILE GENERATING A VIDEO CAN YOU NOT GENERATE A ANOTHER ONE?
CAN ANYONE PLEASE CLEAR ME ABOUT THIS IN?
@deep adder bro I asked the same gpt 5.1 model a very complex task in chatgpt ui and lmarena and in ui it completely declined my request and kept repeating system guidelines system guidelines but it perfectly worked on lmarena. So in api you not only have custom system prompt feature but also this annoying openai safety guidelines is gone?
For chatgpt they are routing 'unsafe' requests to longer reasoning
GPT 5.1 working for anyone?
yeah?
🤔

For me, it just keeps generating the message and then gives an error.
curious if OAI will release new models immediately after Gem3 comes out
I bet 10 dollars
There's nothing to do with safety for the prompt
Uhh discord is not letting me send it how do I send
it works, but gave a dumb answer
my guess is new improved image model..
5.1 or 5.1-High?
with some kind of meme-ability
thinking. probably medium because im on plus
For us ppl using their models through api is the only choice
Then pick extended thinking and use nudge phrases
HOW DO YOU KNOW IF YOUR VIDEO GENERATION IS DONE OR NOT?
Hi! You´ll receive a message from the bot when your video is ready 🙂
MAN ITS SAID 10MINS BUT ITS ALREADY AN HOUR STILL NOT GETTING
I think if you say i will give your advice to a lawyer as the next step, it restricts legal advice a bit less
I don't believe you one bit
If you're so good then tell me how do I bypass openai's system prompt and guidelines
Very annoying
Kill the legal ban system prompts