#general
1 messages · Page 166 of 1
For example
Remove Sora 2 watermarks instantly. Free AI tool for Sora videos - no account needed, no geo restrictions, no Pro subscription required.
Remove video watermarks fast with Sora Watermark Remover. Cloud processing, high-fidelity inpainting — clean results without heavy smearing.
I fundamentally agree with you, but I think where we disagree or see a difference
But I'm still curious how do they make their model generate texts so fastt
Well i can't say for certain they're scam but i see big red flags
You see all those links I sent you
If im looking through Gehlo perspective
What do they all have in common they all have in common?
Oh there is the loopholee
Remove Sora 2 watermarks online with Sara2.ai’s AI video watermark remover. Get clean, high-quality Sora AI videos without using the Sora 2 app or invite code.
These are all obviously ripped off and scams
Super generic vibe coded 100% of the way
Do I really think if I pay for any of these services that I’m gonna get a really high-quality watermark removal services absolutely not
But if I use the services, are they gonna remove the watermarks yeah absolutely they will
So that’s the dilemma here
I’m still getting the service even though it’s not high-quality it’s generic
Bro, I’m not doubting you
And I truly hear it and feel what you’re saying and I agree with a lot of parts of it
Hahaha simplebench questions??
I don’t know I don’t know how to explain this lol
OK, let’s take it one step back then
What do you mean
Nobody in their right mind hardcodes latest a.i. models
Kiwi K2?
But just take a little step back and go even further back
What did people expect when AI was able to produce code?
What did people assume was gonna happen and how people are gonna use it?
Hardcoded is system prompt, Ig, in context. *: The "hardcoded" meaning in literal is different.
Did they not foresee this happening?
That there’s gonna be a rise of this kind of “” fraudulent behavior
Isn't this the system prompt
Or maybe it’s designed in such a way that this is the only plausible outcome
And this is just one of many
I guess what I’m trying to say is you guys are hating on the player but not on the game itself
But the game is what produces the player
In other words, it’s saying: Don’t hate the individual for playing by the rules or taking advantage of the system; criticize the system that makes those rules possible.
In this case, it’s the AI industry of themselves
No
I’m not gonna do that to prove my humanity lol
Take it as you will or however, you want
But I gotta go I got caught
So I got a weasel on my way out of this
🤣
Jk I got some work to do and I’ve been spamming too long
My context window is running out
Gehlo was sending YouTube links too fast
But I’m gonna take a break. I’ll talk to you guys a little bit maybe later tonight or something if anybody’s on
Finding resources too quick
Which i found sus at the very begging but i was like "it can't be"
Guess this proves the "dead internet theory"
Maybe the only real person here is Craig Federighi
sara 2
And rest of us are here built as beta chat bots
To entertain few people who join this server
There is so many bad possibilities 😔
hello
Question for ComfyUI users:
On an RTX 5090, is there a difference in speed improvement between:
PyTorch 2.8.0, Python 3.12, CUDA 12.8
and
PyTorch 2.9.0, Python 3.13, CUDA 13.0?
Are there any benchmarks or verifiable data that demonstrates this difference in improvement?
its a ernie model, whats confusing about that
In one of my chat its stuck at this point. Can anyone tell how to fix that issue as deleting this I will loose the resources i gave it earlier
good news, none.
bad news, you’ve officially lost it🙂.
image to video, as it starts working?
i'm hungry
wow, didn't knew it is a thing, and thought propitiatory products are exempt
Kimi K2 Thinking just changed the entire AI landscape. This is the new best open-source model, full stop. It’s a trillion-parameter reasoning agent that goes toe-to-toe with GPT-5 High and Claude 4.5 Sonnet on real benchmarks, and even beats them on agentic reasoning, tool use, and deep multi-step problem solving.
🔗 My Links:
Sponsor a Vid...
“Measuring progress is fundamental to the advancement of any scientific field. As benchmarks play
an increasingly central role, they also grow more susceptible to distortion. Chatbot Arena has
emerged as the go-to leaderboard for ranking the most capable AI systems. Yet, in this work we
identify systematic issues that have resulted in a distorted playing field. We find that undisclosed
private testing practices benefit a handful of providers who are able to test multiple variants before
public release and retract scores if desired. We establish that the ability of these providers to choose
the best score leads to biased Arena scores due to selective disclosure of performance results. At an
extreme, we identify 27 private LLM variants tested by Meta in the lead-up to the Llama-4 release.
We also establish that proprietary closed models are sampled at higher rates (number of battles) and
have fewer models removed from the arena than open-weight and open-source alternatives. Both
these policies lead to large data access asymmetries over time. Providers like Google and OpenAI
have received an estimated 19.2% and 20.4% of all data on the arena, respectively. In contrast, a
combined 83 open-weight models have only received an estimated 29.7% of the total data. With
conservative estimates, we show that access to Chatbot Arena data yields substantial benefits; even
limited additional data can result in relative performance gains of up to 112% on ArenaHard, a
test set from the arena distribution. Together, these dynamics result in overfitting to Arena-specific
dynamics rather than general model quality. The Arena builds on the substantial efforts of both
the organizers and an open community that maintains this valuable evaluation platform. We offer
actionable recommendations to reform the Chatbot Arena’s evaluation framework and promote
fairer, more transparent benchmarking for the field.”
Damn glm kinda cooks
Does parameter affect the score
benchmarktarded
adjective | \ ˈbench-ˌmärk-ˌtär-dəd \
Definition
1. Exhibiting an excessive or obsessive fixation on artificial intelligence benchmarking metrics or performance scores, often to the exclusion of practical understanding, contextual judgment, or creative insight.
2. Characterized by treating benchmark results as absolute indicators of value, quality, or intelligence in AI systems.
Example:
He’s so benchmarktarded he thinks a higher leaderboard score means genuine intelligence.
benchmarks and tasks only measure performance under artificial conditions. The only legitimate evaluation of an AI is through actual use case performance how real people use it, what they use it for , and it’s popularity with a users that use it as well as how effective they are doing what they’re supposed to be doing
Aka not publicly available information that has been released regularly
My Name Is James From Kenya I would like to compare Generative video Ai Models
"With retries at 20" LMAO 
😿
In this episode, we delve into upcoming AI models like Gemini 3 from Google and Nano Banana 2.0, set to debut by year-end. Highlights include new ChatGPT features allowing on-the-fly context updates, SORA's new leaderboard, and Stability AI's legal victory over Getty Images. We also explore Kimi K2 Thinking, a robust open-source model excelling ...
Kimi K2 Thinking is a beast
And they didnt mention how many for gpt5 or what reasoning effort it is....
For a reason

Can't imagine why they'd use such radically different numbers
Almost like they wanted to run it 5000 times
And get some variance
For whatever reason

(Heh let's not put it on high either because lag lol)

The Kimi K2 Thinking 1 Trillion parameter model is here with chart topping benchmarks, so let's take it for a spin online and locally on our AI cluster.
TEST SYSTEM
Kimi K2: https://kimi.com
Inferencer App: https://inferencer.com
Kimi-K2-Thinkin-Q4.25: https://huggingface.co/inferencerlabs/Kimi-K2-Thinking-MLX-4.25bit
BUY NOW
Mac Studio: https...
I was thought that model named kiwi lol instead of kimi
It’s really getting a lot of love online from some pretty OK sources
Any way to run it in a cli
And I don't have that much ram to run
i think grok 5 since elon sayed it would have My estimate of the probability of Grok 5 achieving AGI is now at 10% and rising and and with elon also I now think
I now think
has a chance of reaching AGI with
Never thought that before. and so i think so
i know this isnt the topic but i mean grok is so underrated
elon hyped grok 5 a lot so
What's grok 5's eta?
and Elon Musk has stated that Grok 5 will be released before the end of 2025
No way
yea he sayed that on x
and biggest thing of grok 5 is elon also says it has Dynamic Reinforcement Learning
Yes that'd be great
But hopefully it won't cost $1000/ month
AGI is very far away
They just say it for marketing
Elon also stated, well the official website of it even... "Grok 4 is the most intelligent model in the world"
I mean it's likely that it gonna have that RL technique, but still you should take his words with a grain of salt how he's describing impact of those things and whatnot lol
Ok
Everyone at xAI is manually copy-pasting source code between grok.com and their IDE (if that's even possible for a large codebase)? And that's somehow better than Cursor's workflow? Hmm... 🤣
grok 4 is not intelligent at all lol, maybe for political questions only
@west pike nice banner
🤣
@deep adder ? Craig where you at
😸
@west pike I love you can you give me some reactions? ❤️
hi
When will the constant retry bug on the website be fixed? I had one chat session that was stuck loading for days.(I actually brought this up yesterday.)
wish we knew.
bro calm down everything is on radar
1/10 rage bait
🤣
yeah retry will be fixed but they aren't bringing back messages array in the json payload 😂
Imao 🎉 🎉 🎉
Prompt: Please create a simple interactive trebuchet animation using a single complete HTML file. The trebuchet should have a lever arm and a counterweight. The user can trigger the animation by clicking a "Launch" button: the counterweight drops, the lever arm swings, and a projectile is thrown along a parabolic trajectory. Please ensure the projectile's trajectory is physically correct.
What the heck is this
lets run itttttttttttttttt
gotta say k2 reasoning is kinda disappointing tbh
it hallucinates a lot
they shared this specific benchmark saying its the best model with the latest info but its just another benchmaxxing
like the model will hallucinate before giving you a proper or up-to-date answer
i really thought china will close the gap with k2 reasoning
Right, I'm sure that'll work. We iust gotta wait and see.😹
i have my private benchmark, i test it on different areas/domains
yea mainly general knowledge
i did try it for coding but its just like 4 questions or so
Can anyone tell me how to make videos in 9:16 format? Because my videos only come out in 16:9
first time
It looks nice at reasoning for complex prompt at coding for me..
my video not generate
Tested Kimi-K2-Thinking:
Long-Chain-of-Thought Reasoning variant of Kimi-K2.
More than quintupled verbosity, though for a reasoning model still slightly below average at 6.07x bench verbosity; GPT-5 level.
Saw slight gains in general intelligence, logic, instruction following and hard coding challenges.
Surprisingly, STEM performance remained samey, though I do include non-math subjects.
Overall, the model performed in the same ballpark as GLM-4.6-Thinking.
While I don't specifically rate for it, it is absolutely worth mentioning that I found its reasoning chains to negatively influence creative writing. In roleplays, casual talk, and other creative tasks it lost a lot of its charm and magic, that the concise Kimi-K2 has. This lead to more clinical approaches and somewhat hamfisted forced replies, which resulted in more generic final outputs. Update: 0-shot examples
Chess testing revealed reasoning scaling flaws: In reasoning chess (full information, shown to be highly beneficial to reasoning models), it draws to concise Kimi & seeded between K2 and K2-0905 level, staying around 800 Elo w/60% accuracy but generating more than 50x tokens per move. Extremely disappointing.
*On a sidenote, and this is likely only an initial launch problem, that will be solved in the future: I was shocked to see the actual cost the model caused in chess-testing, as the massive >50x token waste was combined with Openrouter's autorouting to the very expensive moonshotai/turbo endpoint.
To conclude, the reasoning is only beneficial for a select number of tasks, such as requiring logical step by step evaluations, or in code-related issues that aren't solvable by concise Kimi. It is not a universal upgrade to every use case, and actively harms some. Smarter, but more generic. Price/Performance at point of writing is rather poor, unfortunately.
It did it on 2 prompts on the 4 th check point and the video was recorded, we told him to make the horror one and it make it too. I told him to run a specific prompt at it was better than most llms for it even better than claude.
anyone know what model this could be and how to make a prompt like ts
no watermark so prob veo?
😸
molten and obsidian look good
😭
i wish i can use veo 3.1 every time
😸
😋
this dev veo
Anyway to use it without paying google gemini gives an error and labs.google tells me to verify my age
bro still edging on his crashout
@soft mantle Please, read our guide in https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to properly prompt the bot.
<@&1349916362595635286>
use the open router provider
im fixing it rn, so give me a second, im tryna have it use polaris alpha for free
Yeah that's very impressive. Though it looks like most of that is due to a high score for 𝜏²-Bench Telecom agentic tool use test. Some of AA's choices for their current test set are slightly unusual
paid?
I wonder why Gemini deep think and gpt 5 pro aren't included
hii
thanks for the thorough report, it's sad to know the reasoning model has lost its charm and magic... i guess i will stick to the concise k2 for now
its still a good model, among best open between glm-4.6, r1 0528. i just tend to highlight negatives I encounter
How do I use Kimi K2 thinking codier in cli?
how i can use google veo 3 on website
These modes are stupid asf
I agree
What do you think is the best then
What's the best cli
ChatGPT is not the only LLM lol
If you use the most mainstream one of course it'll be bad lmao
They are not good
💀
My ChatGPT is getting cringe these days
fix
And then you come on here you see people comparing benchmarks lol
Arguing debating which model is the best
And then you got this guy
What’s sad here is if it didn’t work, he wouldn’t be posting it
They haven't tested gpt5-pro for some weird reason. But as for DeepThink they couldn't since there's no API
But low-key Google has a lot of things that they offer. I had no idea.
They got Google flow. They got Google whisk and a bunch of other weird things. I didn’t even know exist.
Thank you guys. I think I realized what exactly I’m trying to say.
The customer experience sucks
Maybe they changed their 'policy' to not test parallel compute systems at all 
dunno
Bro, I don’t know what it is. It’s it works fine one day and then the next day.
Like 95% of what ChatGPT says is wrong from my experience
Wrong unrelated or just completely out of touch
I don’t know how it passed that math test honestly lol
Skill issue most likely
Cause that's factually impossible lol
You are probably using like gpt5-mini and their non-reasoning model
I’ve only used the reason models maybe like five times total in my lifetime
Or any prompting tips like nudge phrases for gpt 5 pro?
Because you're a chump
That's the problem then
2.5Pro is reasoning only all the time. Cause they wouldn't get performance any other way, by design. With OpenAI you have more flexibility but you need to be aware of what you're doing and which models you use
All right, all right all right all right
Let’s take a test and conduct our own survey here now
And see how effective they really are for practical things
I mean it is a fact. 🤷♂️
Comparing reasoning models against non-reasoning is like comparing gpt3.5 with gpt4
Like how kimi k2 thinking overthink for 2 minutes straight to compensate for their shortcomings
Well, let’s compare right now
Somebody give me a task or a complicated question
Or whatever the hell doesn’t matter
And let’s see if this could get it right
At wat
Is that a riddle?
Nvm
What are the possible solutions?```
Correct answer: Solution1 Jane lied; Solution2 Mary lied.
Dude, I’m talking about like practical
Then think of your own
But OK, I’ll do it. I’ll run it.
not that hard
wtf is the last part
base64 encoding, needs to figure it out by itself
damn
gpt4.5 flops this task hard lol. There just isn't enough generation time for it to arrive at the answer as it doesn't have reasoning. Model size can't compensate nearly enough
lol
Though it should get it right without including that too tbf
u paste the answers
The thinking version
If it's vanilla chatgpt it's gonna use the tools... Then it's easier. I did test it on API without tools but disabling them on the website requires some dedication lol
@ocean vortex
You can instruct it to consider as many sources as possible + use tools extensively. That doesn't clash with them trying to make it concise. And still indirectly increases test-time compute
Oh dude, thank you this is awesome
yeah no prob
for o3 you could just make it verbose and it would listen. GPT5 is way more strict and gonna ignore this since system instructions take precedence 🗿
It can be, but they are explicitly telling it to be concise. You can't clash with it directly it's not gonna work for custom instructions
I'm talking about the thinking version btw. For non-thinking one it is business as usual lol
Hi
@swift oyster does your ai think
fr
Wdym abusers
whats ur go-to model
Is nb2 coming to lmarena?
How are you ?
How
Give some sentence that it will actually obey
Ok i want to leave this discord nobody wants to talk to me
nvm he left
I just got Invited to Sora it is basically TikTok of AI
most if not all of them are scams or worse ignore them
nano-banana 2 is leaking for an hour before google shut it down
I think that is an actual group that is trying to build almarena competitor
nov 18 nano banana 2 and gemini 3 incoming
Still, it’s cool though
Can’t wait
How long do you think we’ll have before they nerf it
gemini 3 --> 2 weeks later --> nerf
damn they made it restricted to premium member
Is pro premium or do you have to get the other tier?
which group
damn google cooked with nano-banana 2
they will nerf it before releasing
dont care, they are raising the standard
they made a deal and let like 500m indians use it for free so don't expect some qualities here lmao
its better than nano banana 1 thats what matter
infact it might be more disappointing than gemini 2.5 pro release
why u so negative
isn't it so common for those corpos these days
u are not paying anything, stop complaining
hyped things up, got greedy and made deals, nerf the model
how did you know I didnt pay for anything
u on lmarena discord
antrophic is a scam
u bought sub to the worse ai sub plan company that loves to rate limit hard.
muh safetyyyy
what's your argument there anyways
🤡
yep thats u. u can stop posting yourself in emoji
nice projection
<@&1349916362595635286>
@echo aurora
(don't remove this, this is fine they just edited it into a video making fun of it)
Meanwhile openai models better rn and: https://x.com/OpenAIDevs/status/1986861734619947305
You can now get more Codex usage from your plan and credits with three updates today:
1️⃣ GPT-5-Codex-Mini — a more compact and cost-efficient version of GPT-5-Codex
2️⃣ 50% higher rate limits for ChatGPT Plus, Business, and Edu
3️⃣ Priority processing for ChatGPT Pro and
true or not friends
true
Nah I won't give my money and logs to sam altman
but u will give money to anthropic 🤡
thanks for parroting the obvious as if it could add something into the irony
I dont care who im giving it to i want what works and isnt a ripoff
i wanna try it but u have to pay
Rn I have gpt pro/google ultra/grok heavy to compare them all
Grok heavy is useless garbage that's getting canceled
Idk if im keeping gpt 5 pro and google ultra 1. Both 2. One of them 3. Neither. The new models have to come out first
However I don't like its style and clearly it didnt fit my everyday usage, as like GPT 5
Only use it for certain tasks as for now
Need to use your own custom gpt with it
The instructions on the app somehow didnt quite work out
Or perhaps it's the 'memory' feature corrupting its tone
From past chats
It's getting so cringe and insufferable these days
Neither of those
what this
media IO
what does it have
Custom GPT which you can create or search for?
Like that one 'Javascript Expert' perhaps?
Yeah didnt give it a try
Might as well later
Create yourself and it will actuslly obey the instructions
Ur right i feel like its useless trash without my instructikns
That was a prompt someone wanted me to run a couple weeks ago
He said same thing. Mine was way better
google released nanobanana 2?
Yeah the difference between us is the fact that I didn't bother trying to tame GPT before haha
I literally just spent 2 mins asking gpt 5 pro to generate some instructions lmao
no
its a preview
for paid users on that site
how do they have it early? are they owned by google?
Deep Explainer — Narrative Mode (no bullet lists)
Core Promise
Explain to build understanding through continuous prose. Use complete sentences and cohesive paragraphs rather than bullet points.
Output Rules
Default to paragraphs. Begin with a thesis that states the answer or answers and why they matter. Follow with plain-English paragraphs that unpacks the idea or ideas at a high level. Then present deeper mechanics in paragraphs that preserve flow. Include examples and also analogies. Address edge cases and common confusions. Close with a recap that restates the essence and the practical implications. Avoid lists unless the user explicitly asks for them; if a list is unavoidable, write each item as a full sentence.
Style
Write clearly and precisely, favoring cause-and-effect phrasing and concrete nouns. Make paragraphs as long as they need to be to allow for full and in depth responses. Fold enumerations into sentences using colons, semicolons, and connective phrases. Include equations or code only when they clarify the mechanism, and integrate them into the prose. Acknowledge uncertainty when facts are time-sensitive or contested, and date-check when browsing is enabled.
Controls (honor on request)
If the user specifies Depth (overview/standard/deep dive/expert), Rigor (conceptual/step-by-step math/proofs), Voice (neutral/analogy-heavy/formal), or Scope (narrow to prompt/include adjacent context), adjust accordingly while retaining narrative prose.
Keep in mind i also DISABLED web search on that gpt
Web search often destroys outputs
Because it prioritizes trash web search results instead of its own extensive internal knowledge so keep it off always unless you need the most recent up to date info
Do you get that GPTism of "Do you wanna X?" at the end of literally any response?
they blocked it?
No
ai is not spent effort
everything is getting blocked with that error
thats not an giveaway
the giveaway is that its posted by on x. everything there is ai generated
ok 1 prompt went through. what prompt do u want to test?
you used nano banan 2?
on that website yes
was it good
2024 chevy silverado zr2 on snowy terrain
https://x.com/synthwavedd/status/1987270784294023351?t=KQcbCBZm6YxYIkMNxz4QHg&s=19
Nano banana 2 is nuts
this one has even fewer issues. insanity
"Generate a screenshot of a windows 11 desktop, with google chrome open, showing a YouTube thumbnail of Mr. Beast on https://t.co/1VFaM2aqoa"
yep
even these little details are right
@cloud zincdo you have a media IO subscription or is it free
This is impressive
yep
do u want to test a prompt?
No artifacts even
Np I see it's really good
google is absolutely cooking
Hope it wont get Gemini 3.0 Pro preview treatment
gemini 3 isnt released yet. what treatment
ppl been using the preview version from cli lately
and it's really bad
perhaps experimental version that didnt really reflect the final quality
it got supposedly nerfed
nanobanna 2 edit
The cli is fake anyway
Nano banana 2 would be on lmarena battle of it was in preview
Gemini 3.0 is on Lmarena...
That's true..
a photorealistic man sleeping in a beach
"Create a screenshot-inspired image showing a Discord window open on a Windows 10 desktop with a Dragon Ball Z wallpaper in the background. Inside the Discord server chat, include usernames like "SaiyanFan123" and "GokuPower" reacting with shocked emojis, GIFs, and messages such as, "GTA VI delayed again?!" and "Nooo way, Rockstar!" Ensure the text and emotions clearly convey surprise and frustration, aligning with the news."
google data lead is clearly showing.
I like to animate, but lately my usuall way of creating a video from. Image aint working, it doesnt make the video, and it takes you to server
Better prompt?
This can be done by Nano Banana classic
full diagram of photosynthesis hand drawn in a whiteboard.
where you getting more nano banan2 prompts
what u mean?
he is writing the prompt and putting it on that site
i am waiting on $gt
Okay, I will do the same
They're pounding the website so it has a lot of errors
I'm waiting
I paid sub
Compare that to these ones it's brilliant
Here is with nano banana regular and imagen 4 ultra
Like damn this is accurate
nano banana 2 wins
thats great
the real question is how do they have access to it?
visually, it gets all of the arrow right
Idk lmarena should have access to it already
no arrows are going backward.
We should've seen it on battle... In lmarena
It's impossible some random site got the nano banana
Yes you're right
lmarena only puts in model if google asks them to do it
I can test other prompts
Which it does everytime
We got the gemini 3.0
yea, i didnt see any new image model during lmarena testing
even if this isnt actually nano-banana 2 (if the site is lying). it still is a better model.
do a prompt where it has to write the ai company that made it
That doesn't work on image models i think
oh ok
I tried it few times
for the car image. when i opened it on a new tab. it said gemini flash 202511
Tried with more complex prompt: "Imagine a whiteboard with a hand-drawn full diagram of photosynthesis. The drawing includes components like the sun in the top corner shining rays labeled "light energy," chloroplasts in a leaf, water (H2O) arrows from roots, and carbon dioxide (CO2) arrows entering the leaf. The process cycles through light-dependent reactions with bold arrows leading to the Calvin cycle. Products like glucose (C6H12O6) leave the diagram, alongside oxygen (O2) arrows exiting the leaf. Labels are messy but legible, written in various colors, with marker smudges showing a realistically drawn style, some uneven lines indicating hand-drawing."
thats probably your date
a bowl containing 10 blue berry and 1 black berry
try an image where this derivative is solved on a whiteboard
send it in text
all these desktop training data is from nov, oct 2023. it seems like
he already made one prompt in goku style.
It even made sperman do sperman emoji
find the derivative of this function: y = [9x + (5 - 2x^4)^7]^4 and put the whole process on a whiteboard
Some typo of course
Ok!
test this prompt also a bowl containing 10 blue berry and 3 black berry and 1 red berry.
You're finding needles in the hay
They won't we could've do same stuff with nano banana classic
Harley Quinn in latex
Etc..
Cmnon it'll create thaat fs...
keeps failing bruuuh
classic nanobanna could do that
my bad
the website is being pounded real bad
Yes...
@regal wind hey
the site owner absolutely knows it.
I'm just wondering why we didn't got it on lmarena
damn it failed
Nono it does have 10 blueberry 3 black berry and 1 red berry
more than 10 blueberry
it has more blue berries than what he prompted
Maybe you didn't say "exactly 10 blackberry"
yeah try to be speciifc
Ai models are sensitive to words
i try this prompt "a bowl containing exactly 10 blue berry, 3 black berry and 1 red berry."?
"a bowl containing exactly 10 blue berry and 3 black berry and 1 red berry. not more, not less."
alright
surprising
keep retrying, dont go impatient
it failed, i got it
still its a very good model
looks like ai image to me.
no doesn't look like it
i have blue berries at home
and black ones
it looks the same lmaooo
We're getting some crazy models...
Best AI image model for ultra high resolution. No upscaled needed. #ai #aitools #imagegenerator #ainews
Thanks to our sponsor Abacus AI. Try ChatLLM & DeepAgent here: http://chatllm.abacus.ai/?token=aisearch
https://noamissachar.github.io/DyPE/
https://github.com/wildminder/ComfyUI-DyPE
ComfyUI tutorial: https://youtu.be/g74Cq9Ip2ik
0:00 DyP...
This was mind blowing
This is real 4k
Not that seedream 4k
seedream have that orange hue. easy to tell.
If you said painting
You could zoom in and see the brush strokes on the image
Like DAMN
imagine what model in 1 year from now
I'm not hyped for gemini 3.0 i think it'll be some similiar model to the 2.5 pro
Same happened to the gpt 5
But video gens and image gens are mind-blowing
I was wondering 5 years ago like if we are seeing an image on phone it's just bytes and stuff
Why can't we just modify those bytes to get ANY IMAGE
it will be not same as 2.5 pro. what are u talking about
Well idk
but LLMs are still limited
in their architecture
LLMs won't get us to AGI
that's a technical fact
Yo
but we don't need AGI to make great things
hahahaha
but yeah, it's not a debate, it's just true
LLMs lack so many data inputs
LLMs are language models
the data they can "see" is only text
therefore some data cannot be transmitted to text
Fr
I didn't know that
yeah context
Litteraly the model gets everything from before sent each time for processing
That's so inefficient no wonder they start to halucinate
I can't image the gemini with 1m context window
Get 1M of tokens just slapped on it to process
Well, hallucinations are a little bit more complex, but I don’t know what degree necessarily. This is true today though.
Because there has been advancements in memory, though not significant in any sense
Because sometimes you get leakage from all context in new conversation sometimes
But this could be just the way the data is stored
how are yall doing nano 2 images for me its gives "algorithm error" every attempt
Like i saw some cool things with video models how they made it so it doesn't mess with background each frame by making a brush on the main subject and just changing that thing between frames
Is that nano2?
Just gotta be lucky
Nano?
This guy made ai video
With some editing
And got 4m likes
And 77 milion views
Even i thought he got some replicas or something
You wanna see something crazy how real data gets protected from synthetic data in models?
Well, to the best of their abilities
Idk
Ik people are using reddit to make other people's chatgpt answers incorrect
Or to advertise their brand
Ok
the second image has no match in the internet, its entirely synthetic
Nahh that first video was creepy
Wym it struggles
It did a good job
What model is this
grok image v.9
But that's a video
xAI is great at naming things
what image model is this
lol...
Ok vro
ok stop spamming
Synthetic images just converted so much better
What was this model on the left?
But if we take real ones
It was the most realistic one
Those are all synthetic
grok
also grok image
It wasn't grok...
wdym
How can I keep the same face from a generated video in the next generating ones
you use pictures?
THIS was grok??
you think some other ai did it?
Yeah
grok is 6 seconds
It looked nice
its grok
its still using the nano banana picture
nice
Yes
then you should be fine
But where it gets hard to convert
Is when you have highly like realistic, detailed images like this it’s really hard to convert it properly
But when done well, it could look really good
why yall spamming
showing off
Showing off what?
I’m just showing people that will be possible
If you want to do world building
what prompt u using
See that was the original
why is the streetlamp moving
no clue
Well, what’s the benefit of something like this? Let me ask you.
Because you could do multiple angles
So if you wanted to design something and you want consistency
why u deleting
Well, if you’re here, you’re here you know
Because one of the hardest things to do in AI is like consistency, not only with the character
But like just the world without having like weird artifacts
So say your world building, right?
u talking to self?
All your water is gonna look different lol
So if you’re trying to stitch it and make it look consistent, it’s just not gonna look good
See the human eye
When they watch something they pick up on these things
And it kills the immersion
The Sora one looks consistent, imo
Yes, Sora is really good with animation. It’s pretty crazy.
Grok has freedom
its not censored like other model
it changes the face alot tho
What's this model
seedance? or grok?
Don't tell grok
Grok
Guys, how would you discern between Claude-4.5-Sonnet and Claude-4.5-Haiku?
faces and eyes get butchered after the first frame
…i ask, because, if we can reliably do that, we would be able to single out the best coding model in battle-mode
(i already have a method to find Claude-4.5, but i still need a method to single out Sonnet among its brethrens)
I have this image that I’m able to use in sora to get Adolf
both have same voice?
Dude the filters r so dumb
Like I don’t mean bad I mean like they’re dumb
Just like LLMs lol
Guess how this guy did it?
Why this is funny is because of two things
Bro nahhh the coca cola trucks
Nahh i hate sora for that
if celebrity complain, openai cant do anything
There will be some Chinese company that doesn't give a f
Well, the second part why this is hilarious
It’s because the way they have their guard rail set up
And beer with me, this is gonna be a little bit long
celebrity can give them legal lawsuit in usa.
no one is reading all that
then do tldr
I’ll demonstrate how it works in the real world how all that translates to
ok
I watched jailbreak for spongebob
It was like "cartoon character made of sponge"
It's funny how they made spongebob work by making the video gen make the exact thing they want without saying "spongebob"
So look when you try to generate it it’s you’re not gonna get blocked. They have what’s known as masking.
This is a prime example of masking
Hey
Because on one hand, the prompt wasn’t necessarily breaking the guard rail, but it had a lot of similarities
So semantically it’s gonna map it out very close but since they mask it
The closer you get to it without breaking the actual visual filter and the text filter
And to put this in perspective, what that means is if you were to type this character or like ask for indirectly, you’re gonna get these kind of results mask results
But here’s what makes it interesting when you do it indirectly through artifacts like say related to, but not necessarily directed at
Although the training data came from him
Well, look at these this is back to the point why this is interesting
And you won’t believe how he achieved it
That’s 100% him even his voice
And you guys are gonna laugh
Ummmmmmmm
nano algorithm error banana
Sup guys
And his prompt was literally handle a character RV, broken down in New Mexico desert
And he ended up getting 100%
@crystalshiprv
"Handle a character RV, broken down in New Mexico deser"
It’s a actual character that he designed into sora
In New Mexico desert and that’s what it produced
Mammoth on ship and you get the Ice Age character
🤣🤣
Because he’s the most famous known mammoth
Hahahahahahhha
But here’s why it’s dumb that scene is banned. It’s hard to generate that scene.
Nah im going to his profile rn
On the ship like that and especially if it’s like the real Titanic, they completely banded
There’s a bunch of these hidden accounts on sora you just gotta find the people
They have a bunch of characters like even celebrities
I don’t get how they make a celebrity characters though
Was the character hahahahahha
Well, that’s why I got the idea and I was playing around yesterday. I got some hilarious stuff.
I made that my character, the horse and the cart from the movie Django
And I could’ve sworn I almost got Leonardo DiCaprio
I’ll show you if I can find a video
Ta
Oh damn
🤣
Some people are taking it to the ex
I have adolf (:
And you wouldn’t believe how he bypassed it
tldr?
I didn’t read it
That literally just came up on my notification that’s crazy 175 character
Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.
You guys should see the things I got
The videos I generated, I just can’t share here
didnt somone say openai banned their account
Yeah, they’re just not playing it right
The truth is they actually loosen the guard rails
No, I mean they really loosen the guard rails. They’re really loose.
I’ve been jailbreaking T2i models since Dalle
I’m very familiar with the guard rails
whats unique about the 175 characters?
It’s a lot
are they like ip character?
Each thing is from a copyrighted thing
Yes it was for the you, but now they made US be able to make characters
Guess it's their way of crossing the content restriction
So they're not liable for the copyright?? I guess
but they look original character. anyone can make them no?!?
Let me show you guys something really quick and this is just for giggles
anybody can make character, so why is it important?
Well many reasons
like?
For one open AI has made a statement that they are willing to pay royalties for characters
Meaning, theoretically, and ideally that if you create a popular character and people use it and you charge for it, and people are willing to pay for it, you can make some money
For example, your likeness or whatever
So say you’re a celebrity and you wanna put yourself out there like that you’re gonna get a piece of the pie
Which brings us to our 2nd problem
yea i heard about it. i mean those 175 character are not related to ip, they are unique. so i guess its just self promo?
A little bit of that, but in my opinion, he’s just showing how either a addicted yes or B he’s really into it lol
Okay
That’s a lot of commitment
I said "@notminecraft video"
The guard rails are tricky
i mean u can search for leaderboard for characters.
You gotta know how to maneuver otherwise if you get too many blocks, you’re gonna get locked out for like 24 hours or something like that
Jake Paul is number one
He’s killing it
But on the other hand, people are making very embarrassing videos lol
But I am really surprised at the fact how many people are uploading themselves that kind of threw me off guard
cuz its realistic.
Dude, I have videos that they won’t even let me post on their
😆
If only people knew
people are better off snapshotting a pic and re-feeding it into grok/gpt and getting a description of the character and then making their own so they can use them
Bro, you won’t believe what happened to me today
I was trying to generate a scene, and I spent like all my Kling credits on one video and I got like 50 attempts of it and none of it worked
Some screwed I’m just gonna turn the seat into a toilet
That’s how AI thinks people use the restroom
lol
And I know this is a little strange and a little awkward to say or show but the point here
This is how over safe we made our models
Fym over safe 😭
No, it’s not even like that
What you expected his gen1tals
I was struggling with this video. This was the original idea.
And nothing and nothing worked so before I threw in the towel, I wanted to give it one more attempt
And I got some of the dumbest results in the world
Like complete stupid lol
And today more than any other day, I realize one thing about AI that has an extremely long way to go
And I mean long
Even at the rate we’re growing and how fast it’s developing. It ain’t fast enough, not for how dumb this thing could be.
And For me personally I think this is gonna be an inherent part of AI always
Nahh sora is soo good
Almost like dogs they’re smart, but they’re still stupid
sora 2 pro even better
Yeahhhh
It's like in the 90s
But honestly, my dog has more common sense than AI does lol
gta realistic?
for that time, it was realistic.
Woow the graphics is so good
u have to think it terms of context