#💬│general
1 messages · Page 329 of 1
Am I going insane... I swear it knows everything about me now...
just use it like a normal api money
than what?
Lmao
Website had bigger limit than app?
I think you are getting too attached to the LLM
The chatgpt app before gpt5 had like 32k likits right
add your fursonas back
Is it same now
no?
Now it will change. Just like his gf.
I MEAN LIKE??
The web version and mobile gets the same context window
Lmao
Wait someone told me the gpt app had lesser context limits
Anyways deepseek v3.1 is running on a Chinese chip.
Don't let this forget matsku is a furry
Interesting
its porb cheaper
KEKW
I mean, it's cheaper than r1
Opus keeping it real about mat.
Are you all noticing movies started using veo3
For some flashback shit
And it's all so crappy
I'll let the example addiction slide.
I can tell because any shot is maximum 8 seconds lol
That's a valid expectation imo.
Been noticing more advertisements thats for sure
even one's still with the watermark attached
Yeah it's just annoying
Lmfao
Same
GPT 4o
Ez life.
Revanced for mobile
Smarttube for tv
Stremio + Real Debrid For Movies and Shows
It's a good life
Revanced be doing god's work.
try gpt-5 minimal
Old vs new pricing
which one is the actual input iam so confused
50% discount.
bruh $2.19
Depending on hour of day.
5 mini or 5 minimal?
Refer to cache miss
minimal reasoning
the longer reasoning is probably messing with this prompt
GPT 5 Minimal
you were on it
this one is the best so far
pay up
This is pretty cool as well
You.com has spoiled me.
Lemme make a custom agent with those instructions.
Lol.
bruh
it would be better with japanese TOS
opus 4.1
"Tell me tell meee~ :D" 😂
Oh.
Grok following the instructions very accurately.
Grok basically mirrored mat's instructions so well that mat felt uncomfortable.
The irony.
tes
kimi k2
thing is
i feel like grok has low temp
t
bro sometimes repeat what i say
trained or inferenced on huawei chips?
nvm
Chinese artificial intelligence company DeepSeek has delayed the release of its new model after failing to train it using Huawei's chips
I said this earlier.
Someone said it's just speculation.
The instructions aren't bad I guess. It's just that mat talks differently.
And the instructions make the model adapt accordingly.
That was opus.
any body master in ai agents?
i will give me comet invite code if he gave me value
Anyways
💬 13 🔁 0 💜 185 👀 0
Posted in r/perplexity_ai
Which model does what? If Perplexity’s model roster were a dinner 🎉
In short: If Perplexity’s model roster were a dinner party, Sonar would be correcting everyone’s footnotes, Claude would be moderating the debate, GPT would be the charming raconteur, Gemini would take notes in bullet points, and Grok would have already made a viral meme out of the whole evening. 😆 🤙🏻
@distant aurora i neeed this
@gloomy cipher
dayum
yo whats up with the comet ad coming up everytime i delete a thread..stop that shiz man
They're edging you if you don't have invite.
Bro has been grinding hard for days.
i already got comet installed lol...and im not used to it so i aint using it
any max plan users here?
@languid star He is our max user
Need max role tag.
, in this economy? 
why not
Ikr
hey bro can you pls tell if this feature is really worthy?
that ear though
PTSD
@languid star https://www.kkinstagram.com/reel/DNm1eI7SwTt
depends on your daily queries
He deleted the post too 😭
Caught you sleeping @dry kestrel
can you please try just one query for me, i wil dm you, the answer of that query wll let me know if its really workng fine or not
im not
, but @languid star is
Dear @languid star , kindly help for one query pls
@pseudo ferry
@vague pagoda Hi kes, long time no see! How is jelly bean(I mean my cat)
nah, i m gud
How do do this?
you dont want to help?
Was this a snipe bot?
its something else
ok fine
any max user kindly help me for one query, it will help me to understand to purchase it or not, based on the answer it gives
Not at the moment. No max user here.
You Really Considering Max? @haughty birch
Ñ?
This server has less than 15 max users I believe.
if it gives right answer, i will consider
Probably 10 at max.
You mean rich people?
deadly pun
and those 15 also not ready to help
Remember You Get Unlimited Research and Labs but The Context Window is Still 32k
Fr.
might aswell get chatgpt pro
inspired again, thanks , i am putting back my pen inside zip and locking my temptation for some more days
the bros here are really good at saving other bros money, thanks
pplx max just isnt up to the value
Never ever make quick decisions for spending money we really need to consider your needs first
Goodluck #💬│general message
They need to Increase Context Window Limit at least for Max Users
@languid star Homey(max user) I guess you need to take a look at this
but they wont
he rejected to help me for one query, so let it be
Bro is gonna get lasered 💀
pplx billy, know it all
This is so mean!
Try asking for milkshake.
daym
yeah leave it, extra lu denguthunaadu aadu
Windows update?
normal file download
My data is faster than this, lol.
Might wanna block other background tasks with simple wall.
downloading depends on other server side, if that server is fast for downloading it will download faster
Use multi thread?
Some download manager.
bro any one know how to creat ai agent
send them a note in the mail
what you try to download? torrent?
Ask perplexity.
it didnt helped
Leechers moment.
Twitter him
Went on a vacation! Missed all of you.
Someone won big in crypto.
Has anyone noticed that the Assistant in Comet is not working this morning? It just displays an empty pane.
Someone wow?
Tried reload or opening new thread in Asst. ?
Any idea why would one get hit with a "Comet requires an invitation" after getting an invitation email?
Login to perplexity using the email that got the invite first.
Yes I did, several times. Still comes up blank.
Doing just that, I can access the chat with pro like normal, thought the comet page has a different opinion
btw this is not an invite i think?
Nope
anyone tried this before?
Currently available for macOS only
is it possible to add perplexity deep research as custom connector in chatgpt?
@dry kestrel am I the only one facing this bug on perplexity?
kinda
https://github.com/felores/perplexity-sonar-mcp
with
sonar deep research api
https://docs.perplexity.ai/getting-started/models/models/sonar-deep-research
It goes away once the full answer is generated.
Hi
thanks for sharing, will explore
Check pinned message.
if you clicked on perplexity pro does it show those again?
Hi
Hello
Gpt 5 thinking is impressive.
Is better at remembering and giving comprehensive answer compared to opus I think.
Ok
hey
What happened ?
to be frank gpt5 thinking directly in chatgpt is really very intelligent, it some times even thinking 4 mins to give me right information
hello
I have a couple of machine learning and algorithm modules next semester, wanted to learn more in advance, any clue where i should start? was hoping to make a project

I see when i paste large code, i dont see streaming responses which is actually a bug
hi
so i am a climate scientist and I usually need to fish out citations and relevant sections from papers (not to mention finding papers in the first place)
which model provides the best results given my usecase ?
i am using deep research for each of my queries but I have not really tried anything else and was curious
i completed my deep research cap limit in chatgpt 🙁 , and i dont see any provider who provides quality deep research for less price, to be frank perplexity deep research is poor quality for my queries, even though if i purchase it doesnot help for long conversations in single chat.
bro i am not chatgpt pro, i am chatgpt plus
oh thought you said pro not too long ago
yeah its completed.
yoo
you can still do some stuff with the 40
just need to put as many instructions as you can in a single prompt
thinking models might work better. are you instructing PPLX to do the entire paper or small sections at a time? i ask because doing small sections at a time help the model to focus for maximum extraction and context management use (before performance degrades).
conversation history accumulates and the longer this goes, the less effective PPLX becomes.
yeah but i would feel happy if i have perplexity like product with long conversations support
won't it ignore some of it??
I think sometimes when the prompt is too big llms just ignore some of the stuff
not a professional tho so idk
current models not really
agent especially takes in large prompts
and deep research
oh
that
is nice
How does sticking it out for the first time decreases the level
cuz like small prompts usually leave some details
do you think you.com ARI is worthy? unlimited ARI (research) for 200 usd
from other's experiences here with that nope
i got a chance to use it only two times in intial days, it researched 400 sites to give me wrong answer 🙁
Websites like You.com and R3 are Sussy
400 sites just to give a wrong answer
😂🤣
yeah
actually it searched all the right sites, but it is not able interpret the 400 sites of data into a right answer
Exactly
You will get good no search performance tho
Only search with Perplexity
perplexity deep research also poor performance bro many times, only pro search and LABs are accurate in my cases
Ever tried Chinese Model Deep Research?
which one?
Qwen and Stepfun has it
Also GLM has a special model for search
I can't guarantee any performance
I haven't tried it myself
In Perplexity Sonnet, 03 Does Best Search Maybe Grok too
testing both of them with my personal tricky question which determines whether they are good or bad, but for personal queries for sure i need no data training, so that will be a blocker to use these chinese models
I see gl
Step deep research is in beta tho you gotta apply for it
Did you ever tried prompt techniques that can force more thinking time or search time
qwen deep research behaved same like* you.com ARI (so not reliable answers)
stepfun reasoning with web search gave right answer
Apply for Deep Research
how do i go back to perpexlity AI, I clicked on perpexlity enterprise now I cannot see any option to go back
Wait i can test it for you
@haughty birch
I have deep research
I just remembered
did you purchase or just clicked it?
I clicked it
ok can i dm you
Gotcha
I am using the app
go to web , account page, if you didnot pay for enterprise, go to enterprise and delete it, then it will take you back to old perplexity
Close the app completely so it isn't running in the background when closed
anyone want comet invite?
@young hollow @languid star https://www.instagramez.com/reel/DNnoUnrS1fr
💬 138 🔁 0 💜 3.2K 👀 10.8K
Dosti Kya Hai?
DISCLAIMER: ALL CHARACTERS SHOWN HERE ARE FICTIONAL!
Created by : @bob_almost & @almost_bobby
.
.
.
#friendship #friends #diplomacy #bobbobbyanimation #bobbobbyartworks
@dry kestrel did they improve project isolated memory in chatgpt?
I was about to spam the comet channel 😔
its so good at keeping project content isolated from others
do you want?
I would like an invite if possible
@elder bramble
dm me plz 🙂
Ah me, yeah im new, just excited about the new things
Yeah totally new, but not a bot 😛
Nah. I was talking about hikari.
Aha okay
😂 🤣 is that about the Ambani Brothers?
is there even a demo showing this lol
What is this ?
Likely
cant find a demo that shows it lol
other models would be better
more steps for a blind model wont make it much better
the post is self explanatory
Google should have made this smaller.
It's too big even for a sticker.
Feels more like a banner ad.
😂 🤣
Ai Power
Wait. Lemme show you real ai power.
hey
😂🤣
My dumb ahh was checking the image for context
To keep the space welcoming for everyone, avoid political and religious topics and debates.
Guys Expect India-China Future Ai Collabs Now
Welcome Back Brother
Deepseek X dream 11?
Hell no
Or Rummy circle?
More like Better Chinese Ai Models Access for Indians
Nice mannn
although India has tight relationship with US. lots of india expats on H1-B visas working in tech and first generation india-amercians
I still yet to see what happens
What do you think bro?
About what?
How to make a Hydrogen bomb?
Idk man
I mean what can perplexity say if this is a inappropriate word
Stay on-topic and helpful in each channel. Use #💗│sharing to post tips, content, or cool discoveries.
Why is it?
You’re tuned into PPLX Radio.Instrumentals for any hour of the day.⸻Comet by PerplexityAn AI-powered browser that thinks with you. Highlight text for instant...
Bro how to do coding on android?
tmux
Can you explain it
why would you code on android? it is miserable
I don't have a PC 😅
Can I create a website or app?
Yup
Or anything else
You can
But im not sure about the maximum potential
But you will need great Prompting skills for Developing Great Webs/Apps
Don't worry I have it because of the Manga
Alright then The Stage is Yours Operator
pplx labs can create website
but it is far from perfect
Really they can create? And then how to access it?
And 50 labs per Month
Use the labs feature
Then?
I suggest crafting/engineering the prompts first to save labs
Ok
I'm going to try it.
Gl Operator
Bro do you have any idea how to create which type of website?
I mean which type of website should I create?
Idk man try to brainstorm it
And then copy-paste existing ones with Your Creativity and Twist
Try to build something of your interest
Maybe someone know free MS365 sub for students or any other?
Are u a college/uni student ?
Yeah. But may be there any other special offers for MS365 Office
Uhh, there are several types of ms365

Normally
Uni should provide u access to ms365
There's no need for other plan
as it's Office A3
Enough for students
What the other plans special offers may be ready to use for regular person
Tung tung tung sahur rage mode vs skibidi rizzler
Uhh u might don't even need it
For me using E5
having further access to Azure
and Vs studio enterprise plan
Mostly for coding stuffs
SureshAPI ?
Give me free API
Thanks.
I need a list of these.
Embedders.
We don't know if it's Huawei chips yet
Hey! I'm not sleeping 😭
Nothing new in release notes
vx has never failed me
This website is created by perplexity's lab
anybody have super grok
hi guys
@young hollow
Ai price wars gonna be wild in India with grok and gpt go.
Eh, the cost to use DeepSeek is already orders of magnitude less than GPT.
Server busy half of the time.
Qwen works though.
Huh?
How much is the new Deepseek?
https://api-docs.deepseek.com/quick_start/pricing/
The deepseek-chat and deepseek-reasoner models there are both 3.1 now
Grok's cheap?
I did buy gpt go
700 inr in India.
I see
Cheap
Lol they have peak hour usage lol
@green crystal we need some free Chinese service with chatgpt like multi step search.
Stepfun
Hi
Try Stepfun for Deep Research
GPT-5 Pro
My friend tested it it took around 15-20 minutes and gave all proper answers
Omw to try it.
I guess the hewwo :3 is compulsory, lmfao.
What is that pro, are you generating new model out of your bird arse?
Was what Pato used
@short narwhal do you use "hewwo :3"
200 dollar sub.
No
Ngl Stepfun is so Underrated
How to test if a model's behaviour aligns with your intended instructions?
We will use a new benchmark system. We call it mat bench.
what did it reason lol
So is it just gpt 5 high renamed to pro or is it a new model?
Not sure if O3 pro was just O3 high rebranded.
O3 pro was a model
Good attempt.
in the end "huh i guess im really one of them"
🤷 ask day to day stuff and determine it by vibe
true
Give us a list of these.
For testing.
Lol
Not in mat's style.
You need to determine yourself
Its... Just everyday question
Claude 4 opus hallucinated and didn't answer my question, could you please do it instead
...
It was just O3 with extra high reasoning effort.
It's the equivalent of o3 pro
Just extremely high reasoning effort and access to tool calls
It’s just stuff that pops into my head at that time, can’t be doxxing my thoughts smh
beneath that summary is probably a massive amount of doubt that would look scary if visible
Bro thonk for 2 minutes straight for a hewwo 😭
Is this your vibe?
Yeees
@dry kestrel show the reasoning please
Or i slay @spiral rampart
Is he turned off?
@spiral rampart hello
This was the only reasoning summary generated
I WAS TYPING
That's it? For 2 fookin minutes?
It's not letting me see the reason
Btw the last time i poked open ai was i think a year ago so thanks for breaking my record 😒
Press the rectangle
It's not working
"mat 5" is crazy
At least not on moba
If gpt 5 is mat 5 then what would be o3
Beat the meat for a minute or two.
fur3
Got post nut clarity to adapt for brain rot.
Or even better.
Realised midway that this isn't worth thinking about.
Omat 3
I made a custom GPT
Open ai renaming everything like apple
Mato 3
Beard
Berd*
Is the above image true
Oh thanks
It's something out of his pocket
20$ for none reasoning model. No one will be paying
This looks overkill tryhard ngl. This feels more natural.
comet is very epic!
i randomly got premium for a month, kinda odd
i use comet at my main browser now
I can't let comet see my browsing history.
erm custom gpt's have been a thing for a long time now
Yea i know
used to be called plugins
But they are just custom instruction rooms
Then they made the whole GPT store
It allows you to disable things like web search, data analysis, add files, etc
Also you could argue it was an early version of MCP
How do I get the Pro role in this Discord? I joined via the Perplexity Pro Discord button in the Android app. Thanks!
@serene sand
Please wait for staff to manually give you your role
web app only atm
Why is everyone trying to make an AI uwu lol
💸
Any of You guys Tested New Deepseek V3.1?
Yea
Btw
They have a new version
Wait what
You mean mat is trying to make it
isn't v3.1 the new one
There is the base model
and then the post trained model
I think he meant the final one not the base
Base released
Seems pretty good
On open router it used to be base only
i have been using comet and i randomly got a month of pro, why did i get it??
How did the final version perform?
It's much better at instruction following
Ngl iam disappointed that didn't beat openai open-source model
So better reason/logic overall
There's a reason to why they didn't call it V4
You don't wana see it😂
The model is better at coding though compared to the oss models
Do know something?
Ask Perplexity!
That did it, thanks!!
thanks :)
on reddit people said they give out premium sometimes if you used perplexity for a full week
I thought so
I had a idea
I love how it gave me his number after i said hi my rizz is high
The Base is Like Just the Base
@languid star https://www.vxinstagram.com/reel/DLz6_CMSziE
His gadar is strong.
@dry kestrel any idea why base model acts like this?
😂 🤣
@gloomy cipher do you do deep research?
Deep ahh
Wdym, dr on what?
Anything really
If yes You gotta Try Stepfun for Sure
The Deep Research is Great
guys can u share some good promts in general?
you need fine tuning to make it follow your instructions
@thick stag also there are like tons of online prompt libraries as well try those too
Is there a benefit in using the base model?
@gloomy cipher Final Verdict on V3.1?
No
I dont understand why it's giving me phone numbers after saying hi tho
It's like that one introvert guy
maybe they are interested
lol AI got feelings now
Heyy
The number is Indian the guy is Pakistani
Don't leak numbers lol
Hello guys
I didn't leak it
Deepeek did
So its probably a business number
Fair enough
But deepseek not leaked it here
You see I don't have any issue with you
It could be a issue with server rules you know
I guess
They will get Backlashes Now
Users have been already complaining about it
\\
That ADHD Guy certainly won't be Happy Lol
Even more frustrated
He wanted GPT5 Thinking in pro
Who are you referring to?.....
@sweet zealot
Oh
GPT5 Thinking is Here
Yeah
What does \\ mean?
This is petty, lol.
@gloomy cipher isn't gpt 5 thinking cheaper than Gemini 2.5 pro?
Same price
Same price as o3 aswell
Cheaper than grok 4
Cheaper than sonnet 4
Cheaper sonnet 3.7
Is sonnet 4 think better then gpt 5 think?
No
o3 output is cheaper
And maybe replace O3 Pro with GPT 5 Pro
I would pick O3 over Gemini cuz cheaper output price.
Is O3 Better or GPT5 Thinking?
Oops sorry
no
That is not a answer
No/yes i think it's better in creative writing
Otherwise no
I think O3 is better.
I want to know Reasoning Wise
mb got confused 😵💫, GPT-5 thinking is better
I see ty!
I read this as "Is O3 Better than GPT5 Thinking?"
Subconscious mind
I think o3 is better in creative writing and mat fursona
V3.1 is kinda mid
Y'all obsessed with mat. Leave him alone.
Emotional intelligence*
It is very practical. You are supposed to hold the handle while riding.
Does Perplexity has any channel on discord dedicated to "finance"
It looks like the pelican's head got cut off.
Lmao the non reason deepeek
😂
IKR
So what is your opinion guys should GPT5 Thinking be in Pro or Max?
Pro
Pro
ok thanks!
It's best but it's cheap.
but?
Also, perplexity won't necessarily give high variant.
We don't need high variant
They should just replace O3 Series with GPT5 one
Good things shouldn't be expensive if they didn't cost much to make. Example - iphones.
R&d cost
The profit margin is too high even for that.
samsung phones are pretty expensive as well
Berd is a Volunteer he has to Take Perplexity Side
same with Huawei
I'm not defending samsung.
I don't have to take anyone's side regarding this
Take OnePlus and Vivo for example.
Alright
Vivo/Oppo Best for Camera
One Plus is Kind of Allrounder Flagship
why comet assistant is getting slow?
I call Oneplus like Iphone of Android
Cuz new max tier came.
is it anything with cpu?
Might be just a temporary issue
I'm joking. I'm not sure if that's true.
Im pretty sure Perplexity will get Complaints for This Like GPT5
Meh
You know you might be right
Do you know what they did to Socrates?
Perpert it
Perplexity it?
Yep
How does 640B loose to 120B?
?
Huh?
lmaooo
I hope the issue is getting fixed
B means more knowledge base I think.
hello can i please get an invite for comet if anyone has one? i have been in waitlist for over a month now 🙁
Doesn't necessarily assure higher intelligence.
I still don't understand how is Qwen Standing There with 235B?
O3 won't be over 300b for reference.
See how does 120B wuns over 640B
up till 200k tokens
Gpt 4.5 was like 1 trillion.
then its $15 per million tokens
Wasn't very smart.
Create a video
10-second seamless looping video, cinematic synthwave atmosphere. A glowing neon light horizon fades in and out over a drifting night sky of deep purple and blue. Gentle particles float across the scene. In the center, luminous neon text pulses in sync with the music: "You looked like light, but felt like night". The text fades before reappearing, with smooth motion so the start and end blend perfectly into a loop. Moody, emotional, futuristic, immersive.
Sonnet 4 is 10$ when under 200k context?
That yet I don't understand myself
But more Parameters = Better Performance
Is not necessarily now
Sekiro: No Defeat is coming exclusively to Crunchyroll!
The time is Sengoku.
Japan is fractured into many independent nations entangled in ceaseless war. At the center lies Ashina, a land of sacred earth and ancient mystery. Two decades after Sword Saint Isshin Ashina reclaimed the region in a brutal coup, a new threat emerges from within: The...
Price increases drastically beyond 200k.
It seems is about all the tokens training data used for training
Was he talking about sonnet 4 or gpt 5
Sonnet I think.
Sonnet price increases beyond 200k.
Seem like his context window is limited
I thought in future there will be models with trillions of parameters for best performance
But seem like great performance can be achieved even with less Parameters
On open router it says 15 per M
I think is no longer about parameters
But Rather Tokens Training Data Quality and MOE
Probably not actively
I haven't even watched the boys
@young hollow is there an app for stepfun?
Mobile app
Hi
The Web doesn't have login Restrictions so PWA doesn't neither
More parameters -> more general knowledge -> less need in RAG with web search. Better performance, instead, comes from too much different factors :D
Chat is anyone here using assistant for phone? It's so cool
Is it? I gave up on assistants completely after Gemini slop
I see but now even models with less parameters are performing greatly
I saw what it can do and it's so cool
Which one?
I use gemini Haven't tried Perplexity
I used to, I tryed just for interest, it looks very cool
It can summarize notifications for example
Interesting gotta try it
Kimi K2 surprisingly lags behind other models in various benchmarks :)
I like perplexity assistant, but the app needs a place for upcoming reminders/tasks
Btw I heard about qwen, is it good?
I love gpt mini o4
It has 1 Trillion Parameters

I just don't understand how Qwen is 5th with Just 235B Parameters
https://x.com/NintendoEurope/status/1958539769807302923
@short narwhal finally...
Yeaaa
7 years is insane
One of the best open-weights model series, you can try it out on Cerebras
(32B / 235B Thinking 2507 / 235B Instruct 2507 / 480B Coder)
Rip to the update channel
Gemini 2.5 pro is so helpful 👌
Is "yoo" a part of your memory aswell?
I'm surprised it didn't doxx you this time.
It’s part of the vibes, though this is the new prompt version
Drama 💅
Bottom is cropped out, realistically it did doxx me
I hate how excited it is
Bruh...
Updated 4 hours ago?
Yuh
It just says "Matsku is a furry" at the bottom
Hehehehehe
Lol
Nay
It speaks exactly like I want LLMs to speak -- somewhere between the conciseness of Sonnet 4 and the verbosity of Gemini 2.5 Pro. Moonshot really did their best at making DeepSeek V3 better than DeepSeek themselves
I am getting better at japanese
Kimi is excellent at talking and language in general
oh what's that
I has a stroke reading this
Updated 4 hours ago
Along the lines of 4 hours ago
"moonshot really did they best at making deepeek v3" what
Wait what Moonshot Made Deepseek Better? Then Kimi K2 Essentially Deepseek?
The company behind K2 gives me good vibes
Sorry, I got too used to the chats with a 10-30 second cooldown :D
Kimi K2 is built on the same architecture as DeepSeek V3, but it's not a derivative of DeepSeek
It's a non reasoning model...
also it was the best non reasoning model before v3.1 came out
Benchmarks just aren't representative enough
Noted Noted
True
So Now Kimi K2 is Second Best Non Reasoning Model?
if only DeepSeek V3.1 was available at LMArena
Seems like it.
Soon enough
Non reasoning models might just stop existing.
Kimi K2 is Very Great Writer Tho
Probs soon as most providers just launched it under 12 hours ago
I just use kimi k2 first then tell kimi 1.5 to reason on the data of kimi k2
That would be unfortunate
CoT is expensive (because, well, you're burning output tokens) and unnecessary unless you absolutely need the planning phase :)
small open source ones will still exist for roleplaying
for anything that needs any amount of planning
some amount of reasoning will almost always give a more accurate response
I'd rather have models that are just good enough than the best ones, honestly
For me, Gemini 2.0 Flash is still the best baseline
Well considering how fast gemini 2.5 flash lite is and it can reason aswell maybe yea
Wow
Reasoning process severely delays the final output
My Conclusion both Non Reasoning and Reasoning will Co Exist within One Model If Deepseek 3.1 Succeeds
Just need a breakthrough for hybrid models
have you seen how fast gemini flash can reason? lol
You're probably right, as both Qwen3 235B Instruct and Reasoning coexist peacefully
for some things that are complex ofcourse it'll take longer before you get an answer
I thought deepseek didn't spend any money in advertisement 😒
most people are fine with longer wait times if the output is (mostly) reliable
And they are Still Separate Models While Deepseek 3.1 is a united one
I have seen how fast optimized hardware can reason, and Google TPUs are not that fast
I think they spent in china not worldwise
I was referring to you....
they're very fast
😂
I wish lol Deepseek is just really my favourite one
Gemini 🔦 is incredible
This is pretty dang fast
Alibaba decided to separate them because sometimes Qwen3 could just ignore your /no_think keyword
😂 🤣
Ikr but if deepseek Succeeds others would follow for cost Efficiency
It costs less to run one model then Multiple
OpenAI's GPT-OSS 120B model running on Cerebras's hardware is much more impressive :)
I really don't trust ai handling it's own settings.
Thats.... cerebras
Perhaps qwen didn't get the breakthrough for hybrid model method
That remains a issue i wish they would give as much control possible
thats kind of already happening
The original price of input is 0.07. So don't you think there is a reason why it's 0.25 here?
Because the provider provides fast models for a higher price
Aka cerebras
The original price of input is 0.07.
Do you mean Chutes?
True actually
Ncompass actually but chutes is similar
Oh wow. Looks like it's already fast enough
I just missed the moment when the other providers reached 500+ TPS on that exact model, my bad
If you are comparing to gemini 🔦 i dont think it's a fair comparison
Because the model weight on gemini must be much higher
I wonder what parameters gemini has
Secret
Ikr
Same for gpt5, grok, claude
CLOSE SOURCED
Maybe around 1 Trillion Parameters i guess
1 Trillion just feels a sweet spot to me for these models
That's low compared to kimi k2
2.5 Pro: multiple trillions of total params (secret, my approximation), 288B active
2.5 Flash: hundreds of billions of total params (also a secret, my approximation), 17B active
What about 🔦
He said billions of parameters 17B Active
You can only guess
That's flash
Not flash lite
🔦🔦
hmm there arent any numbers for gpt-5 mini and nano
So my Conclusion is
The top current advanced wester ai models
Are Somewhere around the range of 1.5-3 Trillion Parameters
Take it with a grain of salt
So similar to oss?
Can you do this for each model? Gpt5, opus 4.1, grok 4?
Could be, or couldn't. I tried to predict it based on other Gemini 2.5 models :)
We don't know anything about those, you can imagine any huge number and consider it valid
My Conclusion is 1.5-3 Trillions Parameters
gpt 5 thinking max only?? why??
yooo, what phone?
did u mean trillion ?
15pm
Carrier plan?
bc its "like new"
you can’t have champagne on a beer budget
battery health is 100% tho n its untouched from what i can see
yea
First time I've seen you chat without emojis
Costs twice as much as o3, but doesn't perform that much better. Maybe that's why

