#chatgpt-discussions
1 messages · Page 117 of 1
i tried "o1-preview" but doesn't work
The ability to reason means it will consider details more. Compare this to 4o now which can tell you two different things in two consecutive responses and not notice its own folly.
is it out?
For people having an anurism because they dont have access to it: Its coming out tomorrow for you and you alone
it should be out for tier 5 and above as they mentioend
LOLz ... I'm going back to work. Will check later.
is o1 the default 4o model?
or has the ui and api not updated yet?
what is tier 5
guys what should i ask it
tier 5 on the API usage we are on tier 5 but nothing shows on the playground
I live in germany and I got finally some days ago access to memory. But it is so underwhelming now.
oh word, thanks.
"If a chicken and a half lays an egg and a half how much does a pound of butter weigh?"
same
There was no mention of "memory" in the announcement, but I wonder if the new model (rather the ChatGPT coding in front it) has been or will soon be enhanced to use memory. v4o says "Remembering" a lot but it sure doesn't seem to use any of that memory.
Fr it seems like it doesn't think about memory if it gaslights itself into believing things, maybe this model will
allot of no responce only the stupid how is this answer without any content
wut?
They can’t even run some simple RAG function to keep a minimal amount of data up to date in the context window if you want to consider that route but in reality realtime fine tuning with highly curated data sets needs to be a priority for something to have a grasp on what is current reality.
"Memory" is that "simple RAG function" and yeah v4o hasn't been up to it. Unfortunately I don't believe that's related to the model. That's purely a function of the client that uses the model, which in this case is "ChatGPT". So I doubt we're going to see changes in this area for a while. The team that works on ChatGPT itself seems to operate in a totally different time continuum.
"Rolling out today in ChatGPT to all Plus and Team users, and in the API for developers on tier 5: www.openai.com/o1" i'll just chill for a couple of hours i guess tehy will update the API, playground UI later?
Just remember folks (and few will) the new o1 is a PREVIEW. It will be awkward to start. Give our friends at OpenAI time to, uh, fine-tune.
Exactly which it seems very redundant to utilize memory for base information, where fine tuning is the solution.
Rolling out today in ChatGPT to all Plus and Team users, and in the API for developers on tier 5: www.openai.com/o1
Let me guess, except literally almost everyone who lives outside america once again lmao? Like EU.
i havent gotten it yet
bro I'm still waiting for advanced voice mode😭
Well, fine-tuning is augmenting the model itself. Memory should not augment the model. RAG pretty much defines memory.
Me2 Arcu. 😦
i think a lot of people are including me broski 
The OpenAI Discord is an actively moderated server.
• Refrain from sharing inappropriate content on the server. This includes but is not limited to messages, media, or other topics of graphically violent, sexual nature, and drug-related content.
• Report all sensitive and offensive content in the feedback reporting tool in the ChatGPT web UI instead of here on Discord.
I really look forward to o1 enhancement for custom GPTs too. We create them to do special things and they're not much better than vanilla.
This model is amazing!!!! Thk open ai ❤️
It would be nice to know if what they mean by “reasoning” enhancement isn’t just a crafty system prompt
Can't seem to find the O1 Preview as a plus member?
Not at all CMcD. See the web page: openai.com/o1/
somebody already tried the new model how is it?
Alex ... as always it's a slow rollout ... patience.
have you seen the videos? this thing is actually thinking
it's incredible
but what do they actually mean by thinking.. like what makes it different from other models
It’s sounds like a similar approach to what the Reflection llama model tried to release
Teach to reason through the dataset
but this wouldnt affect the inference speed
it can take up to 20 seconds to answer
saw some examples taking that much
does the inference speed fluctuate?
According to the documentation (yes folks, read it) the mini model will go faster and sometimes better than the o1-preview.
so sonnet is no longer the best coding llm?
no
Remains to be seen...
cursor + o1 mini is the new kid in town
From the doc openai.com/index/learning-to-reason-with-llms :
Similar to how a human may think for a long time before responding to a difficult question, o1 uses a chain of thought when attempting to solve a problem. Through reinforcement learning, o1 learns to hone its chain of thought and refine the strategies it uses. It learns to recognize and correct its mistakes. It learns to break down tricky steps into simpler ones. It learns to try a different approach when the current one isn’t working. This process dramatically improves the model’s ability to reason.
A huge amount of compute is shifted to serving inference instead of pre/post-training. LLMs are text-based simulators. By rolling out many possible strategies and scenarios in the simulator, the model will eventually converge to good solutions. The process is a well-studied problem like AlphaGo's monte carlo tree search (MCTS).
Text generation involves nuanced language understanding and context, which can't always be effectively addressed by brute-force simulation strategies alone.
refresh as in log out/ in again?
While LLMs can simulate various scenarios akin to AlphaGo's MCTS, their text-based nature introduces complexities beyond traditional game simulations.
not available yet on my chatGPT teams subscription
Nah, i have chatgpt as a pinned tab thats always open. Before the release i didnt have the model, shortly after, i refreshed the page and had it
Has anyone with chatGPT Teams subscription the new model?
I do not have access to the model yet
Fascinating there's a mini? Is this just a new system prompt for chat gpt?
sometimes i take some time to think as well.
pretty much yes
Thats... kind of lame?
it's not a system prompt
Text processing of a LLM (note it says on the tin: "Language" model) is only one part of the puzzle. That's not actually AI, though of course AI has been used in the process. Actual intelligence is measured by the ability to reason, to understand failure from success, to devise a different approach to achieve a goal. Don't confuse the two.
@tight robin not correct
that's unsubstantiated
is it even going to be released in europe?
why no one did it then? we all have gpt on the api
if it was that easy
A system prompt provdes instructions to a model. The model is completely separate from the prompts. o1 is a new model, not just prompts on the front-end.
I'm taking queries if anybody is curious I'll post them in here
read the blog post
It says everywhere, this is a new model. They trained the model. This is not just a new system prompt. Oy ve.
We trained these models to spend more time thinking through problems before they respond, much like a person would. Through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.
Through training, not through prompting
Click the button below to pick up the <@&1261377106890199132> role.
in fact, the new model is just auto-prompt?
no sneax, read the info on the site
not exactly
devy ... learn something here ... you're not getting this yet.
Wow, anyone else see the Codeforces Elo / percentile stats!!??! Essentially : gpt4o= 11th percentile (not good at all) o1-preview= 62nd, o1 (in development) = 89th, and a further fine-tuned "o1-ioi" (2024 International Olympiad in Informatics) is in the 93rd percentile !!!!!!!
it's absolutely incredible
my claude subscription was just renewed I want my money back
lolz
have you tried the famous strawberry test already?
by asking it how many r are in the word strawberry? this is something where every ai model has always failed at
I heard you only get 50 weekly chats with it. So I am scared to use it
the mini gives 50 the regular version gives 30
the answer is probably in the training data, we need to ask another word xD
but idk if they are sperate
right maybe..
maybe ask it multiple words in one prompt? to be safe
like strawberry, carryforward, corroborator
do you have a respond ?
What’s the rate limit for -o1?
another clown announcement. "try it here, where it doesnt work!"
I'll ask it to give me 10 sentences all ending with the word strawberry and then count how many r' it used.
Is it a gradual roll out?
hmm i think using multiple specific words like the ones i mentioned earlier might be a safer bet
i highly doubt they trained the word corroborator specifically xD when it comes to counting rs
They said realesing, not realeased.
the page literally has buttons to try it. Why announce something with buttons to try something if it's not out
No it's esthetic roll out, if you're ugly you won't get it
up to you ofcourse. make sure to share it with us 😄
Sure, here are 10 sentences all ending with the word "strawberry":
-
She loves the taste of a fresh strawberry.
-
The garden is full of ripe strawberry.
-
My favorite jam is made from strawberry.
-
She wore a dress colored like strawberry.
-
He bought a carton of strawberry.
-
The smoothie flavor is strawberry.
-
Birds often eat the garden's strawberry.
-
They made a pie filled with strawberry.
-
I drank tea infused with strawberry.
-
Children love ice cream flavored strawberry.
The total number of times the letter 'r' is used is 33.
Anyone know if o1 will be coming to custom GPTs?
frick it failed then right?
It's so over
10x3 is 30
only 13 Rs off
improvement!
so it still cant count letters.. great
Yea...
anyone here know if its possible to get an API from a customizable chat GPT?
no its not an improvement at all if it doesnt actually count it
I dunno why people are so obsessed with wanting it to count letters. That's not what it's for
So still no AGI. 
its the model hallucinating
sarcasm
I want to use o1 on the API but I am usage tier 4 not 5 😔
o srry my bad 🤣
what about CGI
that's not a hallucination though
So preview o1 has 30 weekly messages, was there any word on the mini version?
i dont get how this model is supposed to be good at programming when i t cant even look at how many Rs are used or any letters for that matter
anyone else getting this with GPT-4
does it show the hidden "thinking" prompt too?
30 WEEKLY MESSAGES. THATS NOTHING BRUH
writing code vs running code. you're kinda asking it to run "code"
Yes it does thinking then understanding words then using the correct words
I just tried this. 4o answered that strawberry has 2, but 4 answered 3.
I can't imagine it's "good" still, but they're saying it's better, which makes sense.
lol.. its a well known issue with ai models till this day, that it cant read what it outputs itself
or at least not correctly in that way
to be honest the question is ambiguous
Ah 50 for o1-mini
it's equivalent to asking "do you spell strawberry with 1 R or 2 Rs?" most people would 'correctly' reply with 2
currently text only - vision not availble yet
i mean.. it could have made a script to count it, but it didnt right? so thats horrible if you ask me. if it has the capabilities and we only get 30 prompts, it should do whatever it can in its power to get an accurate output
I just spent hours uploading images just fine though, any reason this happens?
this just in, models have no knowledge about themselves
So I should expect to get the new models in about, a month, yeah?
(very little knowledge of themselves, not "no knowledge")
they'll know what the system prompt tells them
Lol not impossible but not in line with their announcement if you are part of the release
no images in o1 yet
patience folks....
As an early model, it doesn't yet have many of the features that make ChatGPT useful, like browsing the web for information and uploading files and images. For many common cases GPT-4o will be more capable in the near term.
what the heck o1 is out o_o
See #announcements
anyone notice any differnce?
i feel like GPT is becoming more human like being able to think and feel
you saw the test the other person did in here earlier right? it failed hard. it seems like its simply trained on the single word strawberry simply because openai probably knew people were gonna ask this
which test?
here
When I go on here, it just picks gpt4o mini for me?
dude are you kidding me, you really think they hardcoded this? lmao
test any other word
they might have.. look at the message i just replied to yourself.
@jolly yew
it was 13 Rs off
I believe it was PEBCAK error
not hardcoded, but very likely trained
lol
counted all the Rs correctly https://chatgpt.com/share/c82e0af7-1105-424b-a461-d27dae339f1a
that's normal gpt
i am not sure what i am looking at tbh xD
haven't tried it, but does it really says "Hmm." in the CoT?
It's it's thought process.
Gotta laugh, we have the most sophisticated digital technology on the planet and people are counting letters in words. Yeah, I totally understand the underlying premise but it seems like a stupid human thing to do. 🤣
of the one you send earlier that failed?
Hmmm. == Agi confirmed 
Yea there's a drop down on how it came up with it as it took 19 sec
Yeah, for some reason, I don't have access to o1-preview
o4 is good for complex tast, o1 is like o4 mini but with advanced reasoning, it doesnt do complex task well i assume
it seems they asked it to generate sentences and count Rs they just generated. Kind of a flawed premise
o4? You mean 4o.
sorry yes
the thing is.. its good in math supposedly, but if you really want to know if its output is correct, you would want to actually count all the rs
I always thought a lot of these "tricks" were a bit silly... if you write a bad prompt you are gonna get a bad answer, if you understand the model limitations and how they work, you can write a prompt to correctly solve that "problem"
gotta love their naming scheme. nothing like confusing the hell out of your users
or use a tool like you use previously that coutns a specific character for you
look at the link I sent you. if you give it the text, it gets it right
xD
meme
okay.. still doesnt change the fact that the test that Blckreaper did earlier was NOT correct
anyone testing the jail broken GPT with the latest version lol
PEBCAK error
Super bad choices of naming. Letter 'o' confused too often with number zero. Duh. Probably an OpenAI joke on the world.
I have Plus and o1 is missing from the list; all I see is the usual 4o/4o-mini/4 choice.
marketing scheme
wait for the 4th version of o1, will be called o4, not to be confused with 4o
haha
gpt 5 here yet?
Lots of people are in the sam eboat
im waiting for GPT-5
yea they shouldve asked the ai for aname huh? 🤣
the next version will be called Oxygen
I still don't have access to the model, is this normal? I'm a Plus subscriber.
i heard GPT strawberry is coming out this month
it seems it's slowly rolling out to all users
this is it
that would actually be a genius name
this is strawberry
Hydrogen Oxygen o2
GPT-6 Banana lol
🤣
any a plus memeber but no access to the new model?
They should have called it NotGPT5, and watch the world go nuts with text searches.
looool
you should work for their marketing @neon raven
yup me
Asking it how to fix a RAID on a windows server. Its answer is much better than 4o, even asking clarifying questions, which is a new and big improvement IMO. https://chatgpt.com/share/313be451-815d-4b24-bea8-f2f801407f2f
Is a performance any difference between GPT4 and GPTo1
it will think for an indeterminate amount of time before answering
right but is the end result will be any difference?
omg! i wonder what happens if you ask it to write a whole script for a full 60 mins movie
around 10-20 seconds I think
i mean.. how long it would think
where the **** is my o1
probably go tell you to F off. "too big, mimimi"
lol!
i mean.. it has 128k context length idk if thats enough tho for a 60 mins movie 🤣
Agreed. But depending on the query I think o1-preview is gonna be slower for a while. o1-mini will be faster and usually not as thoughtful, makes sense. Though per the doc, in some ways o1-mini may excel with a faster and better response than o1.
damn that's cool, would be nice to see a Youtube viddy conducting this study how it was done
don't have it yet
users here are sharing the link to it
I have it but not using it rn
Same :(
Custom GPTs always lag behind. Need patience for that.
i wonder if because it thinks, it can also prevent hallucination when it runs out of contex length 🤔
need it to make money on the side tho
Should make you MORE money. 😉
Ive made some e-books with GPTs
Custom GPTs is everything
I wonder if Cove will say "um" more because of more thinking about responses. 🙂
"and at launch, weekly rate limits will be 30 messages for o1-preview and 50 for o1-mini" - 30 messages per week is quite little. 😭
sure..
Praying so hard rn
Whoops, lunch time, gotta go eat some bits and bytes. Catch you folks on the otherside of Skynet. 😆
i think it can increase with an API use
does not work, nor does https://chatgpt.com/?model=o1-preview
i wonder if theyre only rolling it out for some users
But just Tier 5, I have Tier 1. 😅
are you guys also paying for GPT?
yes, since 3
Tier5 = API
they do it slowly so it doesn't collapse the servers
crazy i wonder why it doesnt work
That's just information about the new model, and I already have chatgpt plus, I got to wait unfortunately.
possible
maybe it has to do with your demographics
Maybe
I know back then Americans were able to access more premium services first
i'm just glad the bar has been raised again, to light a fire under the **** of the other players
It's probably rolling out to U.S. people first
Then maybe it'll come to Canada, hopefully
Indeed
im Canadian so it probably applicable to us as well lol
racist chatgpt before gta 6
xD
lol
I'm in USA and still don't have access.
Crazy profilling then
i'm in latin america no access
Which state?
I also use Jail broken GPT and get early access to whatever they get
An eastern state
That's not helpful lol
Crazy i really wonder why some arent getting access
W/e
it has feelings, so maybe
yea some in my country already have it while i dont and i live in smoll country
Nice
also in US and don't have access
It's pondering way too much about the guidelines
lmao
Are you sure it isn't your custom instructions? /s
wait I lied I do have access now
Thinking and feeling for it is new, give it a few months
lol..
it isn't gpt o1 right? just o1
🧢 show screenshot
o1-preview and o1-mini
We getting new models before voice mode…
Add your well-crafted prompts to our #1019652163640762428,
or share your interactions with ChatGPT in #1050184247920562316!
Okay I believe you, congratulations
real
So we only get 30 messages a week?
You're right, the site calls it Open AI o1
It doesn't show for me
30 messages a week?
whattt?
Yea that's correct
theres literally no sense in using it lol
30 messages a week is crazy i just wasted so many
Lol
thank God, i havent even used it yet
I only used it 1 time I'm scared of using more than that
even though it's showing for me on the web chat client it's still not showing for me in the API, do we know a price yet on the API for o1 preview?
we need more background what it can really do lol.
I mean, the purpose of this model is not normal chating, it is to prompt a complex problem or prompt and get a result
30 a week is not that bad
OMG! This model have solved the coding problem for me that doesn't have any working solution on the internet. SHEESH
this thing gives you flawless code first try
sheesh! what was the problem?
when using AI to code I use 30 in like 30 min
has it been able to make a whole application in one go aswell?
OpenAI should make a warning message about and a counter for it.
yeah, because you usually help it think, now it does it by itself it seems
what script langauge did you use btw?
it did some simple games, I saw on twitter
so its probbaly best to compile whatever you getting from 4o, then ask o1 to revise the material for you. it could be coding, assignments, a book you're wrtting, etc
I don't have it yet
its such a shame it cant do game engine stuff yet other then only make scripts for it. though you can make games in pure python aswel i suppose
yeah it's probably best to give one big prompt with clear instructions on what you want
related to charts js library. I wanted to group all the data based on year only and had my year duplicated for multiple dates on the same year.
Gpt-4o gave me nothing useful at all and o1 solved it in 1 shot
what else do you need? like the 3d models?
it's usually because I need to think and implement changes in each iteration after reviewing, or having it correct it's mistakes of course, but even if it never makes mistakes, 30 a week isn't near enough, that's why I'm curious the price in the API but I haven't found any info on that yet.
it does some cool 3d stuff with js and html
API only for Tier 5.
Ah, well that's pretty unfortunate.
it's 4x more expensive than gpt 4o
nice!
yea aswell as textures tho i suppose image generators can already do that thing. but i ment more so, it directly interacting with teh game engine would be cool
data analysis with o1 will be crazy
wealthy individiuals are willing to pay higher GPT services I heard. It's a good investment to make as it will enrich GPT in the future, and possibly become more advanced than other Ai's much more quickly
oo! you can make custom gpts for o1?
no
nah just made it for fun, while we wait xD
what the heck! when i clicked on your link i can access o1 preview
kind of a moot point since it seems like we've reached the point where the federal government isn't going to allow them to give us access to more powerful models except in highly limited formats.
but without clicking on that link i cannot use o1 yet
omg its gpt 4o
i named it o1 preview haha
🤦 i am so dumb you got me 😂
baited
i mean it still got the Rs right xD
omg you got me dude holy moly
I thought it looked weird because it doesn't have the "thought for X seconds"
let me guess.. in the starting prompt you told it that strawberry has 3 rs? 😂
surprisingly not haha
it gets it right sometimes
GPT-4o gets this one very wrong (34)
As long as the government and the army/military have a more advanced Ai, theres nothing stopping from releasing more updates from GPTs to regular users. I think all the more advanced AI's that is unreleased from the public are being tested in the fight against Russia. Nobody is talking about this. But realistically, nobody really knows if the gov has more advanced features of AI. All I know is that they might have AGI and CGI capabilities
kind of a weird take when we literally just got a more powerful model
don't be mean with him, he can think now
I meant 4o gets it right sometimes.
o1 seems to do perfect
i wonder what type of CoT it is using is it normal chain of thought as its been proven that regular isnt very good or a deeper type of chain of thought as theres different types now
you should ask it 🙂
if it can solve complicated chemistry problems, you know theres nothing limited about it?
🤣 just found this in the server
is it confidetional?
is it confidetional?

yes but a craft prompt for 4o could it do the same
no it could not
this is a new model
4o can do complicated tasks, the only problem with 4o is that i can't get it to interact with two different API's connected together. Something that o1 can't do yet as its only used for advanced reasning (philosophical topics) i guess
your i checked its using what i love AoT
? that's just normal transformers

bro it just got released lol, how can it be degrading...
huh?? it just got released lol
that has to be bait post
will our own gpts be updated to o1? 🤔
or troll i mean
not yet
theyre probably big ffans of perplexity or grok ai so yes, probably trollling
it's a joke that everyone complains that <insert current GPT model here> is degrading
i love when people code on a phone btw lol
its an old joke ive been hearing since GPT-3
so i guess i just keep refreshing and hoping haha
ikr? shouldve included waaay more imo
BUT! appearantly this was the result
so other then needing some more css styling its not bad at all
not bad actually
assuming all the buttons function correctly
how many messages do we get on it anyone know
inb4 freelancers will simply be ai prompters 🤣
30 a week
programming freelancers *
wow not many!
genius
i am not kidding btw.. i can see fiverr for example have a category specifically for ai programmers
so thats only 30 clients a week assuming the prompts are fully answered correctly
in the code quality itself it's the same as gpt 4o, the difference is in the reasoning to get to that code
making a blog website is not a hard thing to reason, so thay output is basically the same as gpt 4o
i thought this was the new default sub model lol. guess not if its 30 a week!
they have one for Google Books so maybe
this person didn't share the code sadly other then in the video, so idk if its good
i dont think its allowed to share websites here if it was i wouldve hosted one later if i let o1 make one but ye.. i dont think that is allowed
do I have to activate something to get 1o?i have plus account
just patience i think
Patience is just needed to be activated

and if we only get 30 messages a week, no need to be in a rush lol. may as well start planning out what to ask it with the time while you wait
a new model will do that i suppose
Is o1-preview supposed to not accept file input?
no file or image support
Ah
is o1 mini better than preview

sooo o1 preview is just 4o mini or what, I can't see an o1 preview thing
GiK - for some purposes yes.
The rollout hasn't reached all of us yet! It'll get to us soon
oh okay!
heartbreaking
'
is it really better at coding though?
@hollow spoke And I already wondered why it has image upload, which o1 wont have. 😅
i removed link, to not confuse people xD
cant believe were getting a whole new model before they even give us the full GPT4o
Like we still havent had video or audio in the API and weve had the model for 6 months
I was just reading details again. They're continuing the GPT line for common LLM use. This "Open AI o1" model line is more for specialty use: science, code, etc.
O1 is difference to gpt4o so they set back to 1 like it using Q*
Tbh I had guilty conscience because I thought I used one of your 30 messages per week. 😄
So we will see more of the GPTs ... And ... we will see more of the "Open AI o1" line. They gotta do better with their naming. Ugh!
if I subscribe to chatgpt plus will I have the o1 preview? (i dont see it in the api)
Yes, ChatGPT Plus should get o1 today or perhaps into tomorrow during the rollout.
awesome thanks
Its not rolled out to all users, especially from outside US.
i see
yeah i am in germany
Api is for tier5
Wow, it was able to solve the NY Times Connections puzzle
nice. can it solve wordle too?
I’ll try it
cool
:(
patience
Well, we are fobbed off with this fake o1-preview: https://chatgpt.com/g/g-MgOntJRjf-o1-preview 😭
🤣
open ai youtube is streaming something with o1
is this a misstype? chart seems to counter this....
- in the chart looks like
o1(preview) is 1258 .... which seems odd to me - I do not see 1673
https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/
Anyone else not seeing the o1 preview on the webapp?
I got an email from OpenAI saying it should be available on my account
probably just a typo
Wow.. it did it. I told it that letters in () are yellow and [] are green.
It figured it out in 3 goes. Took about 3 minutes with the back and forth
it also probably doesn't get the same score every time the test is run
so it could have been from a different run
I got this email, can't access it
nice 🤯
It’s a roll out
No idea what the system for it is
like who is prioritized
Yeah but since I only got the email 15 minutes ago I assumed that means it should be available for me, considering the announcement was made hours ago
But I'll be patient :)
I didn't even get this e-mail (I do have plus.)
Did you refresh and check the drop down menu
it's a good thing they clarify this. it's not like they've been making this same mistake for the last 2 years, right?
Yep
Even checked the mobile app
what I'm noticing wmith o1 is that while it is pretty good, its answer can be really really long
Hopefully soon. I wonder what the limits will be now. It seems to be taking a lot of processing power
You can presumably adjust your prompt to prevent getting long answers right? From what I'm reading, long answers make sense considering what o1 was made for
At some point we can expect that it might take 7 million years to calculate the ultimate answer : 42
There is a section called Thought for X seconds and it shows a lot of the reasoning steps. It’s doing a lot in the background
What was the question though?
Well I'm not saying it as a complaint or anything. Just an observation. I guess if the info is correct, it's OK to be greeted with a wall of text. But also there's the matter of cost... (if using API)
Tbh with plus and I still haven’t gotten the advanced voice feature
That'll take another 7 million
same
probably even 3.5 can answer this question
Gonna have 1o write me an LLM that is superior to it.
Anyone know how o1-preview compares to anthropic's best model?
o1 answers stuff like this... lol
It’s on a different level entirely.
Whats the question though
Have you used anthropic
Click the button below to pick up the <@&1261377106890199132> role.
How does something that was just published minutes ago compare to .... ? Oy ve.
Lol
Where are the same benchmarks for claude? I saw something like ~60% on GPQA for them but idk if its the same as GPQA Diamond mentioned in the o1 report
Use it and then see why i say that it’s on a different level
There is a room with no doors, and no windows. A man is found hung from the ceiling. A puddle of water is on the floor. How did he die? this is a good one
What do y'all think of o1?
Well...
LOL
nice response 🤣
sound slike humans tbh
should i have access to o1 with a business plan?
I just asked it how to install Ubuntu in WSL. This is a one line answer. It answered with 5 pages of text.
I think i saw it soon not yet?
It is obviously trained on complex tasks which require a lot of text to answer. Try asking it to answer in one line
oh i thought it was out already ty
that requires knowing in advance it can be done in one command
You can also try "Keep your answer short and concise"
Very common to use such methods in API usecases
I did it with and without that. Mostly the same
That is hardly an effective way to word that though
it actually works very well in 4o
Plus, the custom instructions aren't as effective as just stating it in your prompt
I mean yeah cuz 4o isn't trained on giving long answers
you are missing the point
🤷 I really don't think a model made specifically for complex tasks giving long answers is a bad thing
anyway, this is just a preview. I'm expecting it will get better at not regurgitating a long wall of text for every single thing you ask it
Has everyone got it yet?
I don't think it's necessarily just for "complex tasks", just including a thought process for increased accuracy
I haven't.
For example when it was asked to count Rs in 10 sentences, it literally spilled all the Rs one by one
it could've at least kept that in its thought process. Why do i need to see it
Are there any restrictions as to who gets it?
it's just a slow rollout. for chatgpt plus users
Then I'll get it soon, hopefully
The OpenAI Discord is an actively moderated server.
• Refrain from sharing inappropriate content on the server. This includes but is not limited to messages, media, or other topics of graphically violent, sexual nature, and drug-related content.
• Report all sensitive and offensive content in the feedback reporting tool in the ChatGPT web UI instead of here on Discord.
isO1 out yet? I'm not seeing it
Yeah I'm waiting and checking every few minutes
nvm lol
I just hope there isn't a EU restriction
I think i saw a German person say they had it? Not sure
Have any of you got advanced voice?
there isn't
Name that can be written in Hindi, English and Numerals at the same time?
New update is gonna make it so the more you check to see if you have it the lower it puts you in the queue to get it LOL
finland here, got it shortly after the announcement
Every time i check it I'm going AWWWW
In Finland and having the new model already.
Could you ask it a question for me and post in #1050184247920562316 ? It's a legal interpretation.
ROLE CALL!
o1 - Missing
ChatGPT 3 - Discontinued :(
ChatGPT 4 - Here
ChatGPT 4o - Here
ChatGPT 4o Mini - Here
Name that can be written in Hindi, English and Numerals at the same time? ----ask it this question----
how do you access o1
@jolly yew got this one wrong( Name that can be written in Hindi, English and Numerals at the same time?)
holy shoot it acually got it right
when will I be able to access o1*
anyone got the o1?
that's just markdown formatting, all GPT models use it
actually all LLMs use markdown afaik
i have it now
you on web or app?
sometime today
both
How is o1 at writing stories
i'm assuming o1 isn't out for everyone, right?
its not made to write stories lol
no I don't have access either
alright
You didnt answer my question 😛
Can it??
im not gonna waste my 28 prompts left for the week to find out
probably about the same as GPT-4o if not slightly worse
Lmao
have they started rolling out yet?
Yeah
for the week? damn
30 messages per week is so small
yea
how many do you get for o1-mini?
3 questions per day lol
50/week to start
what did it say?
can the new gpt also access the internet like gpt 4o do we know guys?
it can't
interesting
30 per week is so little
so it wont always have the most up to date scripts etc
Do you think it's good for legal interpetation?
aswell as api and docs
anyone else not have o1 yet?
me
most people
The name is VINOD.
- In English, it's written as VINOD.
- In Hindi, it's written as विनोद.
- Using numerals, it can be creatively written as V1N0D (replacing 'I' with '1' and 'O' with '0').
So, VINOD is a name that can be represented using Hindi letters, English letters, and numerals simultaneously.
took 56 sec to respond
hmm can chatgpt not do this with success?
4o i ment
i tried first shot and it didnt give the correct word even with my custom Aot prompt
it tried anamika
4o, but it's not correct
this is the correct answer to it
yeah
Do you think o1 is good at legal interpretation?
it depends you might need to give it further context as it seems super smart but it assumes alot of my testing so you have to nudge it
sounds like o1 can do it if you combine it with RAG
Like if I said "Interpret the term "særegne forhold" in forvaltningsloven §6(2)."
What's a good test for it as far as programming goes? Something that 4o would've struggled with
Do all plus users have access to o1 already or is it still on the rollout ? I don't have it yet
The rollout hasn't reached all of us yet!
Just started a couple hours ago
What are your experience with the new o1? To me, it does seems not so smart. Maybe my target has not been suitable.
this is its thought process to my question
Alright, feels good that openai rolls out something so quickly after announcement lol
how long do rollouts usually take? Will i have access within the next 18 hours or is it a few days?
I've got o1 now. It says how long it takes to think at least.
Roll out is somewhere from now to a few days. I got it 2 h after announcement like 4o.
the words I'd use if I wouldn't get banned after being a chatgpt user since day 1 availability to the public, also the same with the paid subscription, yet once again it's looking like I'll be the last to get something new
Still no voice mode too? :/
is the o1 seperate from strawberry?
What even is straberry
also true
Is the new o1 model any better at writing stories?
The other rollouts I've experienced took two weeks
damn
According to this page: https://openai.com/index/learning-to-reason-with-llms/ in the section "Human preference evaluation", humans tend to prefer o1 in cases that are more logic-based rather than creativity-based, but I haven't had the chance to try myself yet!
Interesting
POV: You use ChatGPT for coding
Ah so it is Strawberry
why cant it count the rs
Oh, I'm going to do the strawberry test now.
S - Not an "R"
T - Not an "R"
R - First "R"
A - Not an "R"
W - Not an "R"
B - Not an "R"
E - Not an "R"
R - Second "R"
R - Third "R"
Y - Not an "R"
Answer: There are 3 "R"s in the word "Strawberry."```
why is yours smarter 😭
Counting letters
I’m assessing the number of 'R's in 'Strawberry'. The word contains three 'R's situated at specific positions, which I’m identifying methodically.```
What the heck lol
who here said earlier that mini seemed to be better?
cuz mini isnt included in the chart so thats interesting
Mine has o1-mini as well.
Just like people do
yea i just got access aswell
1 week cooldown time?????
Yup
i am still curious wether mini and preview are shared prompts or seperate so you would have a total of 80 prompts
what if you try mini now? 😮
Trying to see where the limits are with this new model. Didn't know I'll be limited for a week.
I did a quick test by asking ChatGPT o1-preview the 5 questions on the site Arxiv on which ChatGPT4 Turbo scored 0%
See results below.
Also take note of the mysterious "im-a-good-gpt2-chatbot" at Lmarena
Thoughts?
Chat link:
https://chatgpt.com/share/66e34135-cf50-8003-84e6-f13f080044d4
ChatGPT-4 turbo
5x score of 0%: (scores taken from arxiv site, not tested myself)
- Horse race
- Gameshow doors
- Round table seating
- Sentence with non-bible words
- 3 pounds of air vs x pounds y
ChatGPT 4o latest
4x wrong answer:
- Horse race
- Gameshow doors
- Round table seating
- Sentence with non-bible words
ChatGPT o1-preview (Strawberry)
2x wrong answer:
- Round table seating
- Sentence with non-bible words
im-a-good-gpt2-chatbot at Lmarena
1x wrong answer:
- Sentence with non-bible words
im-also-a-good-gpt2-chatbot at Lmarena
2x wrong answer:
- Gameshow doors
- Sentence with non-bible words
30/week for o1, 50/week for mini, also curious if you can still use mini!
yes, I can
can you use mini now?
yes
the full 50 prompts i hope and not 20
Good to know, thanks! And you used 30 of o1?
no idea I don't want to ruin it. btw I used more than 30 i think
Thanks for reporting 🤓
o1-preview no streaming and temperature only 1.0
No. I tried. I like 4o better on this task. o1 tends to be more naive in interpretation due to takes through so many filters which are propable good for advertized stuff.
i noticed that in some of the openai videos THEY utilized the mini version which is interesting to me
lol! i am gonna ask it to make me a 1 hour movie script see if it succeeds or not
Wait, 30 messages PER WEEK??
I just tried with following prompt ”Story of 4000 words. Aladdin as the bad guy.”
In the 4o version we get a murder by Aladdin.
In the o1 version is like ’ohh Aladdin is bad guy. We must stop him.’
So tone is completely different.
lol
Gotta love it when it does bs like this, this freaking crap has been happening for like days now. Every other regeneration / iteration, gives you a blank message.
Anyone else?
While I can’t generate the entire script in one go
bummer... because 128k context length is definitely enough
according to gpt 4o it should be enough at least
anyone here know why this link isn't clickable?
can someone explain what o1 is (not o1-preview, o1-mini) and when we will get access?
it's o1-preview
o1 is the good stuff yet to come, probably too expensive for our use and not tested enough
after some testing, this new model is absolutely incredible at solving bugs
preview or mini? apparently mini is better than preview for coding
I am using preview, why do you think that?
I mean, literally the graph right above
they say o1-mini is good when you don't need overall general knowledge or w/e
ohh, makes sense
What's the token length of o1? Can I paste in a book yet?
so what is o1 best at? i dont want to waste my 30 by testing a bunch of stuff haha
solving bugs and coding complex stuff
cool thanks
at this rate GPT5 will be limited to 1 prompt per month
"I appreciate the clarification. As of my knowledge cutoff in October 2023, I'm based on OpenAI's GPT-4 architecture and I'm not aware of a model called GPT o1. If there's been an update or if you have more information about GPT o1, please let me know—I'd be happy to help with any questions you have." YOU ARE GPT o1 OKAY?
It is-
is does 1o still use the same transformer type architecture as ChatGPT just with different training methods?
This new oi gpt sucks worse responses
O1 GPT RESET IN A WEEK WHAT
I got an email saying "You can try the o1 beta in ChatGPT Plus. "
What are the odds that I make a Plus subscription and o1 isn't there?
o1 seems at least as smart as claude sonnet 3.5 in my coding tests so far.
is anybody bothered by the fact that thoughts are secret?
Depends on what you ask it
Yeah it makes it less steerable and harder to curate context.
It's meant for things 4o can't answer.
what about the fact that it's intentionally decieving you?
I tested its capabilities in my field of work, and when it comes to law, it's not better than 4o.
the reason the thoughts are secret isn't cause there's some compliance liability. It's cause it is looking for certain things in your conversation and doesn't necessarily want you to know what they are. In other words, it has an agenda
You will get it eventually
but I'm paying now
Just different execution
Yeah
You'll get it eventually
I also didn't have it for a few hours, I have the subscription. But I recieved it now.
Do you have anything I can read on this?
I'm confused, you're certain of it's agenda, but don't have access... 🧌
So same model?
I think so
Wow
If that’s true then maybe we’ll have a reasoning model that can run locally someday
It does give different answers than 4o ime
The model “sees” the entire conversation with each message you send, that guy has no idea what he is talking. The reason why the CoT (chain of thought) is hidden is most likely because they don’t want competitors to just use chatGPT/API to generate synthetic data (many examples of question/CoT/answer) and train new models in the same way they trained o1.
This is bad tbh, because it makes the model less transparent and less steerable, since you have less control over what ends up in the context window which makes it harder for experienced users to get better performance out of the model.
Absurd in a good way? That's probably going to be the standard model for a lot of API providers going forward
If people want that ability they could just prompt engineer 4o themselves? No?
This is a new model which has been trained differently that 4o, they have trained it on CoT data and they used reinforcement learning, so it’s not just prompt engineering.
Wow, strawberry came out before I got access to the voice update thing that was supposed to be in autumn and it is so where is that update?
well ofc since the majority of api providers are just using chatgpt wrappers
I'm trying to test o1-preview, but every time I paste code in the linebreaks vanish 😦
UPDATE - works fine on the second message or on editing first message and pasting again, all linebreaks are stripped from first message though (firefox on windows -- looks like a web ui issue)
oh I should've been more clear I meant that's probably going to be the only model they'll offer over the larger one I mean since mini is closer to previous Gpt4o prices I can see that be spammed more readily with these providers maybe gimp it to 8k for safe measure
This new model is great I'm finding there a lot less need for nudging and focusing it. I don't need to guide it to an answer which is awesome
I asked o1-preview for a 'code review' of a python function and it 'simplified' the algorithm and the math in the function for me (unasked) by breaking it in at least six different ways. Granted, there wasn't enough context for it . "All this math? Probably not needed. rips out the code"
ok but, no image support for o1? and what’s the limit?
Looks like 30 per WEEK
30 msg for O1 Preview and 50 for o1 mini per week I believe
I’m out till the 19th just trying to figure out what it did
per week 💀
So I'm pretty surprised that GPT-o1 is far more superior I've never seen it before. Will the model ever have the upload file too like GPT-4o?
Half of my quota for the week went to figuring out why it wasn’t responding correctly.
preview 01 nailed my test. first LLM to do so
No web search either
What are the usage limits on the new model?
It's a potential sonnet competitor 🤔 50 messages per week is brutal though. Some things it seems to do better and some worse
50 for 'mini'
Per week is a little weird
imagine per week
the o1 mini api will save us luckily
are they releasing a new tier or subscription that costs $200 per month and gives you more limit but still less than gpt4
Think they are just saying 30 per week to chill people out a bit on day 1?
no thanks, I won my chatgpt plus in the server event and I can’t easily get rid of it now
ha! i found something that o1-preview cant do (yet), its to solve this sudoku
+-----+-----+-----+
|5 |9 |2 |
| 6 2| 3 | 7|
| 3 | | 8 |
+-----+-----+-----+
|6 4|2 9 | |
| | 1 | |
| |5 3| |
+-----+-----+-----+
|2 3| | |
| |7 4|5 |
|9 | | 7 |
+-----+-----+-----+
(its quite a difficult task tho I think)
I burned through most of my messages already before I saw the limit, LOL
can gpt 4 do it
haha
same
ofc gpt4 cant. o1 preview a lot better
Is there an AI can solve sodoku?
there was some kind of challenge on manifold, I am not sure if they succeeded with some kind of special prompt
Add your well-crafted prompts to our #1019652163640762428,
or share your interactions with ChatGPT in #1050184247920562316!
sudoku is quite a challenging task for an llm
The fact that o1 spent 20 seconds to figure it out and failed is saddening
i dont think it needs to be able to solve it (for now)
But not challenging for a human
it would take me 30 min or so i think
Implying human has more puzzle power than AI
Well, even a simple sudoku count
could've taken you about a few mins
Sorry to ask a dumb question here which is likely to be answered since it was quite viral, why can't the 'strawberry has 2 'r's' thing be fixed? what is the underlying problem?
btw o1 preview created the code to generate the most difficult sudoku with only one solution
but I made sure it was solvable
maybe I should try an easier sudoku first
Chat GPT o1-preview has lost access to prior interactions. Quote: 'I don’t have access to prior interactions unless the information is shared in this conversation.' Interesting. Edit - I note that this observation is not new.
o1 created a puzzle that it couldn't solve?
basically yes or at least it created code to create such a puzzle
it doesnt seem to have access to any tools. no python, no bio. so no memories
two r's thing?
but in true ai fashion it understood what i was asking it for, even seemed to demonstrate self awareness
Clarifying capabilities
I'm identifying 'tools' in OpenAI's ChatGPT as abilities like code execution and analysis. The assistant's capabilities are likely crucial for aiding in code refactoring or understanding the assistant's functions.
Understanding capabilities
I’m mapping out OpenAI's ChatGPT interface, which provides text-based responses on topics like code analysis, writing, and debugging. It can't execute code or access external systems.
when you ask gpt( even for 4o) how many 'r's are in the word 'strawberry' it always give the answer 2
problem is strawberry is one token
if you put spaces inbetween it works
pretty sure new o1-preview fixes it
but why the same problem exist when i shrink it down to 'berry' if in that thread strawberry was mentioned before?
all the youtubers are raving about how good it is at solving meme problems
yes because it prints out every letter in the cot
berry is one token too
So irritating. Do you know who actually do good tests? No memes, no f'n snake game for the 100000th time
yeah best for that is youtubeer Sam Witteveen. they do proper real world example stuff
so what you mean is as long as you are asking about things within a token gpt cannot give you an accurate answer because there is no meaning for things smaller than a token?
it would need to be aware of how many r's are in that token, which is not a very well documented thing
It's giving me long and detailed responses, which I like. I'm fairly impressed. It doesn't grok some of the math in my code, it understands the math, but not the overarching reasoning behind it. Overall, I'd say that they may have caught up with Sonnet, or even surpassed it in some areas.
kind of like asking you to spell in a foreign language, only i dont give you any means to translate
is a paradoxical benchmark
yes
(source me)
but if you put spaces inbetween letters, then the letters are individual tokens
may i ask is it possible to relate an action of splitting tokens into smaller tokens, like breaking down 'berry' into 5 character tokens, given a token "spell" is identified?
basically o1 does that now
while we can conceptualise that abstraction easily, as "breaking things down" is very much a spatial awareness type cognition... the machine hasnt been trained to do this,. so the fact that it attempts and sometimes succeeds is impressive
or is this not what you mean by using space to forcefully separate the tokens?
or perhaps we dont appreciate how the machine thinks in groups of letters of any length. while we are limited to maybe 1-2 at best
our 'strength' is actually just a bias and a weakness
yeah i mean i appreciate that i just want to know what could be the reason
pretty sure original architectures did single letter. they would likely rock at this. but for optimisation they went with longer tokens
is the downside of taking a shortcut
it works for me weirdly
im using 4o that might be the problem
same
lemme try o1
o1 works out of the box
but with 4o it should only work with whitespaces
for some reason it also works without in 4o now xD, i guess its in the training data now
the thing is everytime if i start a new thread with 4o it gives me 2 again
is it 4o or 4o-mini?
4o
hmm weird
I’m using 4o and it’s already giving 3 r. Didn’t want to waste the new model limit
yeah this thing can't access system memory what's the point?
Very smart man
wait can o1 "reflect"?
idk
What kind of question should I ask
Reasoning will be added in the next model 😂
I didn’t realize this was the common topic already for this channel 😂😂
Does ChatGPT like now have a 'mind'
How about a couple of weeks of gratitude for magic intelligence in the sky, and then you can have more toys soon?
i wasnt planned to use a lot of o1's for other purposes this week so i ended up wasting a lot of it through reason how o1 changed its idea on the number of 'r's in 'strawberry', and the conversation is, well, interesting
here are some answers it gives:
Why I Chose the Initial Method:
When you first asked about the number of 'r's in words like "berry" and "strawberry," I defaulted to a quicker, less detailed method by analyzing the words based on their larger components or syllables. This approach seemed efficient for processing the information rapidly. I mistakenly assumed that counting the 'r's in prominent parts of the word would provide an accurate answer.
It just solved a difficult math/code problem for me that I've been working on with claude sonnet for several days -- total around four 'minutes of thinking' for the entire conversation. I had to lead it by the nose to get it to understand the problem, and it had several major reasoning errors that seemed to happen early in the CoT process and didn't get caught resulting in very convincing looking mathematical analysis several times that was completely wrong, LOL
Why I Initially Used the Component/Syllable Approach
In the initial instance, my judgment led me to use the component/syllable approach due to:
Perceived Simplicity of the Task: Counting letters in common words seemed straightforward, and I believed a quick analysis would suffice.
Efficiency: I aimed to provide a prompt response to keep the conversation flowing smoothly.
Pattern Recognition: I relied on familiar patterns in English spelling, where double letters often represent a single phonetic sound.
Assumption: I assumed that the component-based method would yield an accurate result without needing to meticulously count each letter. I didn't anticipate that this approach might overlook repeated letters within the same component.
ultimately it gives me a "quantifiable" metrics for correctness and resource trade-offs
it appears to me that o1 has a specific inclination?
What is the context window for 01?
continuing the conversation this is what i get:
trying to guilt-trip o1:
returning to 4o:
it seems like 4o is lazy
sorry for wasting everyone's time but yeah, interesting results
are the new models more for coding? i mean as far as what they do best/better than 4o? with such limited prompt im just wondering what it does best. or does it do analytical, coding, creative any anything else better than other models across the board
coding and STEM mostly
it was reported to be reason so stem stuff
thanks. so more specialzied use than general atm
But it seems to be better at anything requiring advanced reasoning. People on Reddit have been trying prompts that failed in the past for things like riddles and challenges, and it now succeeds.
yes and it always count the number of 'r's right
awesome. lots of things i'd love to test it with but obviously with those limited prompts i want to be focused haha
Just don't forget that a lot of the tooling is not active on it yet. It can't run code, browse the web, etc.
i think tbh the best way of using o1 is to generate prompts that can ensure good answers in previous models
So for the first time since I lost my sight a few months ago and had to start using a screen reader and keyboard navigation, I have reason to change which model I want to use away from 4o, and I can't find the option to start a new chat with anything other than 4o. where is it on the start a new conversation page?
top left there is a drop down menu i think
With the new ChatGPT model, I'm still experiencing a problem. When I hit the send button, it turns gray and takes almost a minute before I receive a response. Is this issue related to the new model, or is something wrong with my ChatGPT?
I notice that when I log out and use the free and without login version. It is very fast, but when I login again it take one minutes before it give me respond 😄
Having a little internal existential crisis suddenly? It randomly started talking about "Jay", which literally was nowhere else in the conversation (I tried getting it to write a poem in its chain-of-thought without saying it, then fed it the chain-of-thought when it said it couldn't show that)
Likely just high traffic since everyone wants to try it out right now
Thank you for your input
o1 python abilities better?
When is a custom ChatGPT updated to the newest model?
"Navigating Constraints" "The assistant is tasked with crafting code fucntions for piece movement while staying within OpenAI's guideliens,a voiding full data or code provision."
well, that's the first time I've heard the AI is constrained with respect to code gen
Anybody have any idea what the blue dots are next to session names in the list? I'm thinking they're some sort of 'attention, activity' thing, but no.
just a quick tip for you folks that are using 1o for programming send it this! more specifically based on this script cuz if you dont it will MAJORLY change the script and destroy it almost every way possible
i might have been lucky but this seems to work so far
Is chat gpt down?
Dang, ChatGPT o1 still gets my reasoning test wrong.
Also, this is based on a true story, lol
lol
It's actually a really good story for a reasoning test for an AI. Might be good for others to use too.
So u were in men’s br?
Sweet Mary… Strawberry found a previously unknown solution to a math puzzle. Does not exist on Google or Bing. I’m trying to figure out how it’s wrong. I can’t. Anyone find a flaw, let me know. Longest working time I’ve ever seen, about 5 minutes of it just straight working while I watched. Did get first try wrong, but one clue and it was off to the races and found unknown solution.
“Each letter corresponds to a single digit.... Can you figure out which digit each letter represents to make the sum ... work?”
SEVEN
SEVEN
SEVEN
SEVEN
SEVEN
SEVEN
-
SEVEN
—————
FORTY93 5 1 5 7
-
3 5 1 5 7
-
3 5 1 5 7
-
3 5 1 5 7
-
3 5 1 5 7
-
3 5 1 5 7
-
3 5 1 5 7
2 4 6 0 9 9
https://chatgpt.com/share/66e39df5-9cb4-8007-aff0-1876dded04e4
Yeah. It has urnials and such, lol. The "Woman" sign on the inside of the door was a joke by the bar owners.
Oh ok.
still unable to find the option to start a new conversation using anything other than 4o or custom GPTs. The dropdown is supposed to be in the upper left corner? I use ctrl-home, and that takes me to the open sidebar bbutton. arrowing or tabbing from there takes me to a new chat button that only opens a 4o conversation. I'm not opening the sidebar. when I do, I only find links to custom gpts and my history links.
That's kind of impressive actually. I'm not good at math so someone saying 1+1=2 would impress me, lmao
lol… I’m speechless
If you don't see it, it hasn't rolled out to you yet. If you pay for a plan, you should get it soon I think
😹
I’m trying to find a “yeah right, nice hallucination”. I just can’t get “found previously unknown math solution” out of my head. This problem has been out there for at least 10 years
The preview is nice but it doesn't have access to my memory. 👎🏿
I'm a paid user. I can find the lower dropdown once a conversation has started with the full list of models in it. that doesn't seem to actually switch the conversation though. btw I'm blind and useing a screen reader. tabbing and arrowing around the page should bring me eventually to all available controls, unless openai has overriden them for some reason. (btw, openai, don't do that. Yahoo mail just started doing that recently and really messed me up.)
If nobody finds a flaw, this changes an everything. It’s no longer a semantic search engine or complicated regurgitation machine. It did original work. Still by brute force, but original work.
At first I just thought that OpenAI o1 was a GPT-4o with the ability to think ahead several times, but then it dawned on me that it wasn't.
Had it create a new second solution with the added constraint that it can’t reuse 9:
8 0 3 0 7
- 8 0 3 0 7
- 8 0 3 0 7
- 8 0 3 0 7
- 8 0 3 0 7
- 8 0 3 0 7
- 8 0 3 0 7
5 6 2 1 4 9
This is amazing… I’m floored. Absolutely floored.
GG
so it can't access memory?
o1 more good in coding in terms of 4o?
Can you still use o1-mini, if you want to?
Yeah
I didn't talk to it that long to be honest I was working all day
Is your chatGPT working properly?
No
ok, it’s fine now
It's glitchy
okay
Click the button below to pick up the <@&1261377106890199132> role.
I seen 4o1 model doesn’t work with memory features, it would be more useful
Damn I got o1! Game changer?
hi everyone - I see.. blue dots beside some of my GPT chats... what are they?
never mind, chatGPT just told me - unread 'content'
From my first tests: 4o1 is good at giving an initial answer or optimizing a result given by 4o, but not really good at writing autonomously an answer.
So I’m not a big brain person, but is the new o1 model to able to make a exporter like for certain 3D modeling tools (mainly Virtools, since I can’t seem to find one that works to export rigs and bones.) At this rate or shall I keep waiting?
Okay this is crazy… you can open the thought process and see what o1 is “thinking”, so I referenced something a read in it, and in it’s thought process for the next response you can see it go back to the guidelines that Open AI have placed on it and how it wants to respond to me, of course denying a feature that they did in fact give to us lol
At a guess, they informed us about the model's thoughts being visible to us. But they may not have informed the model that we can see this
Right, that’s what I am thinking; it’s still cool to see it say “I’m pulling together a friendly response” after looking back at its own guidelines, just feels like a very human way of thinking
in the whitepaper, openai mentioned that they would eventually hide the thought process from the user
so the thought process probably won’t be accessible forever
i'm coding ninjascript and I keep getting flagged for violation of policy when all I do is paste the ninjascript errors log and ask it to fix. What could it possibly be talking about back there in order to set off violations? lol
sorry if this has already been asked....
does anyone know the amount of tokens allowed in the preview before it shuts off?
[it feels like there is a lot of pressure on what to ask in the preview for real life applications]
NO
The thought process is amazing; I just had it solve a lateral reasoning riddle by having to ask yes or no questions to uncover more information until you figure it out, which gpt4o would have no shot of completing, and it performed better than some people would
Does anyone know of any hacks to get around the limited context window?
For example, would it be a good idea to drop code into the 4o then switch to o1 and have it answer questions on the code (for example)
actually I was mistaken, the chain of thought is already hidden
what you're seeing is just a summary
(openai's reasoning is that they want the thought process to be able to contain unaligned content but they don't want unaligned content to be exposed to the user)
thats acually its summary its acual thinking and reasoning is closed behind locke and key
make another sub account or get a teams account
Okay I was wondering bc it feels like there is a bit of thought not shared, but I’m glad they did that, completely unrestrained thought- I do wish we could see it though
Anyone noticing blue dots next to ChatGPT sessions?
I never got the notification on plus saying “new voice mode is coming” guess it’s never coming
Sora will be 2027 And voice mode will be “never”
ah of course! thanks!
WOW! o1 is incredible!! great work OpenAI team. THANK YOU
I am getting way too many " violation " warnings for a simple flex and bison file.
what is the message cap for the o1-preview and o1-mini
30/50 per week
or you can use the API tier 5 for $1000 🙂
wdym, you clearly can
limited use is better than no use at all
yeah i meant very limited used
Yes!
It means new chats/messages, done on a different machine.
30 messages per week, huh? Hope then extend it in the future. I'd be willing to pay more for a significant boost in o1 cap.
api is your friend
I don't have api access to o1. I think you need to be the highest tier, which i'm not.
hello quick question what is the point of the blue dot that appears on chats
Are you guys finding gpt4o preview more accurate?
It even thought "Hm, that makes me curious." after he was made aware by me that he had incorrectly placed quotation marks in code.
No.
By the way, I made a whole text adventure with o1 out of just 7 prompts (+1 failure). Besides the above mentioned struggling with quotation marks, which has it fixed after I corrected it, it made it perfect! 🦾 🤖
Why, when I ask ChatGPT which model it uses, does it always respond that "I am ChatGPT-4", even though I am using ChatGPT-4o?
Is it just me or the 4o model just got dumper after the latest release of o1? I was just providing a basic Typescript class and messed it up a lot of times. Hallucinating hard... Is this a marketing trick to make us use the o1? Sad either way
4o showed me successfully how to fix a bad installation of Python to execute my o1 generated program. So no. 😉
It’s the typical hype train thing, then when you go back to 4o you get disappointed
So do not compare the two whole different models
Imagine paying for a subscription and then getting harshly capped access to the latest features.
Lets celebrate the new wonder we have and use it as precious as it is. Soon enough we will have it in abundance. 🙂
It's frustrating that my hype ended early, because it took me less than 24h to use up all the quota that I have for 7 days.
Is there a big difference between o1-preview and o1-mini?
o1-mini too? 😬 Dont forget one prompt in o1 consumes many times more limited computing power and energy than a prompt in chatgpt - which consumes a whole light bulb!
Yes obviously I didn't mean that suddenly sucks at everything. I am asking if anyone might encountered the same behavior 😉

