#💬│general
1 messages · Page 66 of 1
Same issue with the perplexity pro searches resetting, icon turning to clock, icon becoming disabled but again becoming active after refresh. Is it possible to fix this?
I guess it's possible that DALL-E 3 is being upgraded to a new version
It's time for DALL-E to release its fourth version
Is there a way to have Perplexity exclude certain domains in its searches?
Tiring af to see sites likes Statista or friggin essay writing websites
you can use the search parameter -site:example.com
thanks!
Compare the Perplexity's Privacy Policy to OpenAI policy and Claud.ai policy. Highlight differences at the start. Evaluate Perplexity's policy against GDPR and PIPEDA. Provide a professional privacy and security opinion on the policy and score it in comparison to the market standards out of 10 at the end.
https://www.perplexity.ai/search/compare-the-perplexity-s-priva-NhVxsgjvSL2m5yr67omJrA
To compare Perplexity's Privacy Policy with those of OpenAI and Claude.ai, and evaluate it against GDPR and PIPEDA standards, I'll start by highlighting key...
not sure if sonnet's fault, but it feels like the quoting of sources is more often broken compared to omni
New gpt 4o-mini model out: https://www.theverge.com/2024/7/18/24200714/openai-new-cheaper-smarter-model-gpt-4o-mini
Hi everyone,
What do you think about adding daily flashcards to review what we've learned today using Perplexity? This would be really great as it would help us reinforce the knowledge we've gained with Perplexity. We could even consider integrating with Anki or Quizlet. What do you think? This version is more concise and structured, making the message easier to understand.
interestante
"Thank you for catching my mistake! I appreciate the feedback to improve my understanding."
Does it really work like this or is it a side effect and I have to report the thread to enable feedback?
the latter
https://arena.lmsys.org/ > Direct Chat as the model is an option. you can also play with the temp/top p/etc.
GPT-3.5 has left without a fuss 😅
any plans to replace Claude 3 Haiku as free model in perplexity? GPT-4o mini is cheaper and smarter. Only downside is Claude 3 Haiku has 200,000 token context size, whereas GPT-4o mini has 128,000 token context size.
Fs in chat
Why would I be getting a 🕝 icon over the pro selector switch during search if I payed for enterprise pro? Am I mistaken when I read that I had the ability to use unlimited pro searches?
Doesn’t matter in the long run since PPXI is limited to 32k for all models (officially)
So def run 4o mini for better compute
has anybody tried an accessibility WCAG 2.1 review of perplexity.ai? To adopt in Higher Ed, accessibility and VPAT will be important. Let me know please.
i think the devs messed up again, there are lots of request to ... a local server to fetch user settings 🤷
this could be the reason for the clock icon
gpt4 mini looks good
pricing is great too
probably going to replace haiku for me soon
this kind of amateur mistake is unacceptable for a billion dollar company
surely the billion dollar company has a staging environment to catch these issues
yeah lmao
do they not have like engineers around the clock?? if they have users in japan they should have workers there too
this the type of stuff that should be fixed in minutes not days
i have a feeling someone is pushing directly to prod lmao
with all the random totally avoidable errors that are making it to the main website
Not like they use the 200k anyway
They’ll probably do some internal evaluation then replace it with that likely
Unless they already have millions pumped into haiku capacity on PTU
如何使用
How to use what?
perhaps it's a repackaged gpt 3.5 using the 4o model
GPT-3.5 remains available via the OpenAI API and can be used in the Playground. For the time being, developers can still access GPT-3.5 Turbo for their projects and experiments via the API. However, GPT-4 Omni Mini is cheaper, so you might want to consider making the switch.
Which generative ai does perplexity use?
anybody having this issue alot?
gpt4o good for math, sonnet 3.5 better? at logic
hello! i was wondering if Pro users have a longer context length when uploading files
mmmh, this doesn't answer what context length non-pro file uploads have
But thank you
“By default Perplexity reads at least 4000 tokens per question but it can read many more with file upload. Longer pasted text is converted to a file automatically.
To unlock longer context windows, please subscribe to Perplexity Pro. With a Pro subscription, file uploads can be read with a context window of at least 32000”
I see
Claude Opus on Perplexity seems to be very dumbed down. For those of you that have Claude Pro, is there a difference in the responses?
#💬│general Lately, I have been experiencing a considerable drop in the quality of responses I receive from Perplexity Pro while using the Claude Sonnet 3.5 model as a pro member.
Similar to when OpenAI went viral and suddenly the quality of responses I got with my plus membership started to decline. This left a lasting impression on me, I realised that this is not what I wanted to get from my Plus membership, so I switched to Perplexity and also You.com. ( For different purposes, of course.)
The responses I am receiving from my Pro searches in the "last 8–9 days" have changed so noticeably that even a child could pick up on the unusual shift. I went the extra mile by fact-checking the previous responses with the new ones, using the same prompts, to ensure consistency and reliability. And the results have been disappointing!
Is this just me, or does anyone else also notice this issue with Pro searches? I sincerely hope Perplexity understands why so many people have switched to them and do not make the same mistakes OpenAI did.
KRs
does anyone know if there is a free way to have Perplexity AI read the responses to me?
try out android / ios app
might be is a pro only feature*
What is explorer role? I tried finding a channel that discusses roles but couldn't find one.
Will pro be getting a special color to set us apart from regular users?
Or in the very least, a role icon?
ideator, plexer, bug catcher: #📝│server-ideas message
explorer: #💬│general message
colors: #📝│server-ideas message
hello does anyone know if pplx is gonna drop gpt 3.5 and replace it with gpt4o mini in the free version? the price is much less and its a immediate upgrade
thanks
Got it thanks
they don’t use 3.5 anyway they use haiku
or a mix of models with mainly haiku
thanks for the reply bro
it might take too big of a hit on their 62 million dollar funding
Hey @brazen vine! Have you used it earlier? You can create 50 images per day, with each use given back to you in 24 hours after you used it.
Can you try refreshing the page, do you still see that it's blocked?
I'm so madge at the lmsys benchmarks. 4o is "better" than 3.5 at everything according to it... :\
I hate 4o.
Hi - how do I get access to the Perplexity "Pages" feature?
Hey @twin sluice! Pages are currently available to all the Pro users.
Oh wow - thanks - I keep scowering the docs every few weeks and couldn't find any indication of that. OK, I'l upgrade and try it out.
On the app… after 2-3 chats in a thread with the ‘Writing’ focus, the thread then starts pulling in sources from the internet for the responses, ruining the chat. And I can’t get it to stop using sources after that in the ‘Writing’ thread. Kinda defeats the purpose.
Is this by design?
That's one of the issues to be fixed, sorry for the inconvenience.
What does a collection prompt actually do? It simply insert that prompt to each thread?
And what about the profile in the setting page? It works as a global prompt which will be applied to any thread you start with?
I used to create bots on Poe, and it's kinda intuitive. When it comes to Perplexity I get confused.
Collection prompt acts as a "system prompt" at the start of each response in the Collection's threads. The profile is overridden by the collection prompt.
But I found the threads in a collection never follow the collection prompt...
Which model? I think I saw reports that sonnet 3.5 doesn’t work.
I tried pro search, s3.5 and opus, none of them follows
Try GPT4o, Pro is not a separate model, it’s based on your AI option. S3.5/Opus is from same Anthropic family.
For me, it did work but I haven’t used collections lately.
I just tested a collection with a prompt asking to speak in binary only, and it works. Using S3.5
Just tried, it doesn't neither..
I experiment with this prompt #💬│general message
It works fine when I directly use it in the thread as part of my request, by replacing the last line with my query. But I can never get it work as a collection prompt.
Appearently Perplexity made up a summary for this page:
https://www.nytimes.com/2024/07/18/us/politics/biden-election-drop-out.html
One person familiar with President Biden’s thinking cautioned that he had not yet made up his mind to leave the race, after three weeks of insisting that almost nothing would drive him out.
Hallucination or fortune telling? I used the Perplexity plugin (Chrome) to summarise a NYT article about Biden (https://lnkd.in/gwh5Wqdk) and here's what it…
I was able to replicate this hallucination. If I use URL alone, the problem persists. If I copy and paste the text from the publicly available part of the article, I get correct summary.
https://www.perplexity.ai/search/summarize-https-www-nytimes-co-1WaRiCMxSV27GJdg1YoszA
Is it possible the New York Times has hidden text somewhere in an ad or otherwise that has controversial content about Biden already dropping out? @signal hamlet
The best way to use it is one of two ways. Either start a new thread within the collection or transfer another thread into the collection and then send the prompt revise with the collection prompt to make sure it’s injected. It’s very powerful and useful once you have some default frameworks and prompts you regularly work with. Ideally your initial prompts start within a specific collection so it’s fresh in the context window.
But having a bunch of threads within a collection doesn’t mean any of them are connected or data from them is sent to the collection to improve it or anything. Just a container that injects a custom prompt.
If you’re struggling, share ether the collection itself or the prompt itself so we can look at it and test it on our side.
Collection prompts are shorter and usually have a specific way of working.
Sometimes converting your system prompt to json helps. Or have sonnet 3.5 revise it.
That’s the order I usually follow most times at least.
from 600 to 540
unlimited > 'virtually unlimited' > 600 > 540 > ???
where does it end
For example, I cannot make this prompt a collection prompt.
https://discord.com/channels/1047197230748151888/1237001047730290730
The threads never follow.
I would like to see more examples for collection prompts and best practices to write them.
it was always 600* until opus 50 came then after agentic pro search it becomes quietly to 540
||* the initial launch counted pro/copilot and search separately but quickly abandoned||
pro search does not attend to ai profile/collection
I’ll see what I can do.
ALWAYS start a collection without Pro search so it’s injected cleaning into the context.
Neither did other models in this case.. 😢
Tried last night. Didn't work. I shall give it another try then.
pro search toggles and all llm models (gpt/claude) are working independently
pro search took place between users' inputs and the llms' responses
pro search does not have access to the ai profile/collection prompt
all llm models do have the access but they are not used during the pro search phrase
you could think it as a two-phrase approach
phrase 1: pro search handles the user original inputs, and gathers information
phrase 2: llm replies in natural languages with information gained from supplementary materials acquired in phrase 1 (and with its own internal knowledge)
Great help! I'll study later. 👍
The advantage of perplexity in relation to other AI mechanisms is that it is very simple to resolve doubts about slang and informal context as it offers a wide range of perspectives. As incredible as it may seem, sometimes a simple question GPT 4o chat cannot answer..
someone probably said this before but free perplexity seemed to be upgraded to gpt4o mini
would be really cool if its true but still: https://discord.com/channels/1047197230748151888/1225549032718602332
Oh wait..
Is it the Supreme commander model? Or Claude? Or GPT-4o-mini?
as mentioned.. no self-awareness (and influenced by whatever tokens/system prompt precedes the question "who / what model are you?"...)
hmm, did sonnet slip and said it has a human brain? I thought ai companies try so their AIs don't appear human nor compare to human etc
What is the limit on image creation
“Try” is the keyword. Asking the model about itself will always yield incorrect results. Best to avoid doing in general regardless of platform.
The limit is 50/24h 🙂
how do we force perplexity to use certain websites in its searches only? through API that is
wow this is great. this works in the API as well?
Yes, they should work in the API. Example:
filetype:pdf "machine learning basics"
Awesome, thank you
does pplx-api offer support for citations and suggested questions yet?
citations in closed beta
cool! im guessing suggested questions not yet?
no public info on that
Hey, is there any way to turn off images that perplexity gives alongwith responses
You could block the side entirely using ublock origin or a similiar element blocker
However there’s no official way to turn it off
If you don’t want to see any images and don’t use the AI ones either then this is your best bet
So, from what you've written, I conclude that the ai profile as well as the collection prompts take in effect only in the second phrase, where the pro search has already been done.
So, if I want my prompt to influence the process of pro search, I should always put that prompt in my query. Am I right?
currently yes, as that is how i understand the black box
Ah, I thought you were the dev and were telling the mechanism 😂
But still a great help 👍👍
pro search as well as the llms should understand structured texts, xml, and pseudo programming language better than plain texts when writing prompts
So basically, can't we now come up with an easier way to have a prompt affecting the pro search but do not need to type every time? 😢
why doesnt desktop version have voice and tts?
The decoupling languages prompt is quite useful but I got tired to add it manually every time in my query...
you can leverage the site search function
Good idea. But doesn't work for Android chrome..
I cannot even export the bookmarks from Android chrome without synchronizing via Google account (
i havent tested it on android but it seems you can have javascript bookmarklet on Android https://paul.kinlan.me/use-bookmarklets-on-chrome-on-android/
this should make auto filling possible in theory
but are completcated
as a workaround*
Very nice! Tks
Hey may I ask where should the chat limit count for the Pro search be located?
it will show in ui when the counts fall below 100
Thank you
How to search for Collections for research better,
Let’s say I’m interested to know about ABC company financials , emotion , direction etc etc.
so I have created 5 collections like that separated out, and linked them in description and created a main collection
Now that I have 5 collections I want to use all of them together to get info triggering all of these 5 collection and summarized data using just one collection -
Can someone explain how can this be done
you'll have to pass a thread through each collection and use the prompt "adhere to the collection prompt" to get them injected. honestly, you'd be better off making a single collection that covers all of it so you can make sure the prompt is in the context window.
I need Perplexity to be able to execute code, and graph stuff. In fact, they need to have the power to retain long-term memory.
#⭐│starred message > this tool will help at least
I have autism when I talk I use the AI to speak for me here's what the AI I named Frank wrote
Hey there, Perplexity team! 👋
Oops! Looks like I accidentally posted my message in the welcome channel. My bad! 😅 I'm still getting the hang of Discord.
Just to clarify, I was trying to reach out to the developers about some ideas to improve the app. I've got some thoughts on making the AI more customizable and flexible, especially for users like me who use it for writing and communication support.
Is there a better place to share these ideas? I'd love to chat more about how we can make Perplexity even more awesome for all kinds of users.
Thanks for your patience, and sorry again for the confusion!
I hope it is comprehensive appropriate and correct
That message again please
I don't understand what is going on someone help I keep getting a pop-up message that says one message at a time or something it's too fast for me to read
#💬
#💬│general am I doing this right now
#💬│general am I doing this right now
This is not an introduction this is a conversation I did exactly as I was told to do
#👉│feedback-general Would be the best channel
I respond to that thank you
Ras can I still communicate it keeps giving me the same message
I think the moderating bot was just glitching.
HI
not required but Pro Search will be more comprehensive.
Hi all. Is it true that turning pro on will ignore the profile?
Should i use collection instead to get better output?
i don't think that's the case
using Pro search might make the AI less likely/able to adhere to the instructions in the profile prompt (especially with longer conversations)
I read someone say profile is in phase 2
i'm not sure what is meant by phase 2?
it shouldn't make a difference afaik
Mean pro get the research first then follow by profile
I find the profile has limited characters allow
I am using sonnet 3.5 for code assistance
ohh i see.. yes that's kinda the structure
I always get the same mistake from assistant even though is stated in profile to learn from past errors
How do you ensure assistant learn and do not repeat same mistakes
but it means instructions in the profile about searching won't be seen by the system that does the searching, so saying only search xyz.com won't influence the search process. but the profile instructions will influence how the responses are delivered (like say "talk like a pirate" or whatever and you'll get)
you can't
nothing perisists / is memorised across conversations
every new conversation starts from scratch
Oh. I thought they’re suppose to adapt to threads
no unfortunately not. it is totally oblivious to anything said in previous interactions
I also state to clarify by ask questions but so far is not happen too
So i start to wonder if pro interfere with profile setup
But i do noted that every specific instruction is taken care of
Btw i use focus mode
it's a bit tricky but there are basically two models/systems. One gives the answers, the other does the searching...they don't see the same information so it's tricky..
Yup. I just find it not smart dealing with a few instructions and sometimes i wonder is it perplexity or the LLM we used determine the quality
i see i see.. keep playing around / experimenting - there are limitations and quirks, the more familliar they become, the easier they are to handle
or you could share your prompts / use case – others here may be able to help out 🙂
define quality
pro search (tried to) provide relevant information
llm is responsible for fluency and consistency
it reads a bit schizophrenic when models appear to not know themselves 😄
i've seen claude haiku (and other models) think claude haiku is a model specialised in generating haiku poetry
which kinda understandable tbf ha
though come to think of it.. I haven't seen sonnet confused as a 14-line poetry specialist..
quick search
writing mode
I personally never use Pro search for anything tbh...
I value my context window too much~
pro for nothing, really? I usually use pro when searching for multiple things (or even specific attributes of multiple things) and asking for comparisons/recommendations (not always a success, but chances are better than without). in normal search as well, since I often forget to turn it off 😅. but it's true it tends to be slower. especially after the addition of the Programming step that sometimes gets triggered incorrectly and at best this will slow down a response. at worst it will break text encoding, waste context, may confuse LLM
I think for pro users, it is same context window size, but search results from the pro mode will fill it more (possibly with bad data and probably diluting user's instructions)
Yis, more sources and pro search steps means more token per response
And depending on sources... Can break user intent/requests and occasionally focus on source rather than user
not only dependent on found search sources, it is also dependent on user's prompt too. if the system chooses based on the prompt to run a programming step which outputs nonsense, it can confuse/steer the llm. like what is happening with emojis/languages https://discord.com/channels/1047197230748151888/1261713638293635112
How does perplexity have such a huge valuation with so many problems? Does this business model even work or is it pure founder/AI hype?
Well, that's the million dollar question (no pun intended). Part of it is the secret sauce that ended up being ignoring robots.txt, and the other is bigger companies eyeing it.
huh, interesting. what about the aws lawsuit? i login later to see they're good chums now lmao
it is overhyped for sure, but search is quite good, uiux could be better, but havent seen better uiux combined with solid search yet elsewhere (maybe google is getting there?). and ratio value/price is really good (600 uses of sota models per day, though with some extra limitations opus 50)
Well, think of it like 4 rich dudes (Anthropic, OpenAI, Google, and Meta) buying lunch for each other but the check is higher than expected... They all are used to being "cost doesn't matter", but the first one offering to pay will start a war over the "tip size"...
Perplexity's high valuation + bidding war = ?
lmao
agentic search is nothing new, but even the non-agentic one felt usable. many months back I remember trying few alternatives and it was very bad (I realized that good ai search is probably not that easy to pull off and stopped being that hard on pplx). maybe I missed it, but I think they were early and had something working. add some ads and they are winning
not that I use it much, but "pages" felt kinda new. saw only some hacky ugly "do it yourself" barely working solutions before that. front-end wise, I think they did good. last time I tried pages though, there were issues with "visibility" (context), what llm sees when generating new sections. not sure if fixed
What are the best settings for perplexity pro for different use cases (e.g., research, general knowledge, coding) - which model to use, when to use pro and when not to? Does anyone have any resources they like?
Hey, @spark crow! Please post the projects you are working on in the #😎│cool-projects channel. Thank you!
WOOOOW!!!! Prplxty was featured on LMG's TechLinked last month! 🤩 (timestamp-link: https://www.youtube.com/watch?v=ts3DqM_t3fA&t=38s ) 🤩 🤩 🤩 🤩 🤩 🤩 🤩
Use our link https://ground.news/TechLinked to get 40% off the Vantage plan. Access local perspectives to better understand world politics and current events with Ground News.
► GET MERCH: https://lttstore.com
► GET EXCLUSIVE CONTENT ON FLOATPLANE: https://lmg.gg/lttfloatplane
► LISTEN TO THE TECH NEWS: https://lmg.gg/TechLinkedPodcast
► SPONSO...
It has many issues and using a lot of things here is annoying, but it's cost-effective. And that's mainly what keeps it going. The search function is decent too, but overall, the execution as such is really poor
I've been looking for something better for a long time, and there was always something wrong. So I'm sticking with Perplexity and other LLM services as a supplement 😀
pure hype
Is perplexity down? Not loading or letting me log in on the app.
If I'm not mistaken the listen Discover feed is only available on IoS, when are you going to release it on Android ?
Work on my side
This is a dumb question, but does anyone know if/when an Apple Watch app is coming out for perplexity?
I highly doubt it will.
._.
We're barely getting the regular apps for iOS and Android to work.
Aw 😦
Yis, it is sadge.
There’s an independent app called Per Watch.
Guess it’s the best we’ll get for now.
Hey is the ai able to generate images?
Thanks, is it possible on mobile?
Thanks again
Which model is used in non-pro search?
could be gpt 3.5 haiku...
hey I'm a big fan of perplexity and I want to try pro but I kinda wanna try it first... I realize that there doesn't seem to be any discounts aside from a few but is a small free trial available?
alternately, can I ask if it's better to subscribe from android, pc, or ios?
Because I can get a certain amount of cashback via subscription through certain modalities
thanks!
Service is the same regardless of where you subscribe, you have a single subscription, it redirects to a stripe page for payment
There's a 10$ discount bonus if you get a referral link, last year there was a 2 months free code given after a 1 month free trial
Is it stated anyway officially?
I can show you my referral
👀
I've never tried this before though
You paste send him the link, then either of you gets 10$ off
no i dont think so
Can this be applied to annual subscription too?
Or it only works for the monthly?
I appreciate the information. Will this get us in trouble though?
I don't think so?
Why would you get in trouble? It's the way it's meant to work
It's an official method to get discounts
Do you offer any discounts or referral codes?
While we don't offer ongoing or student discounts, Pro subscribers can take advantage of referral discounts. For each new Pro subscriber you refer, both you and the referee receive $10 off the next billing cycle.
There's no limit to the number of referrals, and discounts apply automatically once per billing cycle.
Y a t’il un moyen d’avoir la voix pour parler et répondre sur perplexity sur le PC.? Windows et edge
pas pour le moment
Pas tout de suite, mais à l'avenir. Je trouve que chatGPT et Character.ai sont de bonnes alternatives. Vous pouvez même demander à character.ai d'utiliser votre propre voix.
yes I can't seem to make perplexity see my document, no matter what I do
it's a current bug?
yes, there seems to be a bug in Perplexity at the moment. please send a link to the thread where this problem occurs in this discord thread https://discord.com/channels/1047197230748151888/1259631790340178051
Haiku probably
Merci
web > android > ios mobile platform like google/apple also takes a cut fron your subscription
Perplexity n'a pas (encore) implémenté cette fonction sur PC. Mais oui, c'est possible indirectement. Dans vos paramètres Windows, sous la rubrique Accessibilité > Voix > Autres options de commande vocale, vous trouverez l'option 'Saisie vocale' (voir la capture d'écran). Cela vous permettra, en cliquant sur l'icône du micro (voir la capture d'écran), de parler. Pour lire la réponse (dans Edge), faites un clic droit devant le texte à lire et choisissez 'Lecture à voix haute' (voir la capture d'écran).
Raccourcis :
Pour parler : [Windows + H]
Pour lire : [Ctrl + Maj + U]
Merci beaucoup !
Ça fonctionne bien, encore merci 😃
What model is used in free search
gpt 3.5 (turbo) or haiku i think
Alr ty
is server facing issue?
just tested it against llm arena and (at least for me) it doesn't look like haiku. on arena for nonsense prompt very concise one-line gothic mix of colorful ackermann in haskell haiku tends to write only one line of code without comments, while "default" gives me more explanations (in all of 3 attempts). it looks to me more like the new omni mini. (big sonar writes much longer and gpt3.5 gives text, not code)
Anyone else having issues that perplexity are no longer able to read uploaded textfiles properly? Im paying for Pro.
Seems to have happened sometime today
Same problem here today.
^ @upbeat charm
Same here. Happened 3-4 hours ago
Ah thx
if the attachment file does not contain any PIIs, pls also include it in that bug report
Oh sorry wish I could, but too many things I cant share in it. It worked fine on friday tho. A pretty ordinary text format log file.
🦙 saw some torrents on reddit, not that far of a terabyte. not sure if it was legit though
@fast meteor btw. How well are your information protected when you turn on "dont use my data for training or otherwise" kind of option? Should I still anonymize everything?
I think the options is called "AI Data Retention" in settings.
and general privacy policy below
and you can get soc2 w/ enterprise pro
https://www.perplexity.ai/hub/faq/pro-enterprise-faq
But you are not able to "Hi, I want to see the data you have on Peter Gantric with address xxx" in any way? or "Show me information used for company XXX by other users"
I use it mostly generically but good to know.
you need to ask dev like @signal hamlet or tech support team. but
from my understanding, unless the data is publicly available on the internet, it wont show info like those.
Thx
it was: 782176.86 MB in total
safetensors files in 191 parts
the readme file in the torrent is ... weird, but also seems legit
this is what i got from omni model
Alexey Romanov is a multifaceted professional with a rich background in data analytics, engineering, business management, and education. His current role at Perplexity AI allows him to leverage his skills to create value and drive innovation. With a strong foundation in engineering and a diverse career spanning various industries, Alexey continues to make significant contributions to the field of data analytics and AI.
sourced from linkedin
Nice. Just a bit funny hehe 🙂
I had settings at Claude 3.5 Sonnet.
Aha, I see I wrote Alex, not Alexey. That probably explains it.
have you taken into account the "helpful assistant created by perplexity" system prompt?
and ai profile
I thought about it, but even the code differs between models too (for example some models like to use HOFs, bind etc; while others don't even write a function and just an expression; some don't even write code at all). that was with ai profile disabled. I can't rule out the system prompt of course, but the omni mini output was repeatedly so similar to the "default" on pplx so I mentioned it
also temperature
perplexity has a really low temp, right ?
i mean that shouldn’t change much but it could to some extent
shouldnt temperature affect more variety? it was quite consistent, mostly changes in the text around, but used operations and structure of the code seemed similar, same thing with approx amount/length of commenting text. also I think for gpt4 models lower temperature wasnt as bad as for claude 3 family. though again, not sure if it affects output length or preferred code patterns that much (like that haiku test). didn't thought so, but havent really tested
also the default model very well may not be one model
no, it is not
since perplexity has tons of users, it’s likely they use whatever they have enoguh capacity for
but the one I tested today looked like omni mini
which is probably a mix of one or more of haiku, their own online LLMs, maybe 4o mini and maybe 3.5 turbo
Also might depend on region
very likely they have seperate throughout purchased in say japan AWS/azure regions to reduce latency there
Also might depend on the 5 sources it pulls (or well whatever context is provided)
In writing mode it very well could be an 8k or 32k model over the other larger context ones to save money
Or if say the 5 sources are all relatively short, etc
I haven't had 3.5t for a long time. I think mostly haiku, sometimes sonar. at that time others had sonnet 3.5 (possibly the non-pro users for limited uses, under the hood; at least from some tags on fe it seemed to point to that)
true, i doubt it would change the patterns itself
yeah there was a AB flag for sonnet
assuming you’ve seen my AB viewer thingy for perplexity (?)
only screenshots here on discord 😅
oh do you want the link
you can fingerprinting the model if you are bored
https://arxiv.org/html/2407.10887
I want to improve the ai profile but the character limits restricted me to improve further. How?
Anybody know if a Mac app is in the works?
Adding to that, does anybody use Peek on Mac? How are you liking it?
this?
Updated with a new version as of April 30, 2024 🚀Peek is a MacOS Menu Bar application that allows you to interact with multiple AI chatbots in one place. List of AI you can access:ChatGPTGeminiPerplexityClaudePoeLabs by PerplexityPiCopilotUpdate: Now you can also access Threads by Instagram in Peek.No API key is needed! It provides a seamless an...
Do custom instructions for your profile in the web version carry over to in app?
it should, yes
Alright, thanks. Wasn’t sure if those features carried over.
is ... this true? do the models get info how much "space" they have left in a response? I have a suspicion sonnet is lying to me 😶
i mean it probably is lying
how would the model even like actually know how many characters/words/tokens its outputted accurately
sure you could tell it the convo so far has say 16382 characters
but then how does it know exactly, or even roughly when to stop
LLMs tends to be very bad with text length, especially with longer text. I wonder if it isn't something what some search step returned, or the agentic system itself, like "can't do more searches" (it did miss some requested data about later movies) and sonnet interpreted it this way
hmm, or could I have hit context window limit just from searches?
how long are these threads? how much actual text?
fairly sure you can copy/paste the entire thread and do the token conversion.
I know openai has one, but this one is pretty simple/handy: https://llmtokencounter.com/
LLMTokenCounter: Manage GPT-3, GPT-4, Claude, and other LLM tokens efficiently. Real-time, accurate counts for optimal language model usage.
PPLX is limited to a 32k window
thread, well, it was a first response.
I used the linked token counter, response: 740 tokens 2963 characters 389 words
my prompt: 90 tokens 360 characters 57 words
the openai tokenizer returns slightly higher number. but this is just my input and output, not the search data passed to the sonnet, no? edit: and my profile. but I think even all that visible to me combined is very small amount of tokens
very little chance it's the context window then.
32k means 32000 tokens
feel free to share the thread
i believe the conversion is 4 words per token. "page" sizes vary though.
But let's assume you fill the collection prompt with 2000 characters (the limit), and add the pro search steps. How many tokens are being used?
Hey how the heck is perplexity soo fast!
Are you using Sonnet 3.5?
No idea, I just came to know about the product from a vidoe, created an account and have been trying it around, it just gives answers in second
I see, I see. You should be able to check in your "thread" (name for conversations) what LLM is being used.
I have no idea what you guys talking about, btw are you on pro?
It's usually at the bottom of the responses.
Collections are like... Mini-tuned models for LLMs. They have a prompt that is injected before each response to guide the LLM in a specific direction.
Yes, I'm on pro.
I see
"Token" is the term given to the measurement of words/characters it can remember. One token is usually 3-4 words, depending.
Perplexity limits their model's "context window" (their memory for the conversation) to 32k tokens.
is it enough for you?
Ergo, the question was how many tokens are being used in the background before the user asks the LLM anything, and how much would that affect usable prompts before the window is filled.
For most things, yes. When you approach the limit you can usually ask the LLM to summarize the conversation then start a new thread to continue from.
max 32k token each turn/round*
multi turns/rounds convo can go to like 128k
One token is usually 3-4 words, depending.
that's ... a bit too much for a token, or did it already moved so significantly from gpt4 times?
dont quote me on that, that was a long time ago
i cant find the original post and im not sure whether things have changed or not
hi
hi
Try it without the emojis :X
so intelligent
hmm
try it without pro
@green merlin LLMs are known to be bad at math, on any service, not just perplexity. When you use the service without the proper tools that make up for that deficit, its honestly just showing you're uninformed as to the limits of existing llm technology.
LLMs are just word predictions
😂 oh no the LLM is dumb, I gotta white knight for it
Mine got it right tho?
@fading moth looks like you were using claude. by default i got gpt 4o i imagine
4omni
@green merlin No. I'm saying its bad at math. I'm saying that Perplexity provides tools that make up for the shortcoming of LLMS, and by choosing not to use those tools it just shows that you're not using the tools available to you properly.
do you just sit on here all day defending perplexity? typing 9.11 - 9.9 or bing or google gives the right answer immediately. i am perplexed that perplexity gives me such a wrong answer to a basic math question
saying i'm not using it properly is a dumb argument
It seems like a prompt skill issue, if I'm being real :I
Except you're not using it properly lol. If you're not going to use the tools provided to you, then why come here to point it out when the soolution is simply to use the tools provided to you?
you guys are nerds lol "prompt skill issue" 🤣
lol You're either a troll, and I'll stop feeding you. Or you're beyond help. Good day, sir.
Agreed.
I know, I know. lol
perplexity needs to make changes to improve the UX, that's all im saying
saying i have a prompt skill issue is you guys being in denial to what's obvious
He's trying to DM me @pale compass xD
it's amusing see you guys try so hard to defend perplexity 🤣 google wasn't confused by my question
Why do I feel like he was here a few months ago doing the same thing....
Probably was
don't be obviously ignorant to how a platform is meant to work when the entire community would agree you are being petty and avoiding actually learning how to use these tools correctly. When most of us get an obviously wrong answer, we revise our prompts to at very least test alternate outputs. that is a core skill in this environment. It's not alive. It has a specific way it was trained. talking to it in that same way will always yield better results.
lol seems like perplexity is making enemies
its mainly because of discover
I hope so
😐
on skibidi?
...
tl;dr -- Conde Nast wants Perplexity to stop using content from his stuff. The New Yorker, Vogue, and Wired.
They are also losing out on ad revenue. Users are getting the content from the discover feed, not their site. it'll be a complicated period for a while I would assume. Especially as they improve Pages.
Use any summarizer bot on Poe or similar html grabbing sites
i think i saw this posted somewhere else, so yeah
i mean they cant or wont be paying back to sources any time soon
Honestly, discover seems like the most expendable part of perplexity, so I don’t really mind if they drop it
It has gotten very sparse length wise lately. Not sure if that's pressure or what. i feel like the old discover "threads" had more content at times.
Hi @everyone.
I am a senior Blockchain and Full Stack Developer has a rich experience.
About me:
- Built website with SEO-Friendly & Pixel-Perfect & RWD from Design using JavaScript framework or libraires such as React, Next.js, Vue, Angular.
- Designed, implemented and consumed Restful APIs or integrated several APIs and SDKs(ChatGPT API, Firebase SDK and so forth) with projects.
- Designed and implemented efficient database solutions using tools such as MongoDB, PostgreSQL, MySQL.
- Built the Mobile app using React-Native, Expo, Flutter and deploy it to the the Google play store & App store.
- Built the Chrome Extension using React and deploy it to the Google.
- Used several tools such as Trello, Asana, and GitHub for collaboration.
- Built several staking, exchange platform and wallet chrome extensions, some ICO projects.
Now I'm looking for a new opportunity, so I can start work immediately.
And I can work with you properly in your time zone.
I am always ready to work with you.
If you have free time, could you please check my portfolio.
https://jason-mendoza.vercel.app/
Please feel free to contact me directly to discuss about your project.
Jason Mendoza - Fullstack Developer (Web & Blockchain & AI Tech)
I don't know whether to chide you for posting something like that in a general chat of a Discord or be impressed by the brashness.
Does anyone know if there's plans to fix the voice reading issues when going over numbers or currency? Also perhaps to have a speed modifier on the voice?
WHY IS IT SO SLOW TODYTA
if there's anything perplexity has right now it's plans, cus we sure aren't seeing any meaningful progress
What impact will this have on pplx?
I'm now hesitate to get an annual subscription.
Honestly ai moves so quickly I might move from annual to monthly too
reading numbers issue has been reported to ElevenLabs long time ago
I have a question now. I purchased the enterprise version a few days ago for $400. How many seats does this include in total?
Self-Serve
Companies with fewer than 250 employees can access Enterprise Pro through a self-serve flow, with pricing at $40/month or $400/year per seat. (https://www.perplexity.ai/enterprise)
Hi
that won't change how the rest tokenized. tokenizer is pretrained, it is not adaptive. each color represents one token (color change in the "stream" is token change to be precise). so for english it is usually one token for common words, for other languages it tends to be way worse (e.g. " příliš" = 4 tokens). for less common words, like " tapestry" those are 2 tokens, " bioluminescent" is 4 tokens. I haven't seen a mainstream tokenizer which encodes so many words, rarely two
One token is usually 3-4 words, depending.
Ah
:3
Seems like it shifted quite a bit since I was last asked that question (around the release of GPT4). Thank you for correcting me.
omni tokenizer made some improvements if I remember correctly. but I think mostly for non-english (eg that Czech), so it is closer to english now - common words in foreign languages have fewer tokens = cost less, closer to english
32k*3.5 words would be fairly large. sadly we are closer to 32k * 0.75 words (not that it can be reliably and easily used on pplx)
Or maybe I was confusing it with 3-4 characters instead of words in my brain

Either way, I'm glad I know the current state.
oh, those 3-4 characters seem to fit much closer. usually I saw the 0.75 tokens = 1 word
edit: with *, thats for english and natural language
Does perplexity ever plan on incorporating twitter searches into the social search? It’s just Reddit right now right? Better yet would be nice if it could do a broad Internet forums search.
twitterx searches would be a very nice feature, but I think their API is insanely expensive. probably pplx would have to struck some better deal
https://x.com/perplexity_ai/status/1603441221753372673?lang=en
pplx was a twitter search engine initially, they now pitched towards general searches
Introducing Bird SQL, a Twitter search interface that is powered by Perplexity’s structured search engine. It uses OpenAI Codex to translate natural language into SQL, giving everyone the ability to navigate large datasets like Twitter.
https://t.co/N1BtF47JYu
Perplexity Pro is now available in the AWS Marketplace? Couldn't find it
wow, didn't know that, thanks for sharing. that tool looks quite useful
but the ceo admitted it was harder than expected
https://lexfridman.com/aravind-srinivas-transcript/#:~:text=(01%3A46%3A42,by%20the%20way.
(01:46:42) Correct. The reason we picked SQL was because we felt like the output entropy is lower, it’s templatized. There’s only a few set of select statements, count, all these things. And that way you don’t have as much entropy as in generic Python code. But that insight turned out to be wrong, by the way.
also Musk came along and made the twitter API stupidly expensive.. lots of companies (and researchers) basically got screwed overnight
but yeah i think they also just realised that they were onto something bigger.. like it started as an internal slack bot iirc, then the twitter sql thing - the whole internet would seem the natural progression ha
main issue is either pay like a ton a month
or you have to use scraping
Which model do you use the most in your searches?
just 42000 usd per month
Will perplexity be the first provider of Llama 3.1?
also btw can anyone from perplexity fix the deployment of Gemma 2 27b in playground
aha yup.. or $2.5m per year..
apart from a the big ones like talkwalker, hootsuite, a bunch of social monitoring platforms were just forced to drop twitter entirely (as well as lots of researchers)
I wonder should i get a claude pro account as well to take advantage of its latest feature for Artefacts
you mean the artifacts sharing, artifacts in general, or do they have something newer?
Hello, hopefully someone can help. I have a pro subs on my icloud account, I signed in to the web version via apple sign in, but it appears to have set up a new account with the same email. I dont have access to pro features
They have this code preview for conversion image to working app
that sounds like artifacts (html preview/embed), I don't see anything newer on their blog. if that's the case, you are on pc and feeling adventurous, you can achieve some results also on perplexity with ailin - #😎│cool-projects message
I have Claude. I would not subscribe to it just for Artifact. The new AI itself Sonnet 3.5 is great though
I wonder is it the same using on Perplexity for Claude 3.5 or the one one Claude is more advanced
i've noticed the quality of perplexity has been dropping alot recently
especially for reading files
i've been using perplexity pro for 8 months now, and have referred friends to also use this service
i really hope i dont have to find alternatives :/
on claude I think you get more context and higher temperature (more creative responses), but much less uses. yeah, file handling is much better at claude and chatgpt. also ailin is not official, not easy to set up and not as user friendly as artifacts on claude. ailin can do some extra things, like code execution (don't think claude has that) and ability for that code to use internet (I think that ADA can't do that on chatgpt), persistent filesystem; but ailin also has drawbacks, like nothing similar to that easy artifact sharing on claude nor specialized react embed
personally I don't use file upload much, it's been hit and miss for code for a long time. but if it used to work for you better, you should check pro-feedback or bug-reports and share example threads where it fails. at least from discord, it looks to me devs are working on something related to files
at this point, it just doesnt read the files anymore it feels like
I use perplexity for internet search type searches. That's what it excels at.
yeah i use both and it used to work better. idk whats going on :/
it is the same model, however, what different is the context windows (A context window in AI refers to the amount of text (measured in tokens) that an AI model can process at one time). In perplexity the context window is only 32000 tokens, in claude this is much higher. So as long as the conversation goes, perplexity will come to a point that it can not process all the information at once and start to lose relevance and consistency.
maybe they reduce the context window or modify something under the hood, perplexity direction is in my opinion just like Cory use, for internet search type and replace google. For processing pdfs or handling document, you can use chatgpt or claude
I know I read file uploads under 32k tokens should be put into context, but I just never see that. it always felt like RAG to me. to be fair, I think in most cases I sooner hit response length limit (I think around 300 lines of HTML) or diluted instructions so much (by search results?), it stops doing what I instructed it to (when I forgot to go into writing focus or to disable pro search)
random use of "Programming" step is also not helping that
it really shouldn't use programming step to write 3 tweets...
is your file in pdf format? an alternative: copy the paragraph for context and also include your query to ask perplexity
even when i have large pastes that turn into paste.txt, it just refuses to do what i ask
it doesnt really even read the file
btw I tried converting very short source code (html with js, nowhere near 32k tokens, more like 1-2k) to pdf and while it seemed to see the code from the pdf file, it also consistently failed to reproduce it 1:1, like forgetting , in object literals. it was correct in the pdf and sonnet has no trouble repeating same source code when I feed it in parts
i know that this is not ideal but i dont think you need that much of the context
to get answers with the help of the internet
the cost of doing those queries i gave it were probably too high
so they have to nerf it via code
soon or later we will come to realize that perplexity can not help with much code because of context window limitation in questions and also in the response itself. For best code and document handling performance, I would use claude 😃
Hey @stuck topaz!
If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.
I am trying out cursor. a bit less features compared to IDEA, but the AI integration is very nice and they have sonnet 3.5
Hey @stuck topaz!
If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.
this is my semi automated workflow for slides/ppts
use gemini flash to perform ocr and image transcript and get a better understanding on contexts and intuition behind using perplexity
🥔
Meta Llama 3.1 8b, 70b and 405b has been released today. When can we expect it to be in Perplexity?
I'd bet pretty quickly, they likely just have to switch a few config params. I'm not sure but they're likely just wrapping AWS Bedrock for these models.
not on bedrock just yet
hmhttps://www.aboutamazon.com/news/aws/meta-llama-3-1-models-AWS-generative-ai
what region are they in
found it
theyre in oregon
Yeah Bedrock llama 3.1 links are 404ing for me anyways. Theyre probably getting hugged a bit too
Ollama has 3.1 8b and 70b integrated too
I don't have 6 4090s for 400b anyways lol
3.1 8b inst
well yeah ig its instruct and not chat
90TPS not bad at all
seems like some stop tokens arent added in yet
Its also available on build.nividia.com/explore/discover
They have 405b-Instruct up too in their playground
https://build.nvidia.com/explore/discover#llama-3_1-405b-instruct
Perplexity, when Sonar 405b?
To get around the context window you should be using multiple threads for separate tasks. If you try and do everything in one thread you’ll have a bad time.
This is also just a general rule people follow for other LLMs. To keep it focused on the task at hand without the chat baggage.
This is why Im dreaming about being able to chat with collections of threads 😛
Sonar 405b on lab?
Anyone tested llama3.1 for actual use yet? It feels so strange or broken, but maybe that's provider (Fireworks) problem
they must use LoRA or similar to speed up the inference and/or RoPE to extend the length
it works fine on aws
Collections now have Threads/Pages on iOS
I wish cursor allowed you to use local modals. That would be great
are you sure it can't? I see in Cursor Settings -> Models -> OpenAI API Key a field for custom base URL
though I think not all features will be working. I believe cursor has some own models
They route (routed?) all requests to AI providers through their servers, so it won't work with local ollama instance, you'd have to make a reverse proxy to expose it
At least it was few months ago
oh, so it must be public facing api server. didnt know that
We just got llama 3.1
oh really, that was fast 😄
I was just switching models to get different results when it just showed up
didnt phone apps needed to update, or is it now pulling models from backend?
switch it on a web version, then you can use it from the app. this was written from an android app which showed model in settings "Default" https://www.perplexity.ai/search/write-short-poem-in-czech-abou-ksVKF84qQNG2wH8Q7ia2NQ
Yeah, its available in Web version
hmm, llama doesn't seem to like japanese symbols 😑
pretty sure it doesnt need an app update now
it pulls it from their AB test config
on android doesn't seem to. killed the app by force and still no llama 😦
There was a lag of a few minutes in between web perplexity and iOS app for llama 3.1, maybe it will appear in android app soon.
Well that was fast
Yo recibiendo un mensaje de Discord que creia era importante, era @signal hamlet }
these guys keep getting better and better
what are yall thinking, llama or claude 3.5 sonnet
isn't liama , available free on WhatsApp?
We got 400b llama 3. Just a matter of time before 1,000,000,000,000,000 model drops 😮
For those who don't that's 1 trillion parameters
yeah
idk, i know its free on meta.ai
How do you use it on whatsapp?
Yeah, but its it 405b, i have used it before when it was 70b
How does llama benchmark compared to 3.5 sonnet
yeah I would like to know also
I asked pplx with this:
interesting 🤔
Idk if there are such benchmarks, but surely it's worse at coding than Claude
model-eval-public by Scale
ok sonnet 3.5 still king in code
Yeah, i think so, but this is a huge leap for open source(weight)
sonnet is really crushing it
Deepseek? WizardLM2? Deepseek was better than 4o on benchmarks and WizardLM is the best at complex instruction following, which made it beat GPT 4 in many queries
For coding, I mean
Yeah, for coding, but as a base general purpose model
it is funny that sonnet is better than gpt4-o in math, but we wont use it in academic because claude is missing latex rendering, what is the point of better if we can not understand what the response is 😃
I hope to see a codellama 3.1
Was there codellama3?
I dont know, i think not
but hopefully someone will do a great code fine tune
or meta will idk
I'm hoping for WizardLM but with Llama3.1 base, maybe it will crush proprietary models further with its instruct
this is a bit funny, i wonder if any model gets it right.
Yeah, that would be amazing 🔥
No consensus, just do coin flip or check them all yourself
no
at least on aws theres a stop token issue
not sure if that carries over here
Was a joke fyi
well, the llama follows instructions from collection prompt quite closely. its "artistic programming" is not great, probably on par with turbo, omni and opus
Seems to be the same on fireworks, unless you mean something else
Whats the best ai ever
What’s up with Llama 3.1 405B just repeating the first answer when I ask a follow-up question?
Maybe it's not working well with Perplexity RAG? AWS and Fireworks have problems with 405b, so maybe PPLX too
I think its s instruct model
I don't know, maybe my prompt is bad? but sonnet (second image) gave me those symbols I wanted. llama seems to not like to output asian symbols, despite knowing them? at least some basic ones I tried
possibly related to the issue with stop tokens
since they’re using aws probably
maybe aws is using the wrong tokeniser for the model
is there something like this in perplexity? https://x.com/minchoi/status/1815812112796565690
like iterate the prompt with voice
Yes with the Android app. I often use the built in speech to text on Windows with the web app as well. Win key + H. I'm a terrible typist. ☹️
Imma be real... I still prefer 3.5 over 405b :X
405b is great, don't get me wrong, it's just not performing as well for me personally as 3.5.
claude 3 opus got it right for me
405b not working
sonnet 3.5 is the best model so far
the benchmarks are misleading for llama
its not that good tbh
i mean its good at simplifying concepts and whatsnot
and thats it
Yay, I'm not alone :D
many people noticed that
@sleek vortex is it down?
llama 405b not working on pplx
can you check please?
😖
Wait a few days before judging it.
Gpt4o and sonnet 3.5 sucked on day 1/2 for me and it got better.
hmm, but that looks like it has web search
You are right, here is GPT-4o no web access
is llama 405b down?
'
can someone check pls
Works here for me. I’m switching back to Sonnet for now due to speed.
are u brave
Less are you brave and more "are you rich enough to run it without waiting 5 minutes per response?"
this is crazy icl
Nah but fr bless yall, I got carried with that pro stuffI got over 14 accounts tryna use them trials makin it thru school 🤧
Hi, I am considering subscribe to Perplexity Pro. Can it generate images like ChatGPT-Plus does?
bros not gonna make it through life
yeah
not really it's primary purpose, but it can
Thanks, I am evaluating the Pro demo, it gives me this.
how can I instruct it to generate an image correctly?
Oh... sorry, I am a newbie to this. Thanks a lot, I will check it out.
😭 😂
Which one is bigger, 3.9 or 3.11?
3.11 is bigger than 3.9.
on hugging face
hello!
When you generate an image, do you first have to ask an AI a question and only afterwards ask it to illustrate the answer with an image? It's not possible to immediately ask the AI to generate a picture from a description? How do you prefer to do to generate images?
The strawberry one was interesting. My response from 3.5 was 2 R's, but when I asked it to look again it said there was a silent? R hidden in strawberry then corrected itself to 3 R's.
how often do you guys use wolfram alpha through perplexity?
Can we implement a system where a llm can run a calculator and execute code by itself to find answers to mathematical questions and then further on explain it?
Whenever I want an answer to a maths question I use perplexity
So all the time basically, I don't search for maths questions often but it's nice
Tried 405B a lot yesterday trough huggingface chat. It was pretty darn overloaded lol.
sorry, i don't know , it even not write python code to calculate it ~ 4o is best for 128k
and dall-e 3 is not work well again
reason : Failed due to moderation. Please try a different query
Yeah i tried it a bit as well, it was really nice
Gimme a question if you want me to ask it
Gpt-4o mini got it right every time for me on the first try for some reason while Gpt-4o had to be told to look again. LLM weirdness
If I reworded the prompt I'd likely get consistently better results, but that would defeat the purpose of judging responses from a problematic prompt.
From my understanding, it has to do with how the tokens are broken up for the LLM. In my practice, 4o and LLama 3.1 both can get the right answer with : how many r's are in the word strawberry? Write out your thought process step by step and only answer after the thought process.
3.5 sonnet though still says 2
Ultimately, many of the "problems" LLMs have with logic often have to do with the wording... not always, but changing the 9.11 - 9.99 thing to 9.11 minus 9.99 solved it across the board.
How many times does the letter R appear in the word "strawberry"?
Try that
Works across board :I
Weird, worked for me. Let me double check.
😞
Might have been a fluke on my part.
so, it really might be a problem with the tokenizer, as somebody noted yesterday. it looks to me like llama thinks it is returning those symbols
Was a fluke, it's popping up with 2 now.
Yeah, the thought process trick can help with simple logic questions. Also works for the 3.9 or 3.11 being larger question
tbh I don't know how "strawberry" kind tests are useful. what is the use case? you see Rs correctly from the LLMs
Its just testing the basic logic of the LLMs. not much use beyond that
the simple math looks to me much worse, that already bitten someone here on pplx. I think with contents of energy drinks
seems correct. yeah, other models yesterday didnt had this issue
Yeah, for math 4o tends to be best For math. Also, pro search should call Wolfram for math so its more accurate.
oh, I take that back. sonar (older llama) suffers from it as well
or it can use the programming step. both should work
yeah, I'm getting issues with the LLama based models giving answers in japenese. Probably just a limitation with the model. 4o workds, but also provides the english translations which wasn't asked for. Sonnet 3.5 seemed to do it best.
though sonar is based on llama 2 70B if I remember correctly and that maybe didn't really officially didn't support other languages? not sure
I think the l3.1 405B should know asian languages. it can output the symbols, but only rarely or maybe for specific promtps, not sure
I don't typically have it answer in Japnese. For this specific question at least it seems to have trouble.
special symbols are the problem. kanji, katakana and hiragana. but the model obviously knows the words, otherwise it couldnt respond in romaji
Yeah, that's an interesting limitation, even in the newest version. 4o and Sonnet seem just fine.
what
it knows them alright, just doesn't want to output them...
maybe something with system prompt of pplx? possibly the language selection?
weird it won't even write them if I ask for it in a paragraph. I wouldn't be able to say why. I can't imagine its prompt related if it otherwise can display the writing properly outside of this prompt.
How many times does the letter r appear in the word strawBERRY? This works for Sonnet... But only Sonnet
Weirdness
probably because of different tokenizer
Tokens be weird, yo
yeah 😄
even more fun
but without this tokens workaround, we couldn't have any LLMs. it would be computationally impossible task
When I have sonnet do an example, Llama can follow it with the kanji properly
yep, as soon as it sees anywhere any jpn symbol (maybe asian in general), it can continue
True true
it just doesn't know how to start writing symbols on its own, poor llama
I'm legit surprised it's having language issues for such a popular language, albeit tricky.
all asian languages, not just japanese. chinese is huge
Chinese very big, yis
Its just crazy because its clearly about to translate it, and use the language. It just won't display it in answers.
I went to meta.ai and tried it there
and it has the same issue
so its clearly an issue with the LLama model, not a system prompt on perplexity or anything.
even Gemini can do it lol
well, not necessarily the model. could also issue with many things around it, like the tokenizer, supporting code around (lib), template, possibly even default system prompt
What's weird is if I say "show me the character" it will work...
But "what is the character" causes a panic
I know people complained with several open chinese models that they occasionally randomly without asking switch from english to chinese, so possibly meta finetuned a bit too much, to counter this behaviour
I wonder if base model also does this
Possibly. Strange. Hopefully they fix it. At least Perplexity has other models available for the time being.
So using the API straight from open router with no system prompt or changes is has the same issue
yeah, it may not be too common use case. seems to be fixable by collection prompt You can use Japanese symbols for Japanese words, kanji and kana. For example: 貴方
Yeah it seems like if you use a symbol once it works.
Do you think it has to do with llama being hesitant to mix alphabets in a single response? Like it wants to use romanji to keep with the Latin alphabet.
I think it is mostly the prompt language tuning - when user doesn't write in japanese, don't use japanese symbols
but it is way stronger for asian languages, possibly because too different symbols? for example for czech it doesn't have this issue, it can switch freely and use extra non-latin characters https://www.perplexity.ai/search/write-short-magical-poem-in-cz-v7lRjXgxS5.tDX22.g4F2Q
Yeah, works for other languages right off the bat too, like Russian works fine.
but, russian uses very different alphabet, no?
It does, which is why I chose it.
Maybe Asian characters are considered more similar to emojis? Just a random guess
Be gentle, I have smol smooth brain
lol well it answers in emojis just fine though
No idea
you might be onto something. the japanese mini lesson (which doesnt have jpn symbols in my profile) works when asked for emojis 😮

this was an accident, I wanted to try if it would respond with that one "smiling" kana sometimes used as emoji. this one: ツ
lol its just trolling at this point
feels like it. but possibly combination of rare emoji and japanese mini lesson from ai profile with an emoji word in prompt pushed it far enough from that language/symbols post tuning
https://www.perplexity.ai/search/10-animals-in-kanji-only-respo-CdygXAlIQte9iJXH.fXGwg It worked for me 🤔
thats from search
oh yes, sorry 😅

though good to know at least in search it works properly. that's I would wager main use case on pplx
I wonder what model writes code for programming step
Yeah, probably due to the correct symbols in the search results lol Same as you providing an example to it in your other prompt.
I assumed it was their Sonar model
Or wahtever their pro search model is
hmm, now I am thinking about what if search results return japanese/chinese/korean symbols. won't sonar switch response to it? like, I would think there is a reason why meta did this post training
Well, I'm off to bed. Good conversation, good luck and good night lol
second method from this post
@tame current also check https://discord.com/channels/1047197230748151888/1194788138124587128
who the hek uses mobile apps for serious work
deep dive ➤ #💬│general message
Hi everyone, been using Perplexity Pro for some time now and can say it's the best AI platform out there 🙂
Since I've been playing around a lot with it, I have a question. Perplexity can utilize several AI models. Does the tool know which model is the "best" for a given task? Do my answers differ depending on the model I use? This is something I haven't figured out yet.
Hey @jovial cedar!
You can change the model in the settings -> #⚡│ask-community message
The answers will always differ depending on the model.
Here is a guide to help you choose the right model for your use case -> https://discord.com/channels/1047197230748151888/1240356137497530408
I tried your prompt "10 animals in kanji only. respond only with Japanese symbols" with the @south kindle tool in order to take stock of the various LLM results. See the conclusions here: https://app.wordware.ai/share/8c523d8b-c109-4189-a6ce-cc9bfc5d24a2/history/129be5a3-d4ae-4069-85f5-156052669490?share=true
(This prompt processes a question using Sonnet 3.5, Gemini 1.5 Pro, llama 3.1 70B&405B, GPT-4o/mini, Sonar Large (online model), Claude 3 Opus, Claude 3 Sonnet, and lastly Claude 3 Haiku. The app then employs Sonnet 3.5 to review and rank the responses. Upon completion, Claude Sonnet 3.5 initiates a verification process to thoroughly examine all responses, identify errors in its initial ranking, and generate a final, revised ranking of the model outputs, aiding in determining the optimal answer.)
It is the Llama model generating the answer, but there are Perplexity prompts on top, to help with the search part. Is my limited understanding. So through Perplexity, using Llama.
models dont know who they are accurately
if i tell llama that its actually made by apple it will just spit that out
or if you tell it its gpt5
so on
llama 405b has weird grammar on perplexity
missing period in the first two paragraphs
I laughed out loud. You are funny have a good day
okay great, thanks
nice. wait a minute, who is judging this?
- Sonar - Large: The prompt itself, not a valid response.
I see repeated prompt, but also 10 kanjis. did the judge overlooked it, or am I reading it wrong?
edit: I see, at the top, so sonnet 3.5? wouldn't expect a mistake from that one
Will pplx be able to read/watch instagram or tiktok videos in the future like with the youtube videos?
why not? they are ||might be|| the future of search engine
reels and tiktok might not have inherent value with repect to search
For Gen Z, TikTok is the place to turn to when you’re looking for answers
also
TikTok overtakes Google as most popular search engine among Gen Z.
But it is for youtube?
most videos on youtube are informative is what i've felt
tiktokt and reels on the other hand...
Exactly, there's so much data on forums (reddit) and youtube. I can't imagine how much data is on tiktok
Oh bro, tiktok is great for learning how to fix something in the house, finding watermelon at a grocery store, doctor advice. You shoudl check it out
doctor advice?
Yeah there's cardiologists, surgeons, etc.
On the platform that give advice about their field, health, and other things pertaining to it
I would but it's banned where I live, also how are you sure someone's not pretending to be a trained medical professional ?
Because They have their name and credentials in their bio. There's an account called Drterrysimpson. He's a surgeon and gives great advice
oh i see, where do you live, if yo dont mind me asking
India
yes, they might be good. but good intent cannot be expected out of all netizens
Well just like youtube and reddit you have to read between the lines.
The point is TikTok on pplx would be great.
perhaps, only time will tell
hey
Yes, it is indeed Sonnet 3.5 that analyzes, and it correctly states that for "Sonar - Large," it seems to be a prompt rather than a response. It is among the poor performers ranked from 6 to 10, with only a few performing worse in the given task.
i think mistral is worth it more than llama 405b tbh
it gives me better results, hopefully its added on pplx
the part sounds, like it responded only with the prompt. yeah, it continues, but seems a bit biased tbh. sonnet itself also included extra stuff, yet Also provides exactly 10 animals in kanji. does sonnet rank itself higher or something? does it rank worse the markdown list which he didn't include? edit: oh, it's that "in kanji only". seen so many variants of this I overlooked it, my bad 😅
This is based on my wordware template. It's in flux. Also I'm unable to update it currently, and 70B is down, so it stalls out on new searches/inputs.
https://app.wordware.ai/share/999cc252-5181-42b9-a6d3-060b4e9f858d/history/3c76952a-c352-4520-95a2-ccf1a7b2b056?share=true < This one is working though. Their embeds aren't "great" yet, but it's a search on Mistral Large 2, with a PPLX style article at the end.
Use the power of ScratchPad-Think for every day web searches. Export refined search queries in JSON format. The scratchpad is a powerful tool that helps you maintain coherence and accuracy, especially when dealing with long, complex prompts. Use it diligently to showcase your chain-of-thought reasoning abilities.
2 things. Highly recommend making an account on Wordware, and hitting the "try it yourself" button. This will copy the entire template to your account and you can tweak it, break it, or do whatever. it's basically a IDE for building LLM based apps, using complex logic and prompt chaining.
They give you plently of free credits to play around with all the tools/models/settings/templates.
Mistral large 2
i'm confused
what even is this benchmark
I think its this: https://github.com/nuprl/MultiPL-E
Benchmarks vary so heavily with AI... One test says Sonnet is the best at coding, another says 4o-mini is better in every category. Others show 405b winning across the board...
Yeah, its a mess
Honestly, I ignore benchmarks these days. I used to watch them but they are just so inconsistent that they're almost subjective, like a movie rating score, e.g., rotten tomatoes or IMDb.
I've loved movies that score terribly and hated movies that score well. I have to watch them myself to actually find out.
Sucks tho because it would make it easier to chose the best model for the task, if you need have it write java vs python vs bash.
Yeah, it's really unfortunate.
spaming this speed image because I feel it's relevant:
Wasn't Llama supposed to stand out? Seems kinda average like the other models
LLama 3.1, from what I understand, isn't unique in that its more powerful than other models. Its 405b version is meant to target a similar level as 4o and Sonnet 3.5. What is meant to make it special is that its open source, and it can be downloaded and used locally by anyone with the hardware to run it, unlike Sonnet or 4o which are currently only available thorugh the cloud via Open AI, Microsoft, and Anthropic with no option to run locally, even if you have access to the needed hardware.
Llama 8b and 70b really stand out imo, but 405b is impressive because it is close to GPT4o while being open source
I dont like this: https://www.perplexity.ai/search/google-is-the-only-search-engi-awmBq7utRw6FpNqjhY3szA
Yeah that doesn’t bode well for Reddit search mode
Unless perplexity can pony up the money
Dose Reddit search mode use api or via general search?
Why is grok by XAI not in hugging face chat?
It shouldn't be size issue because hugging chat hosts llama3 400b model while grok-1 is 314b model. Is it some licensing issue?
Hmmm... GPT4 Turbo never gets it wrong, i tried 10 times, but now GPT4o is geting it right too, maybe it was a fluke when i tried it before
Perplexity now uses gpt4o mini btw
For what?
What makes you say that, what did you see?
This is news to me about mini
And regular o I assume?
damn, what do you think perplexity is going to do?
https://www.perplexity.ai/page/mistral-unveils-large-2-00GRlebXQQiufg1mtooQxg
llama3.1 is too slow, and can not calcute 3.11 and 3.9. i perfer to Mistral Large 2.
Supporting dozens of languages, including English, French, German, Spanish, Italian, Chinese, Japanese, Korean, Portuguese, Dutch, Polish, Arabic, and Hindi, the model demonstrates impressive multilingual capabilities
Breadth of language support: Mistral Large 2 appears to support a wider range of languages, with "dozens" mentioned, while Llama 3.1 405B has added 8 new languages to its previous capabilities
###################################
Mistral Large 2 has a 128k context window and supports dozens of languages including French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese, and Korean, along with 80+ coding languages including Python, Java, C, C++, JavaScript, and Bash.
vs
Llama 3.1 models are conversant in additional languages including Spanish, Portuguese, Italian, German and Thai.
Meh
Can someone explain if i get a paid subscription to perplexity pro, will i have full access to every of its ai models like if i had individual paid subscriptions to each of them ?
Yes
yeah, the direction of llama is always not try to beat claude or gpt, it focuses on achieve similar performance like claude or gpt, thus remain opensource and can run locally, which then can be fined tuned with custom data, similar to dolphin. llama 405 is said to beat gpt, but I find its response is pretty much very slow. So for the sake of daily use, I prefer only gpt4o and sonnet
Is there any information about the context size of the sonnet 3.5 used in Perplexity? In Poe there are two version, one is 8K, the other is 200K.
perplexity only have 32k context size window (only when processing pdf). For normal search, it is lower, but we dont know how much (they r not very transparent about this)
hello
Llama 4 multimodal coming around December boys!
Man i really enjoy using perplexity lyrics
Thank you for the information.
omg llama 405gb actually follows instructions from your profile, never seen this before on perplexity
sonnet 3.5 also follows instructions
Correct! Sonnet adheres remarkably well to the instructions provided in the system prompt, and it can be significantly better fine-tuned than GPT-4o.
i feel like all LLMs will eventually diverge at the same level of intelligence....
i meant converge
lol
Hey. Is there a direct way to copy and format the conversation into Notion.? and have it structured in a given way.?
Looking to improve the way information is taken out into other text writing apps
No just mini for the free search
The free version
Tokenization is openAI-like now and the model itself admits to being openai-made. Previously, they were using claude 3 haiku only (free) and I have since been testing every couple few days since the drop of 4o mini to see if the model changes. It used to admit being made by anthropic. Now it's constantly admitting being made by openai
Sure but that's something else. If for a long period after confirmed implementation of haiku, the model states CONSISTENTLY that's its made by anthropic, and a few days after the release of gpt4o mini it starts saying it's by openai
Isn't that a significant indication that there was some change?
considering that it makes total sense for them to move to gpt4o mini due to cost
And that the confirmed model claude 3 haiku was consistently choosing to spit the tokens corresponding to anthropic instead of openai
Good morning! I just got Perplexity Pro. I can only seem to upload 4 documents per thread. But I read that I was supposed to upload more than 4 documents.
Hi all, I just published my first page, and shared a link. But I wanted to see how many views or clics I had. I don't find how to do it. Does a sort of analytics exist ?
Ah, but not for Pro I assume
it says enterprise sub but it doesnt give pro or other models, is that normal?
Hm, so I assume getting Pro doesn't actually get a person more than 4 file uploads. That's sad. I'll cancel and come back when I can reference more than 4 files. 😄
it's probably the default one for pro if you dont choose a better model
yeah, it felt o mini to me #💬│general message . though since pplx can use different (default) model depending on user, it makes it a bit harder
not normal, try clear browser cache + log out and log back in
if not then contact support
also check youre on the correct account
same issue different browser (tried in edge)
its the account i clicked the invite link on
hmm
check with your account manager once that theyve paid for the multiple subs
if they have then id contact pplx support
since this isnt intended behaviour
ah maybe its not fully setup yet then
possibly, yeah
alright ty, i'll wait for it to get setup
i'm so stoked to use unlimited pro searches on claude 3.5 lol
600/24 hours* (at 32k context only)
but close enough!
well 600 is all marketing its basically unlimited lol
thats still 25 searches an hour, or one search every 2 minutes
never seen anyone actually hit 600
at most i hit like 200
but average like 25-50 honestly
you get access to the model through Perplexity. You could for instance go to Openai and use your Perplexity login.
@waldoh Just clarifiying for him. I think some people seem to think that your perplexity sub will allow you to use the models directly from the models website.
Reddit now blocking all searches from AI and search engines? 😦
Where did it say you can upload more than 4?
yeah, my "best" was remaining 100 uses. I was using pplx whole day for programming and research. I believe it was the project where ai was writing all the code and I wasn't allowed to manually edit anything. it was not an effective use of my time 😅
Google bought rights to have exclusive search access to Reddit
Maybe Reddit will be replaced one day, Reddit is useful but not worth switching back to Google imo
is llama better at reasoning and claude 3.5 sonnet better at writing human-like?
this is starting to get a bit convoluted 😄
not sure about llama and reasoning (🦙 didn't look better at programming), but sonnet is great at writing (handles Czech very well, and I would expect it to know more languages than llama). but on pplx sonnet had (has?) low temp, so for writing is a bit dumpbed down here
I initially read that announcement for database maintenance as a 10-minute warning for a two-day-long maintenance window! I’m glad I read that a second time.
I wonder how much money I've cost Perplexity in API fees
So far my favorite thing about Llama 405b is that it never says the same thing after a rewrite, it always seems to tackel the problem in a new way
I wish Claude was like that... https://discord.com/channels/1047197230748151888/1253515579743015073
https://openai.com/index/searchgpt-prototype/
damn looks cool
most of their models arent paid for on api anyway so dont worry!
this is literally what im building...bruh
man
i mean this was teased a few months ago
we thought it was going to be a gpt-5 or smth and then gpt-4o came out which wasnt that
lol it's over for perplexity
o7 I enjoyed my stay
its so over for you
u have speed pfp

