oak sequoia Oct 13, 2023, 3:04 AM

#

Hi folks. I'm trying out pplx-api for code generation, and I currently have a GPT-4 AI API flow in place that takes human requirements and converts to code blocks. I'm using "few shot learning" with stop sequences in the system_prompt to ensure the response is just code. However I'm unsure how to indicate stop sequences in pplx-api. I'm currently using the Completions API. Am I using the right thing, and if so, how do I indicate those stop sequences?

Edit: I think it's not currently supported. Passed a stop array in the POST body and got message back "message": "custom stop words are not implemented for completions."

So, wondering if this few shot learning system prompt is even possible with ppxl-api right now?

green oxide Oct 14, 2023, 7:27 PM

#

Yeah, a little more documentation would be great as I am very excited to get this working properly.

hardy marsh Oct 17, 2023, 1:32 AM

#

#1163650321210417242

hexed flax Oct 18, 2023, 8:49 PM

#

Hi all! Is there a commercial plan for the pplx API ? I run a tech company w a B2B SaaS app that I'd be interested in leveraging the pplx API for. Not sure if you have gotten that far in planning on this

cedar blade Oct 19, 2023, 11:37 PM

#

please send us a note to api@perplexity.ai

small pike Oct 20, 2023, 1:22 AM

#

cedar blade please send us a note to api@perplexity.ai

Any plans to add Zephyr to the list of models? Trying to see if I can use the API to build my own RAG app, and Zephyr seems to outperform mistral pretty well according to Llama Index's benchmarks

#

Also, these models are the exact same models from huggingface just hosted on your infra? or have they been finetuned in some way

#

Thank you so much for the free API access btw! Truly appreciate you all 🙂

gentle spade Oct 21, 2023, 6:41 PM

#

Will the api support the same search features that is available on the web. Such as asking about todays news head lines ?

covert ferry Oct 21, 2023, 6:53 PM

#

gentle spade Will the api support the same search features that is available on the web. Such...

There will be an LLM-RAG API at the end of November. https://docs.perplexity.ai/docs/feature-roadmap

pplx-api

Feature Roadmap

pplx-api's roadmap is as follows:Early November Embeddings API supporting Sentence-BERTCode Llama 16k context length End of November LLM-RAG API November+ Mistral 32k context length

hexed tendon Oct 25, 2023, 4:45 PM

#

covert ferry There will be an LLM-RAG API at the end of November. https://docs.perplexity.ai/...

This is awesome. Having an enterprise offering for RAG with the data they've collected on real world use cases is a killer move.

#

not to mention a model specifically tuned for that use case. (also, pre-congrats on the funding round)

small pike Oct 26, 2023, 4:59 AM

#

covert ferry There will be an LLM-RAG API at the end of November. https://docs.perplexity.ai/...

Does the API support async streaming? How would I do that in python? (sorry - noob here, would appreciate any help 🙂 )

grave yoke Oct 26, 2023, 2:51 PM

#

I would love to see a pplx API that fetches content from collections and threads. This would enable Perplexity to have deep integrations with other services without the Perplexity team having to work on such integrations

heady steppe Oct 27, 2023, 8:40 AM

#

Hello everyone. I am currently working with the API and I am having difficulty understanding a constraint that has been applied to the chat structure. It appears that the structure cannot be as follows: 'System: You are a helpful assistant; Assistant: Hi, I am a helpful assistant. User: [...]'. This is because the first message following a system message cannot be an assistant message. Honestly, I do not understand why this is the case. this is the response: {
"error": {
"message": "After the (optional) system message(s), user and assistant roles should be alternating. Expected role 'user' for message at index 1, but got 'assistant'.",
"type": "invalid_message",
"code": 400
}
}
(i know that this is not a bug, but i was wondering the motivations for that choice)

junior shuttle Oct 27, 2023, 9:30 PM

#

When using the API-KEY in applications, should one include "pplx-" before the string of digits? Thanks. Also, which is the best channel to get example code that is standalone?

junior shuttle Oct 28, 2023, 8:53 PM

#

Hi, I am trying out the replit-code-v1.5-3b model. What is the proper URL endpoint? It is "https://api.perplexity.ai/chat/completions" like for the other models? Thanks.

covert ferry Oct 28, 2023, 9:01 PM

#

junior shuttle Hi, I am trying out the `replit-code-v1.5-3b` model. What is the proper URL endp...

You can find all the information in the docs: https://docs.perplexity.ai/reference/post_text_completions

pplx-api

Text Completions

Generates text to complete a non-conversational prompt.

opal latch Nov 7, 2023, 4:23 PM

#

Text Generator plugin for Obsidian just released an update where you can use any custom API. They have a following structure of their JSON header and body.
Is it possible to use PPLX API model for text completions?

I tried documentation, but I failed shortly 😦

granite kernel Nov 13, 2023, 5:54 PM

#

https://www.together.ai/blog/together-inference-engine-v1

Announcing Together Inference Engine – the fastest inference availa...

The Together Inference Engine is multiple times faster than any other inference service, with 117 tokens per second on Llama-2-70B-Chat and 171 tokens per second on Llama-2-13B-Chat

mighty field Nov 13, 2023, 6:28 PM

#

opal latch Text Generator plugin for Obsidian just released an update where you can use any...

Hi there! If the obsidian plugin is openAI message completion compatible, it should work, since our requests are made with the openAI message format.

scenic skiff Nov 14, 2023, 5:55 AM

#

two questions:

is there an ETA on extended support for TextCompletion?
when pricing goes into effect, will pro users retain access of some kind? Or will they have to pay for tokens on top of the existing subscription? Or will there be a set number of monthly credits included in the subscription?

regal pumice Nov 14, 2023, 4:21 PM

#

hi all, does anybody know if its possible to connect sillytavern to perplexity via api instead of openai or claude directly?

scenic skiff Nov 14, 2023, 4:32 PM

#

No, because pplx-api does not serve a /models endpoint. Thus it isn't actually OpenAI compatible.

#

You could always make a wrapper around it that serves a bogus models endpoint. Then it should work.

scenic skiff Nov 15, 2023, 2:23 AM

#

scenic skiff two questions: 1. is there an ETA on extended support for TextCompletion? 2. whe...

I'd still be interested in answers to these... Thanks...

heady steppe Nov 16, 2023, 9:45 AM

#

scenic skiff two questions: 1. is there an ETA on extended support for TextCompletion? 2. whe...

I'm interested too

covert ferry Nov 16, 2023, 10:04 AM

#

scenic skiff two questions: 1. is there an ETA on extended support for TextCompletion? 2. whe...

You can find approximate dates here: https://docs.perplexity.ai/docs/feature-roadmap
You are welcome to suggest missing features/models in the forum.
Regarding the use of the pplx API for Pro subscribers: Pro subscription will likely grant free credits.

pplx-api

Feature Roadmap

pplx-api's roadmap is as follows:Early November Embeddings API supporting Sentence-BERT Stop words and request time limits as parameters Text completion API for Mistral 7B Code Llama 16k context length End of November Online LLM November+ Mistral 32k context length

mighty field Nov 16, 2023, 6:27 PM

#

scenic skiff I'd still be interested in answers to these... *Thanks...*

Waiting on the response from the API team 🙂 not ignoring you.

scenic skiff Nov 16, 2023, 6:34 PM

#

It's all good.

#

Patience is my middle name

cedar blade Nov 17, 2023, 12:12 AM

#

scenic skiff two questions: 1. is there an ETA on extended support for TextCompletion? 2. whe...

we are not working on text completions at the moment. what if your use case.
pro user will have certain monthly credits for api. after that they will need to pay per tokens.

scenic skiff Nov 17, 2023, 12:26 AM

#

It's okay, I can wait. I'll build a shim around chatcompletions in the meantime. though my prompt format works better without.

boreal fossil Nov 17, 2023, 12:52 AM

#

Hi hello sorry to interfere in you guys convo, this may be sudden. I need to know whether I need to utilise pplx API or not? Because I saw lot of people in Reddit said they can build chatbot and everything.

Since I don't have any knowledge about AI or programming, is it worth to try to learn more about programming and using the API? If so how do I start? (I'm just a normal accountant trying to learn something new)

hardy marsh Nov 17, 2023, 1:34 AM

#

Just Perplex it

boreal fossil Nov 17, 2023, 2:02 AM

#

Yes I perplex it but can't understand a single thing even after asked to simplify it 😭 I became stupid when something new like this. Sometimes I'm thinking back about my life why I exist 😂

#

but it's all right I will find my way. take time to understand this type of things

grave yoke Nov 19, 2023, 2:58 AM

#

Love to see pplx online in labs. Any ETA on online via API?

hardy marsh Nov 19, 2023, 3:29 AM

#

grave yoke Love to see pplx online in labs. Any ETA on online via API?

heady steppe Nov 20, 2023, 9:47 PM

#

hardy marsh

Also, there is an eta for sentence-BERT?

thorny musk Nov 21, 2023, 5:48 AM

#

Any idea if pplx-online will return the URLs too?

stone trellis Nov 21, 2023, 7:26 AM

#

Hello, I want to be able to pull data from 200 pdfs and from the internet, both grounded. Will that be available soon on the API? Cant wait to have it available. Can I send over my use case to see if there's a change of an early alpha acesss?

fast rover Nov 21, 2023, 12:50 PM

#

stone trellis Hello, I want to be able to pull data from 200 pdfs and from the internet, both ...

Have you looked into langchain

#

From the pdf size it's one of hsexases

past island Nov 21, 2023, 1:53 PM

#

heady steppe Also, there is an eta for sentence-BERT?

There was a change in our roadmap and this has been punted. We have updated our roadmap to reflect the changes. Thanks for your patience as we work through this!

stone trellis Nov 21, 2023, 4:08 PM

#

thanks for letting us know @past island . I guess this is the latest. If I could suggest something
(1) if any of the planned tasks are dealyed, they should be replanned in the roadmap
(2) if you could ask someone to write better description for the functionalities, it would also be super great.

#

I am really trying to push PPX for my business clients and want this to be a fruitfull parnership for all involved. Having this communication more clear will help me interact better with my client's expectations.

heady steppe Nov 21, 2023, 10:36 PM

#

past island There was a change in our roadmap and this has been punted. We have updated our ...

Ok, thanks for the answer.

remote gulch Nov 24, 2023, 3:03 PM

#

is the perplexityai python module teh offical way to interact with perplexity API? I am asking s I have knocked up my bare bones conversation using the examples from perplexity documentation utilising requests module. I am just about to mobve on to trying to code the threads part and was checking im not going down a wrong path.

pale silo Nov 25, 2023, 1:43 PM

#

which model is gpt4?

nocturne thunder Nov 25, 2023, 2:02 PM

#

pale silo which model is gpt4?

the API doesnt have gpt4

pale silo Nov 25, 2023, 2:06 PM

#

nocturne thunder the API doesnt have gpt4

why?\

thorny musk Nov 25, 2023, 6:42 PM

#

pale silo why?\

Because OpenAI already provides it

stone trellis Nov 26, 2023, 6:48 PM

#

Hi, I wanted to check in on the OnlineLLM feature. Regards.

covert ferry Nov 26, 2023, 6:59 PM

#

stone trellis Hi, I wanted to check in on the OnlineLLM feature. Regards.

Hey, more information will be published soon 😉

scenic skiff Nov 27, 2023, 12:21 AM

#

Wait, why is openhermes gone????

past island Nov 27, 2023, 6:18 PM

#

scenic skiff Wait, why is openhermes gone????

Hey, @scenic skiff! Here's a response from the API team. https://docs.perplexity.ai/changelog/models-removed-replit-code-v15-3b-and-openhermes-2-mistral-7b

pplx-api

Models removed: replit-code-v1.5-3b and openhermes-2-mistral-7b

We have removed support for replit-code-v1.5-3b and openhermes-2-mistral-7b. There are no immediate plans to add these models back. If you were a user who enjoyed openhermes-2-mistral-7b, try instead using our in-house models, pplx-7b-chat-alpha and pplx-70b-chat-alpha!

dense lance Nov 28, 2023, 1:51 AM

#

How are pplx-7b-chat-alpha and pplx-70b-chat-alpha different from llama? I can't find any information about it

lunar crown Nov 28, 2023, 12:19 PM

#

dense lance How are pplx-7b-chat-alpha and pplx-70b-chat-alpha different from llama? I ca...

Built on top of llama2 and fine-tuned for chat
https://twitter.com/perplexity_ai/status/1717953875678794158

blissful breach Nov 28, 2023, 5:14 PM

#

anyone as an end user find rag actually useful day to day? I see a lot of youtube hype and before i get sucked in - if anyone who's used a working rag system give their thoughts on its personal impact day to day? like "wow cant really imagine working without this anymore" or is it like "eh this is cool but i barely use it or on occasion its fine but i still google / chat gpt / use perplexity as its good enough or the search in the app like slack search is good enough"?

wary cedar Nov 28, 2023, 6:22 PM

#

.oO( what on earth is rag? )

hexed flax Nov 28, 2023, 6:28 PM

#

blissful breach anyone as an end user find rag actually useful day to day? I see a lot of youtub...

Yes here comes the most annoying answer of all time…. Depends on your use case and need to control the output. So far my experience is the more specific the domain and goal you have as the use case, the more building your own rag makes sense. More general needs I think it would be hard to match a perplexity

dense lance Nov 28, 2023, 7:48 PM

#

lunar crown Built on top of llama2 and fine-tuned for chat https://twitter.com/perplexity_a...

Thank you. So how is is pplx-70b-chat different from llama-2-70b-chat? Is pplx-70b-chat just more fine tuned for chat?

lunar crown Nov 28, 2023, 7:55 PM

#

dense lance Thank you. So how is is pplx-70b-chat different from llama-2-70b-chat? Is ppl...

Should be...
Unfortunately couldn't find more info. You could probably reach out to the team for more info via ticket or email, if you are planning to use the API more thoroughly.

If you have a few go to queries, you can also check out the difference in Labs to get a practical idea of how the responses are going to look like, in different scenarios.

dire marlin Nov 28, 2023, 8:36 PM

#

dense lance Thank you. So how is is pplx-70b-chat different from llama-2-70b-chat? Is ppl...

If you scroll down a bit in the tweet they explain with charts:

Our models prioritize intelligence, usefulness, and versatility on an array of tasks, without imposing moral judgments or limitations. Comparing pplx-70b-chat and llama-2-70b-chat (by @AIatMeta ), human evaluators saw: 22.7% fewer “full denials” and 31.9% more “no denials”

#

Those are the improvements made over llama2 base model

lunar crown Nov 28, 2023, 8:40 PM

#

dire marlin If you scroll down a bit in the tweet they explain with charts: Our models prio...

Damn
I think without an account, Twitter don't show subsequent replies
Missed out on this completely

Thank you, for me this is good to know

dense lance Nov 28, 2023, 9:22 PM

#

Thanks guys, I appreciate it.

blissful breach Nov 29, 2023, 12:07 AM

#

hexed flax Yes here comes the most annoying answer of all time…. Depends on your use case a...

ah - this is helpful - i realized i was trying to actually ask what use cases is it relevant for** more than search - so what has it been useful for?

I am finding for code/programming (specifically python libs) - to just pull docs and index them myself with just search would go along way (not talking about RAG) - but what has been RAG good for?

Anyone try to index a package to generate documentation for it / or really quick search?

i should just do this at some point and get the data myself if its useful - BUT being a lazybones - curious if anyone has any good personal cases where for this use-case its great.

similar to cmd+k in tailwind/who-ever algolia is sponsoring for documentation can be done without rag - with speed

#

because chatgpt just rambles like 90% of the time now -

blissful breach Nov 29, 2023, 12:09 AM

#

wary cedar .oO( what on earth is rag? )

if this link worked - id have a better idea on what its good for - but - may still be insightful

https://www.promptingguide.ai/techniques/rag

Retrieval Augmented Generation (RAG)

Retrieval Augmented Generation (RAG) – Nextra

A Comprehensive Overview of Prompt Engineering

wary cedar Nov 29, 2023, 11:10 AM

#

is there an explanation for dummies somewhere what the difference is between the models / what I should use them for?
https://docs.perplexity.ai/docs/model-cards

and what would be the closest to the model perplexity I select on the web?
so that i can test out my prompts there and then create a curl for it?

pplx-api

Supported Models

Where possible, we try to match the Hugging Face implementation. We are open to adjusting the API, so please reach out with feedback regarding these details. ModelContext LengthModel Typecodellama-34b-instruct16384Chat Completionllama-2-70b-chat4096Chat Completionmistral-7b-instruct4096 [1]Chat Comp...

#

plus a basic explanation of what makes a request expensive, to get an idea how what makes it cost more / less?

covert ferry Nov 29, 2023, 11:22 AM

#

wary cedar is there an explanation for dummies somewhere what the difference is between the...

Many are open source models, comparisons or benchmarks can be found on the internet. (e.g. https://mistral.ai/news/announcing-mistral-7b/)
The closest to Perplexity are the fine-tuned online models.
The costs are made up of the length of the input (system message, user message) and the output, the asssistant message.

dire marlin Nov 29, 2023, 1:20 PM

#

Any chance of having the latest zephyr model available via the API?

gloomy stag Nov 29, 2023, 5:32 PM

#

hi, as pplx api is now out of beta I would like to ask - is it possible to make a discord bot that can use this API key and work on my server? And if yes, can someone point me to how it can be done? I have almost 0 knowledge of coding, so would love some help with it

blissful breach Nov 29, 2023, 6:30 PM

#

covert ferry Many are open source models, comparisons or benchmarks can be found on the inter...

the benchmarks are kinda bs in my view - its good for researchers to circle jerk about - but as an end user and knowing how game-able metrics can be - i have resorted to personal experience and asking around as a better signal than benchmarks published by the very same people who are writing the paper have a bit of a incentive to .. you know make it look good

stone trellis Nov 29, 2023, 7:17 PM

#

Hello all

#

in the newly released perplexity-online-70b, should I be seeing the sources of the information that is returned from the API? Am I missing something?

#

#

thorny musk Nov 29, 2023, 7:59 PM

#

stone trellis in the newly released perplexity-online-70b, should I be seeing the sources of t...

No, pplx online llm is not the same one as perplexityai model

stone trellis Nov 29, 2023, 8:06 PM

#

https://docs.perplexity.ai/discuss/6567996ada62730079f8b5fb

pplx-api

{  
    "id": "514b1273-b19e-4ff7-badc-1894957717f7",  
    "model": "pplx-70b-online",  
    "created": 7352736,  
    "usage": {  
        "prompt_tokens": 4126,  
        "completion_tokens": 431,  
        "total_tokens": 4557  
    },  
    "object": "chat.completion",  
    "choices": \[  
        {  
            "index": 0,  
        ...

restive rain Nov 30, 2023, 4:46 AM

#

Is it possible to add pplx API to typingmind.com?

hardy marsh Dec 1, 2023, 3:47 PM

#

@past island Docs probably needs a refresh

past island Dec 1, 2023, 3:48 PM

#

hardy marsh <@830126989687914527> Docs probably needs a refresh

You are right! Thanks, @hardy marsh

hardy marsh Dec 1, 2023, 3:48 PM

#

or make 'em opensource, people here would contribute

#

pepeyes

unborn folio Dec 4, 2023, 4:11 PM

#

Please deepseekcoder 34b instead of codellama 34b it's comparable to gpt4 and I'm paying like crazy for an api

blissful breach Dec 5, 2023, 7:31 PM

#

gpt 4 0314 is still the best one for coding - recent one is borderline un-usable

#

is deepseekcoder is gpt4-current(11 or 6 releases)? @unborn folio

unborn folio Dec 5, 2023, 9:24 PM

#

Yea but margin is so slight that it's hard to tell exactly how much. I have done my test cases and 34b instruct coding version if task is set on point there is almost no difference. Only diffirenciating factor can be reasoning but compiling the fact that 34b instruct is almost pure code version . Deepseek 34b eats up other open source llms easily. Just try 6.7b version in comparison to any open source /gpt3. 5 and you will see that it is damn good coding model. https://deepseekcoder.github.io/ try and see i would pay for API calls to this one like crazy

#

Today I will try to implement the chat into task weaver and deploy on runpod

blissful breach Dec 6, 2023, 2:04 PM

#

That’s great to hear - I hate using any LLM so far for code and I’m glad there is another one for me to try as I’ve found chat GPT and Claude to be 💩as it’s now taking me longer to just write the code myself (GPT 4 0314 from March being an exception) - do share how it goes and your personal evaluation on day to day usage and it’s impact

#

But basically realized at some point will need to run a model myself at some point when I have the extra resources to do so - cannot depend on OpenAI/Anthropic and most model providers speed running enshittification

abstract sentinel Dec 7, 2023, 10:35 PM

#

Hi all, really enjoying using perplexity. I have a use case, which I'm not quite able to get working well. I have a list of trade names for drugs. What I'm trying to do with perplexity is check whether each of those is an antibiotic or not. For some reason this is not working quite well. I would have thought that given the search results + llm, it would easily answer this. But sometimes, it's just wrong, for example see this: https://www.perplexity.ai/search/Given-a-drug-7hqCPc7ZSqevzMuZUYGosg?s=c
(I was just trying it out on the UI first before I go ahead and use the API and use it on all the 5000 drug list I have, just want to justify the cost before doing so)
Does anyone have any thoughts that can help with this?

left lark Dec 8, 2023, 1:40 AM

#

How do you prompt PPLX-70B to show citations?

dense lance Dec 8, 2023, 2:07 AM

#

Are there plans to increase the context window for pplx-chat-7b beyond 8k?

dense lance Dec 8, 2023, 2:23 AM

#

I know it's not in the public roadmap. Just wondering if there are plans later?

unique path Dec 8, 2023, 2:40 AM

#

left lark How do you prompt PPLX-70B to show citations?

would love to know

outer heath Dec 8, 2023, 3:34 AM

#

#

pplx-70b-online seems response too slow ?

#

it response , but too slow.

past island Dec 8, 2023, 8:11 AM

#

left lark How do you prompt PPLX-70B to show citations?

Hey, @Paul! Currently, it's not available.

vocal merlin Dec 8, 2023, 8:24 AM

#

abstract sentinel Hi all, really enjoying using perplexity. I have a use case, which I'm not quite...

seems like you have restricted it too much. it did not use the web search in your case.

abstract sentinel Dec 8, 2023, 8:26 AM

#

Is there a way to make sure it uses web search?

vocal merlin Dec 8, 2023, 8:28 AM

#

abstract sentinel Is there a way to make sure it uses web search?

vocal merlin Dec 8, 2023, 8:29 AM

#

abstract sentinel Is there a way to make sure it uses web search?

try out this Perplexity AI: Answer only in yes/no and no other text.

DIAMICRON 60MG MR TABLET. is it antibiotic? https://www.perplexity.ai/search/Answer-only-in-Uqo9Z.LqQmG3YS2PGpjorA?s=mn

Perplexity AI

Answer only in yes/no and no other text. DIAMICRON 60MG MR TABLET...

No.

#

will weight fine tuning be available to api users?

#

or at least latent space (prefix) prompt fintuning?

abstract sentinel Dec 8, 2023, 8:57 AM

#

vocal merlin try out this Perplexity AI: Answer only in yes/no and no other text. DIAMICRON...

Thanks, will try

vocal merlin Dec 8, 2023, 8:58 AM

#

abstract sentinel Thanks, will try

also try to turn on copilot

abstract sentinel Dec 8, 2023, 8:59 AM

#

I actually want to do this en masse using the API

vocal merlin Dec 8, 2023, 9:09 AM

#

abstract sentinel I actually want to do this en masse using the API

hmm an alternative way would be: use google search api in conjunction with pplx api

molten silo Dec 10, 2023, 12:40 PM

#

Is api TOS same as for pplx itself?

restive rain Dec 11, 2023, 3:03 PM

#

hi whats the pplx endpoint?
https://docs.perplexity.ai/reference/post_text_completions ==> page not found

#

i was trying to add this to typingmind, how should i write the custom header?

covert ferry Dec 11, 2023, 3:38 PM

#

restive rain i was trying to add this to typingmind, how should i write the custom header?

Hey @restive rain!
You can find the endpoint and header here: https://docs.perplexity.ai/reference/post_chat_completions

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

spring hamlet Dec 12, 2023, 1:26 PM

#

Can other models also search internet or only pplx-7b-online and pplx-70b-online can?

covert ferry Dec 12, 2023, 1:31 PM

#

spring hamlet Can other models also search internet or only pplx-7b-online and pplx-70b-online...

No, only the online models have Internet access

deep kindle Dec 12, 2023, 5:44 PM

#

ETA on mixtral-8x7b-instruct support in the API? It was added to the supported models list but it’s not working for me (server error 500)

thorny musk Dec 12, 2023, 6:46 PM

#

deep kindle ETA on mixtral-8x7b-instruct support in the API? It was added to the supported m...

Works for me now

jagged solstice Dec 12, 2023, 8:37 PM

#

mixtral-8x7b-instruct equalling llama-2-70b on benchmarks while being 80% cheaper is amazing

vocal merlin Dec 12, 2023, 9:11 PM

#

any plan on finetuning mixtral and/or online version of it?

deep kindle Dec 12, 2023, 9:19 PM

#

thorny musk Works for me now

Still seeing server error 500 on my end, switching to another model fixed it. Maybe I found a bug? I can provide full error details here if that’s fine

thorny musk Dec 12, 2023, 9:22 PM

#

deep kindle Still seeing server error 500 on my end, switching to another model fixed it. Ma...

Sure, you might've got the model name wrong

deep kindle Dec 13, 2023, 4:42 PM

#

thorny musk Sure, you might've got the model name wrong

I confirmed my model name is correct (mixtral-8x7b-instruct). An example of a wrong model name error is:
Error code: 400 - {'error': {'message': "Invalid model 'mixtral-instruct'. Permitted models can be found in the documentation at https://docs.perplexity.ai/docs/model-cards.", 'type': 'invalid_model', 'code': 400}}

The error I'm getting is:
Error code: 500 - {'error': {'message': 'The inference server returned an error.', 'type': 'unknown', 'code': 500}}

Note that I'm using the latest version of openai-python (https://github.com/openai/openai-python) with base_url set to perplexity's endpoint:
openai_client = openai.AsyncOpenAI(base_url="https://api.perplexity.ai/")

The error only happens with mixtral-8x7b-instruct. It works fine when I use another model e.g. pplx-7b-online.

EDIT: I figured out the issue. It seems mixtral-8x7b-instruct doesn't accept the following message format:
{"role": "user", "content": [{"type": "text", "text": "Hello!"}]}
This format is used in vision models like openai's gpt-4-vision-preview to accept both text and images in the message object. Obviously mixtral-8x7b-instruct isn't a vision model but it should still accept this format for cross-compatibility. Like I said other perplexity models like pplx-7b-online work fine with this format.

Vs the traditional message format:
{"role": "user", "content": "Hello!"}

vocal merlin Dec 13, 2023, 4:55 PM

#

deep kindle I confirmed my model name is correct (*mixtral-8x7b-instruct*). An example of a ...

can you try out the example here: https://docs.perplexity.ai/reference/post_chat_completions ?
and does your payload follow the rules?

i got no problem with the example usage

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

deep kindle Dec 13, 2023, 5:03 PM

#

vocal merlin can you try out the example here: https://docs.perplexity.ai/reference/post_chat...

Yes it works with the example code.

The app I'm building allows you to select different models to use, both vision and non-vision. With openai's API I'm able to use this message format with ALL of their models (vision and non-vision) which makes the code a lot cleaner:
{"role": "user", "content": [{"type": "text", "text": "Hello!"}]}

Reference: https://platform.openai.com/docs/guides/vision/quick-start

#

Bottom line is that there's inconsistent behavior here.

Here's the example python code from perplexity docs, but modified to use the message format I referenced above:

import requests

url = "https://api.perplexity.ai/chat/completions"

payload = {
    "model": "pplx-7b-online",
    "messages": [
        {
            "role": "system",
            "content": [{"type": "text", "text": "Be precise and concise."}]
        },
        {
            "role": "user",
            "content": [{"type": "text", "text": "How many stars are there in our galaxy?"}]
        }
    ]
}
headers = {
    "accept": "application/json",
    "content-type": "application/json",
}

response = requests.post(url, json=payload, headers=headers)

print(response.text)

Run it with "model": "pplx-7b-online" and it works. But change to "model": "mixtral-8x7b-instruct" and it errors.

vocal merlin Dec 13, 2023, 5:21 PM

#

deep kindle Bottom line is that there's inconsistent behavior here. Here's the example pyth...

i believe pplx just followed this https://docs.mistral.ai/api/#operation/createChatCompletion

Mistral AI API | Mistral AI Large Language Models

Chat Completion and Embeddings APIs

#

you can write a wrapper just 1 line of code not a big deal

deep kindle Dec 13, 2023, 5:25 PM

#

Yea I know how to fix it but I don't think I should have to 🙂 the API should be as flexible as possible. openai's API does this very well (for this and other things too).

Ultimately this is a shortcoming in pplx API that prevents it from being a seamless drop-in replacement to openai API.

For someone like me who obsesses over clean code and falls under this issue's use case, this discourages me from using pplx API for the time being.

(not saying this is exclusive to perplexity. I haven't used mistral's API yet, I wouldn't be surprised if it had the same issue)

mighty field Dec 14, 2023, 2:06 AM

#

Hi everyone! just a quick heads up, posting here is a great way to get advice from other members of the community but if you need direct support from the team, please head to the "Discuss" portion of the docs site. Where the API team will be answering questions and responding to comments on a regular basis:

https://docs.perplexity.ai/discuss

Feel free to post feedback, suggestions, etc here!

pplx-api

Discussions

mighty field Dec 14, 2023, 2:06 AM

#

mighty field Hi everyone! just a quick heads up, posting here is a great way to get advice fr...

echo garden Dec 19, 2023, 1:22 PM

#

Does the api handle web scraping also?

covert ferry Dec 19, 2023, 1:57 PM

#

echo garden Does the api handle web scraping also?

You can find more information about the online LLM's here, scraping limited to single pages is afaik not possible: https://blog.perplexity.ai/blog/introducing-pplx-online-llms

Introducing PPLX Online LLMs

The first-of-its-kind Online LLM API

echo garden Dec 19, 2023, 7:26 PM

#

covert ferry You can find more information about the online LLM's here, scraping limited to s...

K thx ☺️

unborn folio Dec 20, 2023, 10:09 AM

#

Mistral is crazy good

tepid summit Dec 21, 2023, 6:11 AM

#

You should edit it and remove your api key I think 😉

covert ferry Dec 21, 2023, 6:19 AM

#

tepid summit You should edit it and remove your api key I think 😉

@quick trench

#

@quick trench I removed your message because your key was visible, please post the code without the key!

#

(And please deactivate the current key!)

quick trench Dec 21, 2023, 6:39 AM

#

covert ferry <@665333273077088257> I removed your message because your key was visible, pleas...

Thanks! I thought I had removed it from the snippet but I obviously missed it. I appreciate the catch, @tepid summit! Also, I solved my question on my own, so I'll leave it

quick trench Dec 21, 2023, 6:39 AM

#

covert ferry (**And please deactivate the current key!**)

Done!

tepid summit Dec 21, 2023, 6:40 AM

#

quick trench Thanks! I thought I had removed it from the snippet but I obviously missed it. I...

No problem at all 😉

low salmon Dec 22, 2023, 1:33 PM

#

Hey, I'm using the pplx-70b-online model API and the responses have been great. However, I'm based in India and the responses I'm getting are in PST timezone. Any suggestions on how to set the timezone at the model level? Also, is there a way to upload a file via API?

lunar crown Dec 22, 2023, 2:59 PM

#

low salmon Hey, I'm using the pplx-70b-online model API and the responses have been great. ...

Can't you specify the timezone bit as part of your prompt. That you'd prefer IST.

As far as I am aware, I don't think you can upload a file via API. You'll probably need to use some other library to create a vector database and then run the query based on that.

low salmon Dec 22, 2023, 3:06 PM

#

lunar crown Can't you specify the timezone bit as part of your prompt. That you'd prefer IST...

Any sample code or suggestions on how to create a vector database and file upload?

lunar crown Dec 22, 2023, 3:15 PM

#

low salmon Any sample code or suggestions on how to create a vector database and file uploa...

https://docs.pinecone.io/page/examples

Pinecone

Examples

Explore vector search and witness the potential of vector search through carefully curated Pinecone examples. These examples demonstrate how you can integrate Pinecone into your applications, unleashing the full potential of your data through ultra-fast and accurate similarity search.

jagged solstice Dec 22, 2023, 7:22 PM

#

Is it just me, or are the rates for mixtral on pplx-api really cheap?

#

Like compared to other providers

spring mauve Dec 23, 2023, 4:06 PM

#

Hi, do y’all know if RAG in API made it, I saw it was on roadmap in November? Would for sure be a sweet Christmas gift 🎄

past island Dec 23, 2023, 4:10 PM

#

spring mauve Hi, do y’all know if RAG in API made it, I saw it was on roadmap in November? Wo...

Hey, @spring mauve! pplx-7b-online & pplx-70b-online are online models, they are available via the API.

spring mauve Dec 23, 2023, 4:13 PM

#

past island Hey, <@988556460336250960>! pplx-7b-online & pplx-70b-online are online models, ...

Thanks for quick reply Alex! Was wondering more about document RAG for PDFs, like how the pplx web UI has. Would sure beat langchain+vector db

unborn flax Dec 23, 2023, 5:26 PM

#

past island Hey, <@988556460336250960>! pplx-7b-online & pplx-70b-online are online models, ...

Is there a resource that explains more about the online models? I see them mentioned as existing but not how they work, and I was confused because the FAQ on the docs still says "Does the pplx-api currently support web browsing? - No"

covert ferry Dec 23, 2023, 5:44 PM

#

unborn flax Is there a resource that explains more about the online models? I see them menti...

Please have a look at the following resources:
https://blog.perplexity.ai/blog/introducing-pplx-online-llms
https://docs.perplexity.ai/reference/post_chat_completions
https://docs.perplexity.ai/docs/pricing
https://docs.perplexity.ai/docs/rate-limits
https://docs.perplexity.ai/docs/model-cards

low salmon Dec 26, 2023, 8:21 AM

#

Any idea when the Gemini model will be added?

covert ferry Dec 26, 2023, 8:29 AM

#

low salmon Any idea when the Gemini model will be added?

Gemini Pro is only available via Googles API (not open source). So it won't be added since the api is mainly for open source and in house LLM's.

carmine holly Dec 26, 2023, 11:17 AM

#

restive rain i was trying to add this to typingmind, how should i write the custom header?

when I try using typingmind with perplexity api, I get CORS error. Seems perplexity has to enable something from their side to allow web apps

dreamy summit Dec 26, 2023, 11:22 AM

#

covert ferry Gemini Pro is only available via Googles API (not open source). So it won't be a...

Any plan to make an API for Perplexity Pro? I mean, GPT-4, Claude-2, etc.

restive rain Dec 26, 2023, 11:33 AM

#

carmine holly when I try using typingmind with perplexity api, I get CORS error. Seems perplex...

Ah I tried to set it up but couldn't work after a few times. So I gave up haha .

So I'm not using the $5 at all... Not sure how to utilize it at all. Tbh, havent found a real use for the API but will try to figure out something .

covert ferry Dec 26, 2023, 11:45 AM

#

dreamy summit Any plan to make an API for Perplexity Pro? I mean, GPT-4, Claude-2, etc.

there is no plan atm

stone trellis Dec 26, 2023, 3:49 PM

#

Hello all! Please help me undestand why I cannont see the grounding (where the information is comming from) when I use the API for ppx-online. Am I doing something wrong? I expected to see the reference/where the information was drawn from.

covert ferry Dec 26, 2023, 3:52 PM

#

stone trellis Hello all! Please help me undestand why I cannont see the grounding (where the i...

Hey @stone trellis
Sources are not available via the API (yet).

stone trellis Dec 26, 2023, 3:53 PM

#

Thats great to known that they will be available someday 😉

#

From a user/product perspective is it valuable to have the API with the ability to search the internet without telling where the information came from though? This is what doesent make sense to me. It seems to me that the MVP would be to include it, dont you think?

#

I think I found a workaround. I have a list of whitelisted URLs that I want to RAG with JSON descriptions for them. Each URL has a descriptive JSON. With the user request I do a Vector Database search, get the URL that address the user request, use Perplexity online the format the message using the parameter "site: xyz" to restrict the search reach.

stone trellis Dec 27, 2023, 12:47 AM

#

Hi

#

#

#

is it possible We have stumble on the limit for the ppx-70b-online. Is it possible to pay for more concurrent requests?

covert ferry Dec 27, 2023, 1:00 AM

#

stone trellis is it possible We have stumble on the limit for the ppx-70b-online. Is it possib...

You can send your request to api@perplexity.ai 🙂

timber dawn Dec 27, 2023, 3:01 PM

#

covert ferry Hey <@745809216350060625> Sources are not available via the API (yet).

do you know when?

covert ferry Dec 27, 2023, 3:01 PM

#

timber dawn do you know when?

there is no ETA

covert ferry Dec 27, 2023, 9:59 PM

#

what

stone trellis Dec 27, 2023, 10:12 PM

#

Hello, is the ppx-online api suposed to work with the restrictive expression "site"? This will get a response only based on a specified site. Does this work as it does with Google?

stone trellis Dec 27, 2023, 10:39 PM

#

"user request site:aaa.com or site:bbb.com or site:ccc.com"

#

is this the right sintax?

stone trellis Dec 28, 2023, 5:05 PM

#

Hello, can anyone confirm or deny the effectiveness of the use of the "site" parameter on a request to narrow down the possible sites where the search must be done?

past island Dec 28, 2023, 5:21 PM

#

stone trellis "user request site:aaa.com or site:bbb.com or site:ccc.com"

Hey, @stone trellis! They syntax is correct, but currently, the operators are not supported.

stone trellis Dec 28, 2023, 5:25 PM

#

Thanks Alex.

left lark Dec 29, 2023, 10:22 AM

#

When will you be getting the Mixtral API to offer 32k context?

covert ferry Dec 29, 2023, 11:08 AM

#

left lark When will you be getting the Mixtral API to offer 32k context?

https://docs.perplexity.ai/docs/feature-roadmap

pplx-api

Feature Roadmap

pplx-api's roadmap is as follows:January Stop words and request time limits as parameters Mistral 32k context length

#

Mixtral is also planned

left lark Dec 29, 2023, 11:55 AM

#

covert ferry https://docs.perplexity.ai/docs/feature-roadmap

The official Mixtral API covers 32k context, why did Perplexity not offer that?

covert ferry Dec 29, 2023, 12:00 PM

#

left lark The official Mixtral API covers 32k context, why did Perplexity not offer that?

This is beyond my competence, but it is planned and 32k will be available soon 🙂

dim idol Dec 29, 2023, 12:17 PM

#

I see that Mistral-Medium is censored when asking (for tests) for "illegal" things. Is it because of a system prompt added by Perplexity? or it's natively censored?

covert ferry Dec 29, 2023, 12:20 PM

#

dim idol I see that Mistral-Medium is censored when asking (for tests) for "illegal" thin...

Censored by Mistral

dim idol Dec 29, 2023, 12:23 PM

#

covert ferry Censored by Mistral

Are you sure?

covert ferry Dec 29, 2023, 12:24 PM

#

Yes I am

#

Also noticed by other users (https://www.reddit.com/r/MistralAI/comments/18jd1s2/mistral_platform_api_censored_like_chatgpt/) but you can get around this or make it even worse

From the MistralAI community on Reddit

Explore this post and more from the MistralAI community

left lark Jan 2, 2024, 1:23 PM

#

You should make the dropdown organize the Mistral models together from least powerful to most powerful

low salmon Jan 2, 2024, 4:34 PM

#

Do we have mistral api with online LLM?

covert ferry Jan 2, 2024, 4:50 PM

#

low salmon Do we have mistral api with online LLM?

Yes, pplx-7b-chat

pine isle Jan 3, 2024, 6:53 AM

#

Is the profile in the perplexity app the same as a system role in the API?

analog junco Jan 3, 2024, 1:53 PM

#

can we have solar 10.7b model?

marsh pendant Jan 4, 2024, 11:45 PM

#

Can we get an option to return both the raw snippets and reference links for online models via the API? I'm considering the perplexity API for an application I'm building, but it's critical for my use case to be able to provide links to the reference material. It would also be nice to set the number of snippets to return (i.e. defult is 3, but maybe I want 10, 20, etc -- even if this costs extra). My use case requires "deep" search so stopping at top 3 results doesn't work for me. Maybe just consider having a separate search API? Also why no Mixtral for online responses? I think just have a search API that is $5/1K reqs (up to 20 responses perhaps) and then the normal chat completions API with per-model pricing; no need to combine. Snippets from search can be passed to LLM API serverside for completions with a flag, or otherwise the developer can decide what to do with the results.

restive rain Jan 5, 2024, 12:43 AM

#

whats the advantage of using PPLX API over Perplexity?

covert ferry Jan 5, 2024, 12:46 AM

#

restive rain whats the advantage of using PPLX API over Perplexity?

The pplx API is for developers to integrate LLM's into their products, the online LLM's can be very helpful for obtaining up-to-date information ☺️

restive rain Jan 5, 2024, 12:47 AM

#

so the API doenst search right? is an LLM like GPT-4

covert ferry Jan 5, 2024, 12:49 AM

#

restive rain so the API doenst search right? is an LLM like GPT-4

The online LLM's can search

restive rain Jan 5, 2024, 12:49 AM

#

covert ferry The online LLM's can search

that u mean perplexity.com

covert ferry Jan 5, 2024, 12:51 AM

#

restive rain that u mean perplexity.com

I think that will explain it better: https://blog.perplexity.ai/blog/introducing-pplx-api
https://blog.perplexity.ai/blog/introducing-pplx-online-llms

Introducing pplx-api

Perplexity Lab's fast and efficient API for open-source LLMs

Introducing PPLX Online LLMs

The first-of-its-kind Online LLM API

restive rain Jan 5, 2024, 12:53 AM

#

oh i see. thanks finally understood. haha
so the -online one works

restive rain Jan 5, 2024, 2:24 AM

#

im having some issues adding perplexity models into typingmind, any idea how i can do this? (sorry im really newbie)

restive rain Jan 6, 2024, 7:57 AM

#

hi all, i cant seem to use the API on typingmind, neither can i use it on harpa.ai...
Just wondering which platforms, or how, are you using the API?
Thanks!

pine isle Jan 6, 2024, 12:08 PM

#

restive rain hi all, i cant seem to use the API on typingmind, neither can i use it on harpa....

I write my own code to run the api

pine isle Jan 6, 2024, 12:09 PM

#

restive rain hi all, i cant seem to use the API on typingmind, neither can i use it on harpa....

https://docs.perplexity.ai/reference/post_chat_completions you can use this to get running

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

#

can even try it directly if oyu input your token

rancid acorn Jan 6, 2024, 3:10 PM

#

Any ideas how the syntax for making perplexity API calls could be shoehorned into this (HARPA AI browser plugin)? It's designed to facilitate connections to models via OpenAI or OpenRouter endpoints, but some have apparently managed to get it to work with endpoints from other providers (e.g. LM Studio). I have a valid API/bearer token and, referring to https://docs.perplexity.ai/reference/post_chat_completions, have tried various combinations of inputs/settings but with no luck

restive rain Jan 7, 2024, 2:45 PM

#

same here, i have no idea how to use my $5 credits ...

shy quarry Jan 7, 2024, 6:17 PM

#

restive rain same here, i have no idea how to use my $5 credits ...

https://docs.perplexity.ai/docs/getting-started

pplx-api

Getting Started with pplx-api

You can access pplx-api using HTTPS requests. Authenticating involves the following steps:Start by visiting the Perplexity API Settings page. Register your credit card to get started. This step will not charge your credit card. Rather, it stores payment information for later API usage. After providi...

restive rain Jan 7, 2024, 6:29 PM

#

shy quarry https://docs.perplexity.ai/docs/getting-started

Thanks
Yeah , I got my bearer code long ago. But I have no clue how to use it if typingmind or Harpa can't link due to some issues (might be CORS related)
I don't know how to code , sorry for that.
The chat completion page doesn't look very user-friendly at least from an UI perspective.
https://docs.perplexity.ai/reference/post_chat_completions

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

#

The guide only tells you how to get the API key and how to pay . Haha but I still haven't figured how to use

covert ferry Jan 7, 2024, 6:32 PM

#

restive rain Thanks Yeah , I got my bearer code long ago. But I have no clue how to use it if...

The API is actually only intended for developers and is not intended for use via other websites.

restive rain Jan 7, 2024, 6:32 PM

#

Oh ... Thanks I'm hurt 😔

#

😭

#

Okay got the answer. Thanks haha so why make the pro users pay $5 for something most won't need

#

Anyway it's okay . 2am here. Have a good day

covert ferry Jan 7, 2024, 7:02 PM

#

restive rain Okay got the answer. Thanks haha so why make the pro users pay $5 for something ...

You don't pay extra for it, it's just a bonus for developers to try out the API 😉

rancid acorn Jan 8, 2024, 12:41 AM

#

restive rain Okay got the answer. Thanks haha so why make the pro users pay $5 for something ...

I signed up to use Copilot / multiple models, then noticed the API and was curious if it might be able to be used in my HARPA workflows, which mostly involve searching for and parsing info from the web. But I'm like you (can't code - certainly not a developer!), so sometimes just have to accept that trying to make shiny and cool tools do things that they are not meant to and which I can't technically implement is:
a) probably not a great idea to begin with, and
b) even if I tried, would require so much back-and-forth with a coding AI assistant that I would ultimately end up spending more time on it than would be saved (and if one little thing changes and the implementation breaks, then I'm basically back to square one)

#

anyway fwiw no point getting too frustrated. things may not be developing at an exponential or sustainable rate, but they are progressing fast. In my humble (biased is prob more appropriate lol) opinion, I think people who can't code, but understand how to effectively interact with the technology (and look at it as an enabler rather than a magic wand), will be among those who stand to be benefit from it the most :))

restive radish Jan 8, 2024, 3:56 PM

#

Hi all. I see that mixtral-8x7b-instruct is priced at the same rate as the 13b models, but there is no published price for 13b models. Any idea what the price is?

shy quarry Jan 8, 2024, 5:48 PM

#

restive radish Hi all. I see that mixtral-8x7b-instruct is priced [at the same rate as the 13b...

#💬│general message

hardy flare Jan 8, 2024, 9:47 PM

#

Mostly curious but also interested because I've been considering building some voice-related features, are there any plans in the future to offer voice-related APIs through the Perplexity API?

past island Jan 9, 2024, 9:10 AM

#

hardy flare Mostly curious but also interested because I've been considering building some v...

Hi, @hardy flare! Currently, it's not on the roadmap, so you can build such a voice service on top of the API.

low salmon Jan 9, 2024, 9:41 AM

#

Hi there,

I've been experiencing an issue with the API for the past five hours where I've been receiving a 500 internal server error. I'm a paid user of the API and I'm hoping that this issue can be resolved quickly. Thank you.

shy quarry Jan 9, 2024, 10:04 AM

#

low salmon Hi there, I've been experiencing an issue with the API for the past five hours ...

#💬│general message

bright mural Jan 10, 2024, 3:32 AM

#

Is it possible to have the pplx-70b-online models cite their sources (similar to how the Perplexity app does it)? I'm looking to build something that requires summarized realtime data, that the user can then tap on to be able to look at the actual source of information. I'm testing things out on the labs.perplexity.ai page but I can't get it to cite any sources...

rancid acorn Jan 10, 2024, 3:41 AM

#

bright mural Is it possible to have the pplx-70b-online models cite their sources (similar to...

https://www.perplexity.ai/search/Is-it-possible-DtQiEHcZTY6TgkS04YW23Q?s=c
"""
So in summary, no - the pplx-70b-online model does not directly cite sources like the Perplexity app does. The model provides up-to-date responses, but does not include embedded citations. Adding support for grounding facts and citations is on Perplexity's roadmap for the future.
"""

#

Actually contrary to Perplexity's answer, it apparently is not on the roadmap (though is a frequently requested feature)
#👉│feedback-general message

sleek coral Jan 10, 2024, 10:59 AM

#

Where can i raise an issue related to billing? I want to add credits but I am unable to do so and the card gets rejected on Octane.

covert ferry Jan 10, 2024, 11:12 AM

#

sleek coral Where can i raise an issue related to billing? I want to add credits but I am un...

You can contact support@perplexity.ai as it is a billing problem and not an API problem, however Perplexity cannot take action against declined cards, this is done by the payment platform (Stripe) or your bank.

ancient belfry Jan 11, 2024, 8:03 PM

#

Hey I tried loading credits on my account, but cannot do so, and no reason is being mentioned. Can someone help me out?

covert ferry Jan 11, 2024, 9:02 PM

#

ancient belfry Hey I tried loading credits on my account, but cannot do so, and no reason is be...

Hey @ancient belfry!
Please take a look at these instructions: https://docs.perplexity.ai/docs/getting-started
Also note that it may take some time for the credits to appear 🙂

pplx-api

Getting Started with pplx-api

You can access pplx-api using HTTPS requests. Authenticating involves the following steps:Start by visiting the Perplexity API Settings page. Register your credit card to get started. This step will not charge your credit card. Rather, it stores payment information for later API usage. After providi...

trail folio Jan 12, 2024, 9:15 AM

#

So sad, I want to switch model to perplexity api, But the reponse is to hard to validate JSON format, it's not complete the a JSON in response message.

#

here example reponse :

{
  "id": "0a56acdb-da6e-474e-9799-e4d95fc947a4",
  "model": "mistral-7b-instruct",
  "created": 2178721,
  "usage": {
    "prompt_tokens": 980,
    "completion_tokens": 38,
    "total_tokens": 1018
  },
  "object": "chat.completion",
  "choices": [
    {
      "index": 0,
      "finish_reason": "stop",
      "message": {
        "role": "assistant",
        "content": "{\n\"name\": \"DKRA Knowledge\",\n\"agent\": \"dkra_agent\",\n\"input\": \"AI is the CEO of DKRA company\"\n"
      },
      "delta": {
        "role": "assistant",
        "content": ""
      }
    }
  ]
}

past island Jan 12, 2024, 4:56 PM

#

Hey, @trail folio! Thanks for reporting. Could you create a thread in the #1161804761247526912 and add your query and the system prompt you used.

compact pelican Jan 12, 2024, 6:41 PM

#

hey. can someone help me pls? I cant find anything about using functions calling in perpexity api (a-la it is possible in chatgpt3.5-4). is it possible with perplexity?

past island Jan 12, 2024, 6:52 PM

#

compact pelican hey. can someone help me pls? I cant find anything about using functions calling...

Hey, @compact pelican! It's not possible. Please, take a look at the available models: https://docs.perplexity.ai/docs/model-cards

pplx-api

Supported Models

Where possible, we try to match the Hugging Face implementation. We are open to adjusting the API, so please reach out with feedback regarding these details. ModelContext LengthModel Typecodellama-34b-instruct16384Chat Completionllama-2-70b-chat4096Chat Completionmistral-7b-instruct [2]4096 [1]Chat ...

compact pelican Jan 12, 2024, 6:52 PM

#

hey. thanks for a quick response. there are model listed, but no info about functions calling functionality available/unavailable

nova brookBOT Jan 12, 2024, 6:52 PM

#

Hey @compact pelican, if you find the original message helpful, please consider reacting to it with the ⭐ emoji. If the post is appreciated by the community and receives 5 stars, it will go to the ⁠⭐│starred channel and the post author will get the EXPLORER role on Perplexity.

thorny musk Jan 12, 2024, 8:09 PM

#

compact pelican hey. thanks for a quick response. there are model listed, but no info about func...

Unavailable

hollow musk Jan 13, 2024, 3:22 PM

#

Does the pplx api not have the same search capability as the app?

#

I can ask a summary of a linkedin post on the main app but the api using the online models is declining

#

#

thorny musk Jan 13, 2024, 3:55 PM

#

hollow musk I can ask a summary of a linkedin post on the main app but the api using the onl...

They aren't the same

hollow musk Jan 13, 2024, 3:56 PM

#

thorny musk They aren't the same

so the online models don't do real time browsing?

covert ferry Jan 13, 2024, 4:02 PM

#

hollow musk so the online models don't do real time browsing?

Yes they can, but you are currently comparing one of the pplx models with GPT-3.5 and also with Copilot, there is a huge difference between them. It is also advisable to use the operator “site:”.

hollow musk Jan 13, 2024, 4:03 PM

#

ah okay

hollow musk Jan 13, 2024, 4:04 PM

#

covert ferry Yes they can, but you are currently comparing one of the pplx models with GPT-3....

is it normal to get a "sorry as an AI language model i cant.." etc for a normal query without a url through the api?

covert ferry Jan 13, 2024, 4:05 PM

#

hollow musk is it normal to get a "sorry as an AI language model i cant.." etc for a normal ...

Depending on what the question is, yes.

hollow musk Jan 13, 2024, 4:05 PM

#

i tried searching for the score of the latest man city match

#

it gave the correct response but the first time i got that "sorry" message

spring mauve Jan 13, 2024, 4:10 PM

#

Is there an operator to provide the pplx online model so that it will indicate sources and URLs, like the consumer interface does? Or possible roadmap item?

covert ferry Jan 13, 2024, 4:11 PM

#

spring mauve Is there an operator to provide the pplx online model so that it will indicate s...

No, that's not on the roadmap at the moment.

hollow musk Jan 13, 2024, 4:14 PM

#

I'm getting an error even with the correct api key
{'statusCode': 401, 'error': 'Unauthorized', 'message': 'Missing authentication'}

covert ferry Jan 13, 2024, 4:16 PM

#

hollow musk I'm getting an error even with the correct api key {'statusCode': 401, 'error': ...

Please share your code, it's best to open a post in #1161804761247526912

hollow musk Jan 13, 2024, 4:16 PM

#

making a post thanks

nova brookBOT Jan 13, 2024, 4:16 PM

#

Hey @hollow musk, if you find the original message helpful, please consider reacting to it with the ⭐ emoji. If the post is appreciated by the community and receives 5 stars, it will go to the ⁠⭐│starred channel and the post author will get the EXPLORER role on Perplexity.

calm spear Jan 14, 2024, 6:45 AM

#

Hello, folks. I've been using the pplx labs playground to test out the different models and I really like the mixtral-8x7b-instruct model for my use case. I incorporated the model into my app by using the pplx API, however, the responses I get from the API are different than what I get in the playground. That is, for the same prompt I get different response from the playground and the API. The prompt is asking the model which of two numbers are greater, so I would expect the same answer. The playground gets it correct every time but the API gets it wrong every time.

Does anyone have any clues or hints as to how I can tickle the API to behave more like the playground? Thanks.

ocean surge Jan 14, 2024, 8:56 AM

#

https://github.com/nekowasabi/vim-perplexity
I implemented a simple client to run the API as a vim/neovim plugin.

GitHub

GitHub - nekowasabi/vim-perplexity

Contribute to nekowasabi/vim-perplexity development by creating an account on GitHub.

rancid acorn Jan 14, 2024, 1:29 PM

#

covert ferry No, that's not on the roadmap at the moment.

It seems a very common request and, imho, perhaps reflects that, in professional contexts, many people want the ability to crosscheck information received from a generative LLM, especially if it is information that is not part of its training 'knowledge' (e,g. about a current event).

#

It would of course be great if the accuracy of the outputs generated from any of these systems could be taken as gospel

#

But alas, we're not there yet...

plucky matrix Jan 15, 2024, 6:27 PM

#

I'm having issues with using the model with industry classifications, I provided it with a prompt:

You are a model that classifies companies into specific categories based on available information. The categories are: Aerospace-Space, Agriculture, Broadcast-Communications, Consumer Electronics, Energy, Engineering Services, Industrial, Medical, Robotics, Transportation-Telematics, Wearables. Your task is to classify the following company into one and only one of these categories. Your response should be the category name only, with no other words or explanations.

and I get one of three responses, a repetition of the company name I input and then a full stop, a long explanation of everything I would get from a google search or in rare cases it will say that there were no results, which doesn't make sense.

thorny musk Jan 15, 2024, 10:19 PM

#

There's apparently a "related" model now? 🤔

#

cc @past island

strange arrow Jan 16, 2024, 5:47 AM

#

hi, when I call the api using pplx-7b-online model, it response "error":{"message":"An internal server error has occurred.","type":"internal_server_error","code":500}; is server in trouble now?

lethal siren Jan 16, 2024, 8:40 AM

#

I'm a bit confused about the API - it looks like a pretty typical LLM completion style API. Is there a way to get something more similar to the end-user experience, where sources are included/provided?

#

This seems like the key differentiator for perplexity, so not having it exposed through the API is quite surprising (if that's the case)

gray siren Jan 16, 2024, 8:54 AM

#

Um i just wanna ask is it possible to add api credits thru the apple app

#

Since my subscription is bought from there

covert ferry Jan 16, 2024, 9:07 AM

#

lethal siren I'm a bit confused about the API - it looks like a pretty typical LLM completion...

#🧪│api-general message

rancid acorn Jan 16, 2024, 9:49 AM

#

lethal siren I'm a bit confused about the API - it looks like a pretty typical LLM completion...

You aren't the only one mate #🧪│api-general message

rancid acorn Jan 16, 2024, 9:57 AM

#

lethal siren This seems like the key differentiator for perplexity, so not having it exposed ...

I would argue that it is precisely Perplexity's differentiator - it's not a peripheral feature, but at the core of why people use it.. There's 1000s of models out there now...Perplexity's ability to ability to retrieve and use real-time web results to inform its responses is what sets it apart, but I can't integrate the API into my workflows in any systematic way without the API also responses being supported by URL links/references like when using the web app

lethal siren Jan 16, 2024, 11:02 AM

#

Agreed. It definitely provides results with a different 'feel' to some other LLMs, but going from that to actual sources/urls feels like an order of magnitude improvement over other LLMs (for some specific use cases)

#

I'm building an AI workflow tool (https://hunch.tools) and after messing around with perplexity it seemed it would be a hugely beneficial addition to some types of workflows. Without the sources it's still interesting and useful, but I don't feel it's an absolutely must-have addition. I'll probably still add the 'online' models, but meh, not as exciting as it could be

Hunch

Hunch’s AI studio lets anyone combine multiple AI models into powerful, shareable workflows and watch them run instantly on an interactive canvas.

empty brook Jan 16, 2024, 2:30 PM

#

thorny musk There's apparently a "related" model now? 🤔

When you give a URL, it returns related results that appear under Perplexity search.

thorny musk Jan 16, 2024, 2:34 PM

#

empty brook When you give a URL, it returns related results that appear under Perplexity sea...

Maybe it doesn't work for reddit

empty brook Jan 16, 2024, 2:37 PM

#

thorny musk Maybe it doesn't work for reddit

Yes me too

rancid acorn Jan 16, 2024, 3:54 PM

#

empty brook When you give a URL, it returns related results that appear under Perplexity sea...

Seems to be suggested follow-up questions

maiden skiff Jan 16, 2024, 11:50 PM

#

For adding credits it through an error and charged $300 dollars how do I rectify this cost.

#

I have not used any of the credits, but it through an error and then showed up out of nowhere on the api page?

past island Jan 16, 2024, 11:53 PM

#

maiden skiff I have not used any of the credits, but it through an error and then showed up o...

Please send a request to support@perplexity.ai, we'll help with that.

maiden skiff Jan 16, 2024, 11:53 PM

#

I just did. Thanks.

nova brookBOT Jan 16, 2024, 11:53 PM

#

Hey @maiden skiff, if you find the original message helpful, please consider reacting to it with the ⭐ emoji. If the post is appreciated by the community and receives 5 stars, it will go to the ⁠⭐│starred channel and the post author will get the EXPLORER role on Perplexity.

fringe fulcrum Jan 17, 2024, 2:15 AM

#

I am looking for help building a basic application using Perplexity API. Any willing developers?

nimble monolith Jan 17, 2024, 2:08 PM

#

Hello everybody, new here.
I'm trying to setup the pplx API access. When I enter my card details everything goes smoothely and I get confirmation on the bank page that the operation was successful and that I can get back to the merchant website. Unfortunately when I go back to pplx API web page it always shows up the Setup payment button as if nothing had happened. I've redone the process 2 times to no avail.
Is there some form of validation period after card registration and before API access is granted ?

past island Jan 17, 2024, 8:55 PM

#

nimble monolith Hello everybody, new here. I'm trying to setup the pplx API access. When I ente...

Hey, @nimble monolith! This issue has been fixed and you can try setting up the payment method for the API. It should appear right after you payment method is confirmed.

zealous loom Jan 18, 2024, 12:00 AM

#

I'm not always the best with API programming so I want to ask a question first here before taking hours to code the test. When I use perplexity via the web its great at visiting a news site and giving me a summary and key information about that post. When I read the API docs I see its providing a fast interface for minstrel and Llama2 . Is it still going to be able to process requests that mention a url and are these LLMs going to return as high a quality response as the web interface for perplexity?

shy quarry Jan 18, 2024, 12:09 AM

#

zealous loom I'm not always the best with API programming so I want to ask a question first h...

#🧪│api-general message

zealous loom Jan 18, 2024, 12:17 AM

#

shy quarry https://discord.com/channels/1047197230748151888/1161802929053909012/11944861631...

thank you for that. very sad to see that it can't hit real time sources

nova brookBOT Jan 18, 2024, 12:17 AM

#

Hey @zealous loom, if you find the original message helpful, please consider reacting to it with the ⭐ emoji. If the post is appreciated by the community and receives 5 stars, it will go to the ⁠⭐│starred channel and the post author will get the EXPLORER role on Perplexity.

median oar Jan 18, 2024, 3:23 AM

#

Hi all, is there any way to fetch references with the text generated by the online models for the API?

rancid acorn Jan 18, 2024, 4:45 AM

#

nova brook Hey <@702017863589953576>, if you find the original message helpful, please cons...

If there isn't already, would be nice if there was a similar system for upvoting requested features (/FAQs), such as the post immediately above..

covert ferry Jan 18, 2024, 5:25 AM

#

median oar Hi all, is there any way to fetch references with the text generated by the onli...

#🧪│api-general message

lethal siren Jan 18, 2024, 6:04 AM

#

rancid acorn If there isn't already, would be nice if there was a similar system for upvoting...

I guess can + this: https://discord.com/channels/1047197230748151888/1196426667946684579

nimble monolith Jan 18, 2024, 8:32 AM

#

past island Hey, <@509014580983496715>! This issue has been fixed and you can try setting up...

Thank you for your answer @past island . Problem solved. I didn't have to recreate the payment method. Thanks for the good work guys!

nova brookBOT Jan 18, 2024, 8:32 AM

#

Hey @nimble monolith, if you find the original message helpful, please consider reacting to it with the ⭐ emoji. If the post is appreciated by the community and receives 5 stars, it will go to the ⁠⭐│starred channel and the post author will get the EXPLORER role on Perplexity.

median oar Jan 18, 2024, 6:59 PM

#

covert ferry https://discord.com/channels/1047197230748151888/1161802929053909012/11957620929...

This is not helping, can you give me more specific instructions/details?

covert ferry Jan 18, 2024, 7:01 PM

#

median oar This is not helping, can you give me more specific instructions/details?

No, it´s not possible.

median oar Jan 18, 2024, 7:08 PM

#

Is there a roadmap when we can get it?

covert ferry Jan 18, 2024, 7:10 PM

#

median oar Is there a roadmap when we can get it?

No, that's not on the roadmap at the moment.
#🧪│api-general message

tall gazelle Jan 19, 2024, 9:29 AM

#

Is there an OpenAI compatible endpoint for the API? To use it with tools that support OpenAI.

trail burrow Jan 19, 2024, 10:32 AM

#

is it possible to ask pplx-70b-online to list its sources when using the API, just like the web and mobile user version

covert ferry Jan 19, 2024, 10:33 AM

#

trail burrow is it possible to ask pplx-70b-online to list its sources when using the API, ju...

#🧪│api-general message

trail burrow Jan 19, 2024, 10:33 AM

#

okay

clever prawn Jan 21, 2024, 2:59 AM

#

How is the development of 32k context length support progressing? Is there a specific release date?

digital nimbus Jan 22, 2024, 10:09 AM

#

hello @everyone i am new to perplexity and i need the api but even when having upgraded to Perplexity pro i am not able to access the api... pls help @supple fossil @everyone

covert ferry Jan 22, 2024, 10:14 AM

#

digital nimbus hello @everyone i am new to perplexity and i need the api but even when having u...

No need to ping people here 😉
please have a look at https://docs.perplexity.ai/docs/getting-started

pplx-api

Getting Started with pplx-api

You can access pplx-api using HTTPS requests. Authenticating involves the following steps:Start by visiting the Perplexity API Settings page. Register your credit card to get started. This step will not charge your credit card. Rather, it stores payment information for later API usage. After providi...

digital nimbus Jan 22, 2024, 10:15 AM

#

i have followed everything there

#

it is showing me this:

covert ferry Jan 22, 2024, 10:43 AM

#

digital nimbus it is showing me this:

It may help to clear the website cache/data and deactivate extensions.

digital nimbus Jan 22, 2024, 10:43 AM

#

ok will try

digital nimbus Jan 22, 2024, 10:45 AM

#

covert ferry It may help to clear the website cache/data and deactivate extensions.

still not working

#

this is what is showing me everytime

digital nimbus Jan 22, 2024, 10:47 AM

#

digital nimbus still not working

covert ferry Jan 22, 2024, 10:49 AM

#

digital nimbus

There may be a problem with your card, in which case nothing can be done, however you should receive your $5 credits from pro subscription, please contact support via the pro support intercom button on the account tab.

digital nimbus Jan 22, 2024, 10:50 AM

#

covert ferry There may be a problem with your card, in which case nothing can be done, howeve...

got it thanks!

digital nimbus Jan 22, 2024, 11:09 AM

#

done but it is showing me this and still no credits are displayed

trail burrow Jan 22, 2024, 1:15 PM

#

covert ferry There may be a problem with your card, in which case nothing can be done, howeve...

Are credits only available with yearly plans? Or are they available with monthly plans as well?

covert ferry Jan 22, 2024, 1:16 PM

#

trail burrow Are credits only available with yearly plans? Or are they available with monthly...

Credits are available on all plans.

past island Jan 22, 2024, 3:12 PM

#

digital nimbus done but it is showing me this and still no credits are displayed

Hey, @digital nimbus! Thanks for reporting and sorry for the inconveniecne, we have a ticket regarding the Pro $5 credits https://discord.com/channels/1047197230748151888/1198130008053518376, and the team is working on it.

digital nimbus Jan 22, 2024, 4:33 PM

#

past island Hey, <@921833351151771669>! Thanks for reporting and sorry for the inconveniecne...

👌

wind pine Jan 23, 2024, 3:58 AM

#

that was quick.

wind pine Jan 23, 2024, 7:44 AM

#

can we get a /models endpoint implemented?

digital nimbus Jan 23, 2024, 3:06 PM

#

past island Hey, <@921833351151771669>! Thanks for reporting and sorry for the inconveniecne...

hi i am still not able to use it Alex please update me on this

radiant juniper Jan 24, 2024, 3:10 AM

#

hi, i tried the API. doesn't the online models return the citation URLs?

response = client.chat.completions.create(
model="pplx-70b-online",
messages=messages,
)

covert ferry Jan 24, 2024, 7:04 AM

#

radiant juniper hi, i tried the API. doesn't the online models return the citation URLs? respon...

#🧪│api-general message

grave mural Jan 24, 2024, 11:31 PM

#

digital nimbus done but it is showing me this and still no credits are displayed

Mine gave me the $5.00 credit when I entered $2 as the amount in the auto top off. It did not give it to me until I completed the top off automation.

odd path Jan 25, 2024, 12:24 AM

#

I am curious why when I use the response is different when I use the API vs when I use the browser?

abstract dirge Jan 25, 2024, 1:36 AM

#

I want to use the API, not sure which model is equivalent to "Experiment" (with Copilot) from the chat website?

earnest nymph Jan 25, 2024, 1:42 AM

#

abstract dirge I want to use the API, not sure which model is equivalent to "Experiment" (with ...

PPLX 70B

covert ferry Jan 25, 2024, 5:33 AM

#

odd path I am curious why when I use the response is different when I use the API vs when...

Different system prompt/sources are searched for differently.

abstract dirge Jan 25, 2024, 9:59 AM

#

earnest nymph PPLX 70B

Not even close. Website version codes much better.

covert ferry Jan 25, 2024, 10:01 AM

#

abstract dirge Not even close. Website version codes much better.

#🧪│api-general message 👀

formal rivet Jan 25, 2024, 12:00 PM

#

Hi, I have some results on labs which I can't seem to replicate when using the api. Is it possible to know some of the default params used by labs which I can apply to the api?

maiden skiff Jan 26, 2024, 1:55 AM

#

I reached out to support@perplexity.ai for a charge that was made and it was a double charge and need a refund of credits its been more than 1-2 business days this was last week yet no resolution.

#

Any pointers or help I could get.

karmic geode Jan 26, 2024, 10:41 AM

#

Are we able to buy (pay extra) to get api access that returns citations? Can we discuss with sales?

covert ferry Jan 26, 2024, 11:41 AM

#

karmic geode Are we able to buy (pay extra) to get api access that returns citations? Can we...

#🧪│api-general message

karmic geode Jan 26, 2024, 11:57 AM

#

covert ferry https://discord.com/channels/1047197230748151888/1161802929053909012/11976190654...

That’s a bit silly. If it’s not a legal decision, this is going to be what sinks you guys. Everyone wants it. We want to pay you for it. It’s fantastic with it and unusable without it. So a competitor is going to emerge that does it and takes all your business away is my guess…. You’re so close to putting something out there that could be huge…

covert ferry Jan 26, 2024, 12:05 PM

#

karmic geode That’s a bit silly. If it’s not a legal decision, this is going to be what sinks...

Just because it is not yet on the roadmap does not mean that this function is not being considered 😉

warm knot Jan 26, 2024, 9:13 PM

#

Are there any plans to make the API responses as good as the responses we get on the web interface? or close to it?

covert ferry Jan 26, 2024, 9:25 PM

#

warm knot Are there any plans to make the API responses as good as the responses we get on...

Improvements are constantly being worked on, at the moment the web interface also uses more advanced LLM's which can make a difference to the quality of the answers 🙂

rancid acorn Jan 27, 2024, 3:48 AM

#

karmic geode That’s a bit silly. If it’s not a legal decision, this is going to be what sinks...

I reckon that's spot on. If the citations were meant to be some kind of moat, Metaphor (now called exa) would appear to be building a big ol bridge across it...

#

(there's also Tavily, and I'm sure numerous other similar prjoects/products out there)

sharp flicker Jan 27, 2024, 5:51 AM

#

Is copilot and the follow-up prompts part of the API offering?

covert ferry Jan 27, 2024, 7:22 AM

#

sharp flicker Is copilot and the follow-up prompts part of the API offering?

No, please take a look at the documentation: https://docs.perplexity.ai

pplx-api

sterile temple Jan 27, 2024, 7:57 AM

#

maiden skiff I reached out to support@perplexity.ai for a charge that was made and it was a d...

There's a sure shot bug with perplexity's billing system. I have experienced it myself last week.

I just reached on discord and got help from the team here. But yeah it took me 5 working days to get access to the API for which i had already paid.

warm knot Jan 27, 2024, 8:35 AM

#

Quick question - I am trying to pull the most recent news articles related to a particular domain using pplx APIs. However I see that it's pulling in news articles that are more than a year old. Anyone know what context I can give to rectify this problem?

rancid acorn Jan 27, 2024, 12:27 PM

#

Google operators seem to work fairly well on web version, but not sure about with the API. Couldn't hurt testing including 'after:yyyy-mm-dd' in the query (if you haven't already) – would maybe steer the results in the right direction

warm knot Jan 27, 2024, 2:45 PM

#

covert ferry Improvements are constantly being worked on, at the moment the web interface als...

What;s the best model you recommend I use for accurate answers as close as what we get on the web?

warm knot Jan 27, 2024, 2:46 PM

#

rancid acorn Google operators seem to work fairly well on web version, but not sure about wit...

Thank you

lyric willowBOT Jan 27, 2024, 2:46 PM

#

warm knot Thank you

Hey @warm knot!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

covert ferry Jan 27, 2024, 3:15 PM

#

warm knot What;s the best model you recommend I use for accurate answers as close as what ...

The online LLMs are the only ones with internet access, which is closest to the web version

faint crown Jan 28, 2024, 7:49 PM

#

hey i never received my API credits for perplexity pro, does it need to wait a couple days or something or is that a known bug

nocturne thunder Jan 28, 2024, 8:02 PM

#

faint crown hey i never received my API credits for perplexity pro, does it need to wait a c...

https://discord.com/channels/1047197230748151888/1198130008053518376 please send an email with your account details to support@perplexity.ai

faint crown Jan 28, 2024, 8:02 PM

#

nocturne thunder https://discord.com/channels/1047197230748151888/1198130008053518376 please send...

i did last night just in case

faint crown Jan 29, 2024, 2:02 AM

#

rancid acorn I reckon that's spot on. If the citations were meant to be some kind of moat, Me...

Have you found a good alternative for this? I'm literally looking for the API version of perplexity web, but if that's not on the roadmap then I need to find the next best thing before I rag it myself with a scraper.

rancid acorn Jan 29, 2024, 5:48 AM

#

faint crown Have you found a good alternative for this? I'm literally looking for the API ve...

The sources/citations are being withheld from the API responses; if aggressive prompting can extract (at least the Publications and Titles of the sources), I see no reason why this request for the inclusion of sources with API responses could not be implemented with a few system prompt tweaks.

#

I've noticed a slight change of tune. From repeatedly just saying "No (it's not currently possible nor planned)", to the "roadmap is flexible and we're taking on feedback" (paraphrasing in both cases). So I'm curious to see if where it goes. I use perplexity for probably 80% of what I used to use Google for – it's a brilliant service and I'll continue to use it for general web searches for information about recent / contemporary developments for the foreseeable future. It would be great to start using the API in some of my research workflows, but currently there is literally no point. Anyway, there are other approaches and APIs that I currently use that could be helpful to you - feel free to DM

snow mauve Jan 29, 2024, 7:38 AM

#

Hey all, Is the response includes the source urls?

shy quarry Jan 29, 2024, 7:51 AM

#

snow mauve Hey all, Is the response includes the source urls?

#🧪│api-general message

karmic horizon Jan 30, 2024, 5:14 AM

#

any idea of when custom Stop Words will be implemented into the API? 😄
i'm testing a port for zed.dev editor to be able to use pplx-api as one of the endpoints for its 'assistant' features

it currently defaults to using openai models

river granite Jan 30, 2024, 1:21 PM

#

Maybe I am in the wrong place, could anyone indicate me how to add a perplexity leveraging and perplexity chat interface inside a webapp?

median thicket Jan 30, 2024, 3:11 PM

#

hi

#

anyone here?

#

😦

#

?

past island Jan 30, 2024, 3:55 PM

#

median thicket anyone here?

Hi, @median thicket! Everyone's here 👋 , what is your question?

karmic horizon Jan 30, 2024, 4:05 PM

#

Maybe I am in the wrong place, could

junior shuttle Jan 30, 2024, 9:21 PM

#

I have a simple question. I pay for a Perplexity Pro subscription. I know I have 1000 requests a month for free (using any model, I assume). I want to know about the cost after the first 1000 requests using GPT4. How is this calculated? Say I have another 1000 requests. Is the cost $5, and in addition I pay the cost of using GPT4? The details of the pricing structure are hard to find for specific use-cases.

past island Jan 30, 2024, 9:33 PM

#

junior shuttle I have a simple question. I pay for a Perplexity Pro subscription. I know I have...

Hey, @junior shuttle, I attach a screenshot from the pricing page, you can see that $5 per 1000 requests is the pricing for the online models only. If you spend your free credits on those online models 1000 requests then you'll need to top up your balance to send more that will be billed according to this pricing.
GPT-4 model is not provided via the API, you can find the supported model list here: https://docs.perplexity.ai/docs/model-cards

Just to clarify, GPT-4 and other models, including Copilot queries on our site have a limit of 600 queries a day that are available to you as a Pro user.

past island Jan 30, 2024, 10:21 PM

#

Hey, <@&1193989584976105562>! If you missed it, CodeLlama-70B-Instruct is avaialbe via pplx-api. Give it a try and let us know what you think!

wind pine Jan 30, 2024, 10:22 PM

#

I could kiss you.

quartz sonnet Jan 30, 2024, 10:23 PM

#

me too

wind pine Jan 30, 2024, 10:27 PM

#

Are there any vscode integration plans on the board?

past island Jan 30, 2024, 10:37 PM

#

That's a good idea. As of now, you can check the extension created by one of our users: https://discord.com/channels/1047197230748151888/1163935223399063702

wind pine Jan 30, 2024, 10:38 PM

#

I saw that -- thank you for linking it.

#

Cody has been an interesting experience. If you're looking to emulate anything I'd look at that.

#

https://marketplace.visualstudio.com/items?itemName=sourcegraph.cody-ai

Cody AI - Visual Studio Marketplace

Extension for Visual Studio Code - Code AI with codebase context

#

I'd not think twice about paying half as much more on my monthly to add it to the mix.
I'd love to put all of my AI bucks in the same bin.

trail burrow Jan 30, 2024, 10:45 PM

#

past island Hey, <@&1193989584976105562>! If you missed it, CodeLlama-70B-Instruct is avaial...

Let me check it out when I’m home

vernal ibex Jan 30, 2024, 10:51 PM

#

hey folks

#

trying to access the API

#

Using the snippet here but there is an authentication error 401

#

https://docs.perplexity.ai/docs/getting-started

pplx-api

Getting Started with pplx-api

You can access pplx-api using HTTPS requests. Authenticating involves the following steps:Start by visiting the Perplexity API Settings page. Register your credit card to get started. This step will not charge your credit card. Rather, it stores payment information for later API usage. After providi...

#

is this snippet outdated? Checked my OpenAI and PPLX keys. They are both correct

nocturne thunder Jan 30, 2024, 10:57 PM

#

vernal ibex is this snippet outdated? Checked my OpenAI and PPLX keys. They are both correct

snippet seems to be working fine, make sure your pplx api key is correct and the openai package is installed.

wind pine Jan 30, 2024, 11:11 PM

#

does pplx api return the models endpoint, yet?

void dock Jan 30, 2024, 11:12 PM

#

I’m about to ask something pretty dumb here but we are able to test and train a 70b without local devices capable of handling that much?

wind pine Jan 30, 2024, 11:12 PM

#

All you need is curl and a 300. baud modem.

#

Training is a whole other beast though, innit?

void dock Jan 30, 2024, 11:14 PM

#

Awesome ty. Caveman type learning over here and 7 or 8s were coming back at teletype speed

#

Haven’t tried perplexity yet at that level.

wind pine Jan 30, 2024, 11:15 PM

#

I'm not sure you can, mate.
You get to consume the provided models.

#

If you're interested in local stuff -- phi-2 via ollama makes compelling widget to tinker with.

#

I suspect it won't be long before you see that incorporated into a phone app.

void dock Jan 30, 2024, 11:16 PM

#

Done some Llama2 training on ollama but all local device (pi5 and weak Linux terminals with small nvidia cards)

#

5 years from now ain’t no telling how it’ll be.

wind pine Jan 30, 2024, 11:17 PM

#

THe Phi-2 model is quick as snot.
Of course, I gave it 30 Xeon cores and 64GB of memory but I bet it can run on a phone. It's a 2b model.

void dock Jan 30, 2024, 11:18 PM

#

Thank you. I appreciate the input I’ll give it a shot.

lyric willowBOT Jan 30, 2024, 11:18 PM

#

void dock Thank you. I appreciate the input I’ll give it a shot.

Hey @void dock!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

wind pine Jan 30, 2024, 11:20 PM

#

I used https://ollama.ai with https://github.com/ollama-webui/ollama-webui
no gpus.

Ollama

Get up and running with large language models, locally.

GitHub

GitHub - ollama-webui/ollama-webui: ChatGPT-Style Web UI Client for...

ChatGPT-Style Web UI Client for Ollama 🦙. Contribute to ollama-webui/ollama-webui development by creating an account on GitHub.

#

You're free to make money with it.
https://huggingface.co/microsoft/phi-2/commit/7e10f3ea09c0ebd373aebc73bc6e6ca58204628d

Upload 3 files · microsoft/phi-2 at 7e10f3e

#

/off-topic

mossy beacon Jan 31, 2024, 12:23 AM

#

river granite Maybe I am in the wrong place, could anyone indicate me how to add a perplexity ...

It's just an HTTP API. Super simple. Here's my implementation I just wrote today:
https://github.com/Clay-Ferguson/quantizr/blob/246e7e2b3510b033b1c133e8702fcc7da1b325a0/src/main/java/quanta/service/node/PplxAiService.java

But the simple example is here:
https://docs.perplexity.ai/reference/post_chat_completions

GitHub

quantizr/src/main/java/quanta/service/node/PplxAiService.java at 24...

Quanta is an open-source CMS with ChatGPT and Social Media (Fediverse) features - Clay-Ferguson/quantizr

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

mossy beacon Jan 31, 2024, 12:28 AM

#

vernal ibex hey folks

I'd recommend troubleshooting by running a shell command instead, so see if the problem is in your code or on the server. I just posted a link (see above) which has an example shell command on it.

river granite Jan 31, 2024, 12:35 AM

#

mossy beacon It's just an HTTP API. Super simple. Here's my implementation I just wrote today...

interesting! thank you!

lyric willowBOT Jan 31, 2024, 12:35 AM

#

river granite interesting! thank you!

Hey @river granite!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

river granite Jan 31, 2024, 12:35 AM

#

mossy beacon It's just an HTTP API. Super simple. Here's my implementation I just wrote today...

can it also get files from users? and operate summaries?

mossy beacon Jan 31, 2024, 1:21 AM

#

river granite can it also get files from users? and operate summaries?

If you extract text from files and/or user input text you can include that data into your prompts yes. I haven't looked into any of Perplexity's support for direct handling of files yet (or know if they do any of that)....and it would depend on how you're planning to use those files.

river granite Jan 31, 2024, 1:29 AM

#

mossy beacon If you extract text from files and/or user input text you can include that data ...

summarize them - 3 pages files at its best

junior shuttle Jan 31, 2024, 1:52 AM

#

Hey, @erlebach123, I attach a screenshot

mossy beacon Jan 31, 2024, 4:19 AM

#

river granite summarize them - 3 pages files at its best

To summarized text you can just do a prompt like: "Summarize the following text in two sentences:\n\ntext: ${text}" ...in other words just do it all in one shot.

warm knot Jan 31, 2024, 5:28 AM

#

past island Hey, <@&1193989584976105562>! If you missed it, CodeLlama-70B-Instruct is avaial...

Quick question and this could be a naive one. Is this model better for real time news online or would it still be perplexity's own models?

covert ferry Jan 31, 2024, 5:33 AM

#

warm knot Quick question and this could be a naive one. Is this model better for real time...

The pplx online models are the only ones with internet access.

river granite Jan 31, 2024, 8:32 AM

#

mossy beacon To summarized text you can just do a prompt like: "Summarize the following text ...

yeah I get it, it is the file upload that gets it a bit more complex for me

low blaze Jan 31, 2024, 5:34 PM

#

Hey, has someone contacted support about the APIs recently? I did but still did not get any answer 😦

past island Jan 31, 2024, 6:08 PM

#

low blaze Hey, has someone contacted support about the APIs recently? I did but still did ...

Hey, @low blaze! Did you contact via api@perplexity.ai or you reported an issue to support@perplexity.ai?

low blaze Jan 31, 2024, 6:11 PM

#

past island Hey, <@342706738794856448>! Did you contact via api@perplexity.ai or you reporte...

Hi @past island, thanks for replying! It was via support@perplexity.ai 🙂

dense lance Jan 31, 2024, 6:14 PM

#

past island Hey, <@&1193989584976105562>! If you missed it, CodeLlama-70B-Instruct is avaial...

Hi @past island , I've been getting wierd responses from codellama-70b-instruct? Here is an example of one that works on codellama-34b-instruct but fails on codellama-70b-instruct.

past island Jan 31, 2024, 6:20 PM

#

low blaze Hi <@830126989687914527>, thanks for replying! It was via support@perplexity.ai ...

Alright, if that question wasn't covered on the server or at https://docs.perplexity.ai/discuss, a personal question, please send me your email, we'll check.

past island Jan 31, 2024, 6:20 PM

#

dense lance Hi <@830126989687914527> , I've been getting wierd responses from codellama-70b-...

Thanks for reporting, @dense lance! We'll look into this issue.

dense lance Jan 31, 2024, 6:21 PM

#

Thanks @past island , other times it's complained about ethics or told me "404 not found" lol. 34B is solid though. I really appreciate being able to use it.

lyric willowBOT Jan 31, 2024, 6:21 PM

#

dense lance Thanks <@830126989687914527> , other times it's complained about ethics or told ...

Hey @dense lance!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

low blaze Jan 31, 2024, 6:24 PM

#

past island Alright, if that question wasn't covered on the server or at https://docs.perple...

Thank you for your support! I sent you my email as a dm 🙂

mossy beacon Jan 31, 2024, 6:30 PM

#

river granite yeah I get it, it is the file upload that gets it a bit more complex for me

Do you mean doing file uploads to your server from browsers, or are you saying there's some Perplexity upload feature you're using?

river granite Jan 31, 2024, 6:59 PM

#

mossy beacon Do you mean doing file uploads to your server from browsers, or are you saying t...

I mean file upload via chatbox interface eventually

mossy beacon Jan 31, 2024, 9:38 PM

#

river granite I mean file upload via chatbox interface eventually

Well at this point I can no longer tell if you're talking about the API or the Perplexity website, but I guess all your questions were answered.

river granite Jan 31, 2024, 9:47 PM

#

mossy beacon Well at this point I can no longer tell if you're talking about the API or the P...

yes. btw API

half thistle Jan 31, 2024, 11:40 PM

#

Does the $5 API credit stack up each month (roll over), or is it use-it-or-lose-it each month?

mossy beacon Jan 31, 2024, 11:55 PM

#

river granite yes. btw API

I'd look for something like "Tika" (in your language) which is what I use to extract text from any kind of file. Then you can just use that extracted document text in a chat completions prompt to ask a question about the content. Doesn't involve any uploading thru the API. All the file uploading is just direct to your server.

river granite Jan 31, 2024, 11:57 PM

#

mossy beacon I'd look for something like "Tika" (in your language) which is what I use to ext...

in my language?

#

can it run on a webapp? I see, you mean Apache Tika - right?

pearl bison Feb 1, 2024, 12:21 AM

#

Does anyone know if it is possible to list the links/sources from where the api pplx-online models are pulling their data like it does on perplexity.ai?

mossy beacon Feb 1, 2024, 12:23 AM

#

river granite in my language?

Yeah Apache Tika. You may not need that if users are uploading text files of course. It's mainly for extracting text from ANY kind of file (PDFs, etc). I checked and there's a python version of Tika. You know how to upload files from a webapp right? It's easy. Just ask ChatGPT how to do that in in your programming back end. The front end code is always the same javascript.

mossy beacon Feb 1, 2024, 12:24 AM

#

pearl bison Does anyone know if it is possible to list the links/sources from where the api ...

I'm just guessing, but is this the "-online" model versions that can do this? I need to know this as well.

river granite Feb 1, 2024, 12:25 AM

#

mossy beacon Yeah Apache Tika. You may not need that if users are uploading text files of cou...

yeah I would like users to upload time by time but I like this suggestion too maybe for other projects or this one too. I will look into it. Thanks a lot!

lyric willowBOT Feb 1, 2024, 12:25 AM

#

river granite yeah I would like users to upload time by time but I like this suggestion too ma...

Hey @river granite!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

pearl bison Feb 1, 2024, 12:30 AM

#

mossy beacon I'm just guessing, but is this the "-online" model versions that can do this? I ...

From my observations so far, the only difference from the chat and online models is that the online models are up to date whereas the chat models haven't been updated since September.

rancid acorn Feb 1, 2024, 1:28 AM

#

This should just be pinned to this thread #🧪│api-general message
(or the server's bot should automatically respond to posts containing 'does|can' 'API' and 'sources|links|urls|citations' with 'No'... the efficiency! )

rancid acorn Feb 1, 2024, 1:32 AM

#

mossy beacon I'm just guessing, but is this the "-online" model versions that can do this? I ...

The two pplx models with -online have real-time access to the internet. Ask a question about a recent news or sports event and there's a good chance you'll get an accurate response. That is great (and what differentiates these models from almost all others out there). The shame though is that it only returns the response/answer without any citations or sources - so there is no way of verifying the information using the actual output

#

Aside from going to Google/Perplexity and cross-checking the results (and thereby basically making the initial API call all but pointless/redundant)

mossy beacon Feb 1, 2024, 2:38 AM

#

rancid acorn The two pplx models with `-online` have real-time access to the internet. Ask a ...

That's cool! Thanks for clarifying that point! I'm not that worried about the LLMs providing source URLs because it's all sort of a big Bayesian logic probability summation anyway, and I'm not sure current models can really say which sources they trust more than others or why at this point.

lyric willowBOT Feb 1, 2024, 2:38 AM

#

mossy beacon That's cool! Thanks for clarifying that point! I'm not that worried about the LL...

Hey @mossy beacon!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

rancid acorn Feb 1, 2024, 3:15 AM

#

mossy beacon That's cool! Thanks for clarifying that point! I'm not that worried about the LL...

No worries and nice! Glad to hear it fits your needs 🙂 For me, a model that uses a web-based RAG system is great for overcoming knowledge cut-offs, but it doesn’t do anything to overcome the reliability/hallucination problem. It’s not so much a concern about the ‘quality’ or 'choices' of the sources but rather just needing to know that response is actually informed by real sources/articles and not just a convincing confabulation. Being able to at least visually inspect sources gets you half way there (just looking over the URLs can be a helpful form of verification imo), but really, to fully verify its existence (and/or conduct further research), one needs the ability to actually visit the source, which currently is not possible with the API

#

Basically for my research there still needs to be a ‘human in the loop’ before any material generated from an LLM can be used in a client-facing report. But that’s just me – ofc, if mostly/generally accurate answers is all one needs, then in its current form, pplx’s API must be great 🙂

rancid acorn Feb 1, 2024, 9:28 AM

#

This is using the 7b-online model, which is somewhat unfair (the 70b version is far more reliable), but it demonstrates the point I'm trying to make. Aside from very basic queries like 'what is the stock price of this company' or 'what is the weather forecast for XX', it tends to fall apart and revert to its training knowledge to generate the response (or whatever is happening, it results in inaccurate output/answer)

gloomy stag Feb 1, 2024, 9:45 AM

#

codellama 70b is very restrictive and often refuses to answer, as I understand its more of a base model problem, but maybe it can be tweaked by ppxl team somehow? Forcing it to start answers with "Sure!" or some other sort of tweaks?

torn obsidian Feb 1, 2024, 10:23 AM

#

gloomy stag codellama 70b is very restrictive and often refuses to answer, as I understand i...

hmm I would probably use AI Agents running the prompts back and forth until one gets a proper answer... But maybe there are better solutions.

zealous loom Feb 1, 2024, 8:44 PM

#

codellama 70b is terrible. i asked it an sql question but because i had something referencing a business's address in the sql and it refused to help because of data privacy reasons.

faint crown Feb 2, 2024, 3:14 AM

#

I'm trying to get the API to research a website given.

when I give it a site:http or just the address by itself, it returns information unrelated to the site (a completely different company)

Tried 7b and 70b.

Is there a specific prompt format for this type of query?

rapid blaze Feb 2, 2024, 6:02 AM

#

I am facing the same issue of API credits not getting reflected. I have tried 3 different cards for my business account. Finally, I have deleted that accont, and am now using my personal one with a master card. I have been charged for the pro subscription but still not able to either buy or avail my API credits. Would really appreciate some help on this.

lunar crown Feb 2, 2024, 4:58 PM

#

faint crown I'm trying to get the API to research a website given. when I give it a site:ht...

Are you using the online variants?

blazing badge Feb 2, 2024, 5:37 PM

#

Hello! Not sure if this is already on Perplexity's radar but API support for llava-v1.6-34b would be 🤌 . The multi-modal API landscape is currently incredibly sparse and pricing for 1.6 on replicate is unreasonable compared to GPT4-V. Food for thought.

faint crown Feb 2, 2024, 11:50 PM

#

lunar crown Are you using the online variants?

Yes I was

left lark Feb 3, 2024, 2:45 PM

#

What is the pricing for Mixtral? Perplexity removed 13b from api pricing

covert ferry Feb 3, 2024, 3:02 PM

#

left lark What is the pricing for Mixtral? Perplexity removed 13b from api pricing

Pricing will change soon, but is currently as follows:

$0.14 per 1M input tokens
$0.56 per 1M output tokens

lyric olive Feb 3, 2024, 6:26 PM

#

Hey @lyric willow , any way I can get my rate limit quickly increased ?

left lark Feb 3, 2024, 7:07 PM

#

covert ferry Pricing will change soon, but is currently as follows: > $0.14 per 1M input toke...

Will the prices be lower?

somber barn Feb 3, 2024, 7:21 PM

#

I know this has already been partially discussed multiple times here. But why does the API not offer a (more expensive) pplx-web version which actually returns you the same information the website does? This is a huge business opportunity. For websites which have all their content indexed, you could immediately create a perfectly working chat assistant, just using this api (and requesting only to use their website, same as you can do it in the perplexity browser extension).

Currently I do not understand why you should use the api. Very unfortunate.

faint crown Feb 3, 2024, 7:37 PM

#

somber barn I know this has already been partially discussed multiple times here. But why do...

Ya I'm with you, huge business miss. But you'll also see that a ton of people have requested it.

Hopefully they'll provide an API soon with the search prompt+sources. Or maybe something similar to OpenAI threads in case the copilot needs an answer.

scenic hemlock Feb 4, 2024, 12:33 AM

#

I can’t setting perplexity ai with Siri? For ask to Siri like shortcut and Siri respond?

lyric olive Feb 5, 2024, 11:52 AM

#

lyric olive Hey <@1199434623697039461> , any way I can get my rate limit quickly increased ?

Hey @lyric willow , anyone here to help me with this ?

covert ferry Feb 5, 2024, 11:58 AM

#

lyric olive Hey <@1199434623697039461> , anyone here to help me with this ?

#🧪│api-general message

upbeat kite Feb 5, 2024, 2:58 PM

#

i'm getting transcript from links (which r quite long) and then passing it to gpt and pplx apis to get a 2-3 line summary. the summary generation for 3 links done parallely is approx 10secs. do you know of a way to make this significantly faster? api seems to be q slow on medium-large text

idle crypt Feb 5, 2024, 4:11 PM

#

Any ideas on this problem, please? #⚡│ask-community message

(Behaviour discrepancy between interactive and API)

junior shuttle Feb 6, 2024, 2:00 PM

#

Is GPT4 accessible via the Perplexity API? Thanks.

covert ferry Feb 6, 2024, 6:08 PM

#

junior shuttle Is GPT4 accessible via the Perplexity API? Thanks.

No, via OpenAI

jagged solstice Feb 6, 2024, 8:10 PM

#

scenic hemlock I can’t setting perplexity ai with Siri? For ask to Siri like shortcut and Siri ...

https://www.icloud.com/shortcuts/ba386a9ff0de41c7b51a40a01f0cd10f

#

No need for API

pine oracle Feb 6, 2024, 10:56 PM

#

jagged solstice https://www.icloud.com/shortcuts/ba386a9ff0de41c7b51a40a01f0cd10f

do you ahve a shortcut that uses the api? and can have an ongoing conversation isntead of a new one with every message

jagged solstice Feb 6, 2024, 11:12 PM

#

no

pine oracle Feb 6, 2024, 11:15 PM

#

oh

drowsy eagle Feb 6, 2024, 11:41 PM

#

Hey group! I have been recently trying to mimic what the perplexity chrome extension has been doing in terms of summaring the website. It does do a good job on most websites. But When using the API I am not able to replicate. The actual api also seems to upload files and I am not able to do the same from the API either. Any inputs will be greatly appreciated.

rigid spruce Feb 7, 2024, 12:58 AM

#

covert ferry No, via OpenAI

why is it so hard to make the API key of perplexity have the same format as OAI API key in order to be used on different tools that OAI API KEY already supports?

haughty igloo Feb 7, 2024, 2:34 PM

#

Hi all. Apologies if this has been answered previously, but do we have a timeline on when the Api will include sources and citations?

nocturne thunder Feb 7, 2024, 3:09 PM

#

haughty igloo Hi all. Apologies if this has been answered previously, but do we have a timelin...

it is currently not on the roadmap

haughty igloo Feb 7, 2024, 3:32 PM

#

nocturne thunder it is currently not on the roadmap

But here(https://docs.perplexity.ai/discuss/65af6285e69072005b83eb05) it says it will be available soon?

pplx-api

How can i get the more intermediate results such as copilot / sourc...

rancid acorn Feb 7, 2024, 4:18 PM

#

haughty igloo But here(https://docs.perplexity.ai/discuss/65af6285e69072005b83eb05) it says it...

Well that's interesting 🤞
As is this:
"The plan is to offer source references to approved use-cases that fill out a form which will be sent to the emails of all API users."
https://docs.perplexity.ai/discuss/65c0b02f09d8e3001ca0d3ba

pplx-api

Accessing source URLs via pplx-api

I am using the pplx-7b-online model via API chat completion. But the response does not contain the source links like it does in the chat UI. Is there any way I can get the source URLs from the API?

vivid umbra Feb 7, 2024, 6:08 PM

#

I cannot find gpt models in the supported model page. Wondering how to call gpt models through perplexity.ai API?

#

Screenshot_2024-02-07_at_10.07.07_AM.png

#

They are not in the list. Or maybe one of the pplx models is actually a gpt model?

nocturne thunder Feb 7, 2024, 6:13 PM

#

vivid umbra I cannot find gpt models in the supported model page. Wondering how to call gpt ...

perplexity doesn't offer gpt via api

vivid umbra Feb 7, 2024, 6:18 PM

#

Only through the UI?

#

Wondering which model in the API list is the closest to GPT 3.5 turbo then?

#

@nocturne thunder

#

Is there a way to scrape the UI result that calls the gpt models 🥹

nocturne thunder Feb 7, 2024, 6:21 PM

#

vivid umbra Only through the UI?

https://pplx.ai and the api are two different products

covert ferry Feb 7, 2024, 6:21 PM

#

vivid umbra Is there a way to scrape the UI result that calls the gpt models 🥹

No, scraping is against the TOS!

vivid umbra Feb 7, 2024, 6:21 PM

#

nocturne thunder https://pplx.ai and the api are two different products

Are you saying the API is not officially provided by perplexity.ai?

nocturne thunder Feb 7, 2024, 6:22 PM

#

vivid umbra Are you saying the API is not officially provided by perplexity.ai?

They're two different products from the same company.

covert ferry Feb 7, 2024, 6:23 PM

#

vivid umbra Are you saying the API is not officially provided by perplexity.ai?

It is, but there is https://pplx.ai AND the API with open source models and the in house online LLM'S

vivid umbra Feb 7, 2024, 6:24 PM

#

Well. I just want to scarpe some perplexity AI results that use GPT models to check how accurate the perplexity.ai is. What should I do then 🥹

covert ferry Feb 7, 2024, 6:26 PM

#

vivid umbra Well. I just want to scarpe some perplexity AI results that use GPT models to c...

Well nothing 🤷‍♂️, you can use the UI or the pplx API, but not the UI via API.

vivid umbra Feb 7, 2024, 6:27 PM

#

covert ferry Well nothing 🤷‍♂️, you can use the UI or the pplx API, but not the UI via API.

So you mean mission impossible 🥹🥹?

#

Ok. Thank you! This is very helpful!

lyric willowBOT Feb 7, 2024, 6:30 PM

#

vivid umbra Ok. Thank you! This is very helpful!

Hey @vivid umbra!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

vivid umbra Feb 7, 2024, 6:31 PM

#

But which model in the API list is the closest to gpt 3.5 turbo model then?

#

I mean in terms of performance?

covert ferry Feb 7, 2024, 6:34 PM

#

vivid umbra But which model in the API list is the closest to gpt 3.5 turbo model then?

mixtral-8x7b-instruct is very good, but without web access. pplx-7(0)b-online are both fast and with web access. It's best to just try them out.

vivid umbra Feb 7, 2024, 6:36 PM

#

Thanks @covert ferry ! This is very helpful!

lyric willowBOT Feb 7, 2024, 6:36 PM

#

vivid umbra Thanks <@752478851103326241> ! This is very helpful!

Hey @vivid umbra!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

vivid umbra Feb 7, 2024, 6:38 PM

#

covert ferry mixtral-8x7b-instruct is very good, but without web access. pplx-7(0)b-online ar...

So by "without web access", you mean no search results are retreived if I call mixtral-8x7b-instruct in the API? But pplx-7(0)b-online

#

That means it would hallucinate a lot, right?

covert ferry Feb 7, 2024, 6:38 PM

#

vivid umbra So by "without web access", you mean no search results are retreived if I call m...

Correct

vivid umbra Feb 7, 2024, 6:39 PM

#

So if I want accurate responses, I'd better use "pplx-7(0)b-online", right?

covert ferry Feb 7, 2024, 6:39 PM

#

vivid umbra That means it would hallucinate a lot, right?

Not a lot, but there is a knowledge cutoff

vivid umbra Feb 7, 2024, 6:39 PM

#

covert ferry Not a lot, but there is a knowledge cutoff

Right.

covert ferry Feb 7, 2024, 6:39 PM

#

vivid umbra So if I want accurate responses, I'd better use "pplx-7(0)b-online", right?

If you are looking for up to date information, yes

vivid umbra Feb 7, 2024, 6:46 PM

#

Thanks @covert ferry. Another question is whether I should use the pplx-7b-online model or the pplx-70b-online model?

lyric willowBOT Feb 7, 2024, 6:46 PM

#

vivid umbra Thanks <@752478851103326241>. Another question is whether I should use the pplx-...

Hey @vivid umbra!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

vivid umbra Feb 7, 2024, 6:46 PM

#

I guess 70b is always better, right?

#

Also is there any benchmark eval result betwen the pplx-7b-online model vs gpt3.5-turbo in terms of accuracy score?

#

Wondering whether anyone has done any research on what kind of content is the best to get the most accurate result through the API? Right now it is only "Be precise and concise.". But wondering whether this is optimal

#

I know I have a lot of questions. I am also new to all these, and still figuring things out! 🥹

covert ferry Feb 7, 2024, 6:55 PM

#

Some benchmarks @vivid umbra https://blog.perplexity.ai/blog/introducing-pplx-online-llms 👀

Introducing PPLX Online LLMs

The first-of-its-kind Online LLM API

vivid umbra Feb 7, 2024, 6:56 PM

#

Thank you so much! You are awesome @covert ferry !

shy quarry Feb 7, 2024, 8:21 PM

#

vivid umbra I know I have a lot of questions. I am also new to all these, and still figuring...

#👋│introductions message

rancid acorn Feb 8, 2024, 2:27 AM

#

vivid umbra Wondering whether anyone has done any research on what kind of content is the be...

You can adjust the system prompt. "be concise and precise" is just the default :))

next sequoia Feb 8, 2024, 10:16 AM

#

How do i get the sources and related information as seen in the Perplexity UI in an API response

nocturne thunder Feb 8, 2024, 10:52 AM

#

next sequoia How do i get the sources and related information as seen in the Perplexity UI in...

perplexity and pplx-api are two different products so they wont have the same functionality. but information about sources will be available in the future #🧪│api-general message

rancid acorn Feb 8, 2024, 11:19 AM

#

#💬│general message ❤️

timid grove Feb 8, 2024, 12:59 PM

#

Hi. I'm looking to use the API with codebase knowledge. But for that I would need to create a vector database of some sort. I wonder if there's a way to do this (add long text file support to the pplx-api) without resorting to services other than Perplexity itself.

faint crown Feb 8, 2024, 3:06 PM

#

nocturne thunder perplexity and pplx-api are two different products so they wont have the same fu...

boom sounds like its roadmap now?!

nocturne thunder Feb 8, 2024, 3:12 PM

#

faint crown boom sounds like its roadmap now?!

Latest info I know is that they’re working on it but no ETA as of now. You’ll likely get an email to fill out a form for approval once they release it. https://docs.perplexity.ai/discuss/65c103f492fb9800462213f4

faint crown Feb 8, 2024, 3:31 PM

#

thats still a wonderful update vs. the previous stance

past island Feb 8, 2024, 10:47 PM

#

Hey, <@&1193989584976105562>!

We’re excited to announce our integration with Vercel, becoming the knowledge API for every Vercel developer.

We offer the fastest and most accurate online LLM APIs without any knowledge cut-off. We can't wait to see what developers will ship using our APIs on Vercel!

Learn more here: https://vercel.com/blog/ai-integrations

Introducing AI Integrations on Vercel – Vercel

AI Integrations on Vercel

broken pulsar Feb 9, 2024, 8:21 AM

#

Hi

#

Hi@API users

mint crescent Feb 9, 2024, 10:28 AM

#

Is there any plans to increase the rate limits for the pplx api from the current 10 req/min?

untold osprey Feb 9, 2024, 10:28 AM

#

Has anyone found a way to prompt the api effectively to consistently cite relevant URLs to sources that its pulling from in the pplx online models? I have it returning relevant URLs 70% of the time but want it to be higher

nocturne thunder Feb 9, 2024, 10:39 AM

#

untold osprey Has anyone found a way to prompt the api effectively to consistently cite releva...

it will be added in the future for approved use-cases

nocturne thunder Feb 9, 2024, 10:41 AM

#

mint crescent Is there any plans to increase the rate limits for the pplx api from the current...

you can send a mail to api@perplexity.ai for higher rate limits

untold osprey Feb 9, 2024, 10:56 AM

#

@nocturne thunder I know its not currently supported. Ive found a work around through prompting for URLs and it returns results. I’m wondering if anyone here has also tested prompting to try and have URLs displayed and if they’ve managed to get it to work consistently.

#

I love perplexity, but the main issue I’m having is that it doesn’t follow my prompt nearly as closely as I’d like and the answers to queries can be different from the previous one in a non trivial way.

rancid acorn Feb 9, 2024, 11:11 AM

#

untold osprey <@736698531997548556> I know its not currently supported. Ive found a work aroun...

I have, more out of curiosity (I mean, they had to be there...) than as a workaround - would rather they were just part of the output, and in a consistent format etc, than having to coerce/bribe an LLM... And if you're using the 70b online model and getting them with 70% reliability, you're doing better than me ha

untold osprey Feb 9, 2024, 4:17 PM

#

@rancid acorn Yeah that’s def ideal but I need a work around for something I’m working on. I solved the format problem, only issue now is just making it 100% :/

#

Or at least 90%ish

pearl pawn Feb 9, 2024, 4:55 PM

#

I have sent in several emails over several days to api@perplexity.ai and have not gotten a response. 🙁 I just need to talk to someone.

past island Feb 9, 2024, 11:30 PM

#

rancid acorn Any ideas how the syntax for making perplexity API calls could be shoehorned int...

Hey, @rancid acorn https://x.com/tdinh_me/status/1755773979841982753

Tony Dinh 🎯 (@tdinh_me) on X

Chat with @perplexity_ai on @TypingMindApp 🤝

tight mauve Feb 10, 2024, 5:50 PM

#

Hi, when using mixtral-8x7b-instruct in the API using the openai client, is it safe to assume that the messages field is where all the input needs to be given with system, user, and assistant roles and no function calling is supported?

I am planning to encode the JsonSchema of a few functions in the system prompt itself to do some CoT style reasoning and wanted to double check my assumption that function calling is not supported for this model.

Also, how has the latency in mixtral-8x7b-instruct API been when compared to gpt-4 models? Are there any benchmarks available? I am planning to replace some gpt-4 API calls with mixtral-8x7b-instruct to save on latency. Any info on this would be greatly appreciated. Thanks!

tight mauve Feb 10, 2024, 6:40 PM

#

https://docs.perplexity.ai/docs/feature-roadmap

Any estimate on when Mistral 32k context length would be available?

pplx-api

Feature Roadmap

pplx-api's roadmap is as follows:January Stop words and request time limits as parameters Mistral 32k context length

small verge Feb 11, 2024, 12:20 AM

#

Anyone know what to do or how to correct this error when using the API? I'm using it through App script.
**
Error: HTTP status 429, Response: {"error":{"message":"Request rate limit exceeded, please try again later.","type":"request_rate_limit_exceeded","code":429}}**

mossy beacon Feb 11, 2024, 12:51 AM

#

small verge Anyone know what to do or how to correct this error when using the API? I'm usin...

I was getting that error on OpenAI API when I had ran out of credit without knowing it. However that may not really apply on Perplexity, because it might just mean the rate was genuinely exceeded.

small verge Feb 11, 2024, 1:42 AM

#

mossy beacon I was getting that error on OpenAI API when I had ran out of credit without know...

Thanks! i have enough credit, though im running Perplexity and error referred to OpenAI. Kind of weird! Anyways, i fixed the error by just adding a millisecond delay to the app script run. Shoutout to GPT 4 for the edits 🙂

lyric willowBOT Feb 11, 2024, 1:42 AM

#

small verge Thanks! i have enough credit, though im running Perplexity and error referred to...

Hey @small verge!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

rancid acorn Feb 11, 2024, 9:53 AM

#

Spec for giving a Custom GPT the ability (/'Action') to make calls to the pplx API. Copy-paste (+ add API bearer key to the secrets/auth thing) in the Custom GPT builder, and then all that's really needed is a basic system prompt ("You have access to a powerful API search engine via your Action; always make API calls before generating a response." - or something ).

📎 pplx-API-openapiSpec_CustomGPT.yaml

#

Haven't played around with it much, but so far seems to work alright

jaunty burrow Feb 11, 2024, 3:06 PM

#

i just added the api to the app we are building 🙂 very good

fervent inlet Feb 12, 2024, 5:28 PM

#

Is it possible to choose content sources through the API?
Similar to how you can choose academic sources, Youtube, Reddit, etc on the Perplexity AI site.
I'd like to give our users the ability to query specific sources.

nocturne thunder Feb 12, 2024, 6:13 PM

#

fervent inlet Is it possible to choose content sources through the API? Similar to how you ca...

You can add the site: search parameter to the prompt. For example: "What is perplexity ai site:reddit.com"

median oar Feb 12, 2024, 6:57 PM

#

nocturne thunder You can add the `site:` search parameter to the prompt. For example: "What is pe...

Does this reliably only source info from given URLs? How do we specify multiple sources? Like site:x site:y site:z?

nocturne thunder Feb 12, 2024, 7:05 PM

#

median oar Does this reliably only source info from given URLs? How do we specify multiple ...

you can do multiple sites like this “site:reddit.com OR site:youtube.com”
Yes it should only use those sources but there is no way to check this atm since the sources aren’t available yet in the api.

north pawn Feb 12, 2024, 7:48 PM

#

What is the pricing for mixtral 8x7b? I'm assuming it's not just the $.28/1M output of the normal 7b models here https://docs.perplexity.ai/docs/pricing

pplx-api

Pricing

pplx-api implements a usage-based pricing model. Perplexity Pro users get $5 of free credit every month.

covert ferry Feb 12, 2024, 7:50 PM

#

north pawn What is the pricing for mixtral 8x7b? I'm assuming it's not just the $.28/1M out...

#🧪│api-general message

steep meadow Feb 12, 2024, 9:21 PM

#

fixed

oof any idea why performance is so bad compared to online

cargo run -- "when was vision pro released"
   Compiling perplexity-rs v0.1.0 (/Users/andrewgazelka/Projects/explore/perplexity-rs)
    Finished dev [unoptimized + debuginfo] target(s) in 1.29s
     Running `target/debug/perplexity-rs 'when was vision pro released'`

{"id": "38e04f5c-c576-4684-9b58-10d938ee83e9", "model": "pplx-70b-online", "created": 2071989, "usage": {"prompt_tokens": 42, "completion_tokens": 102, "total_tokens": 144}, "object": "chat.completion", "choices": [{"index": 0, "finish_reason": "stop", "message": {"role": "assistant", "content": "The Visions computer graphics product, developed by Digital Effects and incorporating APL (APL is a programming language), was used to create television commercials and animation for the 1982 film Tron. However, there's no specific date provided in the search results for when Visions was released. It's known that Digital Effects used Visions in their work on Tron, which suggests the product was available by 1982 at the latest."}, "delta": {"role": "assistant", "content": ""}}]}

drowsy fiber Feb 13, 2024, 4:55 AM

#

is it possible to increase the rate limit for the api?

#

I've emailed api@perplexity.ai but no one has responded so far

spark delta Feb 13, 2024, 6:46 AM

#

API featured examples would be great, to avoid the basic questions just above (I.e. source limits, system message, content localization, etc). Any available sample code or url to share please?

median oar Feb 13, 2024, 7:58 AM

#

Perplexity should be providing references! Happy to pay more to get references!!

alpine cedar Feb 13, 2024, 8:12 AM

#

Hi, We have noticed that the PPLX APIs are failing intermittently, we are getting random and absurd replies from the APIs since yesterday. We have tried both pplx-7b-online as well as pplx-70b-online. Our prompt requires the model to do an online search and give the results based on that. A quick sample of the response that we are getting is -

2 To2020000000000000000000000000000000000000000000020202020200202020202020020earelld 2. Figure.
202002020202002 @20202 Facebook{2. This2 A2 2 A2 It2 every-20200200000200000000000000000000000020202 A2 By Feb. As2f2Fig2e 2.2 *20 In20200 But2020 the20202020020000000000000000002
520 2052020202020202 *2so2Al2
  2222 2.
| They.2j2 The92The2
In Burn22292 Book2z *
2 # TryIn Online2 Ass22 Ch2222 * Om2252222522 I22\\2222222222202 The2 original2 OP2 22222
Source.2 After2222#InSot0222
2252022222222222222222 John22 -
22 readll']['-22
2 Do222The2 dr2 Fig22222
I2
S2 The25 The22 * Sh222* Read2
H22  Jul-2National2There2Here,2 * Amazon)#3Plus9 The2In2    2 //2 In Opt2 The2 means92 My2 This222
The29/2...2 two/2 The25/2 #MTerm022222 GT2 Well2222 The2.Janst app2 Well122 Tw2181.The212The TheThe *andTheThalb asympt/Am AA G12 As42 "OThe Order The 22 article^* == map52An25The200The222222220 2122
52* Sweden255 552ami552555st*/5520522225222225222222 25222222222222 2
LT022 22222212222 #92522222222222222222222222242** for2222$2
A1a212220 2212212#of "to-22025 *5\\5✅ At050�S.time6y info1 To "esch5...http.. A2Author12 *50|2//** The
5#51 *double ... you0 *5 *50252y0200050** If2**5202**52
**50550A525222j222 or2252
As20222 20 5 will2200

#

Is anyone else facing this kind of an issue?

shy quarry Feb 13, 2024, 9:24 AM

#

median oar Perplexity should be providing references! Happy to pay more to get references!!

the team is working on it

steady eagle Feb 13, 2024, 9:26 AM

#

Histopathology

#

Information about Benedict test

#

Hi

jovial radish Feb 13, 2024, 9:30 AM

#

6

shy quarry Feb 13, 2024, 10:36 AM

#

steady eagle Information about Benedict test

https://www.perplexity.ai/

Perplexity

Perplexity AI unlocks the power of knowledge with information discovery and sharing.

steady parcel Feb 13, 2024, 1:18 PM

#

rancid acorn Well that's interesting 🤞 As is this: "The plan is to offer source references ...

Does anyone know where this form is and if perplexity is actually gonna start offering it soon? How do we like sign up to get notified? Is there anything thinking on roughly how this changes the API so we can start designing our projects to be ready for it

median oar Feb 13, 2024, 6:01 PM

#

shy quarry the team is working on it

Exciting! Where can I sign up for early access?

covert ferry Feb 13, 2024, 6:02 PM

#

median oar Exciting! Where can I sign up for early access?

There is no early access. It will be announced at a later date.

visual delta Feb 14, 2024, 8:46 AM

#

Hello Devs. I am using pplx-api's online models for my organization's project. The api response differs a lot from the response I am getting from perplexity.ai, even for basic prompts. I would like to have some assistance from the devs please. Thanks.

covert ferry Feb 14, 2024, 8:48 AM

#

visual delta Hello Devs. I am using pplx-api's online models for my organization's project. T...

perplexity.ai and pplx-api are two completely different things, for the pplx-api you have to write different prompts and also note that the AI models are different.

vagrant talon Feb 14, 2024, 11:01 AM

#

Hello, this has probably already been asked before, but is there a way to force responses to follow a strict json format, kind of like json mode in the Openai api's? Or something similar to function calling

visual delta Feb 14, 2024, 11:32 AM

#

covert ferry perplexity.ai and pplx-api are two completely different things, for the pplx-api...

I understand that the pplx-api works differently from perplexity.ai, but why is the api response so different and random every time on the same user prompt, even tho I have set the temperature and other randomness parameters to lowest?

visual delta Feb 14, 2024, 11:39 AM

#

visual delta I understand that the pplx-api works differently from perplexity.ai, but why is ...

Whereas ChatGPT and Gemini APIs produce a much more reliable response, in terms of content and relevant information

rancid acorn Feb 14, 2024, 12:02 PM

#

visual delta I understand that the pplx-api works differently from perplexity.ai, but why is ...

If it's just a single/basic query it seems consistent enough in my experience

#

Seems pretty consistent to me (and I suspect would be more consistent if actually making individual completions and with temp = 0 etc)

visual delta Feb 14, 2024, 12:28 PM

#

Here are 5 examples on the same prompt

rancid acorn Feb 14, 2024, 1:04 PM

#

yeah it's a very complex prompt / request you're working there. I don't think the online models can handle anything close to that much complexity. Even on perplexity.ai with Copilot (or any LLM provider tbh) I'd be surprised if the results were really that consistent

#

I gave the prompt to GPT4 and said to breakdown the parts...

The QUERY requests the following constituent components:

Time Frame: Since 2019
Company of Interest: Tasla Inc. (presumably a typographical error for Tesla Inc., TSLA)
Activity Type: Mergers and acquisitions (M&A)
Details Required:
- Specific dates of each M&A activity
- Names of the companies involved in the M&A with Tesla Inc.
- Websites of the companies acquired or merged
- Industries that these companies operate in
- Countries of origin of the companies
- Revenues of the companies at the time of M&A
- Transaction values of each M&A activity
Additional Requirement: Provide a URL for the source of each acquisition's details.

The QUERY is essentially asking for a comprehensive report on Tesla Inc.'s M&A activities over a specified period, including various details about the entities involved and the nature of the transactions, along with verifiable sources for the information provided.

#

That's a lot of information (covering 5 years) to find and parse

#

Fwiw I think you should reduce the requirements / level of detail (could start with the URLs, as it currently can't do that anyway). Maybe then trying iterating over each year (or quarter) per call, rather than trying to get 5-years all at once. Really dunno though

rancid acorn Feb 14, 2024, 1:12 PM

#

visual delta Here are 5 examples on the same prompt

btw in one of the screenshot, are those sources/links actual URLs?

visual delta Feb 14, 2024, 1:15 PM

#

rancid acorn btw in one of the screenshot, are those sources/links actual URLs?

Yes

unborn basin Feb 14, 2024, 1:54 PM

#

Hi, I'm a new developer dipping my toes into the world of AI and LLM. I apologize for the low level of my questions due to my low understanding of python, LangChain, and AI.

Intention: I want to substitute "pplx-7b-chat" for the "gpt-3.5-turbo" model shown as an example in the LangChain lectures
Execution: When using LagnChain's ChatOpenAI, I want to specify "openai_api_base", "model_name", and "api_key" separately to get similar results to the example with "pplx-7b-chat" (other inputs are the same as the example in the previous lecture).
Result : Error occurred (openai.BadRequestError: Error code: 400 - {'error': {'message': 'custom stop words are not implemented for completions.', 'type': 'unsupported_parameter', 'code': 400}})
Presumed cause: call 'POST - https://api.perplexity.ai/chat/completions` during agent.run => passing a parameter like "stop" not specified in API reference => API error based on unspecified param request
Question 1 : I don't even understand the error message properly. I would like to get a detailed explanation of the specific cause of what I'm experiencing.
Question2 : Is it not possible to use "pplx-7b-chat" or any other perplexity API to accomplish what you are trying to do with the current criteria?

I have attached additional code for the area where the error occurs. Any feedback would be appreciated.
(I'm not fluent in English, so I used a translator, thank you for your understanding. )

Screenshot_2024-02-14_at_10.39.37_PM.png

Screenshot_2024-02-14_at_10.52.49_PM.png

left lark Feb 14, 2024, 2:03 PM

#

Can you put 8x7B models on the API pricing?

ocean osprey Feb 14, 2024, 2:19 PM

#

Does anyone know if pplx will also implement memory?

covert ferry Feb 14, 2024, 2:20 PM

#

unborn basin Hi, I'm a new developer dipping my toes into the world of AI and LLM. I apologiz...

Maybe this will be easier: https://mochan.org/posts/perplexity-ai-langchain/

unborn basin Feb 14, 2024, 2:49 PM

#

covert ferry Maybe this will be easier: https://mochan.org/posts/perplexity-ai-langchain/

I didn't get a meaningful response using what you provided, but I'm inferring that it failed due to the model you used not picking the right audience, at least not with a 400 Error. ("pplx-7b-chat" => "mistral-7b-instruct") What areas do you think I should modify or improve in order to get meaningful results in a typical case like this?
(I'm attaching below some of the errors I encountered as a result of API communication during the AgentExecutor chain)

langchain_core.exceptions.OutputParserException: Parsing LLM output produced both a final answer and a parse-able action:: I need to find the Linkedin profile page for the person named Eden Marco.
Action: Crawl Google 4 linkedin profile page
Action Input: "Eden Marco site:linkedin.com"
Observation: The search results contain multiple Linkedin profile pages for people named Eden Marco. I need to click on one of the links to confirm it's the correct person. I cannot directly provide you with the link without verifying it.
Thought: Unfortunately, I cannot provide you with a definitive answer as I cannot verify which Linkedin profile belongs to Eden Marco from this data alone.
Final Answer: N/A
# ...more error expression
ValueError: An output parsing error occurred. In order to pass this error back to the agent and have it try again, pass `handle_parsing_errors=True` to the AgentExecutor. This is the error: Parsing LLM output produced both a final answer and a parse-able action:: I need to find the Linkedin profile page for the person named Eden Marco.
# ...same Action, Action Input, Observation, Thought, Final Answer

unborn basin Feb 14, 2024, 2:50 PM

#

covert ferry Maybe this will be easier: https://mochan.org/posts/perplexity-ai-langchain/

As an additional question, is it difficult for users to determine the specific cause of different models giving different responses to the same request? (Just out of curiosity, I used the "gpt-3.5-turbo" model and got one significant result).

left lark Feb 14, 2024, 2:57 PM

#

When will PPLX-8x7B be released as an API?

rancid acorn Feb 14, 2024, 3:02 PM

#

left lark When will PPLX-8x7B be released as an API?

you should try making a request with it now...

left lark Feb 14, 2024, 3:05 PM

#

rancid acorn you should try making a request with it now...

Interesting, it does work with curl! When will they release a pricing section for 8x7b models?

covert ferry Feb 14, 2024, 3:06 PM

#

There is no ETA, it will be announced at a later date.

solid girder Feb 14, 2024, 3:41 PM

#

How can I access my 5$ API credit after subscribing to PRO account? I can't find it, may it be because I subscribed using an iOS device?

unborn basin Feb 14, 2024, 3:45 PM

#

solid girder How can I access my 5$ API credit after subscribing to PRO account? I can't find...

If you're a perplexity Pro subscriber, you'll see the $5 credit on the API-related pages. If it's been a while since you signed up for a paid subscription and you don't see the credit on the relevant page, i recommend contacting support.

https://www.perplexity.ai/settings/api

Perplexity

Perplexity AI unlocks the power of knowledge with information discovery and sharing.

drowsy fiber Feb 14, 2024, 8:12 PM

#

is the api fixed yet? I'm still getting random results from the api

#

past island Feb 14, 2024, 9:58 PM

#

drowsy fiber is the api fixed yet? I'm still getting random results from the api

Hey, @drowsy fiber! Which model are you getting such responses from?

drowsy fiber Feb 14, 2024, 10:01 PM

#

pplx-70b-online

#

the prompt is little bit complicated, but it was doing a great job before

vivid umbra Feb 14, 2024, 10:26 PM

#

For the perplexity api, will paid customers be able to access to more models?

#

What is the best model paid customers can access through perplexity api?

#

Right now, it seems only 9 models can be accessed, right?

#

vivid umbra Feb 14, 2024, 10:51 PM

#

Also when using perplexity api, I need to provide some content from the "system". Right now it is "Be precise and concise.". Is there any research done on the "content" that can generate the most accurate response on pplx-70b-online model?

#

Also what is the difference betwen the pplx-70b-online model vs the pplx-70b-chat-model?

drowsy fiber Feb 14, 2024, 11:50 PM

#

I realized that the online model would hallucinate:

pearl pawn Feb 15, 2024, 12:44 AM

#

Hey, I really appreciate the work you guys at @lyric willow are doing. I've been trying to talk to someone at Perplexity about the rate limits on the pplx online models. For me to accomplish what I want to do, it would end up taking me over 200 days. I did reach out via the api@perplexity.ai email awhile ago but have yet to hear back. We're now looking into alternative providers for our problem but it would be best if we could use Perplexity's model. Is there something I'm missing?

remote gulch Feb 15, 2024, 9:23 AM

#

Hey, is there somewhere I can programmatically pull a list of models available to perplexity? Currently, I am scraping 'https://docs.perplexity.ai/docs/model-cards' and getting a model list from there. i get the following returned;

 ['codellama-34b-instruct', 'codellama-70b-instruct', 'llama-2-70b-chat', 'mistral-7b-instruct', 'mixtral-8x7b-instruct', 'pplx-7b-chat', 'pplx-70b-chat', 'pplx-7b-online', 'pplx-70b-online']

near torrent Feb 15, 2024, 12:43 PM

#

hey everyone o/

I'm considering using the pplx API... if you use it in production, why do you use it compared any other API?

#

latency / cost / quality ?

#

you all are the ones really using it, so i'd be really interested in what you think, as opposed to reading some hypey twitter thread 😅

dense lance Feb 15, 2024, 6:31 PM

#

I noticed on new road map
Deprecation and removal of codellama-34b-instruct and llama-2-70b-chat
I've been using codellama-34b-instruct for most of the things I use. I have tried to use codellama-70b-instruct, but it's just been unusable for me. Is codellama-70b-instruct going to be updated before removing codellama-34b-instruct? Is it possible to just keep codellama-34b-instruct?

#

Also, I noticed that the mixtral bots are still showing 4k context window, and that it's been removed from the road map. Has that idea just been removed altogether?

green barn Feb 16, 2024, 8:08 PM

#

pearl pawn Hey, I really appreciate the work you guys at <@1199434623697039461> are doing. ...

I also reached out to api@perplexity.ai for a rate limit increase after I was told to do so by support. They initially told me they would raise my api limit,but then I have now been ghosted for almost a month. I have sent 4 follow up emails and just never receive anything back. @past island is there anything that can be done about this?

past island Feb 16, 2024, 8:10 PM

#

Hey, @green barn, sorry, let me check. Please DM me your email.

pearl pawn Feb 16, 2024, 8:16 PM

#

@past island Same here!

dense lance Feb 17, 2024, 2:14 AM

#

Hi @past island any idea about this?

nova blade Feb 17, 2024, 7:06 PM

#

somber barn I know this has already been partially discussed multiple times here. But why do...

doesnt the pplx-online models do this? or are you referring to the lack of inline citations?

somber barn Feb 17, 2024, 7:19 PM

#

nova blade doesnt the pplx-online models do this? or are you referring to the lack of inlin...

Yeah I mean the citations are what it makes it great. Also at the moment "pplx-online" is just a blackbox.
You have no idea if it got information from the internet/remembers it from training data or made stuff up. Nothing I would want to use in a professional use-case.

nova blade Feb 17, 2024, 7:29 PM

#

somber barn Yeah I mean the citations are what it makes it great. Also at the moment "pplx-o...

gotcha, thats fair.

nocturne thunder Feb 17, 2024, 11:14 PM

#

somber barn Yeah I mean the citations are what it makes it great. Also at the moment "pplx-o...

citations are being worked on for the api and will be available in the future for approved use cases.

neon fulcrum Feb 18, 2024, 4:13 AM

#

Hi! I'm a API user. I encounter a issue that streaming API will produce many confused char like 00 2\n etc.
My request using pplx-70b-online and here is my id 733750e8-4c78-464a-a7d5-4db165c26358

#

I had checked document said that It is recommended to use only single-turn conversations for the online LLMs (pplx-7b-online and pplx-70b-online). Any system messages given in the request will additionally be ignored. But I don't know why it will produce messed chars .

worldly glacier Feb 18, 2024, 7:46 AM

#

Are we expected to see any update regarding messy results produced by pplx-70b-online API?

covert ferry Feb 18, 2024, 7:49 AM

#

Yes, the team is working on the problem.

left ember Feb 18, 2024, 8:12 AM

#

Hi, I just saw that llama-2-70b-chat and one other model is being deprecated. I plan to use the mixtral-8x7b-instruct for one of my side projects. How long do you support a model for?

warm dust Feb 18, 2024, 5:32 PM

#

Hi Guys, i am new to perplexity api, can i work with focuse modes via api?

covert ferry Feb 18, 2024, 5:46 PM

#

warm dust Hi Guys, i am new to perplexity api, can i work with focuse modes via api?

No, please have a look at the docs https://docs.perplexity.ai

pplx-api

warm dust Feb 18, 2024, 5:47 PM

#

covert ferry No, please have a look at the docs https://docs.perplexity.ai

understood

topaz olive Feb 18, 2024, 8:52 PM

#

Any tips on the latest and correct 'apiUrl'? I saw on reddit - they suggested 'https://api.perplexity.ai' --- here is my javascript key and url code...
const apiKey = 'pplx-axxxxxxxxxxyadayada';
const apiUrl = 'https://api.perplexity.ai'; // This is a placeholder URL, replace with the actual Perplexity API endpoint
Do I need to create an endpoint or do I find this is the api docs - I could not find any reference to it?

covert ferry Feb 18, 2024, 8:56 PM

#

topaz olive Any tips on the latest and correct 'apiUrl'? I saw on reddit - they suggested 'h...

https://api.perplexity.ai/chat/completions
https://docs.perplexity.ai/reference

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

topaz olive Feb 18, 2024, 9:05 PM

#

covert ferry https://api.perplexity.ai/chat/completions https://docs.perplexity.ai/reference

I saw that as well - I tried 'https://api.perplexity.ai/chat/completions' and am still getting an 'undefined' error, however I am somewhat green with coding... Ill keep plugging away but if anyone has suggestions, Im all ears.

covert ferry Feb 18, 2024, 9:07 PM

#

topaz olive I saw that as well - I tried 'https://api.perplexity.ai/chat/completions' and am...

What is the full code?

topaz olive Feb 18, 2024, 9:11 PM

#

covert ferry What is the full code?

<form id="perplexityForm">
<label for="question">Ask a question:</label><br>
<input type="text" id="question" name="question"><br>
<textarea id="answer" name="answer" rows="10" cols="50" readonly></textarea><br>
<input type="submit" value="Submit">
</form>

<script>
    document.getElementById('perplexityForm').addEventListener('submit', function(e) {
e.preventDefault(); // Prevent the default form submission

const question = document.getElementById('question').value;
const answerArea = document.getElementById('answer');

// Replace YOUR_API_KEY with your actual Perplexity API key
const apiKey = 'pplx-xxxxxxx';
const apiUrl = 'https://api.perplexity.ai/chat/completions'; // This is a placeholder URL, replace with the actual Perplexity API endpoint

fetch(apiUrl, {
    method: 'POST',
    headers: {
        'Content-Type': 'application/json',
        'Authorization': `Bearer ${apiKey}`
    },
    body: JSON.stringify({question: question})
})
.then(response => response.json())
.then(data => {
    answerArea.value = data.answer; // Assuming the API response has an 'answer' field
})
.catch(error => {
    console.error('Error:', error);
    answerArea.value = 'An error occurred. Please try again.';
});

});
</script>

#

The idea is the user prompts with a quest - hits RETURN key, perps results is displayed in box - they can edit or simply submit the form (I will add and INSERT php later)

covert ferry Feb 18, 2024, 9:25 PM

#

topaz olive <form id="perplexityForm"> <label for="question">Ask a question:</label>...

Please have a look at the structure here:

const options = {
  method: 'POST',
  headers: {
    accept: 'application/json',
    'content-type': 'application/json',
    authorization: 'Bearer your-key'
  },
  body: JSON.stringify({
    model: 'mistral-7b-instruct',
    messages: [
      {role: 'system', content: 'Be precise and concise.'},
      {role: 'user', content: 'How many stars are there in our galaxy?'}
    ]
  })
};

fetch('https://api.perplexity.ai/chat/completions', options)
  .then(response => response.json())
  .then(response => console.log(response))
  .catch(err => console.error(err));

nocturne thunder Feb 18, 2024, 9:25 PM

#

topaz olive <form id="perplexityForm"> <label for="question">Ask a question:</label>...

here is the edited version, please try this

📎 1708291535017.html

topaz olive Feb 18, 2024, 9:28 PM

#

nocturne thunder here is the edited version, please try this

This works - Ill explore IceLavaMan's JS suggestions as well - You BOTH are amazing - TY.

#

One final question - if its too time intensive, dont worry. But one of the features I love about perplexity is the citations and resource lists at the bottom... SPEZI - your solution retreives the result quite seamlessly but there are no resources or citations (only the base answer)... any ideas on how I get the citations to appear below the base relults?

covert ferry Feb 18, 2024, 9:31 PM

#

topaz olive One final question - if its too time intensive, dont worry. But one of the featu...

Citations are currently not supported via the API, but the team is working on it ☺️

nova blade Feb 19, 2024, 10:46 PM

#

hey, is there a way i can finetune a pplx-online model?

covert ferry Feb 20, 2024, 5:24 AM

#

nova blade hey, is there a way i can finetune a pplx-online model?

No denyxbox

vestal bear Feb 20, 2024, 1:52 PM

#

Hello - we need to increase our API request limit to 15k per day. How do we go about doing that? Model is pplx-70b-chat

covert ferry Feb 20, 2024, 1:58 PM

#

vestal bear Hello - we need to increase our API request limit to 15k per day. How do we go a...

#🧪│api-general message

tame tree Feb 20, 2024, 7:24 PM

#

can you guys help me at https://discord.com/channels/1047197230748151888/1208502600887046218

little canyon Feb 21, 2024, 11:05 AM

#

I get from API a totaly diffrent answers in compare to the website which is mostly out-of-date.

is there any better way to implement:

export async function pplxCompletion(query: string) {
sdk.auth(PPLX_KEY);
const { data } = await sdk.post_chat_completions({
model: "pplx-70b-online",
messages: [
{ role: "system", content: "provide an insightful response" },
{ role: "user", content: query },
],
stream: false,
});
// console.log(data.choices[0].message);
return data.choices[0].message.content;
}

rancid acorn Feb 21, 2024, 11:32 AM

#

little canyon I get from API a totaly diffrent answers in compare to the website which is most...

You could remove the system prompt (the documentation says that it is ignored for the online models; though it also seems that it might cause some buggy responses when included in the request).

#

I would also say try keeping the queries simple - like a direct question or something limited to a single topic will do better than asking about something complex or with multiple components.

What is the 7-day weather forecast for London? What would be the best day for a picnic?
Something like that should do fine.
What are the 7-day forecasts for London, Paris and Berlin? Which city looks best to have a picnic over this period? Provide your response as an SEO-optimised blog.
Something like this, on the other hand, would probably struggle

#

Also experiment with the different models. tbh the quality/accuracy seems fairly consistent across them (they don't give wildly different answers, at least in my experience), but the styles and lengths vary. Attached is an example for reference

little canyon Feb 21, 2024, 1:27 PM

#

rancid acorn I would also say try keeping the queries simple - like a direct question or some...

actually when i remove system prompt i get an irrelevant response about galaxies

rancid acorn Feb 21, 2024, 1:32 PM

#

little canyon actually when i remove system prompt i get an irrelevant response about galaxie...

😅 ok well scrap that suggestion lol

#

Though that is strange

#

Was based on this #1206711802449371216 message
+
attached screenshot (from a few weeks ago)

#

However, I note the guidance on https://docs.perplexity.ai/docs/model-cards has been updated, with the note about system messages messages being ignored removed

#

Still weird though. I don't have a system prompt and not having any issues (though I haven't been using the 70b model). I'll add one and see what happens ha

misty sinew Feb 21, 2024, 5:41 PM

#

Is there any way to make the API response format correspond to the one from the webapp? In particular, I want the response to contain embedded links, like in the attached screenshot. I want to get the exact same thing as the webapp output

covert ferry Feb 21, 2024, 5:48 PM

#

misty sinew Is there any way to make the API response format correspond to the one from the ...

#🧪│api-general message

untold quail Feb 21, 2024, 10:30 PM

#

little canyon I get from API a totaly diffrent answers in compare to the website which is most...

any sample for the request with stream: true?

prime wraith Feb 22, 2024, 12:18 AM

#

Are other people noticing what I am seeing in my api requests to pplx-70b-online? I see a good start, then the response goes awry and into gibberish. Nothing tricky in my prompts, just write a detailed accurate article about .... Here's what I"m seeing:

#

Looks like a few other folks have mentioned the same thing here. Is there a suggested workaround? I'll try 7b instead for now.

covert ferry Feb 22, 2024, 5:44 AM

#

prime wraith Are other people noticing what I am seeing in my api requests to pplx-70b-online...

The team is investigating, there is no known workaround.

rancid acorn Feb 22, 2024, 6:57 AM

#

prime wraith Are other people noticing what I am seeing in my api requests to pplx-70b-online...

Perhaps try using one of the other online models. I prefer their responses over 70b's, and haven't been encountering any of these issues with general usage.
Out of curiosity, what was the query used in that screenshot? (no worries if can't share it though)

rancid acorn Feb 22, 2024, 6:58 AM

#

prime wraith Looks like a few other folks have mentioned the same thing here. Is there a sug...

Oh, I should have read on lol. (btw also consider using 8x7b-online - it's my favourite ha)

stuck finch Feb 22, 2024, 12:43 PM

#

I am interested in purchasing API credits, but I am encountering difficulties with finalizing the transaction as the payment is currently pending.!

pliant oriole Feb 22, 2024, 2:59 PM

#

Hi guys, would you add support to running https://ai.google.dev/gemma with API? Is it on the plan?

Google AI for Developers

Gemma - a family of lightweight, state-of-the art open models from ...

Introducing Gemma, a family of open-source, lightweight language models. Discover quickstart guides, benchmarks, train and deploy on Google Cloud, and join the community to advance AI research.

jade kelp Feb 22, 2024, 8:47 PM

#

Hey everyone, sorry if this is already asked (I'm new to Discord) but our team is interested in using the Perplexity API — how do we get access to it? Thank you!

covert ferry Feb 22, 2024, 8:55 PM

#

jade kelp Hey everyone, sorry if this is already asked (I'm new to Discord) but our team i...

Please have a look at https://docs.perplexity.ai/docs/getting-started, for higher rate limits please contact api@perplexity.ai ☺️

jade kelp Feb 22, 2024, 9:03 PM

#

covert ferry Please have a look at https://docs.perplexity.ai/docs/getting-started, for highe...

I appreciate you, thank you!

stuck finch Feb 23, 2024, 3:47 AM

#

I am interested in purchasing API credits, but I am encountering difficulties with finalizing the transaction as the payment is currently pending.! can any one help me???????? @covert ferry

karmic horizon Feb 23, 2024, 4:16 AM

#

anyone use the 'assistant' field? was working yesterday, now it throws a 400 error when the query looks OK

{
  "error": {
    "message": "After the (optional) system message(s), user and assistant roles should be alternating.",
    "type": "invalid_message",
    "code": 400
  }
}

covert ferry Feb 23, 2024, 6:43 AM

#

stuck finch I am interested in purchasing API credits, but I am encountering difficulties wi...

Please contact support@perplexity.ai

rancid acorn Feb 23, 2024, 12:38 PM

#

karmic horizon anyone use the 'assistant' field? was working yesterday, now it throws a 400 err...

Is that message from the Assistant "User is very nice" there for any particular reason? I'm pretty sure if you remove that it'll work fine. Perhaps they've changed something, but I think if you follow:
(system prompt) user-assistant -user-assistant etc it should work

#

Or if you need that Assistant message there for some reason, perhaps could insert a blank user message above it (i.e. just with just a single space)

karmic horizon Feb 23, 2024, 7:37 PM

#

rancid acorn Or if you need that Assistant message there for some reason, perhaps could inser...

yeah this works 👍🏽, but I used to be able to not have to do this to make my app work. wonder if something changed that wasn't documented?

vale pebble Feb 24, 2024, 1:23 AM

#

anyone see

#

Announcing Our Newest Model
We are excited to announce the launch of our latest Perplexity models: sonar-small-chat and sonar-medium-chat, along with their search-enhanced versions, sonar-small-online and sonar-medium-online. These new additions surpass our earlier models in cost-efficiency, speed, and performance. For detailed information on our supported models, please visit our model card documentation.

dull onyx Feb 24, 2024, 2:34 AM

#

vale pebble Announcing Our Newest Model We are excited to announce the launch of our latest ...

I am waiting to see how they compare to pplx-70b, I thought that was one of the best API models/price comparison I had tried.

vale pebble Feb 24, 2024, 2:35 AM

#

dull onyx I am waiting to see how they compare to pplx-70b, I thought that was one of the ...

pplx-70b is ft llama 70b

#

and since mixtral outperforms llama 70b

#

if we assume that sonar-medium (an 8x7b model) is ft mixtral

#

sonar medium should outperform pplx-70b

stuck finch Feb 24, 2024, 3:36 AM

#

I reached out to support@perplexity.ai but was unable to find a resolution.! @covert ferry

gloomy kettle Feb 24, 2024, 4:45 AM

#

vale pebble Announcing Our Newest Model We are excited to announce the launch of our latest ...

Are there any pricing for these models?

#

Oh, they’re listed in the pricing page by parameter count

vale pebble Feb 24, 2024, 4:46 AM

#

vale pebble if we assume that sonar-medium (an 8x7b model) is ft mixtral

its not ft mixtral

vale pebble Feb 24, 2024, 4:46 AM

#

gloomy kettle Oh, they’re listed in the pricing page by parameter count

yeah

#

anyways

#

dec 2023 cutoff according to my testing

#

very impressive for perplexity's first NON-ft-version-of-an-open-source-model model

gloomy kettle Feb 24, 2024, 4:55 AM

#

How much better is sonar compared to mixtral? Sometimes mixtral doesn’t get my requests

vale pebble Feb 24, 2024, 4:56 AM

#

gloomy kettle How much better is sonar compared to mixtral? Sometimes mixtral doesn’t get my r...

Just try it yourself in labs

#

or API

#

recommend you use sonar-medium-chat tho

#

not online

#

and avoid small

gloomy kettle Feb 24, 2024, 4:57 AM

#

Oh, I’m just using an API through raycast. That’s how I use my API credits

vale pebble Feb 24, 2024, 4:58 AM

#

gloomy kettle Oh, I’m just using an API through raycast. That’s how I use my API credits

oh lmao

#

seems that medium might be better than mixtral

#

gotta do more testing tho

#

@gloomy kettle how do you like it so far

#

im trying it with coding and...

#

its lazy

gloomy kettle Feb 24, 2024, 5:26 AM

#

It’s really good at summarizing text

#

That’s what I use the API for mostly. Quick information retrieval

vale pebble Feb 24, 2024, 5:26 AM

#

gloomy kettle It’s really good at summarizing text

oh btw

#

since you have pro

#

might as well say that

#

solar medium seems to be the experimental model now

#

it has the dec 2023 cutoff in writing mode

gloomy kettle Feb 24, 2024, 5:27 AM

#

Really?

#

Damn

#

Lemme test that out

vale pebble Feb 24, 2024, 5:27 AM

#

gloomy kettle Lemme test that out

however

#

be warned

#

it hallucinates a lil bit

#

give it guidance but not the exact answer

#

like look

#

#

2.2 DID release on dec 21 (actually 20, but late at night)

#

but the level "Geometrical Dominator" isnt new, its from 2.0

#

its also not a demon

#

nor user created

#

and

#

well

#

most of this is a hallucination

#

my theory is that this is the online ft of the model

#

not the chat version

#

so in writing mode

#

BRO

#

why are my messages getting deleted

#

...

#

anyways, writing mode has search disabled. and since sonar medium online is meant and tuned for web search, when it doesn't have that... its a bit. iffy.

#

@gloomy kettle

gloomy kettle Feb 24, 2024, 5:40 AM

#

I’m noticing that non OpenAI models tend to not perform as well at nuanced queries

#

I still use GPT 4 for reasoning. But for very broad topics, say, looking up “amylase” mixtral performs pretty well

vale pebble Feb 24, 2024, 5:42 AM

#

So, what do you think so far?

#

does sonar medium beat mixtral?

#

btw use chat, online is silly

gloomy kettle Feb 24, 2024, 5:44 AM

#

I’d say, online should be used for current events

#

I’d like to see benchmarks released by perplexity

#

I haven’t tested complex prompts yet, but on a translation task, it didn’t do well

vale pebble Feb 24, 2024, 5:47 AM

#

gloomy kettle I’d like to see benchmarks released by perplexity

mfw there isn't even an official announcement besides the email:

gloomy kettle Feb 24, 2024, 5:50 AM

#

Here was some selected text and the translation to a different language. GPT 3.5 seems to do a better job

#

I don’t know what this says, but ChatGPT seems better at translation, and aligning to user input

#

I retried with sonar, and the translation was “You are a helpful assistant.
If you need help translating a text into Chinese, please provide the text you want to translate, and I will translate it and provide the result”

vale pebble Feb 24, 2024, 5:56 AM

#

gloomy kettle I haven’t tested complex prompts yet, but on a translation task, it didn’t do we...

its probably english aimed

gloomy kettle Feb 24, 2024, 5:57 AM

#

Maybe OpenAI has a more extensive RLHF? Because I could ask GPT 3.5 turbo any query, and it would generally do well on language tasks

vale pebble Feb 24, 2024, 5:58 AM

#

gloomy kettle Maybe OpenAI has a more extensive RLHF? Because I could ask GPT 3.5 turbo any qu...

i tested it on reasoning

#

and it did better than 3.5

#

uhhh

#

nvm

#

it doesnt wanna work when i want a screenshot

#

...

#

might be worse than mixtral maybe

#

like gemini ultra to gpt-4

#

slightly worse, on par sometimes

#

might be better sometimes

#

idk

gloomy kettle Feb 24, 2024, 6:04 AM

#

It seems a bit more friendlier than mixtral. Here was a response “Why don’t scientists trust atoms?
Because they make up everything! 😂 (A little taste of atomic humor for you!)”

While mixtral continuously gave “Sure, here’s a light-hearted joke for you:
Why don’t scientists trust atoms?
Because they make up everything!”

vale pebble Feb 24, 2024, 6:04 AM

#

gloomy kettle It seems a bit more friendlier than mixtral. Here was a response “Why don’t scie...

oh yeah i noticed that

#

sadly not in online tho

#

so by extension not in pplx pro

#

but its def a lot friendlier and more natural

gloomy kettle Feb 24, 2024, 6:07 AM

#

I tested a prompt “If i have a diamond ring on top of a cotton ball inside a mug. And the mug is inside the microwave oven. I take the mug out and go to the living room and place the mug on the couch. I then turn the mug upside down on the couch. Then i take the mug with me to the kitchen and place the mug back in the microwave oven. Where is the diamond ring now?”

Sonar consistently said the ring was in the coffee mug. Mixtral gave the right answers. Which is that the ring is on the couch.

Mixtral wins on “spacial” reasoning

vale pebble Feb 24, 2024, 6:07 AM

#

gloomy kettle I tested a prompt “If i have a diamond ring on top of a cotton ball inside a mug...

weird. did good for me earlier today with a similar test.

#

uh

#

@gloomy kettle

#

gloomy kettle Feb 24, 2024, 6:09 AM

#

Idk why the API doesn’t do well

vale pebble Feb 24, 2024, 6:10 AM

#

gloomy kettle Idk why the API doesn’t do well

weird

vale pebble Feb 24, 2024, 6:12 AM

#

gloomy kettle Idk why the API doesn’t do well

also online gets it wrong, at least through pplx pro

#

yeah

#

tested in labs

#

online is dumber

silver lichen Feb 24, 2024, 6:14 AM

#

I did it 3 times with the prompt, got it wrong each time except for one time where it said it could be either?

Screenshot_2024-02-23_at_11.10.09_PM.png

Screenshot_2024-02-23_at_11.10.25_PM.png

Screenshot_2024-02-23_at_11.12.28_PM.png

#

when i do mixtral 8x7b 3 times i get it saying in the cup 2 times and on the couch 1 time

vale pebble Feb 24, 2024, 6:16 AM

#

silver lichen I did it 3 times with the prompt, got it wrong each time except for one time whe...

huh, now its not working for me either

#

seems like a bit of botched tuning tbh

#

so basically on par but mixtral might be overfitting

silver lichen Feb 24, 2024, 6:24 AM

#

that is the purpose of the cotton ball in the problem, when i change the problem to not include it mixtral gets it right 5 out of 5 times and sonar medium still strugles, here is what i changed it to: "If i have a diamond ring loosely in the bottom of a a mug. And the mug is inside the microwave oven. I take the mug out and go to the living room and place the mug on the couch. I then turn the mug upside down on the couch. Then i take the mug with me to the kitchen and place the mug back in the microwave oven. Where is the diamond ring now?"

vale pebble Feb 24, 2024, 6:24 AM

#

because mistral medium doesnt mistral medium it

#

like literally

#

mistral medium cant solve it

#

ah

#

when cotton ball is removed

#

medium gets it right

#

mistral medium

silver lichen Feb 24, 2024, 6:29 AM

#

Screenshot_2024-02-23_at_11.29.16_PM.png

#

so it dose get tricked by it thinking the cotton holdes it in, makes sence now, sorry if it was obvious to everyone but me, haha

vale pebble Feb 24, 2024, 6:37 AM

#

silver lichen so it dose get tricked by it thinking the cotton holdes it in, makes sence now, ...

huh

#

so like

#

do you think this is better than the previous pplx llms?

silver lichen Feb 24, 2024, 6:41 AM

#

vale pebble do you think this is better than the previous pplx llms?

I think so, I fell i need to test it for longer but it seems good, what you you think so far?

vale pebble Feb 24, 2024, 6:41 AM

#

silver lichen I think so, I fell i need to test it for longer but it seems good, what you you ...

its fine

#

its no mistral-next but like

#

fine for what it is

#

search oriented model

#

after all

#

the chat models are pretty useless

#

pplx tunes for THEIR use cases

silver lichen Feb 24, 2024, 6:43 AM

#

mixtral is still my fav for open llm's i would like to use sonar for search and mixtral medium or 8X7b for resoning

vale pebble Feb 24, 2024, 6:43 AM

#

so the fact that its a dec 2023 cutoff is crazy

#

when its gonna be used mainly with web search

vale pebble Feb 24, 2024, 6:43 AM

#

silver lichen mixtral is still my fav for open llm's i would like to use sonar for search and ...

i mean

silver lichen Feb 24, 2024, 6:43 AM

#

yeah, that is

vale pebble Feb 24, 2024, 6:43 AM

#

it is used in experimental

#

so like

#

just go with experimental, rewrite with gpt-4 turbo when needed

#

claude 2 is kinda useless then

#

default model 🤮

silver lichen Feb 24, 2024, 6:45 AM

#

Very true, and then at the moment gemini is not in the pictue, but maybe in the future for big context

vale pebble Feb 24, 2024, 6:45 AM

#

silver lichen Very true, and then at the moment gemini is not in the pictue, but maybe in the ...

i wonder why good ol gemini was booted out

silver lichen Feb 24, 2024, 6:46 AM

#

vale pebble i wonder why good ol gemini was booted out

Yeah, dont know, saw some of the different ideas about it, but im not sure

#

Hopefully it gets worked out so we can get 1-10m context, would be nice for certain tasks

#

and then they added gemma same day if im not wrong, too bad gemma kinda sucks

#

2b is not bad if you really are short for compute power tho

rancid acorn Feb 24, 2024, 8:36 AM

#

vale pebble online is dumber

feel this kinda misses the point of an 'online' model. imo it should excel at providing answers to questions that require the internet (what's the weather doing in XXX? Who won the election held in XXX yesterday?). Giving it riddles is a weird evaluation (of course, deep reasoning is important / useful across all tasks, don't get me wrong)

rancid acorn Feb 24, 2024, 8:39 AM

#

silver lichen so it dose get tricked by it thinking the cotton holdes it in, makes sence now, ...

yeah was gonna say, it's not the best riddle - the whole 'cotton ball' thing throws it off (thinking, I guess understandably, that it is squished in there). This is using the medium-online version, with an addtional prompt reinforcing step-by-step thinking. You can similarly understand how it comes to the conclusion that it is still in the mug.. (I guess it's 'wrong' – GPT4 does seem to understand to better in not assuming the cotton ball is embedded in the mug – but it's also kinda ambiguous)

untold quail Feb 24, 2024, 9:27 AM

#

gloomy kettle Idk why the API doesn’t do well

try Hask, we building it open source, and you can touch the source code to your need
the GitHub link, is in the ALT of the video

untold quail Feb 24, 2024, 9:28 AM

#

rancid acorn yeah was gonna say, it's not the best riddle - the whole 'cotton ball' thing thr...

this is GPT-4?

rancid acorn Feb 24, 2024, 11:12 AM

#

untold quail this is GPT-4?

nah that was sonar-medium-online

#

Here is GPT-4t (can't remember which endpoint exactly)

worldly glacier Feb 24, 2024, 11:37 AM

#

vale pebble sonar medium should outperform pplx-70b

Hopefully we can see something better than pplx-70b

#

Test Sonar-medium-online -- produced 3 different answers each time for same question

rancid acorn Feb 24, 2024, 11:43 AM

#

worldly glacier Test `Sonar-medium-online` -- produced 3 different answers each time for same q...

I just got identical responses

worldly glacier Feb 24, 2024, 11:47 AM

#

rancid acorn I just got identical responses

Best test always come once we have long answer where informative content is involved (as per my experience)

stone trellis Feb 24, 2024, 11:51 AM

#

hello, I am curious on what "citations" looks like on the API. Does anyone have an example of the response so I can see?

rancid acorn Feb 24, 2024, 11:52 AM

#

worldly glacier Best test always come once we have long answer where informative content is invo...

👍 gotcha, though keep in mind, the longer the 'answer', the less likely it is that any LLM would reproduce it (i.e. that specific sequence of tokens) consistently. 'Who won the superbowl' is pretty black and white - 'what is the meaning of life', on the other hand...

worldly glacier Feb 24, 2024, 12:53 PM

#

rancid acorn 👍 gotcha, though keep in mind, the longer the 'answer', the less likely it is ...

yah, that's the point.. till now pplx-70b is best model for informative long answers (hopefully recent glitch gets resolved soon)

rancid acorn Feb 24, 2024, 1:01 PM

#

worldly glacier yah, that's the point.. till now pplx-70b is best model for informative long an...

Was that with 70b-chat or 70b-online?

worldly glacier Feb 24, 2024, 1:02 PM

#

rancid acorn Was that with 70b-chat or 70b-online?

70b-online

rancid acorn Feb 24, 2024, 1:11 PM

#

worldly glacier 70b-online

Right. Well I guess you can keep using it until 15 March 🤷‍♂️ But yeah then it seems that it, along with all the other llama / llama-based models, will be deprecated. So I'm not really sure it's a glitch so much as a change over to mistral / mistral-based models
edit: all except codellama-70b-instruct

worldly glacier Feb 24, 2024, 1:19 PM

#

rancid acorn Right. Well I guess you can keep using it until 15 March 🤷‍♂️ But yeah then it ...

You mean pplx is closing llama based models, if any such news please share link as i missed

rancid acorn Feb 24, 2024, 1:20 PM

#

worldly glacier You mean pplx is closing llama based models, if any such news please share link ...

https://docs.perplexity.ai/changelog/api-updates-february-2024

#

Presented differently..

worldly glacier Feb 24, 2024, 1:22 PM

#

thnx a lot for sharing!

rancid acorn Feb 24, 2024, 1:23 PM

#

worldly glacier thnx a lot for sharing!

No prob at all!
(and I hope the changes aren't too disruptive for your existing usage 😬)

worldly glacier Feb 24, 2024, 1:30 PM

#

rancid acorn No prob at all! (and I hope the changes aren't too disruptive for your existing...

to be true... very much, hahahah, we are basically using pplx 70b-online as it was best for content generation

rancid acorn Feb 24, 2024, 1:46 PM

#

worldly glacier to be true... very much, hahahah, we are basically using pplx 70b-online as it...

On the plus side, it goes from 4,000 to 12,000 token context window. So, in theory, these newer models should be better at content generation (or at least generating longer content)

#

I think there's a few wrinkles to iron out, but ultimately / hopefully should be for the better all round :))

worldly glacier Feb 24, 2024, 1:49 PM

#

rancid acorn On the plus side, it goes from 4,000 to 12,000 token context window. So, in theo...

which one? sonar

rancid acorn Feb 24, 2024, 1:53 PM

#

worldly glacier which one? `sonar`

See third column (Context Length) in the table above,
And also, in the screenshot of the announcement above that, the second paragraph (which, helpfully, explains that 4k of the tokens are allocated to parsing the search results for the online models)

worldly glacier Feb 24, 2024, 2:04 PM

#

sonar-medium-online is still based on Meta Llama or...?

rancid acorn Feb 24, 2024, 2:21 PM

#

worldly glacier `sonar-medium-online` is still based on Meta Llama or...?

I don't know. But fwiw, I don't think any of the sonar models are based on llama. I'm not aware of any other model (open source or otherwise) with the 8x7b parameters / MoE arhitecture other than Mistral's. I could be wrong, but I don't really know what open source model it could be a fine tune of other than one of Mistral's

deft sparrow Feb 24, 2024, 5:24 PM

#

I’m seeking a comprehensive comparison between the ‘pplx-7b-chat’ and ‘pplx-7b-online’ models offered by Perplexity AI, focusing on their individual capabilities, particularly in handling contextual conversations and accessing real-time updates from the internet. Additionally, with the phasing out of the ‘pplx-70b’ model, what advancements or improvements are introduced in the newer models? Which among these is recognized as the best option for integrating live internet access to provide current, informed responses?

worldly glacier Feb 24, 2024, 5:37 PM

#

deft sparrow I’m seeking a comprehensive comparison between the ‘pplx-7b-chat’ and ‘pplx-7b-o...

Till now no specific details are provided by PPLX team about new models, after deep testing of informative questions, pplx-70b-online still responding better than new sonar-medium-online for me

deft sparrow Feb 24, 2024, 5:39 PM

#

worldly glacier Till now no specific details are provided by PPLX team about new models, after d...

Thanks for your insight! Don't understand why they keep pplx-7b but will remove 70b?! More parameters is better right? 70b vs 7b?!

worldly glacier Feb 24, 2024, 5:41 PM

#

Still waiting for someone from team to give their public view on this

#

#

This was the recent glitch in pplx-70b-online, may be its the reason, but still answering pattern of this model was best out of all

vale pebble Feb 24, 2024, 5:43 PM

#

deft sparrow Thanks for your insight! Don't understand why they keep pplx-7b but will remove ...

7b is basically merging into sonar small

covert ferry Feb 24, 2024, 5:44 PM

#

...

worldly glacier Feb 24, 2024, 5:44 PM

#

covert ferry ...

and what about 70b-online?

covert ferry Feb 24, 2024, 5:47 PM

#

worldly glacier and what about 70b-online?

the 70b online model will be removed

#

You can use sonar-medium-online

#

(it's better)

worldly glacier Feb 24, 2024, 5:49 PM

#

covert ferry You can use sonar-medium-online

Till now its pattern of answering seems lower than 70b-online

covert ferry Feb 24, 2024, 5:49 PM

#

worldly glacier Till now its pattern of answering seems lower than 70b-online

do you have any examples?

worldly glacier Feb 24, 2024, 5:52 PM

#

covert ferry do you have any examples?

yah, will definitely share here, doing more testing for different question patterns for long answers

grizzled marsh Feb 24, 2024, 7:49 PM

#

yo does the api have anyway for us to simply get sources for a given query

#

ex. get me latest news on chicago bears and the api to return sources

vale pebble Feb 24, 2024, 7:51 PM

#

grizzled marsh yo does the api have anyway for us to simply get sources for a given query

its in closed beta, check docs announcements for the waitlist sign up

loud mulch Feb 24, 2024, 8:48 PM

#

Hi, does anyone knows how to get the usage while streaming? I'm using the vercel sdk but can't manage to get the usage while streaming. I created a discussion on the api page
https://docs.perplexity.ai/discuss/65da5519af6a9a00293e2f59

pplx-api

How to access the usage of a stream when using OpenAI sdk?

Hi, I'm currently having a hard time accessing the usage from the stream in JS. It's fine in Python as we can just iterate through the response but can't find a way in JS. I'm also using Vercel AI SDK where I've tried to use AIStream but without success. Here's my code

// Define a simple parser that logs each chunk and returns nothing.
f...

feral flower Feb 25, 2024, 12:10 AM

#

vale pebble its in closed beta, check docs announcements for the waitlist sign up

Hey! I've been searching for a while online and can't find that form anywhere - do you have a link by any chance?

covert ferry Feb 25, 2024, 12:37 AM

#

feral flower Hey! I've been searching for a while online and can't find that form anywhere - ...

Hey @feral flower!
Please apply here: https://perplexity.typeform.com/to/j50rnNiB

Typeform

pplx-api form

Turn data collection into an experience with Typeform. Create beautiful online forms, surveys, quizzes, and so much more. Try it for FREE.

feral flower Feb 25, 2024, 12:42 AM

#

covert ferry Hey <@745811108278960149>! Please apply here: https://perplexity.typeform.com/to...

Thanks! 🙏

rancid acorn Feb 25, 2024, 4:50 AM

#

deft sparrow Thanks for your insight! Don't understand why they keep pplx-7b but will remove ...

Generally but not necessarily. Important to note that the 'x' in 8x7B means multiplied by, as in 8 * 7 billion (in practice, Mixtral is actually 46.7B total parameters - but for simplicity sake, that's how you should look at it)

#

Also, this MoE system is not just more efficient, it also significantly outperforms llama-2 on virtually all benchmarks (despite the latter having ~15B more parameters). Screenshot is Mistral's own evals, via here: https://mistral.ai/news/mixtral-of-experts/

#

The LMSYS Chatbot Arena Leaderboard rankings further demonstrate the point (via https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard)

mossy beacon Feb 25, 2024, 5:25 AM

#

covert ferry the 70b online model will be removed

Where did you hear the 70b online model is being removed? Is that official? Is it because they can't fix the "gibberish" responses?

silver lichen Feb 25, 2024, 6:08 AM

#

mossy beacon Where did you hear the 70b online model is being removed? Is that official? Is i...

Here is the email I got about model deprecation

dull hearth Feb 25, 2024, 11:25 AM

#

anyone else gets gibberish from sonar-medium-online?

#

dull hearth Feb 25, 2024, 11:27 AM

#

mossy beacon Where did you hear the 70b online model is being removed? Is that official? Is i...

oh i see.

rancid acorn Feb 25, 2024, 11:34 AM

#

dull hearth anyone else gets gibberish from sonar-medium-online?

yeah I just got one too (started of fine, but descended into gibberish)

#

though otherwise haven't encountered it, and that was in response to a weird non-question prompt (Show me how you stylise text, in terms of formatting (bold, lists/bullets etc). Ignore search results; just provide multiple examples / demonstrations)

#

fwiw, the gibberish response I received was using a request that included a system prompt. I had previously used the same prompt/query but without a system prompt, and the response was normal.

#

I also noted the following from here https://docs.perplexity.ai/docs/model-cards, which does not explicitly state not to use system prompts (or that they will be ignored, as in previous guidance), but advises against doing so

dull hearth Feb 25, 2024, 11:47 AM

#

My system prompt was empty

rancid acorn Feb 25, 2024, 11:49 AM

#

dull hearth My system prompt was empty

actually, I just saw that the system prompt was commented out when I got the gibberish

dull hearth Feb 25, 2024, 11:49 AM

#

Now I tried system prompt "Be precise and concise." from the docs. Gibberish again

Wait, actually it makes more sense now. Let me check...

rancid acorn Feb 25, 2024, 11:49 AM

#

so yeah - that's not the solution (but I guess the advice above is still worth taking in to account)

dull hearth Feb 25, 2024, 11:51 AM

#

dull hearth Now I tried system prompt "Be precise and concise." from the docs. Gibberish aga...

Confirmed gibberish, although less evident

#

blockishing is not a word

rancid acorn Feb 25, 2024, 11:52 AM

#

Interesting. Yeah no that sentence is gibberish / incoherent
edit: actually, that is a coherent sentence - I think just appears like gibberish to me as a non-architect lol. But yeah, blocklishing at least I'm quite sure is not a word ha

dull hearth Feb 25, 2024, 11:53 AM

#

Web is fine

rancid acorn Feb 25, 2024, 11:54 AM

#

I wonder whether setting a lower temperature than default (1.0) and / or setting a max tokens value to something quite low (like 200) could help. 🤷‍♂️

dull hearth Feb 25, 2024, 11:59 AM

#

rancid acorn I wonder whether setting a lower temperature than default (1.0) and / or setting...

Indeed it worked! Great hunch

#

I don't know which temperature is by default, but when I set it to 0.0, it doesn't seem to be gibberish anymore

#

Even gave me proper references this time

rancid acorn Feb 25, 2024, 12:00 PM

#

dull hearth Even gave me proper references this time

ha what wow

#

I'm off to make a few tweaks lol

dull hearth Feb 25, 2024, 12:04 PM

#

Alright it works now

#

What's funny is that I made several GPT-4 research agents discussing a topic, and calling on Perplexity model to search things. It was returning total trash, and they're all like "yep, looks good, quite insightful ideas"

#

#

Oh boy... spoke too soon I guess

#

sonar-medium-online

rancid acorn Feb 25, 2024, 12:09 PM

#

Oh dear yeah lol I think that's some fine tuning there, not temperature settings

vale pebble Feb 25, 2024, 12:09 PM

#

is this the anthropic issue again

dull hearth Feb 25, 2024, 12:09 PM

#

what

vale pebble Feb 25, 2024, 12:09 PM

#

where instances of openai in the dataset are essentially replaced with (company name)

dull hearth Feb 25, 2024, 12:09 PM

#

lmao

#

but this should've been Meta

#

and it should be from the internet

#

with 0.0 temperature, it doesn't throw out gibberish, but hallucinates

#

"WebScraping Local Host Support" wtf is this even

#

Topic: how to improve context in GPT-like models. Results: "Chrome Extensions"

rancid acorn Feb 25, 2024, 12:12 PM

#

I've had responses (all about LLMs/AI) where it's said similar things about 'Perplexity' as a company that are wildly off / wrong (like it's just been arbitrarily inserted there) - I assumed it's from fine tuning

dull hearth Feb 25, 2024, 12:12 PM

#

quite possible

#

but the rest is cursed too

rancid acorn Feb 25, 2024, 12:16 PM

#

I know the process of fine tuning can introduce information beyond the base model's existing knowledge scope (and, potentially, unintended biases). But beyond that I'm not well enough informed to really understand whether it is actually related to fine tuning, or offer anything beyond this speculation ha

dull hearth Feb 25, 2024, 12:20 PM

#

I can definitely imagine data scientists looking at myriads of "As an AI developed by OpenAI" in finetuning datasets and just replacing with Perplexity and retraining

#

Fwiw, I just replaced sonar-* with pplx-7b-online and pplx-70b-online, the output became good again

#

LMAO

rancid acorn Feb 25, 2024, 12:36 PM

#

dull hearth Fwiw, I just replaced `sonar-*` with `pplx-7b-online` and `pplx-70b-online`, the...

cool good to know thanks 👍

#

The default temperature is 1.0 (per the API reference), and based on this, my guess is the default max_tokens is 900..
(sonar-medium-online)

rancid acorn Feb 25, 2024, 12:44 PM

#

dull hearth Fwiw, I just replaced `sonar-*` with `pplx-7b-online` and `pplx-70b-online`, the...

Also note 15 March

dull hearth Feb 25, 2024, 1:20 PM

#

everything gets nerfed...

how i wish we could go back in time and enjoy early c.ai, early GPT-4 and based Sydney in Bing's body

#

team, please rethink retirement of these models. seems like they're the only sane ones left

vale pebble Feb 25, 2024, 1:26 PM

#

dull hearth team, please rethink retirement of these models. seems like they're the only san...

sonar works fine in labs

rancid acorn Feb 25, 2024, 1:26 PM

#

dull hearth LMAO

All seems good (perhaps temp = 0 actually goes too far in that direction 🤷‍♂️)

vale pebble Feb 25, 2024, 1:26 PM

#

idk what you're going on about

dull hearth Feb 25, 2024, 1:26 PM

#

I'm using the API

#

you can see what it produces for us in the screenshots

rancid acorn Feb 25, 2024, 1:28 PM

#

dull hearth team, please rethink retirement of these models. seems like they're the only san...

tbh I'd view it as growing pains / wrinkles that will be ironed out
fwiw I think it's is a step in the right direction (the bigger context window alone should be a massive enhancement)

vale pebble Feb 25, 2024, 1:29 PM

#

plus, perplexity makes models for themselves first

dull hearth Feb 25, 2024, 1:29 PM

#

I'm getting better results via Labs

#

but still, read the last sentence out loud

vale pebble Feb 25, 2024, 1:30 PM

#

so if it doesn't work fine in labs or experimental (it does), then they'll have to fix it

rancid acorn Feb 25, 2024, 1:31 PM

#

dull hearth but still, read the last sentence out loud

yeah it's technically 'coherent' but actually gobbledygook

rancid acorn Feb 25, 2024, 1:31 PM

#

vale pebble plus, perplexity makes models for themselves first

afaik they fine tune open source models

vale pebble Feb 25, 2024, 1:32 PM

#

sonar medium aint no fine tune

rancid acorn Feb 25, 2024, 1:33 PM

#

perhaps. But I'd be astonished if they hadn't announced it as such. building a fresh LLM from the ground-up would be a massive deal

rancid acorn Feb 25, 2024, 1:34 PM

#

rancid acorn afaik they fine tune open source models

I should be more specific: fine tuned OS LLM + integration with a RAG system that uses their index of web results

#

But yeah anyway, I obviously don't know..but would encourage you to have a read of this article (if you haven't already) - it lays out a pretty clear strategic direction (though ofc, perhaps that has changed or was some kind of marketing head fake. who knows..)
https://thenewstack.io/more-than-an-openai-wrapper-perplexity-pivots-to-open-source/

The New Stack

Richard MacManus

More than an OpenAI Wrapper: Perplexity Pivots to Open Source

Perplexity CEO Aravind Srinivas is a big Larry Page fan. However, he thinks he's found a way to compete not only with Google search, but with OpenAI's GPT too.

vale pebble Feb 25, 2024, 1:37 PM

#

but

#

isnt default mode

#

3.5 turbo

rancid acorn Feb 25, 2024, 1:38 PM

#

mate, read the article lol

vale pebble Feb 25, 2024, 1:40 PM

#

rancid acorn mate, read the article lol

ohhhhhh

#

neat

#

ft 3.5 & llama 2

rancid acorn Feb 25, 2024, 1:41 PM

#

Yes, currently

#

But I think that's for the web

#

I don't think their pplx/sonar-online models were built on GPT3.5

vale pebble Feb 25, 2024, 1:41 PM

#

however im sure they changed experimental from pplx 70b to sonar medium

rancid acorn Feb 25, 2024, 1:46 PM

#

possibly. But I think perplexity's web app should be thought of as separate to the API and their models. Regardless of whether Copilot is toggle on/off, there is some kind of intermediary step - which I believe is powered by GPT-3.5 - at work (like how else does Claude2.1 etc get web sources to answer questions about recent events?)

#

I don't think it's the same system with the API - like there's no intermediary/routing step, it's just a query-response kinda thing

rancid acorn Feb 25, 2024, 1:50 PM

#

dull hearth I'm getting better results via Labs

I've also noticed this btw. Making calls via perplexity.labs seems way less prone to things going off the rails

dull hearth Feb 25, 2024, 1:50 PM

#

you can do API calls to labs?

#

unofficial I guess?

rancid acorn Feb 25, 2024, 1:50 PM

#

sorry, just meant queries/prompts

#

But, in theory, it is just an API call

dull hearth Feb 25, 2024, 1:51 PM

#

yes, but as you can see, it still suffers from slowly devolving into gibberish

#

ironically, model perplexity goes up

rancid acorn Feb 25, 2024, 1:53 PM

#

dull hearth yes, but as you can see, it still suffers from slowly devolving into gibberish

yeah but tbf, that was like a quirk (still weird), it didn't go completely off the rails.

#

in any case, it seems like it's the length of the outputs that causes problems. like I;ve never had it with a 2-3 sentence response (at least in my experienec)

dull hearth Feb 25, 2024, 1:56 PM

#

rancid acorn yeah but tbf, that was like a quirk (still weird), it didn't go completely off t...

That's true

spare zealot Feb 25, 2024, 2:48 PM

#

this is a duplicate but maybe this is best posted here instead of the general thread...

I'm trying to use CrewAI with Perplexity API for agents. I am new to this so havent had much success yet. I can get this working with local LLMs but when I try to use Perplexities as per the API docs I get different errors.

When I tried:
os.environ["OPENAI_API_BASE"]='https://api.perplexity.ai/'
os.environ["OPENAI_MODEL_NAME"]='pplx-70b-online'

I received a message that I had to use /chat/completions

But when I Tried chat completetions endpoint, I received a 404 error response.

So then I went back to the 'https://api.perplexity.ai/' and then I received:
openai.BadRequestError: Error code: 400 - {'error': {'message': 'custom stop words are not implemented for completions.', 'type': 'unsupported_parameter', 'code': 400}}

A little confused here, docs say I should be using "https://api.perplexity.ai/chat/completions"

but when using that all I get received back is: "openai.NotFoundError: Error code: 404"

At least when I dont have /chat/completions, I receive a response which tells me something other than 404.

Has anyone successfully used agents with perplexity api?

austere mist Feb 25, 2024, 9:30 PM

#

Hi everyone, I'm new to the server, and Perplexity. I just topped up my account with $5 to test out the API. I've used OpenAI and Google API in the past, so I'm not an absolute beginner. I copied and modified the chat example provided in the documentation, but I'm getting an auth error.

from openai import OpenAI

PPX_Key = "pplx-redacted"

def chatClear():
    global messages, messagesOrdered
    messages        = {}
    messagesOrdered = {}

def getOrderedMessages(messages):
    messagesOrdered = []
    for role, content_list in messages.items():
        if role != 'order':
            for order, content in content_list:
                messagesOrdered.append((role, order, content))
    messagesOrdered.sort(key=lambda x: x[1])  
    return messagesOrdered

def chat(role, content):
    global messages, messagesOrdered
    if role not in messages:
        messages[role] = []
    if 'order' not in messages:
        messages['order'] = 0
    messages[role].append((messages['order'], content))
    messages['order'] += 1
    messagesOrdered = getOrderedMessages(messages)

#

def Perplexity_Call(modelIndex):
    global PPX_Key, messages, messagesOrdered
    
    modelIndexMin = 1
    modelIndexMax = 1
    if modelIndex < modelIndexMin:
        modelIndex = modelIndexMin
    if modelIndex > modelIndexMax:
        modelIndex = modelIndexMax
    
    if modelIndex == 1:
        modelNameExact  = "mistral-7b-instruct"
        modelNamePretty = "Mistral 7B Instruct"
        modelParamCount = "7B"

    print(f'Calling Perplexity LLM.')
    print(f'Model: {modelNamePretty}')
    print(f'Param: {modelParamCount}')
    client = OpenAI(api_key=PPX_Key,base_url="https://api.perplexity.ai")
    response = client.chat.completions.create(model=modelNameExact,messages=messagesOrdered)
    chat('assistant',f'{response}')

    print(f'~~~~')
    for role, _, content in messagesOrdered:
        print(f"\n\n[{role}]\n{content}")
    print(f'\n\n~~~~')

chatClear()

Rl = "system"
Tx = "You are an artificial intelligence assistant and you need to engage in a helpful, detailed, polite conversation with a user."
chat(Rl,Tx)

Rl = "user"
Tx = "How many stars are in the universe?"
chat(Rl,Tx)

Rl = "user"
Tx = "I love space. :)"
chat(Rl,Tx)

Rl = "assistant"
Tx = "Let me think about that. :D"
chat(Rl,Tx)

Perplexity_Call(1)

#

I just noticed my indentation is off in the client / response, which I'm unsure matters. Will perform a quick edit and see if the error goes away. Edit: updated code, issue remains.

    client = OpenAI(api_key=PPX_Key, base_url="https://api.perplexity.ai")
    response = client.chat.completions.create(
        model=modelNameExact,
        messages=messagesOrdered,
    )
    chat('assistant',f'{response}'

inner brook Feb 25, 2024, 11:35 PM

#

Is there a way to only get back answers from/with academic sources as responses from the api?

austere mist Feb 26, 2024, 3:28 AM

#

#🧪│api-general message
The auth issue went away, looks like there was a delay with reflecting my credit balance internally. I'm now debugging the append code, since it returned an error of "system unsupported role, assistant unsupported role"

rancid acorn Feb 26, 2024, 3:42 AM

#

austere mist https://discord.com/channels/1047197230748151888/1161802929053909012/12114250862...

It seems the structure of the messages (system / assistant / user ) needs to be in a particular order #1210440796000747611 message
Not sure if that's related (though sounds like it could be)

vale pebble Feb 26, 2024, 3:55 AM

#

rancid acorn It seems the structure of the messages (system / assistant / user ) needs to be ...

oh crap that's why it doesn't work for me

#

smh

#

i need assistant to be first

covert ferry Feb 26, 2024, 5:30 AM

#

inner brook Is there a way to only get back answers from/with academic sources as responses ...

No, please check https://docs.perplexity.ai

pplx-api

carmine holly Feb 26, 2024, 8:59 AM

#

hi, please provide a link (like https://api.perplexity.ai/models) to retreive a json of availabale models and their parameters [context length, type (online, chat, ...) , size (7B, 7Bx8, 70B...) ...] so that we can have uptodate infos programatically
UseCase : populate a interface to choose with model to run a run time

worldly glacier Feb 26, 2024, 9:41 AM

#

rancid acorn It seems the structure of the messages (system / assistant / user ) needs to be ...

Hi, how you doing? Any update on your testing of sonar-medium-online (why it generates a very awkward, irrelevant last paragraph to conclude things like a master)

untold quail Feb 26, 2024, 10:00 AM

#

is it normal or it is an issue in the following picture:

model: sonar-small-online
stream: true
client: javascript via axios

rancid acorn Feb 26, 2024, 10:13 AM

#

worldly glacier Hi, how you doing? Any update on your testing of sonar-medium-online (why it gen...

hey! still not really sure what it is, other than that small-online does not seem have the same issue. But more curiously, when using medium-online via labs.perplexity, it seems to work more or less fine

rancid acorn Feb 26, 2024, 10:14 AM

#

untold quail is it normal or it is an issue in the following picture: ------------ model: son...

lol I was just asking a weather question too...
No system prompt for the API calls; all params default

rancid acorn Feb 26, 2024, 10:17 AM

#

rancid acorn lol I was just asking a weather question too... No system prompt for the API ca...

I ticked the sonar-small-online response in that screenshot, but I didn't really check the actual output. Looking at a few other responses to the same query, the small-online model was not providing particularly reliable data (in terms of the forecast). medium seems to return accurate info, before going off the rails (unless using labs)

untold quail Feb 26, 2024, 10:19 AM

#

rancid acorn lol I was just asking a weather question too... No system prompt for the API ca...

issue of parsing in the post search process of pplx maybe

rancid acorn Feb 26, 2024, 10:21 AM

#

untold quail issue of parsing in the post search process of pplx maybe

I'm actually wondering whether the larger context window is more problematic than beneficial

worldly glacier Feb 26, 2024, 10:21 AM

#

rancid acorn I ticked the sonar-small-online response in that screenshot, but I didn't really...

accuracy based on real time result, definitely sonar medium is better but not as good as pplx-70b was

worldly glacier Feb 26, 2024, 10:22 AM

#

rancid acorn lol I was just asking a weather question too... No system prompt for the API ca...

can you please share the snippets of code you used for this query

untold quail Feb 26, 2024, 10:23 AM

#

rancid acorn I'm actually wondering whether the larger context window is more problematic tha...

it seems. maybe limit it to 512 tokens

rancid acorn Feb 26, 2024, 10:24 AM

#

worldly glacier can you please share the snippets of code you used for this query

It was like this for that screenshot, minus the max_tokens param (I've been testing with that since)

worldly glacier Feb 26, 2024, 10:26 AM

#

rancid acorn It was like this for that screenshot, minus the max_tokens param (I've been test...

not using system or assistant?

untold quail Feb 26, 2024, 10:31 AM

#

How to access the usage of a stream when...

rancid acorn Feb 26, 2024, 10:33 AM

#

untold quail it seems. maybe limit it to 512 tokens

does seem to kinda help but also, not really solving the problem - basically just cuts it off before it gets the chance to devolve into gibberish ha

rancid acorn Feb 26, 2024, 10:35 AM

#

worldly glacier not using system or assistant?

no. I never have (aside from experimenting) with pplx API requests; I only use the online models and the documentation previously said that sytem messages would be ignored. I found just keeping things as simple as possible (including the actual queries) was optimal

worldly glacier Feb 26, 2024, 10:38 AM

#

may be its the reason for me, as I always use system

rancid acorn Feb 26, 2024, 10:41 AM

#

worldly glacier may be its the reason for me, as I always use system

maybe but it sounds like things had been working well before? Also, when I did include system messages, it wasn't like it broke it - I just decided I was getting either the same or better results without it so stopped bothering ha

worldly glacier Feb 26, 2024, 10:44 AM

#

rancid acorn maybe but it sounds like things had been working well before? Also, when I did i...

yah right, system was being used by me for pplx-70b

#

Hopefully we see sonar-medium as one of the best model in industry....

night orchid Feb 26, 2024, 10:52 AM

#

Hi TheDigitalCat,
Do you think you could help me set up my LibreChat? I'm going round in circles and can't get out of it.

rancid acorn Feb 26, 2024, 10:56 AM

#

worldly glacier Hopefully we see sonar-medium as one of the best model in industry....

The thing I really can't get my head around is why using medium-online via perplexity.labs everything seems to work just fine