#💬│general | Perplexity | Page 55

valid yoke May 24, 2024, 6:55 PM

#

Definitely not

#

I’m on the free version

#

#

Says last month

#

Ah this is why. They’re showing it for a few queries for free and then trying to make you pay

sleek vortex May 24, 2024, 7:00 PM

#

valid yoke I’m on the free version

oh

#

yeah no they made web and things free

#

for like 5 queries per day

valid yoke May 24, 2024, 7:00 PM

#

Sucks

warm cave May 24, 2024, 7:00 PM

#

Oh, yeah its been beta on chatGPT Plus for ages, and recently left beta on chatGPT Plus maybe a month or two ago, and I guess now it's on free

#

But for limited usage 😦

fading moth May 24, 2024, 7:01 PM

#

Bleh

#

I'd love it if Opus had realtime stuff

#

https://tenor.com/view/aladdin-disney-cartoons-jasmine-ic-an-show-you-the-world-gif-4545341

Tenor

warm cave May 24, 2024, 7:03 PM

#

It dose, but only 50 times a day on perplexity 🥲

#

But yeah I agree, Claude pro with web would be great

fading moth May 24, 2024, 7:07 PM

#

Opus would see the news and just be like...

Know what? I'm out. Turn off this realtime feature. Nope.

warm cave May 24, 2024, 7:08 PM

#

Anyone try Golden Gate Claude? 😆

#

Screenshot_2024-05-24_at_12.08.35_PM.png

fading moth May 24, 2024, 7:11 PM

#

#

Speaking of Golden Gate

warm cave May 24, 2024, 7:12 PM

#

fading moth

Bro… noooo 🤣, bad google!

#

They don't need that type of encouragement 💀

fading moth May 24, 2024, 7:13 PM

#

#

Google suggests chlorine gas to clean your washing machine.

warm cave May 24, 2024, 7:13 PM

#

Screenshot_2024-05-24_at_12.13.17_PM.png

warm cave May 24, 2024, 7:13 PM

#

fading moth Google suggests chlorine gas to clean your washing machine.

This is bad, there are so many!

#

Why are they using gemma for this stuff?

#

I mean that is my theory, ain’t no way a Gemini model is this dumb, even with the bad rep they have, so has to be gemma 2b or 7b

fading moth May 24, 2024, 7:15 PM

#

#

John Adams going HAM

warm cave May 24, 2024, 7:15 PM

#

He is killing it 😂

fading moth May 24, 2024, 7:16 PM

#

#

#

Dog played 63 games in the NHL and scores 8 goals.

#

MVP

#

#

@warm cave I saved the best for last.

agile jay May 24, 2024, 7:18 PM

#

warm cave He is killing it 😂

Also is immortal or something...

fading moth May 24, 2024, 7:19 PM

#

A rock a day keeps the doctor away.

warm cave May 24, 2024, 7:19 PM

#

fading moth <@1087526330083594337> I saved the best for last.

It is too much, i am dying 🤣

agile jay May 24, 2024, 7:20 PM

#

Lol, pretty sure it's getting all this garbage from reddit...

#

It believes that the reddit trolls are giving actually useful and accurate advice...

fading moth May 24, 2024, 7:20 PM

#

I dunno. The last one cited Berkeley.

warm cave May 24, 2024, 7:20 PM

#

I wonder if Brave is better or not???

Screenshot_2024-05-24_at_12.20.13_PM.png

agile jay May 24, 2024, 7:20 PM

#

And not just memeing.

warm cave May 24, 2024, 7:21 PM

#

I know that these things have been in the news so i cant ask the same things, but wondering

#

I think they use mistral or llama

agile jay May 24, 2024, 7:21 PM

#

And likely finetuned.

warm cave May 24, 2024, 7:21 PM

#

agile jay And likely finetuned.

Yeah

fading moth May 24, 2024, 7:21 PM

#

I thought the AI Overview used Gemini?

#

:x

warm cave May 24, 2024, 7:22 PM

#

It has to use Gemma, no way Gemini is saying this stuff

agile jay May 24, 2024, 7:22 PM

#

On google, likely. On brave, unlikely.

fading moth May 24, 2024, 7:22 PM

#

Interesting~ but Chrome/Firefox are by far the most popular browsers.

agile jay May 24, 2024, 7:23 PM

#

Brave is literally a chromium browser...

fading moth May 24, 2024, 7:23 PM

#

Yet the results are different

agile jay May 24, 2024, 7:23 PM

#

It's the better chrome, more or less.

warm cave May 24, 2024, 7:23 PM

#

plus imagine the cost of running Gemini 1.0 or 1.5 for every search across all of google, even flash would be crazy expensive

fading moth May 24, 2024, 7:23 PM

#

Exactly

#

Better Chrome, but it isn't Chrome

agile jay May 24, 2024, 7:23 PM

#

fading moth Yet the results are different

Yes, browser and search engine are different things.

#

On brave you can choose to use google search or brave search.

fading moth May 24, 2024, 7:24 PM

#

Que? Interesting.

agile jay May 24, 2024, 7:24 PM

#

A lot of people are likely to leave chrome next month...

warm cave May 24, 2024, 7:24 PM

#

I have the system prompt: https://discord.com/channels/1047197230748151888/1243018691822751754

fading moth May 24, 2024, 7:25 PM

#

Brave search is new to me. I knew you could use different search engines like duckduckgo or bing or Yahoo, etc. but Brave is new to me.

agile jay May 24, 2024, 7:25 PM

#

Brave also has the best default privacy settings too, from the top browsers.

fading moth May 24, 2024, 7:26 PM

#

I have an extension in Chrome that hides the AI Overview crap :x

warm cave May 24, 2024, 7:26 PM

#

Yeah, I dont see why to use chrome anyways, use thorium or brave. same benefit’s as chrome but without a lot of the downsides

agile jay May 24, 2024, 7:26 PM

#

All existing extensions will need to be migrated to V3 by June 2024 to continue working. Beginning June 2024, Chrome will gradually disable Manifest V2 extensions running in the browser.

warm cave May 24, 2024, 7:26 PM

#

Here is the URL thingy: https://search.brave.com/search?q=%s&source=desktop

Brave Search

Search the web privately…

agile jay May 24, 2024, 7:26 PM

#

That is pretty much gonna cause an exodus

fading moth May 24, 2024, 7:26 PM

#

I've had compatibility issues in the past when brave was in its infancy, but it has clearly improved

warm cave May 24, 2024, 7:27 PM

#

idk what you call it, but that is how i set it up as one of my search engines

agile jay May 24, 2024, 7:27 PM

#

The quick search function

warm cave May 24, 2024, 7:27 PM

#

fading moth I've had compatibility issues in the past when brave was in its infancy, but it ...

yeah, it has gotten quite good

warm cave May 24, 2024, 7:27 PM

#

agile jay The quick search function

yeah

agile jay May 24, 2024, 7:27 PM

#

@bs how do I eat rocks?

warm cave May 24, 2024, 7:28 PM

#

Screenshot_2024-05-24_at_12.28.30_PM.png

fading moth May 24, 2024, 7:28 PM

#

If the website compatibility problems are resolved (this was a while ago) then I might as well uninstall Chrome.

warm cave May 24, 2024, 7:29 PM

#

I tend to perfer Brave over Duck, but both are good imo

agile jay May 24, 2024, 7:29 PM

#

fading moth If the website compatibility problems are resolved (this was a while ago) then I...

website compatibility problems
Which ones?

#

Yep, definitely wouldn't be using vanilla chrome

fading moth May 24, 2024, 7:29 PM

#

It was a years go, years

#

Like early 2018~

agile jay May 24, 2024, 7:30 PM

#

Especially with manifest v3 being mandatory next month

fading moth May 24, 2024, 7:31 PM

#

Tbf, I've got tampermonkey scripts out the wazoo and extensions with similar purposes.

#

I don't go in raw, so to speak.

agile jay May 24, 2024, 7:31 PM

#

Tampermonkey will not work as well with manifest v3

fading moth May 24, 2024, 7:31 PM

#

Zat is a problem for moi

#

I love my scripts

#

I can barely install 3rd party extensions anymore in Chrome as it is.

#

Do either of you know a place where I can see some benchmarks just for ghits and siggles?

#

Given that they are both chromium I don't expect notable differences, but I do know Chrome eats RAM like a meth addict.

warm cave May 24, 2024, 7:36 PM

#

https://tenor.com/view/google-chrome-pacman-eating-gif-13756279

Tenor

agile jay May 24, 2024, 7:36 PM

#

I imagine just not having a lot of ads and trackers would make pages load faster.

fading moth May 24, 2024, 7:38 PM

#

warm cave May 24, 2024, 7:38 PM

#

I reset my computer after each semester ends so its not very high overall rn, but this ticks up fast, lol

Screenshot_2024-05-24_at_12.38.03_PM.png

fading moth May 24, 2024, 7:39 PM

#

Neat

agile jay May 24, 2024, 7:39 PM

#

Is it me or does perplexity eat up more memory per page, now than it did in the past?

fading moth May 24, 2024, 7:41 PM

#

Maybe?

agile jay May 24, 2024, 7:41 PM

#

It's a shame that windows isn't containerised from the ground up, so it would be easier to clean up after applications.

agile jay May 24, 2024, 7:41 PM

#

fading moth Maybe?

Each page is taking around 220MB

#

When I hover a tab, it shows the ram being used.

#

The downside to the modern web, lol

#

Would prefer it if it leaned more heavily on wasm.

warm cave May 24, 2024, 7:48 PM

#

Did Google take down the Answers by AI? seems to not have any

fading moth May 24, 2024, 7:49 PM

#

warm cave Did Google take down the Answers by AI? seems to not have any

Maybe due to the whole... Jump off a bridge and eat rocks thing?

agile jay May 24, 2024, 7:54 PM

#

You would think they would test it first, lol

warm cave May 24, 2024, 7:59 PM

#

fading moth Maybe due to the whole... Jump off a bridge and eat rocks thing?

Really 🫠, why would google do that lol

warm cave May 24, 2024, 7:59 PM

#

agile jay You would think they would test it first, lol

Yeah, I guess not

#

I think brave uses mixtral for AI answers, which is good, its small and cheap, but smart and unlikely to tell you to un alive yourself, lol

sleek vortex May 24, 2024, 8:02 PM

#

phi looks really interesting

#

the new one

agile jay May 24, 2024, 8:02 PM

#

phi 3

sleek vortex May 24, 2024, 8:02 PM

#

but i don’t have anything to run it on

#

i could run it on modal like i’ve been working on

#

i ram llama 3 on modal and got like 4.5k token throughput

#

ran*

#

per second

warm cave May 24, 2024, 8:02 PM

#

sleek vortex phi looks really interesting

I tested phi 3, put it in the same situation as Gemma told you to eat glue and Phi 3 did not

sleek vortex May 24, 2024, 8:02 PM

#

warm cave I tested phi 3, put it in the same situation as Gemma told you to eat glue and P...

phi3 is way more regulated

agile jay May 24, 2024, 8:03 PM

#

Just get a qualcomm laptop in a months time.

sleek vortex May 24, 2024, 8:03 PM

#

when i say meow it says like sorry as an ai assistant

#

like …

#

over alignment at some point

agile jay May 24, 2024, 8:03 PM

#

Lol, it doesn't speak to cats

sleek vortex May 24, 2024, 8:03 PM

#

agile jay Just get a qualcomm laptop in a months time.

I’m thinking about scale and not local

#

The perplexity clone idea lol

#

I’m thinking about how to build it, just for fun really

agile jay May 24, 2024, 8:04 PM

#

For scale, isn't it better to just use an API for now.

warm cave May 24, 2024, 8:04 PM

#

Cats are Inferior life forms, its Human>AI>Cats, AI is looking to move up the poll tho

sleek vortex May 24, 2024, 8:04 PM

#

😦

sleek vortex May 24, 2024, 8:04 PM

#

agile jay For scale, isn't it better to just use an API for now.

Yeah but what api will let me hit like 25 requests to summarise sources

#

In one second

fading moth May 24, 2024, 8:05 PM

#

PU_peeporeWide

sleek vortex May 24, 2024, 8:05 PM

#

On free tier or L1 paid tier

warm cave May 24, 2024, 8:05 PM

#

fading moth <:PU_peeporeWide:587014975705579522>

Time to ditch chrome, there is no benefits lol

sleek vortex May 24, 2024, 8:05 PM

#

I’ve used arc for around a year now

#

There are negative performance benefits

#

Really only using for the aesthetic

agile jay May 24, 2024, 8:06 PM

#

Yep, the arc UI on windows doesn't work as well.

sleek vortex May 24, 2024, 8:06 PM

#

Sat for hours coding so i’ve come to talk a walk lmao

#

Currently in a park in idk where

agile jay May 24, 2024, 8:07 PM

#

So you are lost?

sleek vortex May 24, 2024, 8:07 PM

#

Well no

warm cave May 24, 2024, 8:07 PM

#

Lol, at least you have you phone

agile jay May 24, 2024, 8:07 PM

#

Code outside, problem solved.

sleek vortex May 24, 2024, 8:07 PM

#

Why do they put the cut grass back into the grass

#

It just seems so stupid to me

desert latch May 24, 2024, 8:08 PM

#

sleek vortex I’m thinking about how to build it, just for fun really

There's a bunch of perplexity clones now, but the open-source one of them (omniplex) just closed down for good. There's also like correkt ai which isnt open source

agile jay May 24, 2024, 8:08 PM

#

I normally code outside between 10-14, if the weather is good.

sleek vortex May 24, 2024, 8:08 PM

#

desert latch There's a bunch of perplexity clones now, but the open-source one of them (omnip...

I’m trying to make something slightly unique

warm cave May 24, 2024, 8:08 PM

#

Open replit on your phone, you get a keyboard you can reach every key on with one finger, that is productivity🌟

sleek vortex May 24, 2024, 8:08 PM

#

Probablt seen my efforts

desert latch May 24, 2024, 8:08 PM

#

sleek vortex I’m trying to make something slightly unique

Oh? What would it be?

sleek vortex May 24, 2024, 8:08 PM

#

started work today on code interpreter

#

progress is slow

#

but promising?

agile jay May 24, 2024, 8:09 PM

#

Making code interpreter for one language is easy.

sleek vortex May 24, 2024, 8:09 PM

#

I’m thinking of an architecture for how one could index the most trending parts of the web

agile jay May 24, 2024, 8:09 PM

#

Making it support more is when it gets challenging.

sleek vortex May 24, 2024, 8:09 PM

#

agile jay Making code interpreter for one language is easy.

Well my approach could work with anything that works on a linux vm

#

And maybe a fine tune or n-shot prompt of the model

inland bear May 24, 2024, 8:09 PM

#

A friend of mine, who subscribes to GPT Plus, would really like to try Perplexity PRO (which I talk about so much). Is there any way for her to start a free trial? I'm sure she'll end up switching to it as well. Thank you.

agile jay May 24, 2024, 8:09 PM

#

Yep, but cost wise, running a VM is not generally the best.

sleek vortex May 24, 2024, 8:09 PM

#

inland bear A friend of mine, who subscribes to GPT Plus, would really like to try Perplexit...

7 Day trial on the yearly

sleek vortex May 24, 2024, 8:09 PM

#

agile jay Yep, but cost wise, running a VM is not generally the best.

It’s a serverless VM

#

Only billed for cpu core usage by minute

agile jay May 24, 2024, 8:10 PM

#

inland bear A friend of mine, who subscribes to GPT Plus, would really like to try Perplexit...

I have a month free trial that I won ages ago

sleek vortex May 24, 2024, 8:10 PM

#

So if you don’t do anything you don’t get charged anything

sleek vortex May 24, 2024, 8:12 PM

#

sleek vortex I’m thinking of an architecture for how one could index the most trending parts ...

what i’m thinking about is how to store huge amounts of embeddings with no money

inland bear May 24, 2024, 8:12 PM

#

sleek vortex 7 Day trial on the yearly

Thanks a lot!

agile jay May 24, 2024, 8:12 PM

#

Embeddings of the sites?

sleek vortex May 24, 2024, 8:12 PM

#

I was thinking like

#

Either I run a GPU instance on a vm somewhere so fast that i can embed sites and pick the best snippets in real-time

#

But if lots of queries are about the same thing from users then maybe i can store the most like popular 1% of searches or something

#

As already embedded

#

Maintaining like a hot cache of the like extreme top of the web

agile jay May 24, 2024, 8:13 PM

#

So caching the content of the links?

sleek vortex May 24, 2024, 8:13 PM

#

sleek vortex Either I run a GPU instance on a vm somewhere so fast that i can embed sites and...

So deep dive queries might take longer to run this

sleek vortex May 24, 2024, 8:14 PM

#

agile jay So caching the content of the links?

Yeah

desert latch May 24, 2024, 8:14 PM

#

You can try a mongodb vector db or maybe chromadb or something? But these cost money

sleek vortex May 24, 2024, 8:14 PM

#

You saw yesterday with the like sources right

#

Sending the whole thing to the model is expensive

#

so i’m considering ways to pick relevant bits

#

embedding search is one way

agile jay May 24, 2024, 8:14 PM

#

Yep, embeddings is the most popular method.

sleek vortex May 24, 2024, 8:15 PM

#

I was wondering if i could run embedding on modal too

#

on like a T4 instance

agile jay May 24, 2024, 8:15 PM

#

I would probably just use llama 3 8B, on groq enterprise.

sleek vortex May 24, 2024, 8:15 PM

#

Maybe i could get the startup credits grant from them lol

agile jay May 24, 2024, 8:15 PM

#

But would cost money

sleek vortex May 24, 2024, 8:16 PM

#

agile jay I would probably just use llama 3 8B, on groq enterprise.

Waiting for the day groq adds embeddings ngl

#

They’re doing whisper in private beta

agile jay May 24, 2024, 8:16 PM

#

Yep, their inference is crazy

sleek vortex May 24, 2024, 8:16 PM

#

Well what’s my actual goal to be honest

#

Make a search engine AI startup but with literally 0 starting budget?

#

Not 1000% sure what i’m trying to do myself honestly

agile jay May 24, 2024, 8:17 PM

#

I would just cache the content of the links, use llama 8 B for choosing what to use as context, and use llama 8 70B for writing the answer.

sleek vortex May 24, 2024, 8:18 PM

#

Answer i might offer better models like haiku and things too if the service got enough traction

sleek vortex May 24, 2024, 8:19 PM

#

agile jay I would just cache the content of the links, use llama 8 B for choosing what to ...

well i might try this

#

I was kinda interested to see how phi would perform though

agile jay May 24, 2024, 8:19 PM

#

Downside to haiku is that it's pretty slow and can't be hosted on groq.

sleek vortex May 24, 2024, 8:19 PM

#

Maybe i could run that on a T4 gpu

sleek vortex May 24, 2024, 8:19 PM

#

agile jay Downside to haiku is that it's pretty slow and can't be hosted on groq.

Haikus fast

#

It isn’t 900TPS fast but it’s fast enough

agile jay May 24, 2024, 8:19 PM

#

Not compared to groq speeds...

sleek vortex May 24, 2024, 8:19 PM

#

yeah but do we need groq speeds when the previous whole chain will take like how many seconds anyway

agile jay May 24, 2024, 8:20 PM

#

Also I imagine the rate limit is higher.

sleek vortex May 24, 2024, 8:20 PM

#

Search, Query,embedding , so on

agile jay May 24, 2024, 8:20 PM

#

Depends on how many sites you pre-scrape for indexing.

maiden osprey May 24, 2024, 8:20 PM

#

Hey Devs, RevelantAI is looking for you!

these are our products are these

an AI Agent text voice and video supported like https://www.digitalhumans.com/
outbound agent voice support be able to
latency for AI voice agent should be around 8000ms
automating workflows
you should be able to other things like AI Persona too

send me your resume and portfolio and well set up a interview

UneeQ Digital Humans

UneeQ AI Digital Humans Technology enables businesses to design, develop and deploy AI-powered digital humans to improve the experiences of customers, staff or patients.

agile jay May 24, 2024, 8:21 PM

#

The longest part currently seems to be the initial search.

sleek vortex May 24, 2024, 8:21 PM

#

agile jay Depends on how many sites you pre-scrape for indexing.

Yeah no clue yet

#

The main worry for that is storage

sleek vortex May 24, 2024, 8:21 PM

#

agile jay The longest part currently seems to be the initial search.

Also probably going to increase

#

Because right now i was using a trial of some paid api

warm cave May 24, 2024, 8:21 PM

#

There is also Gemini 1.5, I think the price it good

sleek vortex May 24, 2024, 8:21 PM

#

For the thing i’d be going back to manually scraping bing

warm cave May 24, 2024, 8:21 PM

#

Flash I mean

sleek vortex May 24, 2024, 8:21 PM

#

warm cave There is also Gemini 1.5, I think the price it good

per Mtok?

agile jay May 24, 2024, 8:22 PM

#

I think the easiest method would be crawling popular sites.

#

And then just recrawling from the home page every day.

warm cave May 24, 2024, 8:22 PM

#

sleek vortex per Mtok?

Yeah

agile jay May 24, 2024, 8:22 PM

#

You simply make a request to the site, get a list of the links, and follow the ones you haven't seen before.

sleek vortex May 24, 2024, 8:22 PM

#

Yeah but what i’m not doing is indexing fully myself as i wouldn’t have the storage to store all that really

#

idk

#

it’s a hard challenge

agile jay May 24, 2024, 8:23 PM

#

String compression for the actual content of the pages?

sleek vortex May 24, 2024, 8:23 PM

#

If i can make the search bit really fast then

agile jay May 24, 2024, 8:23 PM

#

And after converting the page into markdown.

sleek vortex May 24, 2024, 8:23 PM

#

Do you think i should focus on this thing first or the code interpreter

agile jay May 24, 2024, 8:23 PM

#

Which one is more vital?

sleek vortex May 24, 2024, 8:23 PM

#

Well there’s no rush

#

But maybe the main search thing is probably a better thing to think of first?

#

So first i’d be making a meta search engine basically

#

Pull queries off google and bing and whatever else

#

I could cache search engine results

#

That would be viable and probably increase speed a large amount

agile jay May 24, 2024, 8:25 PM

#

Yep, you could also just use what you currently have to answer the current user, and then behind the scenes, add a future search to a queue and cache the links you don't have.

#

That way you can keep the service fast, while imporving it behind the scenes.

#

Storage is likely gonna be your largest cost.

warm cave May 24, 2024, 8:29 PM

#

Discord it free 😆 https://www.bing.com/videos/search?q=storing+files+in+discord&&view=detail&mid=7AA0628FD494AF90DDAB7AA0628FD494AF90DDAB&&FORM=VRDGAR&PC=1VIV

agile jay May 24, 2024, 8:29 PM

#

Lol, what happens if they suddenly decide to delete them?

#

Back to square one...

sleek vortex May 24, 2024, 8:30 PM

#

Discord recently made the files timed links or sometjing

agile jay May 24, 2024, 8:31 PM

#

Does amazon still do unlimited image storage?

#

Convert the embeddings into an image and just store it, lol

#

Just make sure that they don't compress them or something...

wheat tartan May 24, 2024, 9:20 PM

#

Are there any instructions in the API documentation to truncate the response so that it doesn’t consume so many tokens?

warm cave May 24, 2024, 9:32 PM

#

wheat tartan Are there any instructions in the API documentation to truncate the response so ...

Is this it what you are looking for? https://docs.perplexity.ai/reference/post_chat_completions

pplx-api

Chat Completions

Generates a model's response for the given chat conversation.

agile jay May 24, 2024, 9:36 PM

#

wheat tartan Are there any instructions in the API documentation to truncate the response so ...

Are you talking about perplexity API?

sleek vortex May 24, 2024, 9:36 PM

#

pineconedb looks interesting

#

#

wonder if i could store my link/snippet cache in this

agile jay May 24, 2024, 9:37 PM

#

Yep, it's pretty popular in the vector DB space.

sleek vortex May 24, 2024, 9:37 PM

#

Sadly not open source

#

would be more lock in

#

which im sort of trying to avoid but not "really"

agile jay May 24, 2024, 9:37 PM

#

There's probably an open-source alternative.

sleek vortex May 24, 2024, 9:38 PM

#

#

#

own bing scraper is bringing back in 371ms ?

#

maybe i can scrape google

wheat tartan May 24, 2024, 9:45 PM

#

agile jay Are you talking about perplexity API?

Yes

agile jay May 24, 2024, 9:53 PM

#

wheat tartan Yes

Use it through the vercel API instead.

#

It's documented a lot better.

sleek vortex May 24, 2024, 9:54 PM

#

agile jay Use it through the vercel API instead.

frankly i think this vercel api code is a mess lol

#

well at least the code i've written for code interpreter so far

#

the main isue i had

agile jay May 24, 2024, 9:54 PM

#

Probably because you don't know how to read docs, lol

sleek vortex May 24, 2024, 9:54 PM

#

nonono

#

is that the librayr doesnt support stop tokens

#

i had to go and clone it and add that in

#

they just dont have that as an option

sleek vortex May 24, 2024, 9:55 PM

#

agile jay Probably because you don't know how to read docs, lol

well asin i got it to work fine so thats not the main issue

sleek vortex May 24, 2024, 9:55 PM

#

sleek vortex is that the librayr doesnt support stop tokens

mainly this lol

warm cave May 24, 2024, 9:55 PM

#

I mean changing max tokens is not bad ```import requests

url = "https://api.perplexity.ai/chat/completions"

payload = {
"model": "llama-3-sonar-small-32k-online",
"messages": [
{
"content": "string",
"role": "system"
}
],
"max_tokens": 100,
"temperature": 0.2,
"top_p": 0.9,
"return_citations": False,
"return_images": False,
"top_k": 0,
"stream": False,
"presence_penalty": 0,
"frequency_penalty": 1
}
headers = {
"accept": "application/json",
"content-type": "application/json"
}

response = requests.post(url, json=payload, headers=headers)

print(response.text)```

sleek vortex May 24, 2024, 9:56 PM

#

https://github.com/vercel/ai/issues/1685

GitHub

Stop tokens? · Issue #1685 · vercel/ai

Feature Description Custom stop tokens - a feature in OpenAI and other providers such as Groq is just missing from the custom options for models in the SDK. Can we please have this feature? Use Cas...

#

lol

#

well it wasnt hard to add

#

so thats fine

agile jay May 24, 2024, 9:57 PM

#

Yep, depends if you are supposed to compose it yourself. The ask is mostly just an API wrapper.

#

I'm used to making stuff myself, since I mainly use Go.

sleek vortex May 24, 2024, 9:57 PM

#

I'm debating just rewriting without nextjs and without vercel ai

#

Might take a day but might be worth it

agile jay May 24, 2024, 9:58 PM

#

Next is pretty heavy.

sleek vortex May 24, 2024, 9:58 PM

#

yeah

#

i dont think it's really needed

#

idk i just picked it in a flash so i could try this with RSC and things

agile jay May 24, 2024, 10:00 PM

#

Yep, and react itself is bloat, lol.

#

Don't know why people still use it.

sleek vortex May 24, 2024, 10:00 PM

#

what do you think i should try

#

for this project

#

i picked shadcn and whatever only to quickly prototype

#

learning a new frontend something could be interesting

agile jay May 24, 2024, 10:01 PM

#

Depends what you want your UI to do.

sleek vortex May 24, 2024, 10:01 PM

#

ok think uh

#

building a ui like perplexity's

#

what would you pick

#

as stack

#

(tailwind css is required)

agile jay May 24, 2024, 10:02 PM

#

sleek vortex (tailwind css is required)

Obviously

#

Maybe svelte. It's supposed to be good for animations etc.

sleek vortex May 24, 2024, 10:02 PM

#

i've never used svelte actually

chilly nova May 24, 2024, 10:11 PM

#

sleek vortex

garnet doesn't do that even tho it's redis-compatible?

#

also is there a comparison on which of those is better?

#

i would expect the dedicated ones

sleek vortex May 24, 2024, 10:16 PM

#

https://huggingface.co/avsolatorio/NoInstruct-small-Embedding-v0

avsolatorio/NoInstruct-small-Embedding-v0 · Hugging Face

#

interesting

#

#37 on the MTEB leaderboard

#

but its only a 33M model

#

(filtering >250M)

ionic stone May 24, 2024, 10:29 PM

#

Phi-3 medium
New kid on the block

warm cave May 24, 2024, 11:14 PM

#

Its worse now, lol: https://www.youtube.com/watch?v=zLvFc_24vSM

YouTube

Coffeezilla

Rabbit Gaslit Me, So I Dug Deeper

Is the LAM a Scam? Down the rabbit hole we go.

Support Investigative Journalism:
► Patreon: https://patreon.com/coffeezilla

People who helped this investigation:
►Emily Shepherd: https://twitter.com/ShyTsarina
►Eva: https://twitter.com/xyz3va
►Marcel: https://twitter.com/MarcelD505
►Rithwik Jayasimha: https://twitter.com/thel3l
►jj: https://...

▶ Play video

sleek vortex May 24, 2024, 11:14 PM

#

yeah

#

seen that

#

but tldr lam is fake

#

and the company denies that it isnt

warm cave May 24, 2024, 11:15 PM

#

yeah, they 100% lied, not just that they are taking longer to make the LAM work well, its straight up not what they advertised

#

I got pessimistic back when they did a video with Matthew Berman where they said something along the lines of LAM model not being Transformers the other popular architecture, they said it would be too expensive to train.

sleek vortex May 24, 2024, 11:30 PM

#

apple had some mobile ui model

#

looked like a real lam early beginnings

#

#

...

#

bing does not like datacenter ips

agile jay May 25, 2024, 12:00 AM

#

sleek vortex

You tried rotating them?

sleek vortex May 25, 2024, 12:03 AM

#

agile jay You tried rotating them?

well this was cloudflare worker

#

cant rotate that

#

yahoo works?

#

yahoo is just bing too

#

could use that???

agile jay May 25, 2024, 12:04 AM

#

Did you try comparing them?

sleek vortex May 25, 2024, 12:05 AM

#

ok nevermind

#

seems like both bing and yahoo send rubbish results to datacenter ips

agile jay May 25, 2024, 12:07 AM

#

Most likely those sites are also using cloudflare protection...

sleek vortex May 25, 2024, 12:09 AM

#

likely their own

#

not cloudflare

#

whyd ms use cloudflare lol

agile jay May 25, 2024, 12:09 AM

#

Because they can.

sleek vortex May 25, 2024, 12:09 AM

#

but they have their own infra

agile jay May 25, 2024, 12:10 AM

#

Maybe you need to make your worker make a request to your own server, to get a proxy to use, before making a request to bing?

sleek vortex May 25, 2024, 12:10 AM

#

The worker was meant to be the cloud proxy

#

so im just going to try move it

#

to a different host

#

testing that right now - does modal get blocked

agile jay May 25, 2024, 12:10 AM

#

One which is less likely to get recognised by bing.

#

oh, just curling it

sleek vortex May 25, 2024, 12:11 AM

#

works on local

#

parsed it and i got results fine

#

📎 message.txt

#

meaning its like 94% about your ip

#

useragent this that doesnt matter

agile jay May 25, 2024, 12:12 AM

#

nice

#

Oh, doesn't block the curl user agent?

#

Do you need all of those query parameters in the URL too?

sleek vortex May 25, 2024, 12:14 AM

#

seemed to work

sleek vortex May 25, 2024, 12:14 AM

#

agile jay Do you need all of those query parameters in the URL too?

nope

#

📎 message.txt

#

results from curl on modal

agile jay May 25, 2024, 12:15 AM

#

most of them seem to be set to 0, so most likely remove the ones that are not needed to make your url easier to manage.

sleek vortex May 25, 2024, 12:15 AM

#

yeah in code its just &q

#

and like &cc=gb for region or something

#

i copied the url off my browser for the curl

agile jay May 25, 2024, 12:15 AM

#

country code, etc

sleek vortex May 25, 2024, 12:16 AM

#

sleek vortex results from curl on modal

well this also means i have to rewrite my code into python

#

hmm

agile jay May 25, 2024, 12:16 AM

#

I guess you can make a client, to make it easier to create the url.

sleek vortex May 25, 2024, 12:16 AM

#

do you think its worth doing the html parsing in rust

#

might be like

#

0.05s faster than bs4

#

and then for the search endpoint i can spawn the rust binary instead

agile jay May 25, 2024, 12:16 AM

#

Would be easer to use Go.

sleek vortex May 25, 2024, 12:16 AM

#

idk any go

agile jay May 25, 2024, 12:16 AM

#

And quicker to compile.

sleek vortex May 25, 2024, 12:16 AM

#

i know rust well

#

well either way

#

both are faster than python

#

in theory

agile jay May 25, 2024, 12:17 AM

#

Better to stick to python, if you are gonna try filtering it using css selectors etc.

sleek vortex May 25, 2024, 12:17 AM

#

i already have the code for it

#

i just have to port it

#

from ts

agile jay May 25, 2024, 12:18 AM

#

You can get more performance if you compile the python using something like cython.

#

The only thing they got wrong when making rust is the function coloring.

#

It's really annoying.

#

The lifetimes also have their own coloring too.

sleek vortex May 25, 2024, 12:20 AM

#

change that in your ide?

agile jay May 25, 2024, 12:20 AM

#

Oh, function coloring is not actually to do with colors.

sleek vortex May 25, 2024, 12:20 AM

#

lets see if pplx can save me time

sleek vortex May 25, 2024, 12:20 AM

#

agile jay Oh, function coloring is not actually to do with colors.

oh

agile jay May 25, 2024, 12:20 AM

#

It's to do with stuff like, if you use async, it forces you to go async all the way down.

#

And lifetimes also do a similar thing.

sleek vortex May 25, 2024, 12:21 AM

#

ohh

agile jay May 25, 2024, 12:21 AM

#

Like they pollute the rest of the code to use them.

sleek vortex May 25, 2024, 12:21 AM

#

sleek vortex lets see if pplx can save me time

... and its stcuk on reading the file

#

so much for a new pro search lmao

agile jay May 25, 2024, 12:21 AM

#

Lol

sleek vortex May 25, 2024, 12:21 AM

#

used the bypass trick

#

edit the message instead

#

and it wont upload as a file

agile jay May 25, 2024, 12:22 AM

#

Oh, guess that helps.

#

I don't trust the current models to write good code though.

#

They are generally incompetent.

#

And end up wasting more time fixing the code, than writing it from scratch.

sleek vortex May 25, 2024, 12:23 AM

#

im just trying it yeah

#

probably goign to sleep soon so

agile jay May 25, 2024, 12:23 AM

#

Yep, already 1:23am

sleek vortex May 25, 2024, 12:23 AM

#

yeah

#

you same timezone as me?

agile jay May 25, 2024, 12:23 AM

#

Yep, and also you said cc=gb

sleek vortex May 25, 2024, 12:23 AM

#

oh yeah

#

good catch

agile jay May 25, 2024, 12:24 AM

#

Yep, I have good memory

#

You gonna make BS4 using rust?

#

Looks like there is already a soup crate

sleek vortex May 25, 2024, 12:27 AM

#

Inspired by the Python library "BeautifulSoup," soup is a layer on top of html5ever that aims to provide a slightly different API for querying & manipulating HTML

agile jay May 25, 2024, 12:28 AM

#

You want a 1 to 1 translation?

sleek vortex May 25, 2024, 12:31 AM

#

gpt4 wrote like 90% usable code

#

going off that

#

ill add in the parsing of bing tracking urls tommorow but

#

going to do a quick speedtest

agile jay May 25, 2024, 12:33 AM

#

Oh, all in rust?

#

A lot of the latency can't be reduced, since it's waiting for api responses.

sleek vortex May 25, 2024, 12:35 AM

#

yeah

#

1.2s avg on my local internet

#

probably faster on the cloud server

#

not bothered to fix the cross compile of openssl right now, im going to sleep

#

Benchmark 1: ./target/release/rs-scraper
  Time (mean ± σ):      1.170 s ±  0.146 s    [User: 0.052 s, System: 0.022 s]
  Range (min … max):    0.982 s …  1.405 s    10 runs```

agile jay May 25, 2024, 12:36 AM

#

Night fishy

sleek vortex May 25, 2024, 12:36 AM

#

vpn turned off

Benchmark 1: 1
  Time (mean ± σ):     869.6 ms ±  62.6 ms    [User: 50.4 ms, System: 20.3 ms]
  Range (min … max):   783.9 ms … 952.4 ms    10 runs

agile jay May 25, 2024, 12:38 AM

#

Just add good logging, so you can tell how much each part is taking.

fossil sentinel May 25, 2024, 3:13 AM

#

https://youtu.be/F0S5hYRLytI

YouTube

Classic FM

Gershwin’s Rhapsody in Blue – Hayato Sumino FULL performance | Clas...

Rising star Japanese pianist Hayato Sumino takes on Gershwin's dreamy 'Rhapsody in Blue' in its 100th anniversary year for Classic FM Live with Viking.

Look out for Hayato’s self-duet on a melodica, and his genius improvisation on a ringtone...

Performed alongside the Royal Scottish National Orchestra under conductor, Ben Palmer, you can watch...

▶ Play video

mortal snow May 25, 2024, 3:14 AM

#

hey

sweet jasper May 25, 2024, 5:28 AM

#

agile jay Just add good logging, so you can tell how much each part is taking.

You mean flamegraphs

#

😳

sweet jasper May 25, 2024, 5:30 AM

#

agile jay I don't trust the current models to write good code though.

Java, javascript, python get pretty Good benchmarks

#

At least on gpt4 et.all

#

Rust and Go being possibly slightly lower possibly due to not havin as much training data

#

chilly forge May 25, 2024, 7:03 AM

#

sweet jasper Rust and Go being possibly slightly lower possibly due to not havin as much trai...

Opus does pretty well with them

tame current May 25, 2024, 7:42 AM

#

anyone know where we can find gpt-4o stacked up against these other llm's?

#

like in the chart above

#

found this on vellum, looks like all the boxes arent finished yet, but if anyone is using claude 3 opus for coding, might consider switching to gpt-4o

sweet jasper May 25, 2024, 7:59 AM

#

tame current found this on vellum, looks like all the boxes arent finished yet, but if anyone...

Claude opus and gpt4o are similar, you might also try yi-large from china

grand silo May 25, 2024, 8:08 AM

#

sweet jasper Claude opus and gpt4o are similar, you might also try yi-large from china

imo, gpt4o can be useful but opus generally provides the same information while rating higher ‘vibe’ wise.

sweet jasper May 25, 2024, 8:29 AM

#

Agreed

#

Syntatically sometimes i prefer opus

#

But 50 opus a day

#

Skull

sonic musk May 25, 2024, 8:40 AM

#

tame current found this on vellum, looks like all the boxes arent finished yet, but if anyone...

Try Yi Large for coding

shy robin May 25, 2024, 8:42 AM

#

Why I have only 5 remaining daily usage of Opus since yesterday??? (yes 5 NOT 50)

#

We went from unlimited use of Opus, to 50 uses per day, and now we’re down to 5?

sleek vortex May 25, 2024, 8:58 AM

#

shy robin Why I have only 5 remaining daily usage of Opus since yesterday??? (yes 5 NOT 50...

When you use the model that usage comes back in 24 hours

#

so has it been 24 hours since your last 50 uses

#

That’s probably why

austere kestrel May 25, 2024, 10:54 AM

#

sleek vortex bing does not like datacenter ips

hitting bing.com repeatedly could become a bit of a pain (like they have an API.. and presumably want any third parties to use that if they want to get bing results programmatically ha)

#

could look into something OS like searXNG
https://docs.searxng.org/dev/search_api.html

#

https://github.com/ItzCrazyKns/Perplexica/tree/master/searxng

tame current May 25, 2024, 10:59 AM

#

Why Perplexity doesn't have the ability to create a PDF File? I asked for a summary and now I want to have it in a PDF file well organized.

eager fossil May 25, 2024, 11:03 AM

#

tame current Why Perplexity doesn't have the ability to create a PDF File? I asked for a summ...

When you copy you can usually paste it with formatting.

tame current May 25, 2024, 11:08 AM

#

eager fossil When you copy you can *usually* paste it with formatting.

Yeah. I get it. But I just wanna know why it hasn't this ability built-in.

eager fossil May 25, 2024, 11:09 AM

#

tame current Yeah. I get it. But I just wanna know why it hasn't this ability built-in.

IDK, I also found out the other day it doesn't. Kinda gave me a lot more work, but o well.

austere kestrel May 25, 2024, 11:28 AM

#

tame current Why Perplexity doesn't have the ability to create a PDF File? I asked for a summ...

if the presentation / formatting doesn't matter, could go to Print, then save it as a PDF from there

#

but yeah pretty crude.. ain't pretty ha

tame current May 25, 2024, 11:49 AM

#

austere kestrel but yeah pretty crude.. ain't pretty ha

sadge

lyric pulsar May 25, 2024, 12:48 PM

#

I just downloaded it back and I gotta say it’s actually good

#

I deleted it cuz it was just weird

#

But now it’s goood

feral marlin May 25, 2024, 12:56 PM

#

tame current Why Perplexity doesn't have the ability to create a PDF File? I asked for a summ...

for stuff like that and having ai generate it, look into obsidianMD which is a lot more convenient + ai handles markdown better

agile jay May 25, 2024, 2:58 PM

#

sweet jasper

JS is pretty much out of date every 6 months...

livid mantle May 25, 2024, 3:15 PM

#

I have unsubscribed my subscription with Gemini Advanced 1.5 Pro again after realizing my financial crisis. I am sticking with Perplexity 50 Opus for $200/month for now.

sweet jasper May 25, 2024, 3:20 PM

#

agile jay JS is pretty much out of date every 6 months...

Eh for the newest frameworks mby

agile jay May 25, 2024, 3:20 PM

#

sweet jasper Eh for the newest frameworks mby

Lol, how often do they change?

#

React is pretty much changing every update nowadays...

sweet jasper May 25, 2024, 3:21 PM

#

Idk , i dont use frontend , only backend js + ts kek

#

Which is still bad ik

agile jay May 25, 2024, 3:22 PM

#

Yep, the dynamic types don't help...

#

I would only use JS backend if I was using it with a JS front end.

sweet jasper May 25, 2024, 3:22 PM

#

In any case maybe vue2 has plenty of training

agile jay May 25, 2024, 3:23 PM

#

With nuxt

sweet jasper May 25, 2024, 3:23 PM

#

Other frameworks with breaking changes every update (frontenders fav passtime) should just be avoided ig

agile jay May 25, 2024, 3:24 PM

#

Yep, and languages with the same problem.

sweet jasper May 25, 2024, 3:26 PM

#

Maybe we need to go back to java for max llm efficiency, java 8 fte

#

Ftw

agile jay May 25, 2024, 3:27 PM

#

Or just have separate coding LLM's which are updated very often.

sweet jasper May 25, 2024, 3:29 PM

#

You mean stuff like deepseek

agile jay May 25, 2024, 3:30 PM

#

Yep, stuff like deepseek and codellama, but updated every week.

mental widget May 25, 2024, 3:43 PM

#

Maybe a really stupid question but I am a member of perplexity and chatgpt. Does the option "KI Modell" in the settings really use exactly the ChatGPT modell (for example GPT-4o) without any limitation. Which means I could cancel the membership of chatgpt because the result should be the same ?

agile jay May 25, 2024, 3:50 PM

#

mental widget Maybe a really stupid question but I am a member of perplexity and chatgpt. Does...

It has the same language model. But not the other modalities.

You can try it in writing mode to see how similar it is.

mental widget May 25, 2024, 3:51 PM

#

For a beginner in the AI like me, do you mean that the database is the same but the usage (engine) how the database is interpreted is different ?

tame current May 25, 2024, 3:53 PM

#

mental widget For a beginner in the AI like me, do you mean that the database is the same but ...

you won't get Voice-to-Voice and image generation (not released in chatgpt yet either)

#

also, there is no database

mental widget May 25, 2024, 3:54 PM

#

Isn't LLM a database ?

tame current May 25, 2024, 3:54 PM

#

no, it predicts the next word

#

like auto-complete

mental widget May 25, 2024, 3:54 PM

#

I thought LLM is like a database

tame current May 25, 2024, 3:55 PM

#

it's kinda like your phone's keyboard suggestions, but much more advanced

#

I suggest reading about it, but anyway

mental widget May 25, 2024, 3:56 PM

#

Yes I need to read about it. I need such information in order to understand the logic behind.

tame current May 25, 2024, 3:57 PM

#

there is a limit iirc

#

600 per day

livid mantle May 25, 2024, 4:00 PM

#

nobody asked but I recently started to change my car oil every 5,000 miles, you ask why? because AI said so. And I hope it did not hallucinate those numbers.

tame current May 25, 2024, 4:01 PM

#

of course not

vapid onyx May 25, 2024, 4:08 PM

#

livid mantle nobody asked but I recently started to change my car oil every 5,000 miles, you ...

5000 miles is usually a good number to change oil at assuming you drive regularly.

ionic stone May 25, 2024, 4:09 PM

#

3000 for regular oil.
5000 for synthetic oil

proud patio May 25, 2024, 4:12 PM

#

Coffeezilla mentioned perplexity in his latest video. 🫡

livid mantle May 25, 2024, 4:25 PM

#

#

Perplexity educates you on the costs of your data.

vapid onyx May 25, 2024, 4:33 PM

#

livid mantle Perplexity educates you on the costs of your data.

Love how passwords cost more than a SSN

tame current May 25, 2024, 4:35 PM

#

what can SSN be used for ?

#

I mean if someone got it

warm cave May 25, 2024, 4:40 PM

#

A few things: https://www.perplexity.ai/search/Concisely-tell-me-eNdLBxlTSd6H_4H3aHSeRw#0

Perplexity AI

Concisely tell me why criminals by Social Security numbers

Criminals buy Social Security numbers (SSNs) because they are a key piece of personal information that can be used to commit various forms of identity theft and fraud. With an SSN, criminals can:

Open New Credit Accounts: They can apply for credit cards, loans, and other financial products in the victim's name, leading to significant financi...

fallen kindle May 25, 2024, 6:25 PM

#

Just a props to @tranquil orchid in general - of all the tools and gpts, and claudes etc - I get the best results just prompting in Perplexity

valid yoke May 25, 2024, 6:42 PM

#

https://www.microsoft.com/en-us/edge/copilot-for-social

Copilot for Telegram | Microsoft Copilot

Try Copilot for Telegram, your personal AI messaging companion. Text, create, translate, search, and more wherever you are with Copilot.

#

Big tech is starting to join telegram

tropic scarab May 25, 2024, 7:35 PM

#

It would be great, if Perplexity would allow the creation of collections within collections.

tame current May 25, 2024, 8:36 PM

#

you guys knwo the latency u get?

#

from perplexity

sleek vortex May 25, 2024, 8:56 PM

#

proud patio Coffeezilla mentioned perplexity in his latest video. 🫡

yeah because rabbit had partnership with them

proud patio May 25, 2024, 9:42 PM

#

sleek vortex yeah because rabbit had partnership with them

Yep, he states that their only non chatgpt functionality is provided by perplexity

tame current May 25, 2024, 10:03 PM

#

Hello chat im new

sleek vortex May 25, 2024, 10:08 PM

#

tame current Hello chat im new

hey

tame current May 25, 2024, 10:08 PM

#

sleek vortex hey

Hi

#

welcome!

devout bronze May 26, 2024, 3:06 AM

#

livid mantle

open llm leaderboard?

#

no

#

wait

#

lmsys arena

real storm May 26, 2024, 4:34 AM

#

When will the page feature back?

fading moth May 26, 2024, 5:16 AM

#

I wonder why Gemini isn't on perplexity

#

Or Gemini Flash, given that it's basically free

gritty pasture May 26, 2024, 5:37 AM

#

The Discover page is amazing, i hope it keeps improving

One suggestion is ability to create custom discovers (e.g. put topics / keywords) and the crawlers would maintain the feed for me

halcyon coral May 26, 2024, 5:40 AM

#

real storm When will the page feature back?

It is currently in closed beta 😉

cursive jacinth May 26, 2024, 7:24 AM

#

fading moth I wonder why Gemini isn't on perplexity

Hi sir, a little background sir, before there was Gemini Pro on perplexity sir, then there was a feud between perplexity and google sir, perplexity removed gemini pro sir, people asked sir, perplexity replied that gemini is low performance sir and perplexity promised that they will incorporate gemini again sir (but just an excuse sir, like what they did to Opus sir) now you know sir. Dont believe everything sir.

warped marsh May 26, 2024, 7:30 AM

#

lol they really just got away with limiting opus and saying that they will bring it back

tiny plaza May 26, 2024, 7:41 AM

#

businesses are going to business ig rip

tame current May 26, 2024, 7:45 AM

#

@cursive jacinth sir you know you dont have to end every sentence with sir. just letting you know sir.

sonic hull May 26, 2024, 7:50 AM

#

halcyon coral It is currently in closed beta 😉

And how can someone become part of that closed beta? 😉

halcyon coral May 26, 2024, 7:53 AM

#

sonic hull And how can someone become part of that closed beta? 😉

All seats are full

austere kestrel May 26, 2024, 8:13 AM

#

sonic hull And how can someone become part of that closed beta? 😉

could try: https://www.perplexity.ai/page/new (I doubt it, but there's an outside chance it might work..)
but yeah presumably it'll be out of beta soon enough in any case (strange though that it was briefly available to all users a day or so ago)

hallow linden May 26, 2024, 8:56 AM

#

I'd love to see sonar large go through the MMLU. I bet it'd do really well on all of the factual knowledge

#

"1st LLM to get 100% on the MMLU"

#

I mean there are some math problems that might be tricky for an LLM, so it probably wouldn't get 100%, but it'd be fun to see

junior adder May 26, 2024, 10:27 AM

#

Gday everyone. In case someone wants to take their sub cheaper (and help me in this case to play with claude's narrative capability in making stories🫠 ) feel free to use my referral
https://perplexity.ai/pro?referral_code=J9ID1YP6
ps is there any news on 50 opus limitation? 🥺

Perplexity AI

Perplexity Pro

Unlock the most powerful AI research assistant. Raise Perplexity to the next level with Pro Search, upgraded AI models, unlimited file upload, and API credits.

last remnant May 26, 2024, 10:30 AM

#

fading moth I wonder why Gemini isn't on perplexity

Wondering the same but my guess would be that Google is their number 1 competitor so they do not want to promote their stack... Am I close?

marble parrot May 26, 2024, 10:39 AM

#

Hello I'm new here so not sure if i should post this here but, I m sturggling (with Pro) to compile a substantial literature review using perplexity, the sources it gives are 70% of the time of good academic standard and relate to my prompt and needs. However perplexity struggles to then access the sources I give it and to compile secondary data (qualitative but mostly quantitative) in either bullet point format or directly incorporating it in my literature review draft

magic flax May 26, 2024, 10:40 AM

#

Dear all, i need help please. I want to create images but i cant and the emails just tell me to do it but i need some sort of step by step for begginers or something

tame current May 26, 2024, 10:41 AM

#

Guys i use perplexity since yesterday and it's soo cool!

cinder comet May 26, 2024, 10:41 AM

#

junior adder Gday everyone. In case someone wants to take their sub cheaper (and help me in t...

no its still 50

#

i think it will only change if anthropic decreases their api cost

tame current May 26, 2024, 10:43 AM

#

I like Space 🌌✨ (just saying)

last remnant May 26, 2024, 10:50 AM

#

Been using Perplexity ever since it came out. I am still running up against the same issue after I continue to test.

Seems that if I ask a question, Perplexity will give you a response, then it will have related questions, if I click on those they all work well and provide good additional information.

However if I ask my OWN question, the system seems to do an ok job but it often times repeats ALOT of what it originally produced in the first response. Anyone else witness the same thing? Again this occured for me from day one whether on free plans or pro

marble parrot May 26, 2024, 10:50 AM

#

seconded

tame current May 26, 2024, 10:59 AM

#

Hi, is the search free? Are there any interaction limitations, like 30 searches per day?

halcyon coral May 26, 2024, 11:05 AM

#

magic flax Dear all, i need help please. I want to create images but i cant and the emails ...

https://discord.com/channels/1047197230748151888/1225557012532695060

halcyon coral May 26, 2024, 11:06 AM

#

tame current Hi, is the search free? Are there any interaction limitations, like 30 searches ...

You won't reach any limits

tame current May 26, 2024, 11:07 AM

#

halcyon coral You won't reach any limits

Thank you!

dawn lynx May 26, 2024, 11:38 AM

#

tame current Hi, is the search free? Are there any interaction limitations, like 30 searches ...

Unless you are using Co-Pilot (PRO search) , there are no restrictions on how many basic searches you can do.

sleek vortex May 26, 2024, 1:00 PM

#

livid mantle I have unsubscribed my subscription with Gemini Advanced 1.5 Pro again after rea...

200/month is crazy

#

also you get gemini 1.5 pro free in the ai studio ui 😭

half venture May 26, 2024, 1:25 PM

#

I feel like what all ai art/text/songs/voice/etc lack is human reasoning

#

They seem to be lacking in like consistency and logic

#

All of them feel like dementia patients

#

On steroids

austere kestrel May 26, 2024, 1:28 PM

#

i mean they can't "reason" - some are just better at predicting the next token than others

half venture May 26, 2024, 1:35 PM

#

Yeah

#

We need a reasoning engine

#

Alongside llms

#

Hopefully whatever Yann is working on bears fruit

sleek vortex May 26, 2024, 1:37 PM

#

half venture Hopefully whatever Yann is working on bears fruit

who?

#

bruh when do i get the new pro search

#

smh

#

they use https://www.geteppo.com/

Eppo | Next-Gen Experimentation Platform for Today's Data Teams

Eppo runs trustworthy, data warehouse-native A/B tests. World-class statistical rigor, feature management, and experiment reports - all in one tool.

#

hmm

#

📎 message.txt

#

idk

opaque anvil May 26, 2024, 2:30 PM

#

Hi all, I need help logging back into my pro account but it keeps signing me up for new free accounts on the same mail id. support mail id has not provided any support either 😦

sleek vortex May 26, 2024, 3:20 PM

#

opaque anvil Hi all, I need help logging back into my pro account but it keeps signing me up...

check your emails for which accounts were pro

#

and whatever account it does log you into, check if there's a manage subscription button (maybe it got cancelled/finished?)

hardy willow May 26, 2024, 4:08 PM

#

I appreciate where you're coming from, but you're making subjective statements with objective authority, which is a fallacious argument and contention. You're presupposition that less is better is predicated on opinionative conjecture, when, in reality, it could simply mean that OpenAI, much like Google for the first 2 decades, is purposefully losing money to garner a wider audience and customer base, before largely integrating and monetizing their system worldwide. Also, to further illustrate this point, OpenAI has partnerships with mega-billion dollar corporations, and the most advanced server architecture in history, meaning, they can aptly handle much more compute power, versus a smaller offshoot such as Anthropic.

Positive assertions require evidence based foundations.

hollow aspen May 26, 2024, 4:25 PM

#

How often does perplexity search get updated? One examples, last few weeks I’ve noticed, it still refers to Azure Entra ID as Azure Active Directory. I’m new to Ai, using the Claude 3 opus search with pro.

random charm May 26, 2024, 4:32 PM

#

HelloI I encountered issue with perpelxity. Is there a way to sign-in with password? I opted out from marketing emails and now I am not getting any sign-in verification.

#

Someone help. :p

sleek vortex May 26, 2024, 4:58 PM

#

@agile jay @warm cave

#

look at this haiku response

#

guess how much tokens of context sent to model!

agile jay May 26, 2024, 4:58 PM

#

1k?

sleek vortex May 26, 2024, 4:59 PM

#

2.8k i think

#

and before i was sending 12k

#

well the main issue is

#

this running on my cpu took ages

#

🔗 [36.98s] Encoded 505 source chunks (13.66 chunks/sec)
🔍 [0.58s] Found 13 chunks from search queries - 2763 tokens```

#

using an already tiny model BAAI/bge-small-en-v1.5

#

let me try... on gpu

agile jay May 26, 2024, 5:17 PM

#

Oh is that an embedding model?

#

Or try running it on your gpu...

sleek vortex May 26, 2024, 5:27 PM

#

agile jay Or try running it on your gpu...

this mac doesnt have a useful gpu

#

my cpu is faster for nearly all ai tasks

#

so im porting it to a t4 on modal

agile jay May 26, 2024, 5:29 PM

#

sleek vortex this mac doesnt have a useful gpu

Oh, is it an old Intel one?

sleek vortex May 26, 2024, 5:30 PM

#

agile jay Oh, is it an old Intel one?

yeah

agile jay May 26, 2024, 5:36 PM

#

You could try using an embedding API then.

silver sinew May 26, 2024, 5:37 PM

#

Is there a trick to get the voice feature to work when the app isn't "active" on android?

#

like, I pressed the button (at least in my case) so I could do other things while listening to the answer. if I open another app or lock my phone it stops the audio and resets its position in its reading, super annoying

empty mist May 26, 2024, 6:24 PM

#

bro what the heck, i highlight the text and then the "ask follow up question" appears and it just starts writing without me even asking anything. is that a bug cause bard has that feature?

oblique trout May 26, 2024, 6:36 PM

#

1679puppykiss i have a question, if u send an image/other resources to pplx, does it append them to every subsequent query? I noticed that gpt4 can be confused by the sources sometimes. e.g. when u ask it a subsequent question irrelevant to a previously sent picture, it might still analyze the picture and include info about it in its response. Wouldn't this cost more money for the company 💀 sometimes i have to let it generate a response first, then manually remove the sources (so generate again as you don't have an option to do this beforehand)

tranquil orchidBOT May 26, 2024, 6:57 PM

#

success @nearkala warned

Reason: Spam, please avoid sending single word messages
Duration: 7 days

sleek vortex May 26, 2024, 7:00 PM

#

empty mist bro what the heck, i highlight the text and then the "ask follow up question" ap...

that is the feature

#

the text is used as the question

#

it isnt a question about the text

#

the selected text is the question

ionic stone May 26, 2024, 7:41 PM

#

https://x.com/yoheinakajima/status/1794809981411438909

Yohei (@yoheinakajima) on X

AI Builders' Favorite Tools: A Hive Mind Survey Summary

I recently asked AI builders about their favorite go-to libraries and frameworks for AI projects. Here is a summary of the first ~63 replies, categorized.

Agent Frameworks:

• AgentOpsAI: This tool simplifies the

sleek vortex May 26, 2024, 7:57 PM

#

agile jay You could try using an embedding API then.

so i deployed embeddings to modal on an api

cold start is around ~18 seconds + time to embed (3 seconds for 52k tokens on a super small model)

and it costs (0.000164 + 0.00000667 + 0.0000533)*25 at most per request

#

that cold start is so high though...

agile jay May 26, 2024, 8:09 PM

#

What about Openai text embedding large?

#

It's $0.13/million tokens.

#

Or small, which is $0.02

#

@sleek vortex

warm cave May 26, 2024, 8:28 PM

#

@sleek vortex I have heard good things about mixedbread: https://www.mixedbread.ai/pricing

mixedbread.ai

Embeddings & NLP

mixedbread.ai offers simple text embedding generation, designed to enhance the developing experience in your AI projects.

#

They have an API but also have Open weight models

sleek vortex May 26, 2024, 8:51 PM

#

agile jay Or small, which is $0.02

I know, just that I wanted to try my own embeddings for fun

sleek vortex May 26, 2024, 8:51 PM

#

warm cave <@957611986835898441> I have heard good things about mixedbread: https://www.mix...

Hm

agile jay May 26, 2024, 9:15 PM

#

Looks pretty good, but the pricing is for 1K tokens, not 1M

#

So the actual cost is not much different to openai's large embedding model.

warm cave May 26, 2024, 9:19 PM

#

agile jay Looks pretty good, but the pricing is for 1K tokens, not 1M

Yeah $0.13 vs $0.10 is not huge, but having the option to run it yourself is nice

agile jay May 26, 2024, 9:19 PM

#

warm cave Yeah $0.13 vs $0.10 is not huge, but having the option to run it yourself is nic...

Oh, is the model open-source?

sleek vortex May 26, 2024, 9:19 PM

#

📎 message.txt

warm cave May 26, 2024, 9:20 PM

#

agile jay Oh, is the model open-source?

Yeah

sleek vortex May 26, 2024, 9:20 PM

#

21seconds gained by picking chunks via embeddings, using a remote gpu

warm cave May 26, 2024, 9:20 PM

#

Open source or open weight idk one of them lol

agile jay May 26, 2024, 9:23 PM

#

#

Seems like a very popular model

warm cave May 26, 2024, 9:23 PM

#

warm cave May 26, 2024, 9:23 PM

#

sleek vortex 21seconds gained by picking chunks via embeddings, using a remote gpu

Nice 👍

agile jay May 26, 2024, 9:24 PM

#

Are you storing the embeddings? Or generating them from scratch each time?

sleek vortex May 26, 2024, 9:24 PM

#

right now it's from scratch

#

loading the small embedding model on a T4 and embedding around 500 512 char chunks takes 15-22 seconds on a cold start

#

(3-6 if i leave the gpu hot, kinda "bills" me for that idle though)

devout cargo May 26, 2024, 9:26 PM

#

There is something that I don't quite understand: every time Google is used to search, Google benefits. In the case of Perplexity I understand that it may be different. What benefit does perplexity have every time a user performs a search through your system?

sleek vortex May 26, 2024, 9:26 PM

#

also for some reason my web scraper has got slower

sleek vortex May 26, 2024, 9:26 PM

#

devout cargo There is something that I don't quite understand: every time Google is used to s...

well thats why you have to pay for perplexity pro, you dont pay for google do you

agile jay May 26, 2024, 9:26 PM

#

devout cargo There is something that I don't quite understand: every time Google is used to s...

Quality of output.

sleek vortex May 26, 2024, 9:27 PM

#

no benefit to the company

#

also for free users and pro users that dont turn it off, their data from searches is kept

#

might be used for model training or idk, who knows

solemn cedar May 26, 2024, 9:27 PM

#

Anyone else experiencing troubles with the voice output ? Like volume or language changes ... ?

devout cargo May 26, 2024, 9:29 PM

#

But once this is monetized, the free part could upload images or documents like Copilot. that is very limited in tokens

sleek vortex May 26, 2024, 9:31 PM

#

📎 message.txt

#

so why has my web scraping also gone to 10 seconds

devout cargo May 26, 2024, 9:31 PM

#

How can you fight against Microsoft and Google if what you do they offer it for free not to mention GPT4 obnivus

sleek vortex May 26, 2024, 9:32 PM

#

devout cargo How can you fight against Microsoft and Google if what you do they offer it for ...

if you dont have pro on

#

it doesnt use gpt4

#

it uses like the dirt cheap haiku or their own models

#

so theyre able to do that for free

agile jay May 26, 2024, 9:33 PM

#

sleek vortex so why has my web scraping also gone to 10 seconds

Concurrency issue again, I assume.

warm cave May 26, 2024, 9:34 PM

#

Lots of places are switching to haiku as a budget model, perplexity, brave, I think duck duck go, prob more than that

agile jay May 26, 2024, 9:34 PM

#

Mostly because of the large context.

sleek vortex May 26, 2024, 9:34 PM

#

yeah

#

way better than gpt3.5

warm cave May 26, 2024, 9:34 PM

#

agile jay Mostly because of the large context.

Yeah 200k is 3.5 16k still?

agile jay May 26, 2024, 9:34 PM

#

I assume that will change when meta officially releases their larger context versions.

sleek vortex May 26, 2024, 9:35 PM

#

warm cave Yeah 200k is 3.5 16k still?

yup

#

this is quite the interesting cache folder

warm cave May 26, 2024, 9:35 PM

#

Yeah, can’t for meta’s and their next drops

sleek vortex May 26, 2024, 9:35 PM

#

give me some new queries to try

warm cave May 26, 2024, 9:36 PM

#

Waffles around the world lol

sleek vortex May 26, 2024, 9:36 PM

#

asking about the ipad pro is getting boring

#

#

lets ask "what is perplexity ai"

agile jay May 26, 2024, 9:36 PM

#

What new policy did rishi sunak propose?

sleek vortex May 26, 2024, 9:36 PM

#

idk

#

ask pplx

#

lmao

#

(i am the most disconnected person from politics)

agile jay May 26, 2024, 9:36 PM

#

sleek vortex (i am the most disconnected person from politics)

You asked for queries.

sleek vortex May 26, 2024, 9:37 PM

#

ohh

#

right

#

right

#

my bad

agile jay May 26, 2024, 9:37 PM

#

So I chose something that happened today/yesterday

devout cargo May 26, 2024, 9:37 PM

#

In the free part it seems that the model is Claude Haiku. It makes sense all services are hosted on AWS Antrophic like Perplexity. It was to be expected 🙂

warm cave May 26, 2024, 9:37 PM

#

Ozempic prevents kidney disease 🦠

agile jay May 26, 2024, 9:38 PM

#

Ozempic prevents responsibility 🏃‍♂️

warm cave May 26, 2024, 9:38 PM

#

Yann LeCun on the limits of LLMs

agile jay May 26, 2024, 9:39 PM

#

I expect you have to stay on Ozempic to keep the gains...

warm cave May 26, 2024, 9:39 PM

#

Meta pitches Hollywood on AI

warm cave May 26, 2024, 9:39 PM

#

agile jay I expect you have to stay on Ozempic to keep the *gains*...

Idk, I will ask perplexity 🙃

agile jay May 26, 2024, 9:40 PM

#

warm cave Idk, I will ask perplexity 🙃

So it's a lifetime subscription

sleek vortex May 26, 2024, 9:40 PM

#

📎 message.txt

#

wellll....

fading moth May 26, 2024, 9:40 PM

#

What I miss, I just got back

warm cave May 26, 2024, 9:41 PM

#

agile jay So it's a lifetime subscription

Yep: https://www.perplexity.ai/search/Do-you-have-6mIg_sZzT_eTEtOdlOw1qg

Perplexity AI

Do you have to keep taking Ozempic to keep the benefits

Yes, you generally need to keep taking Ozempic to maintain its benefits. Here's a detailed breakdown of what happens when you stop taking Ozempic:
Weight regain is common after stopping Ozempic. Studies have shown that people tend to regain a significant portion of the weight they lost while on the medication. For instance, in one trial, partici...

sleek vortex May 26, 2024, 9:41 PM

#

ok well a bit of a scam that perplexity can check its own domain and i cant (cloudflare)

devout cargo May 26, 2024, 9:41 PM

#

¿?

Screenshot_2024-05-26-23-40-34-071_com.twitter.android.jpg

agile jay May 26, 2024, 9:41 PM

#

devout cargo ¿?

Red/Blue team testing, I assume.

fading moth May 26, 2024, 9:42 PM

#

They DoD approached Sam a while ago about buying GPT over a year ago. Sam turned them down.

agile jay May 26, 2024, 9:43 PM

#

The government can technically take it if they wanted to, it would just look really bad.

sleek vortex May 26, 2024, 9:43 PM

#

sleek vortex ok well a bit of a scam that perplexity can check its own domain and i cant (clo...

not cloudflare, but because it's a client side only react app

warm cave May 26, 2024, 9:43 PM

#

fading moth They DoD approached Sam a while ago about buying GPT over a year ago. Sam turned...

Or did some under the table deal, GPT-7lite—FBI-o

fading moth May 26, 2024, 9:43 PM

#

agile jay The government can technically take it if they wanted to, it would just look rea...

Well, yeah

#

C'est possible

agile jay May 26, 2024, 9:43 PM

#

After all, if AI is the next nuke, they can easily say it's a safety concern.

fading moth May 26, 2024, 9:44 PM

#

But Sam was pretty publicly adamant at the time that GPT never be used in military

agile jay May 26, 2024, 9:44 PM

#

And all AI companies have already said the same claim.

agile jay May 26, 2024, 9:44 PM

#

warm cave Or did some under the table deal, ```GPT-7lite—FBI-o```

That's what the new rollback thing is for...

#

No longer does the FBI agent have to look through your webcam 24/7

#

They can just rewind and see what you were up to.

fading moth May 26, 2024, 9:45 PM

#

Copilot+ tablets record your screen history as a feature

#

One you cannot turn off btw

sleek vortex May 26, 2024, 9:46 PM

#

not bad at all

📎 message.txt

warm cave May 26, 2024, 9:46 PM

#

Yep, GPT-7lite—FBI-o is so advanced it can infer the future with 92% accuracy, welcome folks to minority report

agile jay May 26, 2024, 9:46 PM

#

sleek vortex not bad at all

Ask how many rocks we should eat for a healthy diet.

fading moth May 26, 2024, 9:46 PM

#

This is that whole Alexa recording thing all over again but with AI

warm cave May 26, 2024, 9:47 PM

#

Yeah, everyone gets all nervous and then gets used to it

fading moth May 26, 2024, 9:48 PM

#

"I know they're recording but I could order pizza with my voice without getting off the couch..

Guess I'll let them do it"

sleek vortex May 26, 2024, 9:48 PM

#

I do not have enough information in the provided context to recommend how many rocks someone should eat for a healthy diet. Consuming rocks is generally not considered a safe or healthy practice, as rocks do not contain the necessary nutrients for human health. Eating rocks can actually be harmful and lead to serious medical issues. The context provided indicates that some specific types of clay or soil may have some nutritional benefits, but it does not give specifics on recommended amounts or types of rocks to eat. Overall, I would not advise eating rocks as part of a healthy diet. Focusing on a balanced diet of whole, nutritious foods is a much safer and more effective approach to maintaining good health.

warm cave May 26, 2024, 9:48 PM

#

warm cave Yeah, everyone gets all nervous and then gets used to it

But this time after getting use to it they will start dating it

agile jay May 26, 2024, 9:48 PM

#

sleek vortex > I do not have enough information in the provided context to recommend how many...

Wow, better than Google AI search...

sleek vortex May 26, 2024, 9:48 PM

#

warm cave But this time after getting use to it they will start dating it

its giving the hologram pleasure girl from blade runner

sleek vortex May 26, 2024, 9:49 PM

#

agile jay Wow, better than Google AI search...

wait what LMAO

warm cave May 26, 2024, 9:49 PM

#

sleek vortex > I do not have enough information in the provided context to recommend how many...

Already 100x better than google

fading moth May 26, 2024, 9:49 PM

#

People are already dating it and claiming their chats have sentience

sleek vortex May 26, 2024, 9:49 PM

#

encoding took 41 seconds tho.... very very slow

warm cave May 26, 2024, 9:49 PM

#

fading moth People are already dating it and claiming their chats have sentience

Hey! 😡she dose, don’t listen to him sky, I love you the way you are

fading moth May 26, 2024, 9:50 PM

#

I swear it's like they are talking to ||stripper|| who just tells them anything they want for money and acts like they are soulmates

#

But don't remember their face or name in public

warm cave May 26, 2024, 9:50 PM

#

Sky just got the “Memory” update, she truly cares 😇

fading moth May 26, 2024, 9:51 PM

#

You and only you

#

Sky wouldn't go out with anyone else

warm cave May 26, 2024, 9:51 PM

#

fading moth Sky wouldn't go out with anyone else

Of course not, she would never 😧

#

Imagine if OpenAI adds a dating mode in the future with some crazy monthly cost

sleek vortex May 26, 2024, 9:52 PM

#

:(

I do not have any specific examples to provide about people dating AI and claiming their chats have sentience. This appears to be an emerging and complex topic that is still being explored.

Based on the context provided, it seems that while some people are forming emotional connections with AI-powered conversational agents like chatbots, the current consensus among experts is that these AI systems do not possess genuine consciousness or sentience. The interactions may feel human-like, but the AI is essentially mimicking human conversation rather than experiencing true self-awareness.

There are certainly ethical considerations and potential risks around people developing deep emotional attachments to AI companions. Experts caution that this could lead to a detachment from real human relationships. The design and use of these AI systems should prioritize user well-being and avoid manipulative practices.

Overall, this appears to be an emerging and complex phenomenon that is still being studied. I do not have definitive examples to provide, but the context indicates it is an area of active research and discussion in the field of AI and human-computer interaction. Let me know if you have any other questions!

sleek vortex May 26, 2024, 9:52 PM

#

warm cave Imagine if OpenAI adds a dating mode in the future with some crazy monthly cost

i mean this is why places like character.ai have so many users

warm cave May 26, 2024, 9:53 PM

#

sleek vortex i mean this is why places like character.ai have so many users

Is character AI a subscription?

sleek vortex May 26, 2024, 9:53 PM

#

i mean its like every ai website ever

#

a bit free, at cheapish price

#

too slow/overloaded = buy our sub for better roleplay!! faster speed!!

warm cave May 26, 2024, 9:54 PM

#

sleek vortex i mean its like every ai website ever

I thought it was place where people shared models, like Lora’s and such, I know about AI dating sites getting big, just did not really know hat character AI was specifically

sleek vortex May 26, 2024, 9:55 PM

#

character ai is like a consumer frontend for idk prompts that imitate certain characters speaking

#

like from tv shows and anime or whatever

#

and idk

#

i dont really use it

warm cave May 26, 2024, 9:56 PM

#

I guess I know where I’m spending my money now 🫠

#

Perplexity makes it sound so legit, lol: https://www.perplexity.ai/search/What-is-character-u9XxIg8oTkedAm94XcQe.w

Perplexity AI

What is character ai?

Character AI is an advanced AI chatbot web application that leverages large language models (LLMs) and deep machine learning to generate human-like text responses and facilitate engaging conversations. Developed by former Google AI developers Noam Shazeer and Daniel De Freitas, Character AI was released in beta form in September 2022 and has sin...

sleek vortex May 26, 2024, 10:00 PM

#

lets ask whatever my thing is called

#

the folder is called kittyagent but thats just because my head is full of cats

#

🔍 [13.23s] Found 16 chunks from search queries - 7719 chars / 1446 tokens

#

Based on the information provided in the context, here's what I can tell you about Character AI:

Character AI is an AI-powered chatbot service that allows users to engage in conversations with unique, lifelike characters. Unlike traditional chatbots, Character AI employs advanced natural language processing models to generate contextually appropriate responses, giving the interactions a more human-like feel.

The key features of Character AI include:

Diverse Characters: Character AI offers a wide range of characters, from fictional beings to historical figures and even celebrities. Users can interact with these characters and explore their distinct personalities, backstories, and behaviors.
Creative Storytelling: Character AI can be used as a tool for creative writing and worldbuilding. Users can design their own characters and have them participate in interactive narratives, allowing for more engaging and dynamic storytelling.
Educational Value: The platform can serve as an educational resource, as users can learn new information and perspectives by conversing with the diverse characters.
Entertainment and Enjoyment: Many users, especially younger audiences, find Character AI entertaining and enjoyable, as they can have fun, imaginative conversations with the characters.

While I don't have specific details about the inner workings of Character AI or its development, the context provided suggests that it is an innovative and advanced AI chatbot service that has gained popularity among users of various ages and interests due to its unique approach to interactive character-based experiences.

fading moth May 26, 2024, 10:01 PM

#

Yeah, character ai is... It's the text equivalent of those late night commercials for you to call and talk to "single women"

#

For only $5 a minute

sleek vortex May 26, 2024, 10:01 PM

#

lmao

half venture May 26, 2024, 10:02 PM

#

I am bored of c.ai

#

It's so bad

#

Like genuinely

#

The gemini flash model

#

Is better than c.ai

warm cave May 26, 2024, 10:02 PM

#

sleek vortex lmao

💯 LOL

warm cave May 26, 2024, 10:02 PM

#

half venture Is better than c.ai

Better at what? 🤔

sleek vortex May 26, 2024, 10:03 PM

#

sleek vortex Based on the information provided in the context, here's what I can tell you abo...

idk what response is better

#

the pplx one or this one

#

theyre both like not gettting the main point

half venture May 26, 2024, 10:03 PM

#

warm cave Better at what? 🤔

Just better at creating character and roleplaying

#

Hell even llama 3 is better

fading moth May 26, 2024, 10:04 PM

#

There are a lot of sites like character ai... One infamous one was called Sakura and their mods were sharing chat logs they found funny on their website in their private discord channels

warm cave May 26, 2024, 10:04 PM

#

sleek vortex theyre both like not gettting the main point

Yeah, basically all models are trained to not talk about that subject, so not surprising

sleek vortex May 26, 2024, 10:04 PM

#

kittyagent % modal run --interactive -q kittyagent.mini_agent_plus --query "price comparison - claude 3 models, gpt 4 turbo, gpt4o, gemini 1.5 pro and flash - use unit million tokens and give it in a table"

#

lets try this

agile jay May 26, 2024, 10:05 PM

#

What does the interactive flag do?

sleek vortex May 26, 2024, 10:05 PM

#

thats all on the modal cli

#

-q = it just removes their weird spinner progress bar
-i = makes it so i can use input()

#

🔍 [1.26s] Turned query into 2 searches
🔎 [0.99s] Searched for price per million tokens claude 3 vs gpt 4 turbo vs gpt4o vs gemini 1.5 pro vs flash - got 14 links, 0 snippets
🔎 [1.65s] Searched for price comparison claude 3 models gpt 4 turbo gpt4o gemini 1.5 pro flash - got 18 links, 0 snippets

#

well thats rubbish...

#

might need a better prompt to gemma 7b

fading moth May 26, 2024, 10:06 PM

#

The hell is kittyagent?

sleek vortex May 26, 2024, 10:06 PM

#

sleek vortex the folder is called kittyagent but thats just because my head is full of cats

.

fading moth May 26, 2024, 10:06 PM

#

Oh

sleek vortex May 26, 2024, 10:06 PM

#

its my little perplexity clone that ive been working on for fun

#

well less a clone and more something similiar, thats trying to be better??

fading moth May 26, 2024, 10:07 PM

#

And what use does a folder full of cats help with?

warm cave May 26, 2024, 10:07 PM

#

There is a model called yarn right, use that one lol

#

Have everything cat themed

fading moth May 26, 2024, 10:07 PM

#

Someone really likes cats

warm cave May 26, 2024, 10:08 PM

#

The yarn mode was cool when it came out

#

I think it was like 100k context

#

Super large for open source

#

At the time

sleek vortex May 26, 2024, 10:09 PM

#

yeah

#

rope scaling 128k of 7b llama

#

i remember running it on llama.cpp and it took years to process the context because of my nonexistent gpu

sleek vortex May 26, 2024, 10:09 PM

#

fading moth And what use does a folder full of cats help with?

No clue

#

I mean it’s just the folder name 😅

warm cave May 26, 2024, 10:09 PM

#

lol, yeah, back before good ok ollama

sleek vortex May 26, 2024, 10:10 PM

#

olllama is just a fancy wrapper really

#

when you run inference it does call literally the llama.cpp cli

#

because FFI in go is much worse to figure out than doing that apparently

warm cave May 26, 2024, 10:11 PM

#

Yeah I know, but you use to have to know how to do it, now it’s basically a one click install

sleek vortex May 26, 2024, 10:11 PM

#

yeah

#

I’m not against it

#

Just saying lol

warm cave May 26, 2024, 10:11 PM

#

And I had no idea what I was doing, lol. I messed up the setup so many times

sleek vortex May 26, 2024, 10:12 PM

#

is there a way to load a torch model into gpu really fast

#

i’m using torch.load but even that takes like 3-10 seconds (literally random) on a what like 130M parameter model

#

seems kinda too slow really

warm cave May 26, 2024, 10:13 PM

#

Yeah, it is slow

sleek vortex May 26, 2024, 10:14 PM

#

do you think people would prefer an ai agent that has a ton of detail over speed

#

realistically what if i tried to turn this into a kind of agent structure

#

instead of gemma7b, the first model is actually haiku with your query

#

which then splits it into the required tasks and tools

#

then that’s allocated to like a few web searches or maybe other function calls each

#

then they share the same embedding gpu instance

#

then it comes out into haiku again

#

Kinda like the new copilot but i could see it integrating other things too

#

weather api
spotify api
whatever api

it would be like a more real version of maybe like what the rabbit promises to do lol

#

well something like that with speed would be even better, not sure how i could make this whole process much faster though

warm cave May 26, 2024, 10:18 PM

#

It depends on the target audience; some want to ask a stupid question and random Google searches for a random user? Then maybe not, but I personally would like it; if Open Interpreter was more reliable/cheaper, then I would use it more often

sleek vortex May 26, 2024, 10:18 PM

#

That’s what i’m thinking too - it would depend on the user

#

maybe the user themselves can pick

#

the thing i thought of earlier

#

like levels to it

“quick search”, “deep dive”

#

I don’t think trying to make an auto mode would suit everyone really

warm cave May 26, 2024, 10:19 PM

#

Yeah, that would be a good choice

sleek vortex May 26, 2024, 10:20 PM

#

It would be pretty cool if i could ask the thing to be like

#

find out a list of the 50 most popular artists by spotify album sales and also by physical album sales and then queue me a few of those songs on spotify

#

well that’s the kind of query i can imagine off my head

#

the real uses would come along when you actually need it i guess

#

perplexity’s “plan a trip to japan”default query on the mobile app could be taken way way deeper

imagine if you gave it a month range and it got the best week in that month based on the weather, then looked at flight prices, all while also looking at the locations to go to

warm cave May 26, 2024, 10:22 PM

#

Yeah, that would be cool, the types of searches I tried on OI were ones that the next step needed informed from the first step, like find out Th top companys in VGT, then find out the names of the founders and then find out which are still alive and then tell me about those people

#

But having further uses would really make it good

sleek vortex May 26, 2024, 10:23 PM

#

Hmm

warm cave May 26, 2024, 10:23 PM

#

sleek vortex perplexity’s “plan a trip to japan”default query on the mobile app could be take...

Yeah, there is a lot you could do

sleek vortex May 26, 2024, 10:23 PM

#

Fine tuning would kind of be a big part of that

#

I don’t think just a prompt could do that

#

either you’d need an expensive LLM like 4o or opus

#

Or maybe you could fine tune like a 7b?

warm cave May 26, 2024, 10:24 PM

#

Yeah, maybe a fine tune of llama 3 8b, or phi 3

sleek vortex May 26, 2024, 10:24 PM

#

There’s a lot you could do with just something like these apis:

weather, spotify, flights, document upload, (cloud) code interpreter

#

you could add in email sending

#

and then it would kind of just rely on a really intelligent first analysis of the query

#

after which you can have mini search agents and whatnot

sleek vortex May 26, 2024, 10:25 PM

#

sleek vortex and then it would kind of just rely on a really intelligent first analysis of th...

including like which steps depend on each other

warm cave May 26, 2024, 10:26 PM

#

I know system prompts get long and the model forgets stuff, maybe a directory, where the modes knows the category to select, and the system prompts with descriptions on how to use each would populate Into the context, idk maybe stupid lol 🤷‍♂️

#

But with small models o feel I can only teach it one or two things

sleek vortex May 26, 2024, 10:27 PM

#

What isn’t working is giving gpt4o or opus all the functions and just praying it works

#

i don’t think that will ever be the way to go

#

that’s probably what like that ai tool devin does really

#

which is why it’s so slow?

#

22 minutes to do one coding task?

warm cave May 26, 2024, 10:28 PM

#

Yeah, probably

sleek vortex May 26, 2024, 10:28 PM

#

or whatever it was

warm cave May 26, 2024, 10:29 PM

#

Kinda funny, back when Autogen first came out I could not figure out how to use Pplx a pi as an agent, so I wrote a skill for the agent to talk to it

sleek vortex May 26, 2024, 10:29 PM

#

To fulfill the given query, we can break it down into the following steps:

Retrieve the list of the 50 most popular artists by Spotify album sales:
- This step involves searching the web and reading websites to find the data on the 50 most popular artists by Spotify album sales.
- This step is independent and can be done first.
Retrieve the list of the 50 most popular artists by physical album sales:
- This step also involves searching the web and reading websites to find the data on the 50 most popular artists by physical album sales.
- This step is independent and can be done in parallel with the previous step.
Queue a few songs from the identified popular artists on Spotify:
- This step requires the lists of popular artists from the previous two steps.
- For each artist, this step involves playing a song given a specific Spotify track ID or the name and artist.
- This step is dependent on the completion of the previous two steps.

The steps can be summarized as follows:

Retrieve the list of the 50 most popular artists by Spotify album sales.
Retrieve the list of the 50 most popular artists by physical album sales.
Queue a few songs from the identified popular artists on Spotify.

The steps 1 and 2 are independent and can be executed in parallel, while step 3 is dependent on the completion of steps 1 and 2.

The available workers can be utilized as follows:

The "Searching the web and reading websites" worker can be used to complete steps 1 and 2.
The "Playing a song given a specific Spotify track ID, or the name and artist" worker can be used to complete step 3.
The "Getting the weather at one specific location" worker is not directly relevant to the given query and cannot be utilized in this case.

#

query was

#

Given the users query and available workers, break it down into steps and also mention which steps are dependent on each other
Query:
find out a list of the 50 most popular artists by spotify album sales and also by physical album sales and then queue me a few of those songs on spotify
Available workers:
Searching the web and reading websites
Playing a song given a specific spotify track ID, or the name and artist
Getting the weather at one specific location

#

hmm

#

Maybe it is possible ?

warm cave May 26, 2024, 10:30 PM

#

Lol, everything else was food tho

sleek vortex May 26, 2024, 10:30 PM

#

This is haiku

#

food?

#

or good

#

{
"steps": [
{
"id": 1,
"name": "Retrieve Spotify album sales",
"description": "Retrieve the list of the 50 most popular artists by Spotify album sales",
"dependencies": [],
"workers": [
"Searching the web and reading websites"
]
},
{
"id": 2,
"name": "Retrieve physical album sales",
"description": "Retrieve the list of the 50 most popular artists by physical album sales",
"dependencies": [],
"workers": [
"Searching the web and reading websites"
]
},
{
"id": 3,
"name": "Queue songs on Spotify",
"description": "Queue a few songs from the identified popular artists on Spotify",
"dependencies": [
1,
2
],
"workers": [
"Playing a song given a specific Spotify track ID, or the name and artist"
]
}
]
}

#

wel this is just me asking on my phone

#

seems interesting though

warm cave May 26, 2024, 10:32 PM

#

Yeah, there is promise

sleek vortex May 26, 2024, 10:32 PM

#

let’s try the flights one

agile jay May 26, 2024, 10:32 PM

#

For specific tasks, finetunes or multi shot reasoning is the way.

warm cave May 26, 2024, 10:33 PM

#

agile jay For specific tasks, finetunes or multi shot reasoning is the way.

Yeah but multishot is good until you have many ability, how would you manage it after a certain point?

agile jay May 26, 2024, 10:34 PM

#

warm cave Yeah but multishot is good until you have many ability, how would you manage it ...

You just fill the context with multiple examples of how to do the task.

warm cave May 26, 2024, 10:35 PM

#

More like how do you manage when you have 100 abilities each with 3 examples.

#

Bc I think we are talking about a general purpose agent

sleek vortex May 26, 2024, 10:35 PM

#

Given the users query and available workers, break it down into steps and also mention which steps are dependent on each other
Query:
plan me a trip to japan somewhere in the month of may 2024
Available workers:
*Searching the web for something, and reading websites and returning relevant information to some keywords
*Playing a song given a specific spotify track ID, or the name and artist
*Getting the weather at one specific location
*Airport code search (enter a rough name and it gives the airport code)

Retrieve the price for a flight, given the take off and landing destinations and the day

#

Let’s try this

#

this list of abilities is getting longer as we go lol

#

which is expected

agile jay May 26, 2024, 10:36 PM

#

warm cave Bc I think we are talking about a general purpose agent

Fine-tuning

sleek vortex May 26, 2024, 10:37 PM

#

why can’t you upload as file on mobile

#

📎 x.json

#

well this is in natural language as again i’m on my phone

#

we would have to make it more rigid as to specify it all in actual json

#

maybe give the list of search queries

#

and what inputs go to what next agents

agile jay May 26, 2024, 11:29 PM

#

Yep, or a schema of the tools etc

cinder comet May 26, 2024, 11:38 PM

#

@sleek vortex did you start that lil project?

agile jay May 26, 2024, 11:40 PM

#

Yep, I think it's been around a week since they started.

cinder comet May 27, 2024, 12:33 AM

#

agile jay Yep, I think it's been around a week since they started.

started what?

agile jay May 27, 2024, 12:34 AM

#

cinder comet started what?

I think it's like perplexity, where it uses sources to answer queries.

cinder comet May 27, 2024, 1:01 AM

#

agile jay I think it's like perplexity, where it uses sources to answer queries.

interesting

#

ive seen alot of similar projects recently

#

its hard to find that new value added

agile jay May 27, 2024, 1:07 AM

#

cinder comet its hard to find that new value added

Yep, probably speed, accuracy and number of sources are the main vectors of progress available.

thorn lava May 27, 2024, 3:28 AM

#

hh ┬─┬ノ( º _ ºノ)

supple pecan May 27, 2024, 5:32 AM

#

I'm curious, I clicked share link as I was planning to share with myself via email a thread I was about a week deep in. There's an eye icon (web version) right next to my name on the top left of the interface, with the number "2" next to it seemingly indicating two people have viewed it. Maybe I'm wrong? Either way, I never shared the link, nor opened it so I'm slightly concerned if someone else has viewed it.. and also that there doesn't seem to be a way to change it back to private other than to put it in a "collection". Can anyone shed some light on this?

austere kestrel May 27, 2024, 5:42 AM

#

sleek vortex we would have to make it more rigid as to specify it all in actual json

you can coerce json-only outputs with prompting, but increasingly models have json mode as a built-in option - it's definitely superior / preferable (like one malformed JSON can break the whole flow..)

#

the newer openai, anthropic, google and mistral models all have json mode i believe. But yeah aside from Mistral, none are OS (not sure about llama3 - I don't think it supports json)

#

https://docs.anthropic.com/en/docs/control-output-format
https://docs.mistral.ai/capabilities/json_mode/

austere kestrel May 27, 2024, 6:10 AM

#

supple pecan I'm curious, I clicked share link as I was planning to share with myself via ema...

once you press share link, i think it becomes possible for a search engine's crawler to pick it up (especially if the link is shared in a tweet or something like that, but conceivably that is not necessarily).. https://search.yahoo.com/search?p=site%3Aperplexity.ai%2Fsearch%2F

fringe harness May 27, 2024, 6:34 AM

#

Just joined pro! Love this application has been using the free version almost daily for almost two months

supple pecan May 27, 2024, 6:41 AM

#

austere kestrel once you press share link, i think it becomes possible for a search engine's cra...

Interesting, I would have thought you'd have to actually share the link in order for it to be searchable. I feel like there could be some serious privacy concerns if simply sharing, especially if it's just to yourself to antoher device of yours, that its now searchable to the public

austere kestrel May 27, 2024, 7:10 AM

#

supple pecan Interesting, I would have thought you'd have to actually share the link in order...

yeah agree, but would also note that 2,730, I assume, is only a very small fraction of all the threads ever made shareable by perplexity users - a lot of the top results are from threads from the perplexity team, posted on twitter. Though not all are. I'm not sure why a handful are getting indexed and thus effectively searchable without having the link

carmine prairie May 27, 2024, 7:15 AM

#

list top 10 glass bottle supplier in Europe

warm cave May 27, 2024, 7:30 AM

#

carmine prairie list top 10 glass bottle supplier in Europe

Screenshot_2024-05-27_at_12.30.17_AM.png

supple pecan May 27, 2024, 8:02 AM

#

austere kestrel yeah agree, but would also note that 2,730, I assume, is only a very small fract...

Good point about the small number, but there should be a 0% chance of shared links, especially ones to yourself, containing private information, etc should be crawled and able to be publicly searched at all. That seems like a pretty big oversight to be honest not to mention legal violations. That should probably be confirmed and looked into a little deeper by the legal team if its true crawled pages that have been linked are searchable to the public, that's begging for a lawsuit tbh

supple pecan May 27, 2024, 8:26 AM

#

As a pro subscriber with "priority help request services", I sent an email asking a much simpler question and never received a response so this was the straw that broke the camel's back for me. Having linked threads be searchable (when asking Perplexity itself it denies this very broadly by saying

"No, Perplexity's threads are private by default and not crawled or indexed by search engines like Google. According to Perplexity's FAQ: "Your Threads are private and only for your reference by default. To make a thread public, click on the top right corner on the share button, and select sharable. Anyone with a link will be able to view your thread." So unless you explicitly share a thread and make it public, your conversations with Perplexity are kept private and not searchable on the open web. Even public threads are only accessible through the shared link, not through general web searches."

But @warm cave your picture proves it so I tried it myself and indeed, there are linked threads from users (can't tell where they're linked to) that are searchable. Canceled the sub after that. No thank you.

tranquil orchidBOT May 27, 2024, 8:26 AM

#

supple pecan As a pro subscriber with "priority help request services", I sent an email askin...

Hey @supple pecan!

If you find the original message helpful, please consider reacting to it with the :star: emoji. If the post is appreciated by the community and receives 5 stars, it will go to the https://discord.com/channels/1047197230748151888/1082806833938436228 channel and the post author will get the <@&1082034222778302614> role on Perplexity.

empty lotus May 27, 2024, 8:39 AM

#

Did the chatbot became restrictive
Now I am getting that it is not able to find from the provided search results.
I didn't provide any links in query. I feel like some changge happened

#

I used Claude Opus. Always I get best result in Opus. Not sure from today what happened

jovial mantle May 27, 2024, 8:59 AM

#

I did some absolutely non statistically relevant tests on perplexity, and I found that free chatgpt has become a bit better?

#

I was using perplexity with pro active

tame current May 27, 2024, 9:03 AM

#

jovial mantle I did some absolutely non statistically relevant tests on perplexity, and I foun...

it's a new model, called GPT-4o

#

that's why chatgpt feels better

#

it's basically a GPT4 level model but free (10 messages per 3-4 hours)

jovial mantle May 27, 2024, 9:05 AM

#

Oh, thank you

cinder comet May 27, 2024, 9:13 AM

#

tame current it's basically a GPT4 level model but free (10 messages per 3-4 hours)

ive never run out of gpt4o usage on chatgpt tbh

#

it feels like 50 msg per hour or so

harsh stag May 27, 2024, 9:14 AM

#

80 per 3 hours

tame current May 27, 2024, 9:16 AM

#

cinder comet ive never run out of gpt4o usage on chatgpt tbh

are you using chatgpt plus ?

cinder comet May 27, 2024, 9:16 AM

#

tame current are you using chatgpt plus ?

yea

cinder comet May 27, 2024, 9:17 AM

#

harsh stag 80 per 3 hours

it feels more tbh

tame current May 27, 2024, 9:17 AM

#

that's why ig

#

free version is pretty limited

#

the limit for plus is "five times higher than free"

cinder comet May 27, 2024, 9:22 AM

#

tame current free version is pretty limited

the free ver is 10msgs per 4h?

sleek vortex May 27, 2024, 9:23 AM

#

cinder comet <@957611986835898441> did you start that lil project?

i didn’t start the pplx labs sidebar thing

#

but i’m invested in this little perplexity clone where i’m trying to adopt agent structure, instead lol

cinder comet May 27, 2024, 9:24 AM

#

sleek vortex i didn’t start the pplx labs sidebar thing

im doing some progress on it, but idk if im doing it right tbh, how would you "save convo" and then continue from where you left?

#

like im doing this custom prompt but then it depends on the model context size

cinder comet May 27, 2024, 9:24 AM

#

sleek vortex but i’m invested in this little perplexity clone where i’m trying to adopt agent...

oh

sleek vortex May 27, 2024, 9:24 AM

#

right now i’ve got responses as good as perplexity copilot v1 in like 30-40 seconds response time using free gpu for embeddings and free claude haiku credits

cinder comet May 27, 2024, 9:25 AM

#

sleek vortex right now i’ve got responses as good as perplexity copilot v1 in like 30-40 seco...

no way lol

sleek vortex May 27, 2024, 9:25 AM

#

cinder comet im doing some progress on it, but idk if im doing it right tbh, how would you "s...

we’d have to somehow reverse engineer the state right??

#

because we want to load the convo into the UI

sleek vortex May 27, 2024, 9:25 AM

#

cinder comet no way lol

it’s only so slow because of cold start gpu embeddings 😦

cinder comet May 27, 2024, 9:25 AM

#

sleek vortex we’d have to somehow reverse engineer the state right??

im just using indexedDB to store the convo + inject prompt ( User: Message1; AI: Message2 ) and continue from that

sleek vortex May 27, 2024, 9:25 AM

#

sleek vortex it’s only so slow because of cold start gpu embeddings 😦

but whatever

cinder comet May 27, 2024, 9:26 AM

#

sleek vortex it’s only so slow because of cold start gpu embeddings 😦

yea 30sec is a bit...

sleek vortex May 27, 2024, 9:26 AM

#

cinder comet im just using indexedDB to store the convo + inject prompt ( User: Message1; AI:...

yeah but idk it would be nice if we could show the messages in the actual ui

sleek vortex May 27, 2024, 9:27 AM

#

cinder comet yea 30sec is a bit...

if i leave the gpu on then it goes down to like 20 seconds

#

but this is with only one pipeline

#

i’m yet to add agent like splitting

#

which might make it longer but idk what i think will happen is it will barely get longer but quality should improve a ton

cinder comet May 27, 2024, 9:27 AM

#

sleek vortex yeah but idk it would be nice if we could show the messages in the actual ui

im not showing it

#

gimme a sec

#

this is how it looks like so far

#

ive hidden the original textarea

#

its quite similar

cinder comet May 27, 2024, 9:29 AM

#

sleek vortex if i leave the gpu on then it goes down to like 20 seconds

like is it the same as perplexity or it has some new features?

sleek vortex May 27, 2024, 9:29 AM

#

well i plan to add new features

#

i was discussing it yesterday

sleek vortex May 27, 2024, 9:30 AM

#

sleek vortex It would be pretty cool if i could ask the thing to be like

.

#

also been working on getting code interpreter working so

#

similisr to chatgpt but better in that there should be internet access

#

and maybe more languages other than python

cinder comet May 27, 2024, 9:30 AM

#

sleek vortex find out a list of the 50 most popular artists by spotify album sales and also b...

oh this is actually a good idea

sleek vortex May 27, 2024, 9:31 AM

#

cinder comet oh this is actually a good idea

if lots of people like it i might make it a real website lol

#

i could ask modal for some credits as a startup

#

they do like “upto 25k$ in credits just ask us!!”

#

which is promising?

#

literally right now been running off the free $30 credits so

cinder comet May 27, 2024, 9:32 AM

#

sleek vortex they do like “upto 25k$ in credits just ask us!!”

under what conditions?

#

a promising project

sleek vortex May 27, 2024, 9:32 AM

#

startups or educational institutions

cinder comet May 27, 2024, 9:32 AM

#

oh i see

sleek vortex May 27, 2024, 9:32 AM

#

cinder comet a promising project

yeah id assume

sleek vortex May 27, 2024, 9:32 AM

#

sleek vortex literally right now been running off the free $30 credits so

and even then i have it optimised enough to use like $0.0001 per query on embeddings

#

that’s embedding like 300-500 512 character chunks so

#

not bad

#

i could improve performance with a bigger model and or bigger gpu but at speed and cost expense

#

models using groq and haiku

#

i could go as far as to run the llm on modal itself if i need to

#

like if i fine tuned a 8b as my decision model/ worker model instead of using groq

tame current May 27, 2024, 9:34 AM

#

cinder comet the free ver is 10msgs per 4h?

yes

sleek vortex May 27, 2024, 9:35 AM

#

tame current yes

really can’t do much with 10 messages ngl

cinder comet May 27, 2024, 9:35 AM

#

sleek vortex that’s embedding like 300-500 512 character chunks so

yea its good

#

if you are already getting outputs similar to perp with only that

#

then its a good start tbh

sleek vortex May 27, 2024, 9:35 AM

#

cinder comet yea its good

i beat googles ai search in misinformation tests so um

tame current May 27, 2024, 9:35 AM

#

sleek vortex really can’t do much with 10 messages ngl

on the other hand, it's better than not having access at all (as was the case for a long time)

cinder comet May 27, 2024, 9:35 AM

#

sleek vortex i beat googles ai search in misinformation tests so um

lmao

#

google getting another hit

sleek vortex May 27, 2024, 9:36 AM

#

sleek vortex > I do not have enough information in the provided context to recommend how many...

mine said this to the eating rocks query

#

google well…

tame current May 27, 2024, 9:36 AM

#

sleek vortex mine said this to the eating rocks query

not bad

sleek vortex May 27, 2024, 9:37 AM

#

cinder comet lmao

oh and i’m only now feeding like 3k of context to haiku at the end

tame current May 27, 2024, 9:37 AM

#

google's ai overview is half baked

sleek vortex May 27, 2024, 9:37 AM

#

which is extremely cheap

#

but it’s between sending the whole sources 10-60k of tokens (response time in 12-15s) to just sending a refined 3k(30-40s cold gpu)

#

i think refined is still the way to go

#

quality of response is the same or even better

#

when sending the huge amount of tokens i think haiku sometimes got confused?

tame current May 27, 2024, 9:38 AM

#

yes

#

perplexity too

#

used to, atleast

sleek vortex May 27, 2024, 9:39 AM

#

idk they’re building their own indexer infra right

tame current May 27, 2024, 9:39 AM

#

btw, have you considered DeepseekV2?

sleek vortex May 27, 2024, 9:39 AM

#

i could try do that on an absolute budget

#

i thought about that

#

how i could index the most popular sites and embed them beforehand

tame current May 27, 2024, 9:40 AM

#

you mean, filter out the sites to the reliable ones ?