#ChatPDF.com

64 messages · Page 1 of 1 (latest)

blazing egret
#

I created ChatPDF - the easy and fast way to chat with any PDF using the new ChatGPT API, free and without sign-in. Check it out at https://ChatPDF.com and let me know what you think!

swift jolt
#

Awesome! 👏 How did you achieve it, using Langchain or GPT-Index for document embedding?

upper niche
#

yoo, this is useful, thanks!

#

btw, how do you keep it for free? I mean, the API is paid right?

silver tiger
#

Thank you so much @blazing egret . do you plan to add paid plans? I find it very useful, gave you a vote on producthunt! keep up the work and updates here please

weary crescent
#

This is so cool

ivory raft
silver tiger
blazing egret
blazing egret
blazing egret
upper niche
ivory raft
jaunty current
jaunty current
#

But an own API ID would be better

ivory raft
solemn wharf
#

Open source?

fast badger
#

It's awesome, any plan on making it open source? I want to try on a 3k pages document

#

upvoted 🙂

nocturne flicker
#

Who has access to the data on PDFs posted there?

#

@blazing egret

neon wave
#

Hi all. I provided pdf a 372-page paper (Italian language construction code, 10 Mb). Few good answers many wrong ones. However, the idea is good. Thank you for your work @blazing egret

neon wave
ivory raft
#

I really like the idea. I tested with a small audio transcribe (studies), works well, all answers correct.

ivory raft
# nocturne flicker Who has access to the data on PDFs posted there?

Personally I would not rely on question and answer format for this, you might want to avoid something too sensitive, as even in best case scenario it still goes to OpenAI after the website and I do believe they have somewhat free (or at least relatively attainable-) access to it from their hundreds of employees~

nocturne flicker
dusk anvil
#

Can it read textbooks?

blazing egret
blazing egret
blazing egret
zealous hemlock
#

Good @blazing egret

dusk anvil
#

Nice!!

ivory raft
# blazing egret Thanks for the feedback! Intriguing idea with the diff. Could you give a specifi...

perhaps with similar content, such as a new contract from the same company (which is what made me think of this in the first place), or perhaps to compare two offers from different companies.
for techniques it could be something like a new approach or a new theory on a scientific subject, like for example comparing the old backpropagation algorithm to the new forward-forward for ML training, or two different medical intervention procedures like LASIK vs IOL for eyesight
or two competing papers like different theories on the same subject like ekpyrotic universe theory versus plasma cosmology or something like that

#

maybe even to compare original to a translated copy to check for discrepancies

fresh lance
#

@blazing egret please make the code available on Gitlab or Github for everyone. Your work is excellent and everybody should have this on their computer, also maybe other developers can contribute and make it even more interesting.

latent cairn
#

Can it be done for large number of related legal documents?

ivory raft
#

but ofc, if mat is comfortable I might end up contributing to a repo, so, either way

upper niche
#

To be honest, I really hoped that there is an offline version available. I'd pay some bucks for it.

fresh parrot
#

That’s an unlikely scenario

#

you should host it on your own physical server

#

Actually you can build it on azure or no top of openai api

#

On* top

ivory raft
#

<Embedding text failed: Request failed with status code 401 > any idea what's up with this? ^^

blazing egret
blazing egret
ivory raft
little mapleBOT
#

@ivory raft

Eslof's message blocked by AutoMod

We are committed to fostering a positive atmosphere and have implemented additional filters to block any inappropriate or disruptive language and topics.

ivory raft
#

if I have some time I'm gonna see if I can try out what ((that dude earlier whose name is content blocked by the server)) mentioned e.g. combining two pdf's

night dune
#

hi. great application here. I'm developing my discord bot to have a similar feature. Would mind sharing the npm package being used for the pdf content? If my idea is correct, the process involves extracting the texts from the pdf, converting them to embeddings and storing them to database or locally.

fading turret
ivory raft
# ivory raft if I have some time I'm gonna see if I can try out what ((that dude earlier whos...

I was able to put in a pdf going over recent findings from JWST, and got it to narrow down which alternative theory to the current big bang most aligns with the findings and I got a pretty decent explanation for why a cyclic model of the universe might cause a type of build up of mass over cycles which might lead to these abnormally large early formations~ but I had to argue with it a bit to pull comparisons from its training data instead of the PDF.
Pretty cool. I honestly think that with two papers at hand it could do well. Currently been trying to find a good example where both fit into the current limit as a single paper with some kind of divider I could prompt it to work with for testing~

nocturne flicker
ivory raft
#

Another suggestion: Download conversation button.
Say I had a really fruitful discussion about a paper and want to share the conversation with someone, at the moment I'm doing right click + save as (Web page, Single File) mht/mhtml, but a more clean export (height of the chat box gets locked with aforementioned method) or as an image or pdf or md of the conversation would be cool.

ivory raft
#

Great work here!

ivory raft
junior epoch
upper niche
#

Hello! Is there any way for us to insert API keys as an option? I found that there is a change of system?

wraith horizon
fresh lance
cursive stone
# blazing egret Super interesting but seems like it would need big-picture understanding of the ...

there is a version of GPT that has 32000 tokens of memory. If i recall correctly 750 words is about 1000 tokens so that's 24000 words or roughly 50 pages of memory that it can work with. I imagine that it would be able to give a rather good summary of most non-review papers but it would be rather expensive when compared to the current methods which consist of summarizing chunks of text.

The issue with the current method is that in the process of summarizing chunks, some context might be lost and summaries of summaries can become opaque (from my experience). There is a mild fix to this but it is not perfect and it increases computation time and cost although it might still be cheaper than a 32000 token model (i did not compare the costs)