I added a pdf of the book "Freedom by Midnight", which I have checked through myself, to the configuration of this GPT. However it is struggling to actually parse it when prompted. It is able to grab a hold of big picture ideas like Gandhi and other historical aspects for the book, but continues to return blank outputs and an error when asked to parse through the text for chapter summaries or outlines. Any tips on how to work around this?
#GPT not able to parse book pdf file properly?
1 messages · Page 1 of 1 (latest)
An example of a page from the book pdf.
The pdf is 672 pages long and its file size is 290 MB
pdf too big i guess
I don't believe that's the problem. According to this link (https://platform.openai.com/docs/assistants/how-it-works), "You can attach a maximum of 20 files per Assistant, and they can be at most 512 MB each. In addition, the size of all the files uploaded by your organization should not exceed 100GB."
But thats the docs for the assistant ( api ) not custom-gpts
False PDF
are there images in it? 290mb of text?
There are images in the pdf, for eg.
“True” or digitally created PDFs
Digitally created PDFs, also known as "true" PDFs, are created using software such as Microsoft® Word®, Excel® or via the “print” function within a software application (virtual printer). They consist of text and images.
Both the characters in the text and the meta-information have an electronic character designation. With ABBYY FineReader 15 you can easily search through these PDFs and select, edit or delete text similar to how you would do that in other editable formats such as Microsoft® Word®. The images in digitally created documents can be resized, moved, or deleted.
“Image-only” or scanned PDFs
When scanning hard copy documents on MFPs and office scanners, or when converting a camera image, jpg, tiff or screenshot into a PDF, the content is “locked” in a snapshot-like image.
Such image-only PDF documents contain just the scanned/photographed images of pages, without an underlying text layer. Consequently, image-only PDF files are not searchable, and their text usually cannot be modified or marked up. An “image-only” PDF can be made searchable by applying OCR with which a text layer is added, normally under the page image.
I just hate pdf
All the file is pure images
lol