What is this?
This plugin allow to scrape an entire website and ingest in rabbithole all website pages and PDFs
Usage
After plugin installation you need to digit scrapycat url
The URL must be the website root url (homepage). The ingest phase may be long, you need to wait the cat response with number of urls/pdf ingested
Settings
On the plugin settings you can set "Ingest PDF": If this settings is enabled the plugin ingest also pdfs presents on website.
Example
"@scrapycat https://cheshire-cat-ai.github.io/docs/"
Plugin repo:
https://github.com/team-sviluppo/cc_scrapycat