crawlee-js
Cheerio extract function
CheerioCrawler
Unit Tests With Crawlee
Web-Scraping
Memory for only 1 browser is 12gb? How to ensure clean up after pages?
Automation
PuppeteerCrawler
custom logic for status codes
PuppeteerCrawler
Error accessing the requests sometimes when two crawlers are running in parallel
Automation
RequestQueue
CheerioCrawler
Crawlee Proxies
PuppeteerCrawler
Web-Scraping
TargetClosedError
PlaywrightCrawler
Web-Scraping
Want to scrap multiple elements on a web page
Web-Scraping
proxy issues
Automation
PlaywrightCrawler
Web-Scraping
elementHandle.$$: Target page, context or browser has been closed
Automation
PlaywrightCrawler
Web-Scraping
RequestQueue
Scraper testing
Web-Scraping
web Crawler got blocked in selenium
Automation
Web-Scraping
Requests timing out - best practices?
PlaywrightCrawler
Retry after 30 seconds for failed requests
Automation
Web-Scraping
CheerioCrawler
Encountering net::ERR_TIMED_OUT despite connection being fine
PlaywrightCrawler
scrapy signals equivalent
PuppeteerCrawler
PlaywrightCrawler
HttpCrawler
Web-Scraping
CheerioCrawler
how to make follow_redirects=false in CheerioCrawler
CheerioCrawler
Call for help, Tokopedia Scraper too slow and costly?
PlaywrightCrawler
AdaptivePlaywrightCrawler: programmatically deciding when to render JS
PlaywrightCrawler
Web-Scraping
Does crawlee already implement this fingerprint generator?
Automation
Cannot use Sentry
PlaywrightCrawler
error TS2307: Cannot find module '@apify/log/log' or its corresponding type declarations.
PlaywrightCrawler
Open chrome (not test version) in crawlee
PuppeteerCrawler
Suggestions
PuppeteerCrawler waitForResponse timeout issue. Seems like it skips desired request
PuppeteerCrawler
Web-Scraping
Need serious help scaling crawlee
PuppeteerCrawler
Automation
Web-Scraping
Suggestions
How to "store" and "retrieve" a browser on a per user basis?
Automation
Web-Scraping
WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. Expected `key` to be of
PuppeteerCrawler
got-scraping vs cheerioCrawler or sendRequest
Web-Scraping
CheerioCrawler
Trying to optimize autoscale options
Web-Scraping
more than one request queue
PuppeteerCrawler
Web-Scraping
Add certificates to Playwright crawler using Chromium
PlaywrightCrawler
Web-Scraping
Failing Dockerfile after updating routes.ts
PuppeteerCrawler
Web-Scraping
useState not working as expected.
HttpCrawler
router.addHandler from a separate function
Web-Scraping
Intercepting requests
PlaywrightCrawler
Web-Scraping
No option for ignoring SSL Errors with PlaywrightCrawler?
PlaywrightCrawler
Crawle failed creepjs test
PlaywrightCrawler
Automation
Web-Scraping
Suggestions
Unexpected results when currying a value into the request handler
PlaywrightCrawler
Pagination assistance
Web-Scraping
Go to solution to prevent recrawl?
Data Storage
preNavigationHook needs to listen to response from network and change goToOptions.
PuppeteerCrawler
Automation
Suggestions
Is it better to separate scraper by domains?
Web-Scraping
How to reset crawlee URL cache/add the same URL back to the requestQueue?
Web-Scraping
CheerioCrawler
Strategy to prevent crawling data that has been crawled before
Web-Scraping
RequestQueue
Data Storage
Is there a way to use both PlaywrightCrawler and CheerioCrawler?
PlaywrightCrawler
CheerioCrawler
Strange Sitemaps. What can I do to use them?
Web-Scraping
Suggestions
CheerioCrawler
Anyone with a crawler with a lot of route handlers ? Like 100s of route handlers in a single crawler
HttpCrawler
Suggestions
Web-Scraping
CheerioCrawler
Crawler becomes idle after some time (queue not empty)
Suggestions
Already crawled URLs
RequestQueue
Reduce time between "PlaywrightCrawler: Starting the crawler." and the "requestHandler"
Automation
PlaywrightCrawler