crawlee-js
Efficient css selectors
PlaywrightCrawler
Web-Scraping
RequestQueue
Data Storage
How to transfer data between playwrightcrawler and cheeriocrawler?
Data Storage
Ignore previously crawled URLs
RequestQueue
How to make Puppeteer crawler ignore errors on page?
PuppeteerCrawler
chromium.launchpersistentcontext with crawlee
PlaywrightCrawler
Web-Scraping
Page.goto never resolves in headful (using XVFB) using `apify/actor-node-puppeteer-chrome` Docker
PuppeteerCrawler
Throw error that respects maxRequestRetries
PuppeteerCrawler
Basic Crawlee how do I use my own proxies?
HttpCrawler
How to run cheerio crawler with Bun?
Web-Scraping
CheerioCrawler
Webscraper.io
Web-Scraping
Playwright crawler failing when element is not found
HttpCrawler
PlaywrightCrawler
Web-Scraping
Multiple queues
PuppeteerCrawler
RequestQueue
How to open multiple browsers?
PlaywrightCrawler
Web-Scraping
TSConfig in Crawlee projects.
Automation
TypeError [ERR_UNKNOWN_FILE_EXTENSION]: Unknown file extension ".ts"
PlaywrightCrawler
Web-Scraping
Target Closed
PuppeteerCrawler
Web-Scraping
Set debug breakpoint in VS Code
Web-Scraping
XVFB fails on server.
Automation
PlaywrightCrawler
Anything special about .php websites?
PlaywrightCrawler
RequestQueue
Handle browser failure
PuppeteerCrawler
Best practices to not crawl links that are already crawled when Actor is run as CRON
Automation
Suggestions
Web-Scraping
Stoping Crawler when done in scraping
Web-Scraping
Code refactoring - reusing a common handler in multiple crawlers while keeping code hints
PlaywrightCrawler
Web-Scraping
Crawler skipping Jobs after processing 5,000-6,000 Requests
Automation
Web-Scraping
CheerioCrawler
RequestQueue
Crawlee does not work with cron job
Automation
Web-Scraping
CheerioCrawler
TikTok scraper following list
PlaywrightCrawler
Running crawlee multiple times with the same URL
PuppeteerCrawler
RequestQueue
Passing data to a router/ handler
PlaywrightCrawler
Web-Scraping
Crawlee does not work with cron job
Web-Scraping
CheerioCrawler
Re-using the crawler, instead initializing after each url?
Automation
Web-Scraping
CheerioCrawler
RequestQueue
'BrowserPool: Page crashed' errors after updating packages
Automation
PuppeteerCrawler
I want to use a created dataset
Suggestions
JSDOMCrawler, website breaks crawlee
HttpCrawler
Web-Scraping
High Volume Scraping
Web-Scraping
enqueuelinks doesn't work.
PlaywrightCrawler
RequestQueue
Proxy authentication bug?
PuppeteerCrawler
Web-Scraping
Memory is critically overloaded. Using 12184 MB of 3883 MB (314%). Consider increasing available mem
PlaywrightCrawler
got many 429 status code when crawled the target site,even though proxies. How to optimise my code?
PlaywrightCrawler
Download Delay
Web-Scraping
Log Proxy IP
PlaywrightCrawler
Web-Scraping
Error: Failed to launch the browser process with Puppeter
PuppeteerCrawler
How to set different requestHandlerTimeoutSecs for specific handlers?
PuppeteerCrawler
Best practice to stop/crash the actor/crawler on high ratio of errors?
Web-Scraping
How does createSessionFunction create session when parallel requests are being made
Automation
PuppeteerCrawler
Web-Scraping
Suggestions
my actor works fine locally but seems to get stuck on initializing crawler, doesn't enter routers
PlaywrightCrawler
Request queue with id Error
PuppeteerCrawler
RequestQueue
How would I build a crawler that accepts API requests to submit forms for a user?
Automation
WARN CheerioCrawler: Reclaiming failed request back to the list or queue. Detected a session error,
CheerioCrawler
Why the CPU utilization rate of crawlee is going down and seem like stop processing any requests
PuppeteerCrawler
Web-Scraping
saving data in apify actor and cleaning
PlaywrightCrawler
Web-Scraping
Data Storage