crawlee-js

782 threads · Page 5 of 16

Cheerio extract function 5 messages
CheerioCrawler
Unit Tests With Crawlee 3 messages
Web-Scraping
Memory for only 1 browser is 12gb? How to ensure clean up after pages? 14 messages
Automation PuppeteerCrawler
custom logic for status codes 3 messages
PuppeteerCrawler
Error accessing the requests sometimes when two crawlers are running in parallel 3 messages
Automation RequestQueue CheerioCrawler
Crawlee Proxies 5 messages
PuppeteerCrawler Web-Scraping
TargetClosedError 15 messages
PlaywrightCrawler Web-Scraping
Want to scrap multiple elements on a web page 5 messages
Web-Scraping
proxy issues 3 messages
Automation PlaywrightCrawler Web-Scraping
elementHandle.$$: Target page, context or browser has been closed 14 messages
Automation PlaywrightCrawler Web-Scraping RequestQueue
Scraper testing 10 messages
Web-Scraping
web Crawler got blocked in selenium 3 messages
Automation Web-Scraping
Requests timing out - best practices? 5 messages
PlaywrightCrawler
Retry after 30 seconds for failed requests 6 messages
Automation Web-Scraping CheerioCrawler
Encountering net::ERR_TIMED_OUT despite connection being fine 4 messages
PlaywrightCrawler
scrapy signals equivalent 3 messages
PuppeteerCrawler PlaywrightCrawler HttpCrawler Web-Scraping CheerioCrawler
how to make follow_redirects=false in CheerioCrawler 6 messages
CheerioCrawler
Call for help, Tokopedia Scraper too slow and costly? 3 messages
PlaywrightCrawler
AdaptivePlaywrightCrawler: programmatically deciding when to render JS 4 messages
PlaywrightCrawler Web-Scraping
Does crawlee already implement this fingerprint generator? 3 messages
Automation
Cannot use Sentry 6 messages
PlaywrightCrawler
error TS2307: Cannot find module '@apify/log/log' or its corresponding type declarations. 12 messages
PlaywrightCrawler
Open chrome (not test version) in crawlee 3 messages
PuppeteerCrawler Suggestions
PuppeteerCrawler waitForResponse timeout issue. Seems like it skips desired request 9 messages
PuppeteerCrawler Web-Scraping
Need serious help scaling crawlee 29 messages
PuppeteerCrawler Automation Web-Scraping Suggestions
How to "store" and "retrieve" a browser on a per user basis? 13 messages
Automation Web-Scraping
WARN PuppeteerCrawler: Reclaiming failed request back to the list or queue. Expected `key` to be of 3 messages
PuppeteerCrawler
got-scraping vs cheerioCrawler or sendRequest 4 messages
Web-Scraping CheerioCrawler
Trying to optimize autoscale options 9 messages
Web-Scraping
more than one request queue 4 messages
PuppeteerCrawler Web-Scraping
Add certificates to Playwright crawler using Chromium 26 messages
PlaywrightCrawler Web-Scraping
Failing Dockerfile after updating routes.ts 2 messages
PuppeteerCrawler Web-Scraping
useState not working as expected. 2 messages
HttpCrawler
router.addHandler from a separate function 2 messages
Web-Scraping
Intercepting requests 3 messages
PlaywrightCrawler Web-Scraping
No option for ignoring SSL Errors with PlaywrightCrawler? 4 messages
PlaywrightCrawler
Crawle failed creepjs test 7 messages
PlaywrightCrawler Automation Web-Scraping Suggestions
Unexpected results when currying a value into the request handler 4 messages
PlaywrightCrawler
Pagination assistance 4 messages
Web-Scraping
Go to solution to prevent recrawl? 5 messages
Data Storage
preNavigationHook needs to listen to response from network and change goToOptions. 15 messages
PuppeteerCrawler Automation Suggestions
Is it better to separate scraper by domains? 2 messages
Web-Scraping
How to reset crawlee URL cache/add the same URL back to the requestQueue? 7 messages
Web-Scraping CheerioCrawler
Strategy to prevent crawling data that has been crawled before 6 messages
Web-Scraping RequestQueue Data Storage
Is there a way to use both PlaywrightCrawler and CheerioCrawler? 4 messages
PlaywrightCrawler CheerioCrawler
Strange Sitemaps. What can I do to use them? 5 messages
Web-Scraping Suggestions CheerioCrawler
Anyone with a crawler with a lot of route handlers ? Like 100s of route handlers in a single crawler 3 messages
HttpCrawler Suggestions Web-Scraping CheerioCrawler
Crawler becomes idle after some time (queue not empty) 5 messages
Suggestions
Already crawled URLs 3 messages
RequestQueue
Reduce time between "PlaywrightCrawler: Starting the crawler." and the "requestHandler" 4 messages
Automation PlaywrightCrawler