crawlee-js

782 threads · Page 7 of 16

Added "playwright-extra" with "stealthPlugin" and got error "Cannot read properties of undefined" 9 messages
PlaywrightCrawler
enqueueLinks same domain strategy ignoring protocol? 2 messages
PlaywrightCrawler Web-Scraping
Cannot find module `/app` import from `/main.js` 3 messages
PlaywrightCrawler Web-Scraping
Using `transformRequestFunction` in `enqueueLinks` overrides `label` 5 messages
PlaywrightCrawler RequestQueue
running on ARM 4 messages
Automation
retireBrowserAfterPageCount does not work with values above close to 30 for Playwright 13 messages
PlaywrightCrawler
isFinishedFunction, check other crawler? 8 messages
PlaywrightCrawler Web-Scraping
The basic app works, but breaks in docker 12 messages
Automation PlaywrightCrawler
change INFO CheerioCrawler 6 messages
CheerioCrawler
aws eb .ebextensions 2 messages
Automation
How can I modify the amount of RAM an actor should use? 2 messages
HttpCrawler Web-Scraping
Named Request Queues not getting purged 9 messages
PlaywrightCrawler Web-Scraping Suggestions
Help with method enqueueLinksByClickingElements 3 messages
Web-Scraping
RequestRouter not using the `log` instance passed in its respective Playwright crawler 6 messages
PlaywrightCrawler Suggestions Web-Scraping
Stop scraping in the middle of a route handler if some condition is met? 10 messages
PlaywrightCrawler Web-Scraping
Not generating jsons but crawling 4 messages
Suggestions Web-Scraping CheerioCrawler RequestQueue
crawler process not exiting after teardown is called. 3 messages
Suggestions
Scrape redirect links gracefully? 4 messages
PlaywrightCrawler Web-Scraping
Dealing with occasional 2fa dialog 2 messages
Web-Scraping
Memory usage is spiking, even with CRAWLEE_AVAILABLE_MEMORY_RATIO set to .7 4 messages
PlaywrightCrawler Web-Scraping
Avoid sharing same CheerioCrawler instance across multiple calls 5 messages
Web-Scraping CheerioCrawler
Custom LoggerText implementation not handling objects 10 messages
PlaywrightCrawler
Can I deploy to Azure? 13 messages
PlaywrightCrawler Web-Scraping
Problem using express 3 messages
PlaywrightCrawler Web-Scraping
Json Array of strings 3 messages
Data Storage
Is it possible to use express inside of Apify? 2 messages
Web-Scraping Suggestions
Catch and solve captchas 7 messages
Web-Scraping CheerioCrawler
CLI in a Container 4 messages
Web-Scraping
Failed requests - Session closed 'without receiving a SETTINGS frame' or 'NGHTTP2_REFUSED_STREAM' 2 messages
Web-Scraping CheerioCrawler
How to use Playwright locator assertions in Crawlee? 3 messages
Suggestions
How to close the crawler from a RequestHandler? 12 messages
Suggestions
long running scraper, 500+ pages for each crawl 21 messages
Automation PlaywrightCrawler Web-Scraping Suggestions
better error handling? 2 messages
PlaywrightCrawler Web-Scraping
CNCF Dapr 3 messages
RequestQueue Suggestions Data Storage
How to access browser instance in Playwright Crawler? 14 messages
PlaywrightCrawler Suggestions
Make PlaywrightCrawler less unique and avoid blocking? (canvas/fonts/plugins/permissions...) 10 messages
PlaywrightCrawler Web-Scraping
Is it possible to run a crawl within a crawl? 5 messages
Web-Scraping CheerioCrawler
is there a way to have custom variables accessible inside the crawler function? 13 messages
Suggestions
Module not found in NextJs projects 6 messages
PlaywrightCrawler
Set cookies with cheerio 2 messages
Web-Scraping CheerioCrawler
Structure Crawlers to scrape multiple sites 26 messages
Suggestions Web-Scraping
playwright & pdf + error handling 12 messages
PlaywrightCrawler Web-Scraping
EnqueueLInks 5 messages
Web-Scraping
Gracefully closing the crawler with keepalive flag true 2 messages
PuppeteerCrawler Suggestions RequestQueue CheerioCrawler
Python logging equivalent 16 messages
Suggestions
Deterministic screenshotting for automated visual regression testing 4 messages
Automation PuppeteerCrawler
Error Target page, context or browser closed 10 messages
Automation PlaywrightCrawler
What would be a recommended way of scraping content behind 2fa auth? 4 messages
Web-Scraping
How to keep sessions alive while crawling? 3 messages
Automation PuppeteerCrawler Web-Scraping
WARN FLOOD error: Sleeping for undefined seconds before adding more users 5 messages
Automation PlaywrightCrawler Web-Scraping Suggestions RequestQueue