Crawlee is a TypeScript library for Node.js that simplifies web scraping and browser automation. It helps you extract various file types like HTML, PDF, and images, making it ideal for feeding data to AI models. You can use it with Puppeteer, Playwright, Cheerio, and more, supporting both headful and headless browser modes with built-in proxy rotation.
Build reliable web crawlers for data extraction and automation.
Developers needing to build and manage complex web scraping tasks for data analysis or AI model training.