01Rate limiting per domain
02Document processing with fallback support for various formats
03JavaScript-enabled web scraping with Playwright and anti-detection measures
041 GitHub stars
05Concurrent batch processing with configurable limits
06Intelligent caching with SQLite backend