01Intelligent content extraction using Readability algorithm
02416 GitHub stars
03JavaScript Support via Playwright headless browser
04Resource optimization by blocking unnecessary elements
05Parallel processing for fetching multiple URLs
06Supports HTML and Markdown output formats