01Automatic anti-bot bypass, including Cloudflare protection
02Provides rich metadata: title, description, author, Open Graph, Twitter Cards, published time, etc.
03Categorizes internal and external links for structured crawling
04Smart fallback system with multiple retries for reliability
050 GitHub stars
06Extracts clean, LLM-ready markdown with tables preserved