01Recursive link following with configurable depth and multi-domain restrictions
02Automatic directory hierarchy generation that mirrors the source website structure
030 GitHub stars
04Intelligent text extraction that automatically filters navigation, scripts, and boilerplate
05Detailed session metadata tracking including start/end times and error logging
06Concurrent URL processing with adjustable request limits for high-performance crawling