01Deep recursive crawling with configurable depth and page limits
02Semantic guidance using natural language instructions to focus on specific content
03Context-optimized chunking to prevent LLM context window overflow
04Path filtering using regex to include or exclude specific site sections
05Automated URL mapping for rapid site structure discovery
0619 GitHub stars