01Configurable crawl depth, breadth, and total page limits
02Automatic conversion of web content to clean markdown format
03Source URL and timestamp metadata included in every file
04Flat directory structure storage for easy file management
05Natural language guidance to focus the crawler on specific topics
061 GitHub stars