Retrieves and extracts web content, converting HTML to markdown for easier consumption by LLMs.
This server, built for the Model Context Protocol, provides web content fetching capabilities. It allows Large Language Models to retrieve and process content from web pages, automatically converting HTML to markdown. The tool truncates the response, but the starting point for content extraction can be specified, enabling models to read webpages in chunks.
Key Features
01Supports pagination through start index.
02Provides Markdown conversion.
034 GitHub stars
04Fetches and extracts relevant content from URLs.
05Offers user-agent customization.
06Respects `robots.txt` (can be disabled).
Use Cases
01Enabling LLMs to access and process information from the web.
02Providing a web service for extracting content from webpages.
03Facilitating the retrieval of specific sections of a webpage for analysis.