Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1322servers and find the perfect MCPs for your needs.
CleanWeb
Extracts and cleans web content, filtering ads and irrelevant elements, and converts it to a clean Markdown format for various applications.
Search
Provides a Node.js-based Model Context Protocol (MCP) server for robust web search and intelligent content analysis capabilities.
Puppeteer
Automates browser interactions, enabling LLMs to interact with web pages, capture screenshots, and execute JavaScript.
Strava
Connects Claude with Strava to enable natural language querying of activity data.
Browserai
Provides serverless browser access for AI agents and applications, enabling real-time web data retrieval and interaction.
Simple Google Search
Enables Google searches and webpage content extraction via the Model Context Protocol.
Kagi
Integrates Kagi Search and Summarizer APIs into an MCP server, offering a stable alternative for AI applications.
Food Nutrition
Provides a comprehensive server infrastructure for food and nutrition intelligence, offering tools for data retrieval, meal planning, and dietary analysis.
MoEngage Documentation
Provides AI assistants with direct access to comprehensive MoEngage documentation for enhanced search and retrieval.
Logo
Automatically identifies, extracts, and optimizes logo icons from websites using advanced recognition and selection algorithms.
Youtube Transcript
Retrieves transcripts from YouTube videos using the Model Context Protocol.
MiraiLens
Empower AI assistants to control and observe web browsers, extending AI workflows with high-level browser automation and data access through a Model Context Protocol (MCP) interface.
Hubble
Facilitates data retrieval and analysis from Google Search and other online sources through API integration with Claude Desktop.
XPath
Evaluates XPath queries on XML and HTML content, both from strings and URLs.
CleanWeb
Extracts and cleans core web content, filtering ads and converting it into a pristine Markdown format.
ArXiv Search
Provides search functionality for arXiv.org papers using the official arXiv API.
Agentic Web Protocol
Facilitates the discovery of websites and APIs for seamless interactions between AI agents.
Firecrawl
Enables web scraping, content searching, site crawling, and data extraction using the Firecrawl API.
Gres API
Provides a minimalist AI command server for agents and developers to snap pages, grab sites, source docs, and ask questions via a single interface.
Websearch
Provides multi-engine web search capabilities with intelligent content extraction, adhering to the Model Context Protocol.
Scroll for more results...