Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 1322servers and find the perfect MCPs for your needs.

CleanWeb icon

CleanWeb

Extracts and cleans web content, filtering ads and irrelevant elements, and converts it to a clean Markdown format for various applications.

Search icon

Search

Provides a Node.js-based Model Context Protocol (MCP) server for robust web search and intelligent content analysis capabilities.

Puppeteer icon

Puppeteer

Automates browser interactions, enabling LLMs to interact with web pages, capture screenshots, and execute JavaScript.

Strava icon

Strava

Connects Claude with Strava to enable natural language querying of activity data.

Browserai icon

Browserai

Provides serverless browser access for AI agents and applications, enabling real-time web data retrieval and interaction.

Simple Google Search icon

Simple Google Search

Enables Google searches and webpage content extraction via the Model Context Protocol.

Kagi icon

Kagi

Integrates Kagi Search and Summarizer APIs into an MCP server, offering a stable alternative for AI applications.

Food Nutrition icon

Food Nutrition

Provides a comprehensive server infrastructure for food and nutrition intelligence, offering tools for data retrieval, meal planning, and dietary analysis.

MoEngage Documentation icon

MoEngage Documentation

Provides AI assistants with direct access to comprehensive MoEngage documentation for enhanced search and retrieval.

Youtube Transcript icon

Youtube Transcript

Retrieves transcripts from YouTube videos using the Model Context Protocol.

MiraiLens icon

MiraiLens

Empower AI assistants to control and observe web browsers, extending AI workflows with high-level browser automation and data access through a Model Context Protocol (MCP) interface.

Hubble icon

Hubble

Facilitates data retrieval and analysis from Google Search and other online sources through API integration with Claude Desktop.

XPath icon

XPath

Evaluates XPath queries on XML and HTML content, both from strings and URLs.

CleanWeb icon

CleanWeb

Extracts and cleans core web content, filtering ads and converting it into a pristine Markdown format.

ArXiv Search icon

ArXiv Search

Provides search functionality for arXiv.org papers using the official arXiv API.

Agentic Web Protocol icon

Agentic Web Protocol

Facilitates the discovery of websites and APIs for seamless interactions between AI agents.

Firecrawl icon

Firecrawl

Enables web scraping, content searching, site crawling, and data extraction using the Firecrawl API.

Gres API icon

Gres API

Provides a minimalist AI command server for agents and developers to snap pages, grab sites, source docs, and ask questions via a single interface.

Websearch icon

Websearch

Provides multi-engine web search capabilities with intelligent content extraction, adhering to the Model Context Protocol.

Showing 20 of 1322 results

Scroll for more results...