Discover our curated collection of MCP servers for web scraping & data collection. Browse 2524 servers and find the perfect MCPs for your needs.
Enables browser automation capabilities using Playwright for LLMs to interact with web pages.
Facilitates open-source intelligence by collecting user account information from various public sources.
Enables AI applications to use Apify Actors as tools for performing specific tasks like data extraction and web scraping.
Enables AI assistants to search and access Google Scholar papers through a simple interface.
Retrieves transcripts for YouTube videos given a URL and desired language.
Enables AI models to perform Google searches and analyze webpage content programmatically through an MCP interface.
Enables Claude to access real-time information from the web for enhanced research capabilities.
Conducts in-depth, iterative research on any topic using AI-powered search, web scraping, and source evaluation to generate comprehensive reports.
Provides unified access to multiple search engines, AI tools, and content processing services.
Enables LLM applications to perform in-depth research through the MCP protocol.
Enables parallel Google searches with multiple keywords using a Playwright-powered MCP server.
Connects AI models to SEC EDGAR filings through an open-source MCP server, enabling financial research and insights.
Provides comprehensive financial data from Yahoo Finance via the Model Context Protocol.
Provides a Node.js client for interacting with the MediaWiki API and WikiData.
Evaluates the performance of MCP servers for web search and database query tasks.
Securely reads and extracts text, metadata, and page counts from PDF files (local or URL) for use by AI agents.
Empowers AI agents and LLMs with real-time web access, data extraction, and bot bypass capabilities.
Provides a high-performance backend system for querying China Railway 12306 train ticket information using the Model Context Protocol (MCP).
Enables large language models to read WeChat official account articles by simulating a browser to bypass anti-scraping mechanisms.
Provides web search and robust content retrieval for AI coding tools, optimizing for comprehensive conversational data.
Scroll for more results...