Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1322servers and find the perfect MCPs for your needs.
WebScraping.AI
Extracts data from web pages using a Model Context Protocol (MCP) server implementation that integrates with WebScraping.AI.
Pdf Reader
Extracts text and images from PDF files, with OCR support for scanned documents.
Google CSE
Provides search capabilities using a Google Custom Search Engine (CSE) via the Model Context Protocol.
Dappier
Connects LLMs and Agentic AI to real-time, rights-cleared, proprietary data from trusted sources.
Tushare
Facilitates intelligent stock data analysis through a Model Context Protocol (MCP) server.
Fetch
Enables fetching web content and processing images for use with Claude Desktop or other Model Context Protocol (MCP) clients.
Dumpling AI
Integrates with Dumpling AI to provide data scraping, content processing, knowledge management, AI agent, and code execution capabilities.
Apollo.io
Integrates with the Apollo.io API, enabling AI assistants to enrich data, perform searches, and find job postings.
AutoGen SSE Stdio
Integrates local and remote tools with AI agents using the Model Context Protocol (MCP) within the AutoGen framework.
Rod
Automates browser interactions and provides web interaction capabilities for applications using the Rod browser automation framework.
YouTube Transcript
Extracts transcripts from YouTube videos, enabling content analysis and processing.
Browser
Enables Large Language Models (LLMs) to interact with web pages through Anchor Browser's cloud-based remote browser service.
Crawl4AI
Enables web scraping and crawling capabilities for Large Language Models.
Chrome Extension Bridge
Enables interaction between web pages and a local server by establishing a WebSocket connection, allowing access to browser APIs and DOM elements.
CodingBaby Browser
Automates Google Chrome through AI agents via a WebSocket connection to a browser extension.
Bocha Search
Empowers AI applications with high-quality world knowledge from billions of web pages and diverse content sources.
Telegram
Enables AI models like Claude Desktop to interact with Telegram channels and groups for comprehensive content scraping and analysis.
DocSearch
Crawls websites, generates Markdown documentation, and makes that documentation searchable.
JigsawStack
Enables AI models to interact with JigsawStack models through a Model Context Protocol server.
Automates LinkedIn job applications and feed exploration through an MCP server.
Scroll for more results...