Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 1322servers and find the perfect MCPs for your needs.

WebScraping.AI icon

WebScraping.AI

28

Extracts data from web pages using a Model Context Protocol (MCP) server implementation that integrates with WebScraping.AI.

Pdf Reader icon

Pdf Reader

28

Extracts text and images from PDF files, with OCR support for scanned documents.

Google CSE icon

Google CSE

28

Provides search capabilities using a Google Custom Search Engine (CSE) via the Model Context Protocol.

Dappier icon

Dappier

28

Connects LLMs and Agentic AI to real-time, rights-cleared, proprietary data from trusted sources.

Tushare icon

Tushare

27

Facilitates intelligent stock data analysis through a Model Context Protocol (MCP) server.

Fetch icon

Fetch

27

Enables fetching web content and processing images for use with Claude Desktop or other Model Context Protocol (MCP) clients.

Dumpling AI icon

Dumpling AI

27

Integrates with Dumpling AI to provide data scraping, content processing, knowledge management, AI agent, and code execution capabilities.

Apollo.io icon

Apollo.io

27

Integrates with the Apollo.io API, enabling AI assistants to enrich data, perform searches, and find job postings.

AutoGen SSE Stdio icon

AutoGen SSE Stdio

26

Integrates local and remote tools with AI agents using the Model Context Protocol (MCP) within the AutoGen framework.

Rod icon

Rod

26

Automates browser interactions and provides web interaction capabilities for applications using the Rod browser automation framework.

YouTube Transcript icon

YouTube Transcript

26

Extracts transcripts from YouTube videos, enabling content analysis and processing.

Browser icon

Browser

25

Enables Large Language Models (LLMs) to interact with web pages through Anchor Browser's cloud-based remote browser service.

Crawl4AI icon

Crawl4AI

25

Enables web scraping and crawling capabilities for Large Language Models.

Chrome Extension Bridge icon

Chrome Extension Bridge

25

Enables interaction between web pages and a local server by establishing a WebSocket connection, allowing access to browser APIs and DOM elements.

CodingBaby Browser icon

CodingBaby Browser

24

Automates Google Chrome through AI agents via a WebSocket connection to a browser extension.

Bocha Search icon

Bocha Search

24

Empowers AI applications with high-quality world knowledge from billions of web pages and diverse content sources.

Telegram icon

Telegram

24

Enables AI models like Claude Desktop to interact with Telegram channels and groups for comprehensive content scraping and analysis.

DocSearch icon

DocSearch

24

Crawls websites, generates Markdown documentation, and makes that documentation searchable.

JigsawStack icon

JigsawStack

23

Enables AI models to interact with JigsawStack models through a Model Context Protocol server.

LinkedIn icon

LinkedIn

23

Automates LinkedIn job applications and feed exploration through an MCP server.

Showing 20 of 1322 results

Scroll for more results...