Web Scraping & Data Collection MCP Servers
Discover our curated collection of MCP servers for web scraping & data collection. Browse 1322servers and find the perfect MCPs for your needs.
Firecrawl
Integrates Firecrawl's web scraping and content extraction capabilities into Model Context Protocol (MCP) environments.
AnyDbApp
Enables AI-powered database operations, file management, and web content scraping via natural language, featuring dynamic schema evolution, semantic search, and Retrieval Augmented Generation (RAG) capabilities with Ollama integration.
ARES
Fetches information about Czech companies from company, beneficial owner, and insolvency registers.
TMD
Provides weather data from the Thai Meteorological Department as an MCP server.
InmoPipeline
Provides an end-to-end pipeline for real estate market analysis and predictive modeling, encompassing data collection, transformation, visualization, and machine learning.
URL Text Fetcher
Provides URL text fetching, web scraping, and web search capabilities for AI models via the Model Context Protocol.
iReader
Extracts content from various online sources, including webpages, YouTube videos, tweets, and PDFs.
DuckDuckGo
Facilitates DuckDuckGo search and web content retrieval via the Model Context Protocol.
Tavily
Enables AI systems to access and interact with real-time web information through search, extraction, mapping, and crawling tools.
Hacker News
Provides AI assistants access to Hacker News data by acting as a bridge to its API.
Browser
Automates web browsing, content extraction, and interactive operations through a Puppeteer-powered server.
OLEXI
Empowers AI chat agents to accurately search and cite Australian legal information from the AustLII database.
LinkedIn Profile Scraper
Scrapes LinkedIn profile data asynchronously using the RapidAPI LinkedIn Profile Scraper API.
Query Table
Scrapes tabular data from websites like Eastmoney, Iwencai, and TDX using Playwright.
Hacker News
Provides AI agents with access to Hacker News data via the Model Context Protocol.
Tavily Web Search
Enables AI models to search the web and retrieve up-to-date information using the Tavily API.
Bibextract
Extracts survey content and bibliography in BibTeX format directly from arXiv papers.
Chrome Browser Assistant
Transform your Chrome browser into an AI-controlled automation tool for content analysis, semantic search, and complex web interactions.
Job URL Analyzer
Analyzes job URLs to extract detailed company information, enriching data through intelligent web crawling and external providers.
Video Downloader
Empower intelligent agents with secure video downloading capabilities from over 1000 diverse websites.
Scroll for more results...