Web Scraping & Data Collection MCP Servers

Discover our curated collection of MCP servers for web scraping & data collection. Browse 1322servers and find the perfect MCPs for your needs.

Firecrawl icon

Firecrawl

Integrates Firecrawl's web scraping and content extraction capabilities into Model Context Protocol (MCP) environments.

AnyDbApp icon

AnyDbApp

Enables AI-powered database operations, file management, and web content scraping via natural language, featuring dynamic schema evolution, semantic search, and Retrieval Augmented Generation (RAG) capabilities with Ollama integration.

ARES icon

ARES

Fetches information about Czech companies from company, beneficial owner, and insolvency registers.

TMD icon

TMD

Provides weather data from the Thai Meteorological Department as an MCP server.

InmoPipeline icon

InmoPipeline

Provides an end-to-end pipeline for real estate market analysis and predictive modeling, encompassing data collection, transformation, visualization, and machine learning.

URL Text Fetcher icon

URL Text Fetcher

Provides URL text fetching, web scraping, and web search capabilities for AI models via the Model Context Protocol.

iReader icon

iReader

Extracts content from various online sources, including webpages, YouTube videos, tweets, and PDFs.

DuckDuckGo icon

DuckDuckGo

Facilitates DuckDuckGo search and web content retrieval via the Model Context Protocol.

Tavily icon

Tavily

Enables AI systems to access and interact with real-time web information through search, extraction, mapping, and crawling tools.

Hacker News icon

Hacker News

Provides AI assistants access to Hacker News data by acting as a bridge to its API.

Browser icon

Browser

Automates web browsing, content extraction, and interactive operations through a Puppeteer-powered server.

OLEXI icon

OLEXI

Empowers AI chat agents to accurately search and cite Australian legal information from the AustLII database.

LinkedIn Profile Scraper icon

LinkedIn Profile Scraper

Scrapes LinkedIn profile data asynchronously using the RapidAPI LinkedIn Profile Scraper API.

Query Table icon

Query Table

Scrapes tabular data from websites like Eastmoney, Iwencai, and TDX using Playwright.

Hacker News icon

Hacker News

Provides AI agents with access to Hacker News data via the Model Context Protocol.

Tavily Web Search icon

Tavily Web Search

Enables AI models to search the web and retrieve up-to-date information using the Tavily API.

Bibextract icon

Bibextract

Extracts survey content and bibliography in BibTeX format directly from arXiv papers.

Chrome Browser Assistant icon

Chrome Browser Assistant

Transform your Chrome browser into an AI-controlled automation tool for content analysis, semantic search, and complex web interactions.

Job URL Analyzer icon

Job URL Analyzer

Analyzes job URLs to extract detailed company information, enriching data through intelligent web crawling and external providers.

Video Downloader icon

Video Downloader

Empower intelligent agents with secure video downloading capabilities from over 1000 diverse websites.

Showing 20 of 1322 results

Scroll for more results...