Discover our curated collection of MCP servers for web scraping & data collection. Browse 2524 servers and find the perfect MCPs for your needs.
Provides an MCP interface to search the Marginalia Search engine, which focuses on non-commercial content.
Extracts and transforms web content into various formats, including rendered HTML, Markdown, and text from media files.
Automates browser interactions through the Model Context Protocol (MCP), enabling integration between large language models and web browsing.
Recursively fetches and extracts content from web pages for LLM consumption.
Enables AI assistants to search and access health science preprints from medRxiv.
Interfaces with Biomart databases using the Model Context Protocol (MCP) to provide biological data to Large Language Models.
Aggregates multiple search APIs via the Model Context Protocol (MCP) for enhanced research capabilities.
Retrieves and extracts web content, converting HTML to markdown for easier consumption by LLMs.
Provides AI assistants access to query and analyze statistical data from the Australian Bureau of Statistics (ABS) via the SDMX-ML API.
Scrape web pages and extract targeted content using CSS selectors through the Model Context Protocol.
Fetches or generates YouTube video transcripts using AI, prioritizing official transcripts and falling back to local Whisper transcription.
Integrate CleanShot X on macOS with AI assistants, enabling control over screenshots and screen recordings using natural language commands.
Provides direct access to NCCN (National Comprehensive Cancer Network) clinical guidelines through a Model Context Protocol (MCP) server.
Consolidates academic research from PubMed, Google Scholar, ArXiv, and JSTOR through five powerful tools for efficient discovery and analysis.
Accesses the FantasyPros API to retrieve sports data, news, rankings, and projections.
Aggregate and deliver the latest news from a comprehensive collection of Icelandic RSS sources, serving as a context provider for AI assistants.
Retrieve, process, and analyze web content from URLs using an AI-powered Model Context Protocol server.
Enables AI agents to efficiently search Reddit and identify specific leads by leveraging high-performance Apify cloud actors for data scraping.
Serves as an MCP server for robust browser automation enhanced with anti-detection technology.
Provide AI agents with reliable research infrastructure, offering real-time web search, evidence extraction, and structured citations to prevent hallucinations.
Scroll for more results...