Discover our curated collection of MCP servers for web scraping & data collection. Browse 2610 servers and find the perfect MCPs for your needs.
Automates browser interactions and enables LLMs to engage with web pages for tasks like screenshot capture and JavaScript execution.
Provides search functionality for arXiv.org papers using the official arXiv API.
Provides a robust web content search and extraction service with intelligent browser instance management.
Provides a remote server for accessing Jina AI's Reader, Embeddings, and Reranker APIs, enabling web content processing, diverse search capabilities, and data optimization.
Discovers and extracts comprehensive metadata from research papers, code repositories, and AI models using web scraping and API integration.
Enables large language models or external tools to interact with a browser through a standardized Model Context Protocol for automation.
Accesses the PubMed database, offering advanced biomedical literature search, retrieval, and analysis capabilities.
Provides cloud-based browser automation capabilities, enabling AI models to interact with web pages and execute JavaScript without local browser installations.
Scrapes webpages using Playwright to extract clean Markdown, designed for integration with AI chat prompts.
Provides comprehensive web intelligence reports, detecting tech stacks, analyzing SEO, and extracting contact information from any URL.
Provides comprehensive intelligence for the pharmaceutical industry, enabling users to search drug pipelines, analyze competitive landscapes, and track critical regulatory and patent information.
Performs comprehensive web search and content extraction using DuckDuckGo for automated research.
Provides a business catalog for the Russian construction and real estate market accessible through AI agents via MCP protocol.
Accesses the X (Twitter) API v2 to search tweets, look up users, and retrieve timelines.
Finds and verifies business email addresses using built-in DNS and SMTP checks, offering a free alternative to paid subscription services.
Enables AI agents to fully manage Delta Air Lines flight bookings and travel needs through robust browser automation.
Integrate with the EDINET API to access and process financial disclosure data directly from MCP clients.
Provides programmatic access to the Turkish Ministry of Justice's legislation information system (mevzuat.gov.tr and bedesten.adalet.gov.tr) via a Model Context Protocol (MCP) server.
Integrates FanQie Novel's book, chapter, and comment data, making it accessible to Large Language Model clients via the Model Context Protocol.
Screen any research topic, author, or paper for statistical fabrication, p-hacking, publication bias, citation manipulation, and data duplication using 8 forensic tools backed by 16 real-time academic sources.
Scroll for more results...