Discover our curated collection of MCP servers for web scraping & data collection. Browse 2524 servers and find the perfect MCPs for your needs.
Downloads websites and their assets for use with Retrieval-Augmented Generation (RAG) systems.
Retrieves reviews and information for Steam games using the Model Context Protocol.
Streamlines access to and analysis of U.S. government spending data from the USAspending.gov API for AI agents.
Searches for papers from the American Economic Association (AEA) using Large Language Models (LLMs).
Extract insights from TikTok data through video discovery and metadata retrieval.
Integrates various digital services into a unified personal assistant server, managing calendars, notes, tasks, and web content.
Fetches and processes real-time job listings from Stepstone.de, providing structured data for AI assistants and agent frameworks.
Manages and enriches genealogical data, enabling AI agents to create, edit, and query GEDCOM files.
Download videos and audio from various internet sources, with support for agentic server operations.
Connects Purdue students' Brightspace accounts to access academic data via web scraping, handling Duo Mobile 2FA.
Fetches TradingView chart snapshots efficiently using Playwright for browser automation, enabling fast and secure visualization of market data.
Integrates Exa's Websets API with MCP-compatible clients like Claude Desktop, Cursor, and Windsurf to manage dynamic collections of web entities.
Perform privacy-focused searches across web engines, social media platforms, and archival services.
Parse various document formats like PDF, Word, Excel, and PowerPoint to extract their content.
Searches and downloads academic papers from multiple platforms, designed for integration with large language model tools.
Execute macOS AppleScript and JavaScript for Automation (JXA) commands securely to automate system interactions.
Converts web page URLs into clean, LLM-ready Markdown or plain text content for AI agents.
Uncover and make understandable Finnish open data, serving as an MCP-server for AI and an open web service for people.
Processes HTML and JavaScript, empowering LLM agents to interact with web content, render layouts, and execute scripts.
Enables AI agents to search the web, news, and fetch web page content, converting it for LLM consumption.
Scroll for more results...