Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.
Enables real-time web research and fact-checking using Google Search grounding within the Claude Code environment.
Analyzes blogs and online publications to extract deep insights into author perspectives, political leanings, and hidden biases.
Downloads high-quality videos and audio from YouTube and other platforms for offline viewing, archiving, and editing.
Automates documentation collection and structured data extraction using Playwright, BeautifulSoup, and Scrapy templates.
Converts complex web pages into clean, LLM-friendly markdown for seamless data extraction and processing.
Extracts audio, subtitles, and cover images from MP4 video files using MCP services and ffmpeg.
Automates video metadata extraction and media downloading by processing structured task lists through MCP services.
Processes, analyzes, and transforms various file formats into structured data or new document types using a standardized CLI.
Conducts real-time, AI-optimized web searches and content extraction to provide up-to-date information beyond Claude's knowledge cutoff.
Ensures rigorous factual accuracy through systematic, multi-pass evidence validation and source tiering.
Crawls entire websites and builds searchable full-text indexes of content converted into Markdown format.
Accesses USPTO APIs to perform comprehensive patent and trademark searches, analyze prosecution history, and track intellectual property assignments.
Downloads videos, audio, and subtitles from YouTube and other online platforms using yt-dlp.
Streamlines the development of Python-based video classification systems with optimized scraping and incremental database management.
Curates specialized AI technology news and technical insights using targeted search strategies and quality filtering rules.
Conducts comprehensive market analysis and trend forecasting across the consumer, technology, healthcare, and finance sectors.
Performs intelligent web searches via the Zhipu search engine with automated relative date resolution.
Extracts and processes comprehensive data from GitHub repositories for ingestion into RAG pipelines and LLM knowledge bases.
Executes a structured, plan-driven implementation workflow that prioritizes context discovery and systematic validation for the kurly-crawler project.
Extracts and analyzes posts, threads, profiles, and media from X (formerly Twitter) directly within your Claude workflow.
Powers Claude Code with semantic search, similar content discovery, and structured research capabilities via the Exa API.
Automates the retrieval and conversion of online framework documentation into local Markdown files for enhanced AI context.
Enables instant web search capabilities using DuckDuckGo to retrieve real-time documentation, news, and technical resources without API keys.
Integrates real-time web search capabilities using the DuckDuckGo engine to find documentation, news, and technical resources without API keys.
Extracts clean, readable text from web articles and blog posts by removing ads, navigation, and clutter.
Fetches and downloads content from any URL using the powerful wget command-line utility.
Scrapes and extracts post data from Threads profiles using automated browser navigation and authentication.
Performs headless web searches and extracts readable markdown content using the Brave Search API without requiring a browser.
Validates blockchain data collection pipelines using a systematic 5-step empirical workflow to ensure data integrity and storage efficiency.
Automates company data enrichment for investment dashboards by fetching employee counts, job postings, and news mentions.
Scroll for more results...