Discover Agent Skills for web scraping & data collection. Browse 17skills for Claude, ChatGPT & Codex.
Downloads high-quality images and videos from Twitter/X using automated gallery-dl workflows.
Queries the Google Places API to search for locations, retrieve place details, and fetch reviews directly within the terminal.
Integrates Google Places API into your workflow for searching locations, retrieving business details, and analyzing reviews.
Automates the collection, normalization, and deduplication of Request for Proposal (RFP) opportunities from government and private data sources.
Extracts clean, readable text from web articles and blog posts by removing ads, navigation, and clutter.
Fetches and downloads content from any URL using the powerful wget command-line utility.
Performs headless web searches and extracts readable markdown content using the Brave Search API without requiring a browser.
Validates blockchain data collection pipelines using a systematic 5-step empirical workflow to ensure data integrity and storage efficiency.
Empowers Claude with semantic, neural search capabilities and specialized web filtering using the Exa API.
Downloads and converts YouTube videos into high-quality audio files using yt-dlp and ffmpeg.
Accesses and retrieves research papers from the bioRxiv preprint server for literature reviews and trend analysis.
Extracts clean, plain text from EPUB, MOBI, and PDF files for analysis and data processing.
Aggregates and summarizes real-time China macro-economic news from premium financial sources into professional magazine-style reports.
Extracts core resources from social media and technical blogs to generate structured Markdown archives with intelligent categorization.
Automates the extraction and parsing of monthly USDA WASDE reports into standardized datasets for agricultural market analysis.
Generates high-performance, robust Python code for scraping and parsing structured API documentation from HTML.
Analyzes API documentation structures to identify data extraction patterns for automated scraper generation.
Fetches and ranks WeChat articles based on research interests with seamless Obsidian integration.
Empowers Claude with multi-domain search, AI-driven answers, content extraction, and comprehensive deep research reporting.
Extracts, transforms, and structures data from complex Excel files into JSON or CSV formats.
Simplifies querying the Google Places API for search, location details, and business reviews directly from the terminal.
Analyzes PDF documents to extract structured data, including tables, section headers, and metadata, while providing automated summaries.
Monitors and manages updates from blogs and RSS/Atom feeds directly through the CLI.
Searches the internet and converts live webpage content into markdown for real-time information retrieval and analysis.
Extracts structured data from complex websites using a robust, three-phase Playwright automation workflow.
Extracts structured requirements and metadata from job descriptions to facilitate automated candidate matching and recruitment analysis.
Analyzes AI tool URLs to extract metadata and automatically categorizes and adds them to the awesome-ai-tools repository.
Automates the end-to-end lifecycle of discovering, validating, building, and publishing Model Context Protocol (MCP) servers and automation tools.
Retrieves real-time information, news, images, and videos from the web using DuckDuckGo to provide up-to-date data and resources.
Automates information gathering from web searches and authoritative sources to generate and save structured research reports.
Scroll for more results...