Discover Agent Skills for web scraping & data collection. Browse 17skills for Claude, ChatGPT & Codex.
Extracts clean, distraction-free text content from web URLs and saves it as readable text files.
Analyzes website structures and debugs web scraping issues using Chrome DevTools to improve data extraction accuracy.
Extracts clean, readable text from blog posts and articles by removing ads, navigation, and clutter.
Downloads videos and audio from YouTube and other platforms for offline viewing, archival, and content repurposing.
Fetches and processes YouTube video transcripts, subtitles, and captions using automated tools and AI transcription.
Overcomes web access restrictions and rate limits by performing federated searches and intelligent content extraction from blocked or challenging URLs.
Converts PDF documents into LLM-friendly Markdown while preserving complex structures like tables, headers, and lists.
Conducts comprehensive cross-platform intelligence gathering with automated cascading research for people, companies, and topics.
Performs deep multi-platform intelligence gathering across LinkedIn, X, Reddit, and GitHub to create actionable networking and sales reports.
Overcomes access restrictions, rate limits, and validation errors to perform reliable web searches and content extraction when standard tools fail.
Performs deep competitive intelligence by synthesizing data from web scraping, social media, and executive leadership profiles.
Scrapes websites, extracts structured data, and automates web data collection pipelines using the Crawl4AI library.
Extracts and structures metadata from PDF form fields into JSON format to facilitate automated document processing and form filling.
Conducts systematic, high-integrity research across diverse information sources with rigorous cross-validation and credibility scoring.
Identifies and captures a subject's authentic voice from social media, blogs, and archives for documentary music projects.
Extracts Twitter posts and comments to organize viewpoints and generate professional narration scripts for content production.
Automates the systematic search, retrieval, and organization of primary source documents from free public archives using browser automation.
Performs journalism-grade investigative research using primary source analysis, triple-source verification, and evidence-chain mapping.
Researches and extracts factual data from official US government agency statements, press releases, and litigation records.
Extracts narrative-rich facts, quotes, and timelines from court documents and indictments for documentary and creative projects.
Analyzes SEC filings, earnings calls, and market data to extract deep corporate insights and financial narratives.
Downloads videos and playlists from YouTube and other platforms in various resolutions and formats for offline viewing and archival.
Extracts YouTube video transcripts, metadata, and chapters into formatted Markdown files for knowledge management systems.
Conducts deep investigative research and source verification for documentary-style creative projects and journalism.
Conducts deep biographical research to extract humanizing details, quotes, and life trajectories for documentary-style music production.
Executes comprehensive web searches using the Gemini command to gather real-time data and detailed information.
Empowers Claude with real-time web search capabilities using the Google Gemini CLI to access up-to-date information and documentation.
Performs intelligent web searches using a prioritized MCP strategy to find the most relevant documentation and live technical data.
Extracts specific data from JSON files efficiently to minimize token usage and improve processing speed.
Extracts, downloads, and cleans YouTube video transcripts and captions for easy reading and analysis.
Scroll for more results...