Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.
Scrapes and filters job listings from Stepstone Germany with support for direct-employer targeting and real-time data extraction.
Extracts and saves subtitles or transcripts from any YouTube video URL directly to your local workspace.
Extracts and analyzes competitor advertisements from ad libraries to uncover winning messaging, pain points, and creative strategies.
Extracts clean, clutter-free article and blog content from URLs by stripping away ads, navigation, and unnecessary UI elements.
Identifies hiring managers and key decision-makers on LinkedIn to streamline job outreach and lead generation.
Automates the end-to-end recruitment pipeline by orchestrating job board scraping and identifying hiring managers on LinkedIn.
Automates the extraction of real-time job listings from Indeed Germany with advanced filtering and deep-dive capabilities.
Orchestrates parallel research agents across Perplexity, Claude, and Gemini to deliver synthesized, multi-perspective reports with source attribution.
Filters, deduplicates, and scores job listings to isolate high-quality technical roles from direct employers.
Integrates real-time web search and content extraction capabilities into Claude Code using the Tavily API.
Automates web scraping task creation and management across social media, e-commerce, and SEO platforms via Feishu API integration.
Empowers Claude with real-time web search, content extraction, and automated crawling capabilities using the Tavily API.
Extracts fully rendered HTML and dynamic content from JavaScript-heavy websites using headless browser automation.
Empowers Claude with real-time web search, content extraction, and automated crawling capabilities using the Tavily API.
Searches the live web and extracts structured content or code examples using Exa AI's neural search engine.
Extracts and formats transcripts from YouTube videos, playlists, and channels using a single unified command.
Accesses official USPTO APIs to perform comprehensive patent searches, trademark tracking, and patent examination history analysis.
Transforms unstructured text from any source into clean, validated JSON based on specific user-defined fields or schemas.
Queries your Google NotebookLM notebooks directly from Claude Code for source-grounded, citation-backed documentation answers.
Optimizes technical information retrieval by delegating web searches to a specialized agent for structured, high-reliability results.
Extracts clean, distraction-free text content from web articles and blog posts for easy reading and local storage.
Automates the extraction of structured data from websites using optimized scraping patterns.
Extracts high-speed, read-only markdown content from documentation, blogs, and static websites.
Searches and retrieves life sciences preprints from the bioRxiv server with comprehensive metadata and PDF support.
Extracts deep web content, captures screenshots, and parses PDFs using the powerful Firecrawl API.
Extracts content from websites using the Scrape.do API to bypass anti-bot protections and render dynamic JavaScript.
Implements a progressive four-tier scraping strategy to retrieve content from any URL while bypassing bot detection and CAPTCHAs.
Executes a progressive four-tier scraping strategy to reliably extract content from websites while bypassing bot detection and CAPTCHAs.
Extracts and analyzes competitor advertising data from social ad libraries to identify high-performing messaging and creative patterns.
Automates multi-tier web scraping with an intelligent fallback strategy to bypass bot detection and capture content from any URL.
Scroll for more results...