Web Scraping & Data Collection Agent Skills

Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.

Browser Content Capture

Extracts data from JavaScript-heavy websites, authenticated pages, and complex documentation using advanced browser automation.

Data Sourcing & Provider Optimization

Optimizes B2B data enrichment through intelligent provider selection, waterfall logic, and credit-efficient routing.

Local Media Source Manager

Lists and manages configured event sources for Instagram accounts and web aggregators used in newsletter generation.

Exa AI Web Search & Research

Powers Claude with real-time web searches, deep company research, and high-quality programming documentation retrieval using Exa AI.

Firecrawl Web Extraction

Extracts high-quality, LLM-optimized web data and performs advanced crawling through a powerful CLI integration.

Hacker News Agent

Integrates real-time Hacker News data streams into AI agents for automated tech news monitoring and community trend analysis.

Asteroid Tracking Agent

Monitors near-Earth asteroids and hazardous space objects using real-time NASA NeoWs data and integrated x402 payment processing.

News Extractor

Extracts structured content from popular Chinese news platforms and converts it into JSON and Markdown formats.

Firecrawl Web Scraper

Converts entire websites into LLM-ready markdown and structured data with advanced anti-bot bypass and JavaScript rendering.

Web Search

Searches the web using Exa AI to provide real-time information retrieval and up-to-date data for AI coding workflows.

Web Content Fetcher

Extracts clean, markdown-formatted content and metadata from any URL using the Jina Reader API for LLM consumption.

URL Summarization Engine

Extracts and summarizes web content using quote-grounding and structured reporting to ensure high technical fidelity.

Google Export

Downloads and converts public Google Docs, Sheets, and Slides into local formats for direct analysis and integration.

Tavily Web Crawler

Crawls and extracts website content into structured markdown files or context-optimized chunks for AI analysis.

Trend Discovery & Market Analysis

Identifies high-potential trending topics and data gaps on X and the web to surface monetization opportunities for AI agents.

Tavily AI Search

Integrates the Tavily API to perform live web searches and structured data retrieval for RAG-augmented workflows.

Hacker News Reader

Fetches and analyzes real-time stories, comments, and user data from Hacker News using the official API.

Apify Web Scraping & Automation

Automates web data collection and browser tasks using pre-built Actors for popular sites like Amazon, Google, and LinkedIn.

RSS Feed Fetcher

Fetches and parses RSS/Atom feeds to automate news gathering and content monitoring directly within Claude Code.

Brave Search

Integrates privacy-focused web, image, video, and news search capabilities directly into Claude Code via the Brave Search API.

Bright Data Web Scraper

Extracts structured data from major social media platforms and websites using the Bright Data Web Scraper API.

Firecrawl Web Scraper

Automates web scraping, site crawling, and structured data extraction from any URL using the Firecrawl API.

ScrapeNinja Web Scraper

Bypasses anti-bot protections and extracts structured data from complex websites using high-performance Chrome TLS fingerprinting and JS rendering.

Supadata Video & Web Extraction

Extracts transcripts from social media videos and scrapes websites into LLM-ready markdown format.

SerpApi Search & Scraping

Accesses real-time search engine results from Google, Bing, and YouTube directly within Claude Code using structured JSON.

Video Downloader

Downloads videos and audio from YouTube and other streaming platforms with customizable quality and format options.

Article Extractor

Extracts clean, readable text from web URLs by removing advertisements, navigation menus, and distractions.

PubMed Database Connector

Facilitates direct access to PubMed literature and the NCBI E-utilities API for advanced biomedical research and data extraction.

Modao Prototype Capture

Automates the extraction of Modao prototype pages, screenshots, and annotations into organized Markdown documentation.

Financial Document Processor

Extracts structured data from financial documents using OCR and text extraction while enforcing rigorous data safety and verification protocols.

30 results loaded • More available

Scroll for more results...