Web Scraping & Data Collection Agent Skills

Discover Agent Skills for web scraping & data collection. Browse 17 skills for Claude, ChatGPT & Codex.

YouTube Video Downloader

Downloads YouTube videos and audio with customizable quality and format settings directly through Claude Code.

LangChain Deep Research

Conducts deep, iterative web research to generate comprehensive reports with verified citations and source tracking.

Defuddle Web Content Extractor

Extracts clean, clutter-free Markdown from web pages to optimize AI context and reduce token usage.

Scientific PDF Data Extraction

Extracts and validates structured data from scientific literature collections to create analysis-ready datasets for systematic reviews and meta-analyses.

Primary Source Researcher

Identifies and captures a subject's authentic voice from social media, blogs, and archives for documentary music projects.

Legal Research Specialist

Extracts narrative-rich facts, quotes, and timelines from court documents and indictments for documentary and creative projects.

Zighang Job Scraper

Automates the collection of bookmarked job postings from Zighang and synchronizes them into Obsidian as structured Markdown files.

SearXNG Local Search

Deploys a local, privacy-respecting metasearch engine to aggregate web, package repository, and code results in structured JSON.

Gemini Deep Research

Executes autonomous multi-step research and information synthesis using the Google Gemini Deep Research Agent.

Wayback Machine Checker

Checks the archival status and availability of URLs within the Internet Archive's Wayback Machine.

Efficient Web Scraping

Optimizes data extraction from websites and APIs using specialized Python scripts to maximize performance and minimize token consumption.

AT Protocol Data Ingest

Extracts and ingests social graph data and content from the AT Protocol and Bluesky into structured formats.

AI News Crawler & Summarizer

Crawls global AI news sources to generate deduplicated, Chinese-language summaries in a structured JSON format.

AT Protocol Data Ingest

Orchestrates large-scale data acquisition and ingestion from the Bluesky/AT Protocol social graph for downstream analysis.

Skillstash Research Assistant

Automates source gathering and note synthesis for the development and validation of Claude Code skills.

Qualitative Literature Orchestrator

Automates the discovery, retrieval, and organization of academic literature for qualitative research and theoretical pattern extraction.

YouTube Video Downloader

Downloads YouTube videos and audio with customizable quality and format settings using yt-dlp integration.

Exa Deep Research

Conducts complex, multi-step asynchronous research and deep analysis using Exa's AI-driven search engine.

Academic Literature Sweep

Automates the discovery, extraction, and organization of academic literature for qualitative research and theoretical pattern identification.

Exa Answer

Generates fact-based answers and structured data from the web using AI-powered search and synthesis.

Academic Literature Sweep

Automates the discovery, extraction, and organization of academic literature for qualitative research and theoretical pattern extraction.

Exa Webset Monitor

Automates the periodic search and refresh of Exa.ai websets to keep your data collections continuously updated.

Exa Find Similar

Discovers related web content, articles, and research papers using AI-powered similarity matching via Exa.ai.

Ark Technical Research

Conducts deep technical research by gathering multi-source evidence, analyzing GitHub repositories, and documenting implementation options.

Torrent Search & Download

Searches multiple torrent trackers and automates content downloading via magnet links and WebTorrent.

Firecrawl Web Scraper

Converts websites into LLM-ready markdown or structured data using the Firecrawl v2 API.

Video Downloader

Downloads high-quality videos and audio from YouTube and other platforms for offline access and archival.

Zighang AI Job Search

Automates the collection and organization of AI and data-related job listings from Zighang into Obsidian-compatible markdown.

30 results loaded • More available

Scroll for more results...

Web Scraping & Data Collection Agent Skills

YouTube Video Downloader

LangChain Deep Research

Defuddle Web Content Extractor

Scientific PDF Data Extraction

Primary Source Researcher

Legal Research Specialist

Zighang Job Scraper

SearXNG Local Search

Gemini Deep Research

Wayback Machine Checker

Efficient Web Scraping

AT Protocol Data Ingest

AI News Crawler & Summarizer

AT Protocol Data Ingest

Skillstash Research Assistant

Qualitative Literature Orchestrator

YouTube Video Downloader

Exa Deep Research

Academic Literature Sweep

Exa Answer

Academic Literature Sweep

Exa Webset Monitor

Exa Find Similar

Ark Technical Research

Torrent Search & Download

Firecrawl Web Scraper

Newsletter Events Research

Newsletter Event Source Manager

Video Downloader

Zighang AI Job Search

Web Scraping & Data Collection Agent Skills

YouTube Video Downloader

LangChain Deep Research

Defuddle Web Content Extractor

Scientific PDF Data Extraction

Primary Source Researcher

Legal Research Specialist

Zighang Job Scraper

SearXNG Local Search

Gemini Deep Research

Wayback Machine Checker

Efficient Web Scraping

AT Protocol Data Ingest

AI News Crawler & Summarizer

AT Protocol Data Ingest

Skillstash Research Assistant

Qualitative Literature Orchestrator

YouTube Video Downloader

Exa Deep Research

Academic Literature Sweep

Exa Answer

Academic Literature Sweep

Exa Webset Monitor

Exa Find Similar

Ark Technical Research

Torrent Search & Download

Firecrawl Web Scraper

Newsletter Events Research

Newsletter Event Source Manager

Video Downloader

Zighang AI Job Search