Discover Agent Skills for web scraping & data collection. Browse 17skills for Claude, ChatGPT & Codex.
Implements robust reliability patterns like circuit breakers, idempotency, and graceful degradation for production-grade FireCrawl integrations.
Resolves complex FireCrawl errors using systematic evidence collection and deep-layer diagnostic techniques.
Executes the primary integration workflow for the Exa search engine to implement core search and data retrieval features.
Automates the primary web crawling and data extraction process using the FireCrawl API to generate LLM-ready content.
Optimizes FireCrawl operational costs through intelligent tier selection, usage monitoring, and budget-aware implementation strategies.
Implements robust rate limiting, exponential backoff, and idempotency patterns for FireCrawl API integrations.
Aggregates real-time cryptocurrency news from over 50 authoritative sources with advanced filtering and relevance scoring.
Extracts and saves YouTube video subtitles or transcripts to local text files using command-line tools or automated browser interaction.
Integrates vision analysis, real-time web search, and GitHub exploration capabilities into Claude Code workflows.
Enhances Claude with real-time web search, vision-based image analysis, and advanced GitHub repository exploration.
Extracts and organizes brand logos for DeFi vault protocols by identifying homepage links and automating asset retrieval.
Normalizes and merges duplicate data from multiple sources using reputation scoring and semantic hash-based grouping.
Fetches Twitter/X post content and metadata into clean Markdown format using the Jina.ai API to bypass JavaScript restrictions.
Transforms web pages into clean, readable Markdown files optimized for AI ingestion and local documentation.
Performs real-time AI web searches with citations using Perplexity models to provide up-to-date information and scientific literature.
Aggregates and synthesizes real-world developer perspectives from Hacker News, Reddit, and major technical communities.
Researches technical solutions and gathers cross-platform evidence to inform architecture and implementation decisions.
Transforms browser traffic into production-ready Python API clients through automated HAR analysis and code generation.
Enables Claude to search the live web and fetch content from specific URLs to provide up-to-date information.
Equips Claude with high-performance web search capabilities and deep content extraction tools powered by the Tavily API.
Converts diverse file formats including PDFs, Office documents, and media into structured, token-efficient Markdown for LLM processing.
Parses and extracts structured content from complex PDF documents using LlamaParse and agentic OCR capabilities.
Transforms unstructured files like PDFs, Word documents, and presentations into structured Pydantic models using LlamaExtract services.
Converts websites into LLM-ready markdown and structured data using the Firecrawl API.
Downloads high-quality videos and HLS streams from platforms like YouTube, Vimeo, and Mux using optimized workflows for yt-dlp and ffmpeg.
Replicates existing websites into production-ready Next.js 16 and Tailwind CSS v4 codebases using Firecrawl MCP.
Automates web content extraction using a progressive four-tier strategy to bypass bot detection and CAPTCHAs.
Implements a four-tier progressive fallback strategy to reliably extract web content from any URL, regardless of bot detection or JavaScript requirements.
Downloads high-quality video and audio content from YouTube and HLS-based streaming platforms while resolving common authentication and formatting issues.
Conducts structured, multi-threaded web research by coordinating subagents to gather and synthesize complex information into comprehensive reports.
Scroll for more results...