About
The PDF Text Extractor skill is designed for research-heavy workflows where abstract-level data is insufficient for deep analysis. It selectively downloads academic papers from URLs or ArXiv IDs, caches them locally, and extracts clean text to support claim verification and evidence-based writing. By offering configurable evidence modes and local PDF support, it allows researchers to balance resource consumption with the need for exhaustive evidence gathering, maintaining a structured JSONL index of all processed documents for seamless pipeline integration.