01Intelligent PDF splitting that preserves all visual content
02Token-aware text extraction for LLMs using tiktoken
03Comprehensive PDF metadata and processing estimates
04Precise extraction of specific page ranges
05Production-ready design with structured logging and robust error handling
060 GitHub stars