01Preservation of document structure including headers, tables, lists, and multi-column layouts
02Aggressive caching system for instant re-processing of previously analyzed PDFs
03Standardized Markdown output with metadata headers and image summary tables
04Automatic image extraction with relative path referencing and visual analysis support
05High-accuracy AI-powered mode for extracting complex and borderless tables
060 GitHub stars