01OCR capabilities for converting scanned documents into searchable text
02High-fidelity text and table extraction with layout preservation
03Programmatic PDF generation and multi-page report building
049,809 GitHub stars
05Integration with command-line tools like qpdf and poppler-utils
06Advanced document manipulation including merging, splitting, and rotation