01Advanced VLM-based parsing for multi-column academic paper structures
02Integrated audio transcription and multi-format text-to-markdown conversion
030 GitHub stars
04High-accuracy extraction of complex tables, figures, and mathematical formulas
05Intelligent tool selection logic based on document complexity and layout
06Automated batch processing capabilities with quality verification checklists