01Intelligent table extraction with adaptive handling for long or outlined tables
02Advanced OCR capabilities for high-resolution text extraction
03173 GitHub stars
04HTML-formatted table extraction to maintain structural integrity
05Agentic page parsing for high-accuracy document interpretation
06Standardized markdown output generation optimized for LLM processing