01Supports multiple document formats: PDF, Excel, CSV, TXT, JSON, Markdown, DOCX
02Streaming API for memory-efficient processing of large files
03Smart encoding detection for UTF-8, Latin-1, CP1252, ISO-8859-1
04Configurable process-wide rate limiting
05Docker support for isolated execution with non-root user and read-only mounts
060 GitHub stars