01Audio transcription for WAV and MP3 files
020 GitHub stars
03Advanced OCR for text extraction from images and scanned documents
04Token-efficient output optimized for Claude and other LLMs
05AI-enhanced visual descriptions for technical and scientific figures
06Supports 15+ formats including PDF, DOCX, PPTX, XLSX, and EPub