01Supports 20+ formats including PDF, DOCX, XLSX, PPTX, and EPUB
02Converts structured data like CSV, JSON, and XML into readable Markdown tables
03Integrates with Azure Document Intelligence and GPT-4o for enhanced extraction quality
0481 GitHub stars
05Extracts text from images using OCR and transcribes audio to text
06Extracts YouTube transcripts and web content via URL