01Support for multi-format conversion including DOCX, XLSX, images, and web content
02Standardized output organization optimized for subsequent qualitative coding stages
03Advanced PDF parsing using MinerU VLM-powered extraction for tables and figures
042 GitHub stars
05Automated data inventory management with JSON tracking for audit trails
06Structured audio-to-markdown workflows for interview transcripts and speaker labeling