01Text-to-speech generation with customizable voices and instructions.
02Multi-model transcription with OpenAI audio models (Whisper, GPT-4o).
03Interactive audio chat with GPT-4o audio models.
04Parallel batch processing for multiple audio files.
0512 GitHub stars
06Advanced file searching with regex, metadata filtering, and sorting.