01Multi-source extraction: priorities manual subtitles, then auto-captions, then AI transcription
02Whisper AI integration: generates transcripts for videos that have no captions available
03Flexible output formats: supports both raw VTT with timestamps and cleaned plain text files
04Automated environment setup: checks for and installs yt-dlp and Whisper dependencies
05Intelligent post-processing: removes VTT timestamps and deduplicates overlapping lines
060 GitHub stars