01Integrated function calling support within the voice interaction loop
02Advanced Semantic Voice Activity Detection (VAD) for natural turn-taking
032 GitHub stars
04Support for multimodal models including GPT-4o-realtime and Phi-4
05Seamless Azure AI authentication using Microsoft Entra ID or API keys
06Real-time bidirectional WebSocket communication for voice streaming