0159 GitHub stars
02Multi-modal capabilities: Vision, Text-to-Speech, Speech-to-Text, Image and Video Generation
03Full MCP server with 41+ tools for programmatic control and Agentic AI Sidekick
04Five simultaneous AI backends (Apple Intelligence, MLX, llama.cpp, HuggingFace, External AI)
05HuggingFace Explorer for cloud inference, model browsing, and media generation
06On-device inference optimized for Apple Silicon with WhisperKit, MLX, and llama.cpp