About
This skill provides a comprehensive technical framework for integrating privacy-first, on-device AI capabilities directly into iOS apps. It offers production-ready patterns for selecting between Apple’s native Foundation Models and the MLX Swift framework, enabling developers to implement local LLM inference, vision language models, and text embeddings. The skill emphasizes mobile-specific best practices such as memory-aware quantization, asynchronous model loading, and streaming responses to ensure smooth user experiences on Apple Silicon hardware.