01Addresses visual grounding, compositional, and external knowledge-dependent VQA tasks
02Integrates with OpenAI API for enhanced language understanding and interaction
03Compatible with FastMCP streamable-HTTP server for client tooling integration
04Implements a Mixture-of-Experts (MoE) architecture for VQA
051 GitHub stars
06Supports both Dockerized and pure Python server installations