MinerU Tianshu is an enterprise-grade document parsing service engineered for high performance and scalability. It features a modern, intuitive web interface built with Vue 3, TypeScript, and TailwindCSS, backed by a robust FastAPI backend. The service leverages LitServe for advanced GPU load balancing and efficient multi-GPU task handling, ensuring optimal processing for demanding document conversion tasks. With powerful parsing capabilities, it converts diverse file types, including PDFs, images, Word, Excel, PowerPoint, and more, into structured Markdown format. A standout feature is its native support for the Model Context Protocol (MCP), enabling seamless integration with AI assistants like Claude Desktop, allowing them to directly utilize its document parsing functionalities. Additionally, it offers comprehensive task management features, including task queues, prioritization, real-time status tracking, and automatic retries, making it ideal for enterprise environments.
Key Features
01High-performance architecture with GPU load balancing and multi-GPU isolation (LitServe)
02Comprehensive task management including queues, priorities, status tracking, and auto-retry
03Enterprise-grade multi-GPU document parsing (PDF, Office, image to Markdown)
0425 GitHub stars
05Seamless AI assistant integration via Model Context Protocol (MCP)
06Modern web UI for task submission, monitoring, and management (Vue 3, FastAPI)