The Awels PDF Processor is a Model Context Protocol (MCP) server specializing in robust PDF processing. It leverages docling to convert PDF documents into clean Markdown format, with the added capability of extracting images such as page images, tables, and figures. Engineered to run in isolated environments, this server proactively avoids common permission issues, ensuring reliable operation. It provides structured JSON output detailing processing results, file metadata, and statistics, making it ideal for integrating advanced document conversion and data extraction into various systems.
Key Features
01Structured JSON output with detailed processing results
02Batch Processing of multiple PDF files
030 GitHub stars
04Comprehensive Image Extraction from PDFs
05Isolated execution for enhanced security and permission handling
06PDF to Markdown Conversion using docling