Chomper FAQs

Question 1

Which document formats and features does Chomper support?

Accepted Answer

Chomper supports over 36 document formats across 15+ categories, including PDF, DOCX, HTML, Excel, Markdown, code files, and email formats. Key features include embedding-based semantic chunking for RAG, image extraction from PDFs, rich metadata extraction, and built-in MCP prompts for common analysis tasks like summarization and entity extraction.

Question 2

What is semantic chunking and how does it benefit Retrieval Augmented Generation (RAG)?

Accepted Answer

Semantic chunking in Chomper uses embedding-based methods (like sentence-transformers) to divide documents into contextually meaningful segments. This enhances RAG systems by providing more relevant and coherent chunks of information for retrieval, leading to more accurate and effective AI responses.

Question 3

How does Chomper optimize data for AI models and reduce token usage?

Accepted Answer

Chomper uses its proprietary Token-Optimized Object Notation (TOON) output format, which can reduce token usage by approximately 40% compared to standard JSON. It also offers smart token management with summary modes and pagination for efficient processing of large documents.

Question 4

Can Chomper be integrated with Claude AI models?

Accepted Answer

Absolutely. Chomper is built as an MCP server, making it fully compatible with Claude AI. It offers direct integration methods for Claude Code and Claude Desktop, allowing users to leverage Chomper's parsing, chunking, and optimization capabilities seamlessly within their Claude workflows.

Question 5

What is Chomper and what is its primary purpose?

Accepted Answer

Chomper is an advanced MCP (Model Context Protocol) server designed to parse and extract text, metadata, and images from over 36 document file formats. Its primary purpose is to prepare and optimize this extracted content for use with AI models, especially large language models (LLMs) like Claude.

Chomper

Chomper

Key Features

Use Cases

Key Features

Use Cases