01Native PDF vision processing for documents up to 1,000 pages
020 GitHub stars
03Context-aware summarization and document-based Q&A
04Automated handling of large files (>20MB) via Google File API
05Multimodal understanding of charts, diagrams, and complex layouts
06Structured data extraction with Pydantic and JSON schema validation