docsynthai
Verified Safeby raahulrawat
Overview
An intelligent document processing server that classifies documents using rule-based and AI (Gemini Vision) methods via the Model Context Protocol (MCP).
Installation
python server.pyEnvironment Variables
- DOCSYNTH_RULES_FILE
- DOCSYNTH_MAX_BASE64_BYTES
- DOCSYNTH_TRY_HTTP
- DOCSYNTH_HOST
- DOCSYNTH_PORT
Security Notes
The server uses `json.load`/`json.dump` for rule persistence, which is safe. Base64 decoding and image processing use PIL, which is generally robust but not immune to image-based exploits; however, size limits are enforced (`MAX_IMAGE_BYTES`, `MAX_BASE64_BYTES`). The Google API key is supplied at runtime via a tool, not hardcoded, and stored in memory, which is acceptable for server operation. No direct `eval` or `exec` of user-controlled input found. Overall, the system appears designed with reasonable security considerations for its scope.
Similar Servers
kreuzberg
Extracts text, tables, images, and metadata from a wide range of document formats (PDF, Office, images, HTML, etc.), with support for multiple OCR backends and an extensible plugin system. Can be run as a Micro-Agent Communication Protocol (MCP) server.
kreuzberg
Extracts text, tables, images, and metadata from 56 file formats including PDF, Office documents, and images. Supports multiple OCR backends, extensible plugins, and is designed for data preprocessing in AI/ML workflows.
mineru-tianshu
Enterprise-grade AI data preprocessing platform for converting diverse unstructured multi-modal data (documents, images, audio, video, bioinformatics formats) into structured Markdown and JSON formats, leveraging GPU acceleration and a robust task management system with user authentication and MCP protocol integration.
mcp-documentation-server
A local-first MCP server for document management, semantic search, and AI-powered document intelligence.