Archive-Agent
Verified Safeby shredEngineer
Overview
An intelligent file indexer with powerful AI search (RAG engine), automatic OCR, and a seamless MCP interface for document retrieval and question answering.
Installation
archive-agentEnvironment Variables
- OPENAI_API_KEY
- ARCHIVE_AGENT_QDRANT_IN_MEMORY
Security Notes
The MCP server can be configured to listen on all network interfaces (0.0.0.0), which could expose it to a local area network without explicit authentication. While MCP tools are read-only, users should be aware of this configuration option. The system uses environment variables for sensitive API keys (e.g., OPENAI_API_KEY) and relies on Docker for Qdrant, which is good practice. File processing, including OCR and entity extraction, inherently carries risks of processing untrusted content, though no direct execution vulnerabilities are apparent in the provided code.
Similar Servers
kreuzberg
High-performance document intelligence platform for extracting text, metadata, and structured information (tables, images, chunks) from over 50 diverse document formats (PDFs, Office, images, HTML, etc.). It offers advanced OCR capabilities, multilingual support, and features like chunking, embeddings, and keyword extraction. Functionality is exposed via multiple language bindings and a Micro-service Communication Protocol (MCP) server for flexible integration.
mcp-documentation-server
A local-first MCP server for document management, semantic search, and AI-powered document intelligence.
Context-Engine
A Retrieval-Augmented Generation (RAG) stack for codebases, enabling context-aware AI agents for developers and IDEs through unified code indexing, hybrid search, and local LLM integration.
pageindex-mcp
Provides vectorless, reasoning-based RAG capabilities for LLMs to navigate and retrieve information from hierarchical document structures, primarily for long PDFs.