RAGFileMcpServer
Verified Safeby pkutsch
Overview
Provides a Model Context Protocol (MCP) server for AI agents, offering Retrieval-Augmented Generation (RAG) capabilities with file-based data sources (PDF, TXT, Markdown) and a Streamlit UI for management.
Installation
docker-compose up -dEnvironment Variables
- STREAMLIT_PORT
- MCP_SERVER_PORT
- EMBEDDING_PROVIDER
- OLLAMA_BASE_URL
- OLLAMA_MODEL
- OPENAI_API_KEY
- VECTOR_STORE_TYPE
- CHUNK_SIZE
- CHUNK_OVERLAP
Security Notes
The server employs environment variables for configuration, a recommended security practice, and uses parameter binding for all SQLite operations in the logging module, effectively preventing SQL injection. File parsing, which can be an attack surface for RAG systems, relies on well-known external libraries (pypdf, chardet). The Streamlit UI binds to `0.0.0.0` by default, which is standard for web interfaces but necessitates awareness regarding external exposure if deployed publicly. The MCP server primarily uses secure STDIO transport, though a network-exposed SSE mode is configurable. No direct `eval` calls, code obfuscation, or other overtly malicious patterns were identified in the provided source.
Similar Servers
mcp-local-rag
Provides a local, RAG-like web search tool for Large Language Models to retrieve current information and context.
codebase-RAG
A Retrieval-Augmented Generation (RAG) server designed to assist AI agents and developers in understanding and navigating codebases through semantic search.
mcp-pdf-reader
Exposes local PDFs for reading, semantic search, chunking, and table extraction to MCP-compatible agents or via a CLI.
viberag
Local codebase semantic search (RAG) for AI coding assistants via MCP server.