rag-mcp-server
Verified Safeby Jevintlie
Overview
Provides a Retrieval-Augmented Generation (RAG) backend for semantic search and retrieval of Sunway University programme information, intended to be called by a larger language model as a tool.
Installation
python -m src.rag_mcp.mcp.server --stdioEnvironment Variables
- CHROMA_DIR
- COLLECTION
- EMBED_MODEL
- RERANK_MODEL
- CHUNK_TOKENS
- CHUNK_OVERLAP
- TOP_K
Security Notes
The server primarily operates via standard input/output (stdio) using JSON-RPC. It explicitly parses incoming requests and calls predefined internal functions (`rag_search`, `rag_get`). User inputs for search queries and document IDs are passed to the `chromadb` client and `sentence-transformers` models. There are no direct uses of `eval()` or unvalidated shell commands with user-controlled input. All models are loaded with `local_files_only=True`, preventing unauthorized external downloads during runtime. Configuration values for paths and models are managed via environment variables or sensible defaults, avoiding hardcoded secrets within the operational code. The `eval_baseline_non_rag_20q.py` script contains external LLM configuration (API_KEY, MODEL) but this is for an independent evaluation, not the RAG server itself.
Similar Servers
qdrant-loader
A Model Context Protocol (MCP) server that provides advanced Retrieval-Augmented Generation (RAG) capabilities to AI development tools by bridging a QDrant knowledge base for intelligent, context-aware search.
viberag
Local codebase semantic search (RAG) for AI coding assistants via MCP server.
concept-rag
This MCP server provides conceptual search, document analysis, and library exploration capabilities over a knowledge base using LanceDB and LLM-based concept extraction.
ml-mcp
A production-ready Model Context Protocol (MCP) server for a Retrieval-Augmented Generation (RAG) system, leveraging a Neo4j knowledge graph to answer natural language queries about Wroclaw University of Science and Technology.