Back to Home
Jevintlie icon

rag-mcp-server

Verified Safe

by Jevintlie

Overview

Provides a Retrieval-Augmented Generation (RAG) backend for semantic search and retrieval of Sunway University programme information, intended to be called by a larger language model as a tool.

Installation

Run Command
python -m src.rag_mcp.mcp.server --stdio

Environment Variables

  • CHROMA_DIR
  • COLLECTION
  • EMBED_MODEL
  • RERANK_MODEL
  • CHUNK_TOKENS
  • CHUNK_OVERLAP
  • TOP_K

Security Notes

The server primarily operates via standard input/output (stdio) using JSON-RPC. It explicitly parses incoming requests and calls predefined internal functions (`rag_search`, `rag_get`). User inputs for search queries and document IDs are passed to the `chromadb` client and `sentence-transformers` models. There are no direct uses of `eval()` or unvalidated shell commands with user-controlled input. All models are loaded with `local_files_only=True`, preventing unauthorized external downloads during runtime. Configuration values for paths and models are managed via environment variables or sensible defaults, avoiding hardcoded secrets within the operational code. The `eval_baseline_non_rag_20q.py` script contains external LLM configuration (API_KEY, MODEL) but this is for an independent evaluation, not the RAG server itself.

Similar Servers

Stats

Interest Score0
Security Score9
Cost ClassLow
Avg Tokens2500
Stars0
Forks0
Last Update2025-12-19

Tags

RAGSemantic SearchInformation RetrievalEducationLocal Models