Back to Home
dataML007 icon

simple_local_rag

Verified Safe

by dataML007

Overview

A multi-modal Retrieval-Augmented Generation (RAG) system for querying PDF documents with conversation memory via a Streamlit UI, FastAPI backend, and MCP server integration.

Installation

Run Command
./start_mcp.sh

Environment Variables

  • OPENAI_API_KEY
  • API_HOST
  • API_PORT
  • STREAMLIT_SERVER_PORT
  • MCP_SERVER_PORT

Security Notes

The system uses `os.getenv` for API keys and recommends storing them in a `.env` file excluded from version control, which is good practice. File uploads are handled with temporary files and explicit `.pdf` extension checks, reducing direct path traversal risks. The `VectorStore` uses `pickle.dump` and `pickle.load` for chunk metadata. While typically used for internal data, if an attacker could tamper with the `chunks.pkl` file, this could lead to a deserialization vulnerability. For a 'local' RAG system, this risk is mitigated by assuming trusted local file access. The FastAPI backend uses `allow_origins=["*"]` for CORS, which is noted as acceptable for local development but a security risk for production deployments.

Similar Servers

Stats

Interest Score0
Security Score7
Cost ClassLow
Avg Tokens3000
Stars0
Forks0
Last Update2025-12-06

Tags

RAGLLMFAISSFastAPIStreamlit