pageindex-mcp
Verified Safeby VectifyAI
Overview
This MCP server acts as a bridge, enabling LLM-native, reasoning-based RAG on documents (local or online PDFs) for MCP-compatible agents like Claude and Cursor, without requiring a vector database locally.
Installation
npx -y @pageindex/mcpEnvironment Variables
- PAGEINDEX_API_URL
- DEBUG
- CONNECTION_TIMEOUT
Security Notes
The server uses standard OAuth 2.1 for authentication, involving a temporary local HTTP server for callback and storing tokens with restrictive file permissions (0o600). It handles local and remote PDF uploads, performing validation (size, PDF magic bytes) to mitigate risks, and uploads files via signed URLs to a remote PageIndex API. No 'eval', code obfuscation, or hardcoded secrets were identified. The primary network risks are inherent in fetching external PDFs and relying on the security of the remote PageIndex API, though the local server includes validation steps.
Similar Servers
UltraRAG
An open-source RAG framework for building, experimenting, and evaluating complex Retrieval-Augmented Generation (RAG) pipelines with low-code YAML configurations and native multimodal support.
flexible-graphrag
The Flexible GraphRAG MCP Server integrates document processing, knowledge graph building, hybrid search, and AI query capabilities via the Model Context Protocol (MCP) for clients like Claude Desktop and MCP Inspector.
Matryoshka
Processes large documents beyond LLM context windows using a Recursive Language Model (RLM) that executes symbolic commands for iterative document analysis.
Archive-Agent
An intelligent file indexer with powerful AI search (RAG engine), automatic OCR, and a seamless MCP interface to unlock documents with natural language.