Back to Home
rexfelix icon

readPDF_mcp_server

by rexfelix

Overview

This server reads PDF documents, extracts text, images, and tables, and provides them to an AI agent in Markdown format.

Installation

Run Command
uv run src/server/main.py

Security Notes

The `read_pdf_resource` tool allows reading arbitrary files from the server's filesystem via absolute paths (e.g., `pdf:///etc/passwd`). This is a critical information disclosure vulnerability. Processing untrusted PDFs from local files or URLs (via `read_pdf`) can expose the system to vulnerabilities in underlying libraries (PyMuPDF, pdfplumber, pytesseract) and potentially lead to SSRF for URL-based sources. OCR with Tesseract also involves external command execution, which could be a vector if not properly sanitized. The server does not explicitly implement input sanitization or sandboxing for untrusted PDF content or file paths.

Similar Servers

Stats

Interest Score0
Security Score3
Cost ClassHigh
Avg Tokens10000
Stars0
Forks0
Last Update2025-12-13

Tags

PDF processingDocument analysisOCRAI toolMarkdown generation