pdf-reader-mcp

Name: pdf-reader-mcp
Author: SylphxAI

Verified Safe

by SylphxAI

View Source

Overview

Provides a robust server for AI agents to extract text, images, and metadata from PDF documents, preserving content order for better comprehension.

Installation

Run Command

npx @sylphx/pdf-reader-mcp

Security Notes

The server correctly uses `path.resolve` to handle file paths, accepting both absolute and relative paths. It has robust input validation via `vex` and implements size limits (100MB) for PDF files to prevent memory exhaustion. Per-page error handling enhances reliability. No direct `eval` or command injection vulnerabilities are apparent. However, as a local file system access tool, it grants the AI agent (and thus, potentially a compromised agent or user) the ability to read any file on the system within the permissions of the running Node.js process. While this is an intended feature for an MCP server, it represents a security boundary that implementers should be aware of regarding the hosting environment and client trust.

Similar Servers

kreuzberg

5420

Extracts text, tables, images, and metadata from a wide range of document formats (PDF, Office, images, HTML, etc.), with support for multiple OCR backends and an extensible plugin system. Can be run as a Micro-Agent Communication Protocol (MCP) server.

Other

$Medium

kreuzberg

5412

Extracts text, tables, images, and metadata from 56 file formats including PDF, Office documents, and images. Supports multiple OCR backends, extensible plugins, and is designed for data preprocessing in AI/ML workflows.

Other

$Medium