mcp-manual-walker
Verified Safeby junkreef
Overview
Provides an MCP server for AI agents to efficiently access structured, token-efficient content from PDF manuals using bookmarks for navigation and semantic search.
Installation
python src/mcp_manual_walker/main.pyEnvironment Variables
- PDF_ROOT_DIR
- DB_FILE_PATH
- CHROMADB_PATH
- MARKDOWN_OUTPUT_DIR
- LOG_LEVEL
- MAX_PAGES_PER_REQUEST
- CHUNK_SIZE
- CHUNK_OVERLAP
- CHUNK_OVERLAP_SEARCH_MARGIN
- HOST
- PORT
- DOCLING_NUM_THREADS
- DOCLING_OCR_BATCH_SIZE
- DOCLING_LAYOUT_BATCH_SIZE
- DOCLING_TABLE_BATCH_SIZE
Security Notes
The server runs locally by default (127.0.0.1:8000) and processes PDF files from a predefined directory, reducing direct external attack surface. It uses SQLAlchemy for database interactions, mitigating SQL injection risks. File operations are constrained by `PDF_ROOT_DIR` to prevent path traversal. There are no obvious hardcoded secrets or 'eval' calls. The primary security consideration arises from the `docling` library's PDF processing, as vulnerabilities in this external dependency could be exploited if malicious PDFs are placed in the configured input directory. However, the `docling` processing is done via a separate `builder` script, not directly by the live server exposed to AI agents.
Similar Servers
pdf-reader-mcp
Provides a robust server for AI agents to extract text, images, and metadata from PDF documents, preserving content order for better comprehension.
mcp-server
Provides a Model Context Protocol (MCP) server for AI agents to search and retrieve curated documentation for the Strands Agents framework, facilitating AI coding assistance.
mcp-pdf-reader
Exposes local PDFs for reading, semantic search, chunking, and table extraction to MCP-compatible agents or via a CLI.
pdflens-mcp
This MCP server provides tools for reading and extracting information from PDF files, including text and images, designed for AI clients.