Back to Home
monamaret icon

datasheet-to-md-mcp

Verified Safe

by monamaret

Overview

Converts PDF datasheets and documents into structured Markdown, extracts images, detects diagrams, and generates PlantUML for integration with AI coding assistants via the Model Context Protocol.

Installation

Run Command
pdf-md-mcp

Environment Variables

  • PDF_INPUT_DIR
  • OUTPUT_BASE_DIR
  • MCP_SERVER_NAME
  • MCP_SERVER_VERSION
  • IMAGE_MAX_DPI
  • IMAGE_FORMAT
  • PRESERVE_ASPECT_RATIO
  • DETECT_DIAGRAMS
  • DIAGRAM_CONFIDENCE
  • PLANTUML_STYLE
  • PLANTUML_COLOR_SCHEME
  • INCLUDE_TOC
  • BASE_HEADER_LEVEL
  • EXTRACT_TABLES
  • EXTRACT_IMAGES
  • LOG_LEVEL
  • MCP_TRANSPORT

Security Notes

The server uses standard I/O for communication (stdio transport) and relies on robust Go libraries for PDF parsing and image processing. Path handling for input and output directories utilizes `filepath.Clean` and `filepath.Join`, mitigating simple path traversal vulnerabilities. Resource limits for image processing are in place to prevent memory exhaustion. There is no direct execution of arbitrary commands or `eval`-like patterns observed. Configuration is loaded from environment variables, which is a secure practice, provided the environment variables themselves are managed securely during deployment. The primary security risk would come from misconfiguring `PDF_INPUT_DIR` or `OUTPUT_BASE_DIR` to sensitive file system locations.

Similar Servers

Stats

Interest Score0
Security Score9
Cost ClassHigh
Avg Tokens50000
Stars0
Forks0
Last Update2025-11-28

Tags

PDF processingMarkdown conversionAI integrationDocument analysisPlantUML generation