md-server
Verified Safeby peteretelej
Overview
Converts various documents, webpages, and media files into markdown format, serving as an HTTP API or an MCP server for AI assistants to read and process content.
Installation
uvx md-server[mcp] --mcp-stdioEnvironment Variables
- MD_SERVER_HOST
- MD_SERVER_PORT
- MD_SERVER_API_KEY
- MD_SERVER_MAX_FILE_SIZE
- MD_SERVER_TIMEOUT_SECONDS
- MD_SERVER_URL_FETCH_TIMEOUT
- MD_SERVER_CONVERSION_TIMEOUT
- MD_SERVER_BROWSER_TIMEOUT
- MD_SERVER_OCR_TIMEOUT
- MD_SERVER_DEBUG
- MD_SERVER_HTTP_PROXY
- MD_SERVER_HTTPS_PROXY
- MD_SERVER_OPENAI_API_KEY
- MD_SERVER_AZURE_DOC_INTEL_ENDPOINT
- MD_SERVER_AZURE_DOC_INTEL_KEY
- MD_SERVER_CRAWL4AI_JS_RENDERING
- MD_SERVER_CRAWL4AI_TIMEOUT
- MD_SERVER_CRAWL4AI_USER_AGENT
- MD_SERVER_LLM_PROVIDER_URL
- MD_SERVER_LLM_API_KEY
- MD_SERVER_LLM_MODEL
- MD_SERVER_ALLOW_LOCALHOST
- MD_SERVER_ALLOW_PRIVATE_NETWORKS
- MD_SERVER_ALLOWED_FILE_TYPES
Security Notes
The server implements robust Server-Side Request Forgery (SSRF) protection by default, blocking access to private IP ranges, localhost (configurable), and cloud metadata endpoints. API key authentication is supported via a Bearer token, which is a good security practice for the HTTP API. Input validation (file size, URL schemes, MIME types) is in place. Reliance on external conversion libraries (MarkItDown, Crawl4AI) introduces inherent risks associated with complex file/web parsing, but the server applies timeouts and input validation to mitigate these.
Similar Servers
html-to-markdown-mcp
Converts HTML content from web pages or raw strings into Markdown format, with options for including metadata, truncating content, and saving to files.
markitdown-mcp
A Model Context Protocol (MCP) server for converting 29+ file formats (e.g., PDF, Office, images, audio) to clean, structured Markdown, designed for integration with AI workflows and MCP clients like Claude Desktop.
lyra-tool-discovery
This MCP server is designed to fetch, parse, and organize documentation from websites implementing the llms.txt standard. It transforms raw documentation into structured, agent-ready formats, exposing tools for AI agents, LLMs, and automation workflows to consume documentation programmatically.
data-extractor
A commercial-grade MCP Server designed for robust web page and PDF content extraction and conversion to Markdown, purpose-built for long-term enterprise deployment.