html-to-markdown-mcp
by levz0r
Overview
Converts HTML content from web pages or raw strings into Markdown format, with options for including metadata, truncating content, and saving to files.
Installation
npx html-to-markdown-mcpSecurity Notes
The server has potential vulnerabilities related to file system access and network requests. The `saveToFile` parameter in `html_to_markdown` and `filePath` in `save_markdown` use `path.resolve` without sanitization, potentially allowing arbitrary file writes outside the intended directory (e.g., via path traversal like `../../etc/passwd`). Additionally, the `url` parameter in `html_to_markdown` fetches content from external URLs without explicit validation, which could expose the server to Server-Side Request Forgery (SSRF) if a malicious URL is provided.
Similar Servers
scrapi-mcp
This MCP server enables AI agents to scrape web pages and retrieve their content as HTML or Markdown, with advanced browser interaction capabilities.
defuddle-fetch-mcp-server
This server allows LLMs to fetch web content, automatically cleaning HTML into markdown, extracting key metadata like title and author, and supporting chunked reading.
md-server
Converts various documents, webpages, and media files into markdown format, serving as an HTTP API or an MCP server for AI assistants to read and process content.
data-extractor
A commercial-grade MCP Server designed for robust web page and PDF content extraction and conversion to Markdown, purpose-built for long-term enterprise deployment.