fetchv2-mcp-server
Verified Safeby praveenc
Overview
Fetches web content, extracts clean markdown using Trafilatura, discovers links, and supports llms.txt for AI agents.
Installation
uvx fetchv2-mcp-server@latestSecurity Notes
The server uses `httpx` for network requests with timeouts, `protego` for robots.txt compliance, and `trafilatura` for content extraction, all of which are well-regarded libraries. It explicitly avoids fetching if `robots.txt` disallows it (unless bypassed manually). The use of `defusedxml` for XML parsing is a good security practice. No `eval` or `exec` calls are present. Regex patterns for link discovery are controlled. No apparent hardcoded secrets or malicious patterns.
Similar Servers
DevDocs
DevDocs is a web crawling and content extraction platform designed to accelerate software development by converting documentation into LLM-ready formats for intelligent data querying and fine-tuning.
mcp
This server provides Hyperbrowser's Model Context Protocol (MCP) interface, offering tools for web scraping, structured data extraction, crawling, and general-purpose browser automation using AI agents like OpenAI's CUA and Anthropic's Claude Computer Use.
kindly-web-search-mcp-server
Provides web search with robust, LLM-optimized content retrieval from various sources (StackExchange, GitHub, Wikipedia, arXiv, and general webpages) for AI coding assistants.
crawlbase-mcp
A Model Context Protocol (MCP) server that enables AI agents and LLMs to fetch fresh, structured, real-time web content (HTML, Markdown, screenshots) via Crawlbase's scraping infrastructure.