webscraping-ai-mcp-server
Verified Safeby webscraping-ai
Overview
Integrates with WebScraping.AI to provide LLM-powered web data extraction, including question answering, structured data extraction, and HTML/text retrieval, with advanced features like JavaScript rendering and proxy management.
Installation
env WEBSCRAPING_AI_API_KEY=YOUR_API_KEY npx -y webscraping-ai-mcpEnvironment Variables
- WEBSCRAPING_AI_API_KEY
Security Notes
The server itself does not execute user-provided code on the host. The 'js_script' parameter is sent to the external WebScraping.AI API for execution within their headless browser environment. The 'ContentSanitizer' is a positive security feature, wrapping scraped content with clear boundaries to protect against prompt injection into connected LLMs. A minor concern is 'JSON.parse(error.message)' for API errors, which could theoretically be a vector if the external API's error messages are crafted maliciously, but its impact is limited to formatting error output, not code execution. Overall, it implements reasonable safeguards for its function.
Similar Servers
DevDocs
Provides intelligent web crawling and documentation extraction, storing content in a Model Context Protocol server for LLM querying and accelerating developer research.
mcp
This server provides Hyperbrowser's Model Context Protocol (MCP) interface, offering tools for web scraping, structured data extraction, crawling, and general-purpose browser automation using AI agents like OpenAI's CUA and Anthropic's Claude Computer Use.
mcp-omnisearch
Provides a unified interface for LLMs to access multiple web search, AI response, content processing, and enhancement tools from various providers through the Model Context Protocol (MCP).
mcp-server
Provides a Model Context Protocol (MCP) server for integrating comprehensive search and data extraction via SerpApi.