Scrapling
Verified Safeby D4Vinci
Overview
Provides adaptive web scraping capabilities to AI chatbots and agents, allowing them to fetch, parse, and extract targeted data from websites, including dynamic content and anti-bot protected sites.
Installation
docker run -i --rm pyd4vinci/scrapling mcpSecurity Notes
The MCP server is designed to fetch content from arbitrary URLs and process user-defined CSS/XPath selectors, which are inherent functions of a web scraping tool. While the underlying parsing (lxml, cssselect) and fetching (curl_cffi, Playwright) libraries are generally robust against direct code injection, exposing such an API, especially to an AI agent, introduces potential risks such as Server-Side Request Forgery (SSRF) if the server has internal network access and is not properly firewalled. Resource exhaustion could also occur with extremely complex selectors or very large pages. There are no clear indications of hardcoded secrets, `eval` of untrusted input, or other direct code vulnerabilities in the provided snippets. The primary security considerations are operational: proper deployment environment, network segmentation, and strong access controls/input validation at the AI agent level controlling the server.
Similar Servers
mcp-server-browserbase
Enables LLMs to perform cloud browser automation tasks such as navigating, interacting with elements, extracting data, and capturing screenshots on web pages.
brightdata-mcp
Enables AI agents to access, search, extract, and navigate the live web in real-time without being blocked.
mcp
This server provides Hyperbrowser's Model Context Protocol (MCP) interface, offering tools for web scraping, structured data extraction, crawling, and general-purpose browser automation using AI agents like OpenAI's CUA and Anthropic's Claude Computer Use.
AgentBoard
Integrates AI agents with browser capabilities, enabling interaction with web pages and external Model Context Protocol (MCP) servers through a suite of specialized tools for tasks like content extraction and web automation.