mcp-browser-use-server
by hubertusgbecker
Overview
An MCP server that enables AI agents to control web browsers using the browser-use library, supporting various transports and persistent sessions.
Installation
docker-compose up -dEnvironment Variables
- OPENAI_API_KEY
- LLM_MODEL
- CHROME_PATH
- PATIENT
- LOG_LEVEL
- BROWSER_HEADLESS
- BROWSER_WINDOW_WIDTH
- BROWSER_WINDOW_HEIGHT
- BROWSER_LOCALE
- TASK_EXPIRY_MINUTES
- ESTIMATED_TASK_SECONDS
- CLEANUP_INTERVAL_SECONDS
- MAX_AGENT_STEPS
- OPENAI_REVERSE_PROXY
- ENABLE_DEFAULT_EXTENSIONS
- WAIT_FOR_NETWORK_IDLE_PAGE_LOAD_TIME
- MINIMUM_WAIT_PAGE_LOAD_TIME
Security Notes
The server includes a 'direct summarizer' feature that fetches content from a user-provided URL using `requests.get()`. This `url_text` parameter is taken directly from the AI agent's input without explicit domain validation or sanitization within this direct path. This creates a potential Server-Side Request Forgery (SSRF) vulnerability, allowing a malicious AI agent to potentially make the server request resources from internal networks or other arbitrary endpoints not intended for public access. Although the `browser-use` agent itself might have `allowed_domains` as a parameter, this direct summarizer bypasses the agent's full execution flow and its associated security controls. Other components, such as `subprocess.Popen` for `mcp-proxy`, execute trusted binaries with arguments controlled by the server's CLI rather than direct AI agent input, which is less of a concern.
Similar Servers
playwright-mcp
Provides a Model Context Protocol (MCP) server for LLMs to automate browser interactions using Playwright's accessibility tree, avoiding pixel-based vision models.
context-engineering
Provides a Model Context Protocol (MCP) server that enables AI agents to control a web browser using Selenium for web automation tasks.
qa-use
Provides comprehensive browser automation and QA testing capabilities, integrating with a backend platform for automated tests, interactive debugging, and batch test execution.
playwright-mcp-server
Provides a robust, token-aware Playwright browser automation server for Large Language Models (LLMs) and coding agents to interact with web content, supporting advanced features like anti-detection and multi-page management.