browser-use-mcp-plus
by Martin-Hausleitner
Overview
Provides a unified interface for browser automation, UI analysis (screenshots to text via LLM), Chrome DevTools interaction, documentation lookup (Context7), and isolated code execution (Docker VM) for agentic workflows.
Installation
~/.browser-use-mcp-plus/bin/unified_mcp.shEnvironment Variables
- BROWSER_USE_MCP_PYTHON
- OPENAI_API_KEY
- OPENAI_BASE_URL
- CONTEXT7_API_KEY
- CHROME_BIN
- CHUTES_API_KEY
- BROWSER_USE_ALLOW_HEADLESS_FALLBACK
Security Notes
The `chrome-devtools.evaluate_script` tool allows arbitrary JavaScript execution in the controlled browser instance, posing a critical XSS/code injection risk if the MCP client input is untrusted. The `docker_vm_run` and `agent_s3_vm_run_task` tools enable arbitrary command execution within a Docker container, and `agent_s3_vm_run_task` explicitly supports `unsafe_exec` for unrestricted code execution, creating potential container escape or resource misuse risks if the LLM or input is compromised. While API keys are externalized, the inherent power of these tools requires a highly trusted execution environment and strictly vetted inputs to prevent malicious use.
Similar Servers
playwright-mcp
Provides a Model Context Protocol (MCP) server for LLMs to automate browser interactions using Playwright's accessibility tree, avoiding pixel-based vision models.
mcp-chrome
Transforms the Chrome browser into an AI-controlled automation tool, enabling large language models to interact with web pages, analyze content, and manage browser functions.
context-engineering
Provides a Model Context Protocol (MCP) server that enables AI agents to control a web browser using Selenium for web automation tasks.
mcp-cli-ent
Orchestrates Model Context Protocol (MCP) servers and their tools on-demand for AI agents, without loading tool definitions directly into the agent's context window.