cua-mcp-server
Verified Safeby taskcrew
Overview
Delegates desktop automation tasks to an autonomous vision-based AI agent controlling cloud-based virtual machine sandboxes.
Installation
npm run devEnvironment Variables
- CUA_API_KEY
- ANTHROPIC_API_KEY
- BLOB_READ_WRITE_TOKEN
Security Notes
The server implements robust input validation for sandbox names and task IDs to prevent injection and path traversal attacks. It includes specific URL validation (`isValidBlobUrl`) to prevent Server-Side Request Forgery (SSRF) when fetching task history from Vercel Blob storage. API keys are sourced from environment variables or request headers, not hardcoded. The agent architecture relies on an LLM generating structured tool calls, not direct shell commands from user input, significantly reducing direct execution risks. The primary security concern, as noted in the `AGENTS.md` and `README.md`, is the default wide-open CORS policy (`Access-Control-Allow-Origin: *`) for broad MCP compatibility, which might need tightening for production deployments with untrusted users, and the implication that the server's Anthropic API key is shared among authenticated CUA API key users.
Similar Servers
UI-TARS-desktop
UI-TARS-desktop is a native GUI Agent application powered by multimodal AI models, enabling users to control their computer and browser through natural language instructions.
Windows-MCP.Net
Enabling AI assistants to automate tasks and interact with the Windows desktop environment.
toolhive-studio
ToolHive is a desktop application (Electron UI) for discovering, deploying, and managing Model Context Protocol (MCP) servers in isolated containers, and connecting them to AI agents and clients.
mcp-vnc
An MCP server for AI agents to remotely control VNC-enabled desktops (Windows, Linux, macOS) through mouse, keyboard, text input, and screen capture commands.