Back to Home
goyaladitya05 icon

omni-parser-mcp-server

Verified Safe

by goyaladitya05

Overview

Automates GUI interactions on any software by 'seeing' the screen with computer vision and executing actions based on LLM reasoning.

Installation

Run Command
python -m osiris.server

Environment Variables

  • GOOGLE_API_KEY

Security Notes

The system processes and executes actions based on LLM output, including mouse clicks and keyboard typing. While the JSON parsing helps mitigate direct code injection, a compromised or poorly prompted LLM could instruct the agent to perform malicious actions on the host machine. No direct 'eval' or hardcoded secrets were found. Temporary screenshot files are used and overwritten.

Similar Servers

Stats

Interest Score0
Security Score8
Cost ClassMedium
Avg Tokens1200
Stars0
Forks0
Last Update2025-11-25

Tags

GUI automationComputer VisionLLM AgentMCPUI interaction