agentic-browser
by tashifkhan
Overview
An adaptive, model-agnostic browser extension that enables AI agents to understand complex web content and perform interactive automation tasks like form filling, navigation, and data extraction.
Installation
python main.py -mEnvironment Variables
- GOOGLE_API_KEY
- OPENAI_API_KEY
- ANTHROPIC_API_KEY
- OLLAMA_BASE_URL
- TAVILY_API_KEY
- VITE_API_URL
Security Notes
The browser extension uses `new Function(action.script)` in its `EXECUTE_SCRIPT` handler (in `extension/entrypoints/background.ts`). This allows arbitrary JavaScript code, generated by the AI agent, to be executed directly within the context of any web page visited by the user. Given that the extension has `<all_urls>` host permissions, this presents a critical security vulnerability. A compromised or misaligned AI model could inject malicious scripts, leading to cross-site scripting (XSS), data exfiltration, or other severe attacks on any website the user is browsing. Additionally, sensitive credentials (like PyJIIT login information) are stored in browser local storage, which, while segregated by extension, is not as secure as dedicated secrets management and could be compromised if the browser profile is breached. A Google OAuth client ID is hardcoded, which is a minor issue as it's typically public, but still less flexible than configuration.
Similar Servers
playwright-mcp
Provides a Model Context Protocol (MCP) server for LLMs to automate browser interactions using Playwright's accessibility tree, avoiding pixel-based vision models.
mcp-chrome
Transforms the Chrome browser into an AI-controlled automation tool, enabling large language models to interact with web pages, analyze content, and manage browser functions.
mcp-server-browserbase
Enables LLMs to perform cloud browser automation tasks such as navigating, interacting with elements, extracting data, and capturing screenshots on web pages.
AgentBoard
Integrates AI agents with browser capabilities, enabling interaction with web pages and external Model Context Protocol (MCP) servers through a suite of specialized tools for tasks like content extraction and web automation.