adk-mcp-multimodal
Verified Safeby alphinside
Overview
This server implements a multimodal AI agent for product marketing, enabling users to edit product photos and generate marketing videos from images using Google's ADK and Veo 3.1 API.
Installation
uv run adk webEnvironment Variables
- GOOGLE_CLOUD_PROJECT
- GOOGLE_CLOUD_LOCATION
Security Notes
The system utilizes Google's ADK and GenAI models, relying on their inherent security. Environment variables are used for cloud project and location, avoiding hardcoded secrets. File uploads are processed and passed to Google APIs, minimizing direct execution risks. There are no obvious `eval` calls or other patterns for arbitrary code execution. Local file operations (e.g., `shutil.rmtree`) are confined to a designated 'gradio_artifacts' directory. Assumes the underlying Google APIs and ADK framework are secure.
Similar Servers
luma-mcp
Provides multi-model vision understanding capabilities to AI assistants that lack native image understanding.
remembrances-mcp
Provides long-term memory, knowledge base, and semantic code indexing capabilities for AI agents.
mcp-game-asset-gen
An MCP server for generating various game development assets, including images, videos, audio, and 3D models using multiple AI providers.
mcp-video-extraction-plus
The server extracts text from various video and audio sources using multiple Automatic Speech Recognition (ASR) providers, including local Whisper and online services from JianYing (CapCut) and Bilibili.