qwen-video-mcp-server
Verified Safeby adamanz
Overview
An MCP (Model Context Protocol) server for AI agents like Claude to analyze videos and images using Qwen3-VL deployed on Modal.
Installation
uv run server.pyEnvironment Variables
- MODAL_WORKSPACE
- MODAL_APP
- QWEN_IMAGE_ENDPOINT
- QWEN_VIDEO_ENDPOINT
Security Notes
The server's primary function is to proxy requests to a Modal backend, processing external video/image URLs. While the server code itself does not show immediate vulnerabilities like 'eval' or hardcoded secrets, the security heavily relies on the Modal backend's ability to safely handle arbitrary URLs and prevent SSRF or other attacks during media processing. Configuration is managed via environment variables, which is a good practice.
Similar Servers
gpt-researcher
The GPT Researcher MCP Server enables AI assistants to conduct comprehensive web research and generate detailed, factual, and unbiased reports. It supports multi-agent workflows, local document analysis, and integration with external tools via the Machine Conversation Protocol (MCP) for various research tasks.
voicemode
Provides robust voice interaction capabilities for Model Context Protocol (MCP) agents, enabling real-time speech-to-text (STT) and text-to-speech (TTS) functionalities, with support for local and cloud-based services. It also includes tools for audio playback (DJ), service management, and diagnostics.
luma-mcp
Provides multi-model vision understanding capabilities to AI assistants that lack native image understanding.
jimeng-mcp-server
An MCP server to provide Claude and other LLMs with image and video generation capabilities via the JiMeng AI service's reverse-engineered API.