mcp-video-extraction-plus
by takereshui
Overview
The server extracts text from various video and audio sources using multiple Automatic Speech Recognition (ASR) providers, including local Whisper and online services from JianYing (CapCut) and Bilibili.
Installation
uvx mcp-video-extractionEnvironment Variables
- ASR_PROVIDER
- ASR_USE_CACHE
- ASR_WORD_TIME_STAMP
- JIANYING_START_TIME
- JIANYING_END_TIME
- WHISPER_MODEL
- WHISPER_LANGUAGE
- AUDIO_FORMAT
- AUDIO_QUALITY
- TEMP_DIR
- DOWNLOAD_RETRIES
- FRAGMENT_RETRIES
- SOCKET_TIMEOUT
Security Notes
The server disables SSL certificate verification for yt-dlp downloads (`nocheckcertificate: True`), which poses a significant Man-in-the-Middle (MiTM) risk. The JianYing ASR implementation relies on an external, unofficial third-party endpoint (`https://asrtools-update.bkfeng.top/sign`) for API signing, introducing a potential supply chain vulnerability. Using online ASR services inherently involves uploading audio data to third-party servers (ByteDance, Bilibili), which has privacy implications. No clear malicious patterns or hardcoded sensitive credentials were found, but the reliance on unverified external services and disabled SSL are critical concerns.
Similar Servers
firecrawl-mcp-server
Provides web scraping, crawling, search, and structured data extraction capabilities to AI models via the Model Context Protocol.
yt-dlp-mcp
Provides AI agents with tools to interact with video platforms for content retrieval, metadata extraction, searching, and transcription.
ffmpeg-mcp
An MCP server designed to work with FFmpeg, providing capabilities for media processing tasks such as speeding up videos and extracting audio.
imgenx
AI image and video generation and processing, functioning as a command-line tool or MCP server.