Back to Home
takereshui icon

mcp-video-extraction-plus

by takereshui

Overview

The server extracts text from various video and audio sources using multiple Automatic Speech Recognition (ASR) providers, including local Whisper and online services from JianYing (CapCut) and Bilibili.

Installation

Run Command
uvx mcp-video-extraction

Environment Variables

  • ASR_PROVIDER
  • ASR_USE_CACHE
  • ASR_WORD_TIME_STAMP
  • JIANYING_START_TIME
  • JIANYING_END_TIME
  • WHISPER_MODEL
  • WHISPER_LANGUAGE
  • AUDIO_FORMAT
  • AUDIO_QUALITY
  • TEMP_DIR
  • DOWNLOAD_RETRIES
  • FRAGMENT_RETRIES
  • SOCKET_TIMEOUT

Security Notes

The server disables SSL certificate verification for yt-dlp downloads (`nocheckcertificate: True`), which poses a significant Man-in-the-Middle (MiTM) risk. The JianYing ASR implementation relies on an external, unofficial third-party endpoint (`https://asrtools-update.bkfeng.top/sign`) for API signing, introducing a potential supply chain vulnerability. Using online ASR services inherently involves uploading audio data to third-party servers (ByteDance, Bilibili), which has privacy implications. No clear malicious patterns or hardcoded sensitive credentials were found, but the reliance on unverified external services and disabled SSL are critical concerns.

Similar Servers

Stats

Interest Score48
Security Score4
Cost ClassHigh
Stars5
Forks1
Last Update2025-11-22

Tags

Video ProcessingASRSpeech RecognitionMediaDownload