Back to Home
Knuckles-Team icon

audio-transcriber

by Knuckles-Team

Overview

Provides an MCP micro-service and an A2A agent for audio transcription and processing using OpenAI Whisper models, capable of transcribing from files or live microphone input.

Installation

Run Command
docker-compose up -d

Environment Variables

  • HOST
  • PORT
  • TRANSPORT
  • OPENAI_API_KEY
  • MCP_URL
  • PROVIDER
  • OPENAI_BASE_URL
  • MODEL_ID
  • DEBUG
  • ENABLE_WEB_UI
  • WHISPER_MODEL
  • TRANSCRIBE_DIRECTORY

Security Notes

The `audio_transcriber/utils.py` module uses `pickle.load` which is a known deserialization vulnerability if `load_model` is called with a file from an untrusted source, potentially leading to arbitrary code execution. The `transcribe_audio` MCP tool accepts `audio_file` and `directory` as parameters, which could expose the server to path traversal or arbitrary file processing if not properly sanitized and validated by the client, though `whisper.load_model` and `Path.exists()` offer some protection. The `compose.yml` file hardcodes `OPENAI_API_KEY=llama` for both the MCP and Agent services, which, while likely intended for local Ollama-compatible setups, is a hardcoded secret in the configuration.

Similar Servers

Stats

Interest Score32
Security Score5
Cost ClassMedium
Avg Tokens1000
Stars2
Forks0
Last Update2026-01-19

Tags

audio-transcriptionwhisper-aimcp-serverai-agentspeech-to-text