Back to Home
mbailey icon

voicemode

Verified Safe

by mbailey

Overview

Provides robust voice interaction capabilities for Model Context Protocol (MCP) agents, enabling real-time speech-to-text (STT) and text-to-speech (TTS) functionalities, with support for local and cloud-based services. It also includes tools for audio playback (DJ), service management, and diagnostics.

Installation

Run Command
voice-mode serve

Environment Variables

  • OPENAI_API_KEY
  • VOICEMODE_BASE_DIR
  • VOICEMODE_DEBUG
  • VOICEMODE_TTS_BASE_URLS
  • VOICEMODE_STT_BASE_URLS
  • VOICEMODE_VOICES
  • VOICEMODE_TTS_MODELS
  • VOICEMODE_WHISPER_MODEL
  • VOICEMODE_WHISPER_PORT
  • VOICEMODE_KOKORO_PORT
  • VOICEMODE_PRONOUNCE
  • VOICEMODE_SERVICE_AUTO_ENABLE

Security Notes

Extensive use of `subprocess.run`/`Popen` for system integration (package installation, Git cloning, running services) poses a risk for command injection if user-provided input is not rigorously sanitized, though `Path` objects and `shlex.split` offer some protection. The `serve` command exposes the MCP server via HTTP/SSE, requiring explicit configuration of IP allowlisting, secret path, or token authentication to prevent unauthorized access. Trust in external repositories (whisper.cpp, kokoro-fastapi) and their integrity is assumed for installation.

Similar Servers

Stats

Interest Score98
Security Score6
Cost ClassMedium
Avg Tokens250
Stars609
Forks79
Last Update2026-01-19

Tags

Voice AISpeech-to-TextText-to-SpeechConversational AIMCP ServerAudio ProcessingLocal ServicesCloud Integration