converse
Verified Safeby fwdslsh
Overview
A lightweight Model Context Protocol (MCP) server that provides voice capabilities (TTS/STT) using remote OpenAI-compatible APIs, enabling AI assistants to speak and listen through system audio devices.
Installation
converseEnvironment Variables
- STT_API_URL
- TTS_API_URL
Security Notes
The server uses `child_process.spawn` to execute system audio commands (`aplay`, `arecord`, etc.). While `spawn` is generally safer than `exec`, if the `PLAY_CMD` or `RECORD_CMD` environment variables are maliciously configured by the operator, they could execute arbitrary commands. This is a configuration risk rather than a direct code vulnerability. No other critical security risks like `eval` or hardcoded secrets were identified in the provided source code. API keys are handled via environment variables.
Similar Servers
elevenlabs-mcp
An official ElevenLabs Model Context Protocol (MCP) server that enables AI clients (e.g., Claude Desktop, Cursor) to interact with ElevenLabs Text-to-Speech, voice cloning, transcription, and audio processing APIs.
voicemode
Provides robust voice interaction capabilities for Model Context Protocol (MCP) agents, enabling real-time speech-to-text (STT) and text-to-speech (TTS) functionalities, with support for local and cloud-based services. It also includes tools for audio playback (DJ), service management, and diagnostics.
deepl-mcp-server
Provides DeepL translation and linguistic capabilities as a Model Context Protocol (MCP) server for integration with AI clients.
mcp-tts
Provides Text-to-Speech (TTS) capabilities to MCP (Model Context Protocol) clients using various AI and system-level TTS engines.