Back to Home
neosun100 icon

glm-asr

by neosun100

Overview

Provides an all-in-one speech recognition service with Web UI, REST API, and MCP integration.

Installation

Run Command
docker run -d --gpus all -p 7860:7860 neosun/glm-asr:latest

Environment Variables

  • PORT
  • MODEL_CHECKPOINT
  • NVIDIA_VISIBLE_DEVICES
  • HF_HOME

Security Notes

The model loading (`AutoModelForCausalLM.from_pretrained`) and VAD model loading (`torch.hub.load`) use `trust_remote_code=True` and `trust_repo=True` respectively. This allows arbitrary code execution from the specified HuggingFace model or GitHub repository, posing a significant security risk if the external source is compromised or malicious. While common in the ML ecosystem for flexibility, it requires explicit trust in the model/repository maintainers. The MCP server's `transcribe` tool accepts `audio_path` directly, which could lead to path traversal if the MCP client is untrusted or improperly configured, although the web API handles file uploads to a temporary directory.

Similar Servers

Stats

Interest Score55
Security Score4
Cost ClassLow
Avg Tokens128
Stars1
Forks0
Last Update2025-12-14

Tags

Speech RecognitionASRGPUDockerREST APIWeb UIMCP