Back to Home
magicyuan876 icon

mineru-tianshu

Verified Safe

by magicyuan876

Overview

Enterprise-grade AI data preprocessing platform for converting diverse unstructured multi-modal data (documents, images, audio, video, bioinformatics formats) into structured Markdown and JSON formats, leveraging GPU acceleration and a robust task management system with user authentication and MCP protocol integration.

Installation

Run Command
make setup

Environment Variables

  • JWT_SECRET_KEY
  • RUSTFS_ACCESS_KEY
  • RUSTFS_SECRET_KEY
  • RUSTFS_PUBLIC_URL
  • REDIS_QUEUE_ENABLED
  • REDIS_HOST
  • REDIS_PORT
  • REDIS_DB
  • REDIS_PASSWORD
  • MODEL_DOWNLOAD_SOURCE
  • HF_ENDPOINT
  • HF_TOKEN

Security Notes

The project implements robust JWT and API Key authentication with role-based access control. The `JWT_SECRET_KEY` is correctly parameterized for production, avoiding hardcoded secrets. However, `RUSTFS_ACCESS_KEY` and `RUSTFS_SECRET_KEY` have insecure default values (`rustfsadmin`) in `docker-compose.yml`, which must be explicitly overridden in the `.env` file for production deployments. File uploads via the MCP server (Base64 or URL) are first saved to temporary files before internal API submission, a generally safe practice. Command execution through `subprocess.run` (e.g., `ffmpeg`) appears to use fixed commands with internal file paths, mitigating command injection risks. Ensure `ALLOWED_ORIGINS` is restricted in production.

Similar Servers

Stats

Interest Score64
Security Score8
Cost ClassHigh
Avg Tokens100000
Stars367
Forks48
Last Update2026-01-19

Tags

AI preprocessingmulti-modal parsingdocument understandingGPU accelerationtask managementAPIDockerFastAPIVue.jsMCP protocolOCRspeech-to-textwatermark removalobject storage