spring-mcps
by HarryReidx
Overview
Ingest and semantically enhance documents (PDF, Markdown) into Dify knowledge bases, with VLM for image analysis and LLM for text summarization, while monitoring task status via a web frontend.
Installation
mvn clean package -DskipTests && java -jar target/dify-ingest-0.0.1-SNAPSHOT.jarEnvironment Variables
- DIFY_API_KEY
- DIFY_BASE_URL
- MINERU_BASE_URL
- MINERU_SERVER_TYPE
- MINERU_TOKEN
- S3_ENDPOINT
- S3_ACCESS_KEY
- S3_SECRET_KEY
- S3_BUCKET_NAME
- S3_REGION
- MINIO_UPLOAD_PATH
- MINIO_IMG_PREFIX
- VLM_PROVIDER
- VLM_BASE_URL
- VLM_API_KEY
- VLM_MODEL
- LLM_ENABLED
- LLM_PROVIDER
- LLM_BASE_URL
- LLM_API_KEY
- LLM_MODEL
Security Notes
The `application.yml` file contains hardcoded default values for critical credentials (Dify API Key, PostgreSQL username/password, MinIO access key/secret key). While environment variables can override these, the defaults themselves are publicly visible in the repository, posing a significant security risk if used in production without being explicitly overridden. This could lead to unauthorized access to Dify, MinIO, and the database. External services (MinerU, VLM, LLM) URLs are also hardcoded to a specific IP (`117.50.75.212`) as defaults, which could expose internal network structure or lead to unintended connections if not overridden. The `RequestCleanupFilter` is a good practice for cleaning request bodies.
Similar Servers
kreuzberg
Extracts text, tables, images, and metadata from 56 file formats including PDF, Office documents, and images. Supports multiple OCR backends, extensible plugins, and is designed for data preprocessing in AI/ML workflows.
DevDocs
DevDocs is a web crawling and content extraction platform designed to accelerate software development by converting documentation into LLM-ready formats for intelligent data querying and fine-tuning.
springboot-mcp-server
An AI-powered Spring Boot server for student management, demonstrating tool-use capabilities with an external large language model.
PairOfCleats
Codebase analysis and intelligence server, providing indexing, search, risk analysis, and vulnerability triage capabilities.