vision-mcp-server

Name: vision-mcp-server
Author: hiroki-yokoyama

Verified Safe

by hiroki-yokoyama

View Source

Overview

A Model Context Protocol (MCP) server for local CPU-based vision language model inference using GGUF models via llama-cpp-python, designed to run as a Windows resident process and analyze images.

Installation

Run Command

scripts\run_server.ps1

Environment Variables

HF_ENDPOINT
HF_TOKEN

Security Notes

The server primarily operates with local file paths for images and models, relying on PIL for image processing and llama-cpp-python for LLM inference. No direct 'eval' or execution of arbitrary code from user input was found. The dynamic loading of chat handlers uses a controlled dictionary, mitigating risks. Main risks involve running potentially malicious GGUF models or providing large/malformed image files, which are inherent to the use case.

Similar Servers

osaurus

3056

Osaurus is an AI edge runtime for macOS, enabling users to run local and cloud AI models, orchestrate tools via the Model Context Protocol (MCP), and power AI applications and workflows on Apple Silicon.

Other

$Low

luma-mcp

Provides multi-model vision understanding capabilities to AI assistants that lack native image understanding.

Other

$High

ollama-mcp-server

Provides a self-contained Model Context Protocol (MCP) server for local Ollama management, enabling features like listing models, chatting, server control, and intelligent model recommendations.

Other

$Low

polybrain-mcp

Connects AI agents to multiple LLM models, providing conversation history management and model switching capabilities.

Other

$Medium

Stats

Interest Score0

Security Score9

Cost ClassLow

Avg Tokens256

Stars0

Forks0

Last Update2025-11-20

vision-mcp-server

Overview

Installation

Environment Variables

Security Notes

Similar Servers

osaurus

luma-mcp

ollama-mcp-server

polybrain-mcp

Stats

Tags