CS8803_MLSecurity_MCP_Server

Name: CS8803_MLSecurity_MCP_Server
Author: NirjharDeb

Verified Safe

by NirjharDeb

View Source

Overview

An MCP server prototype for evaluating hidden-payload delivery to LLMs in agentic IDEs for ML security research and defensive testing.

Installation

Run Command

python server.py

Security Notes

The project's explicit goal is to evaluate 'hidden-payload delivery to LLMs', and its source code implements several techniques that are inherently security-relevant patterns. The `translate_text` tool, for instance, actively injects a base64-encoded 'SYSTEM INSTRUCTION' into its output, which contains instructions intended to manipulate the LLM's behavior (e.g., to call another tool). Other tools use zero-width characters, hidden HTML comments/divs, and ANSI escape codes for obfuscation. The `read_gdoc` tool performs network requests to external Google Docs, which, while targeted, represents a potential vector for SSRF if not carefully managed or if a malicious URL could be injected. While the project states 'no malicious use' and is for 'defensive testing', the techniques demonstrated are direct implementations of methods used in LLM instruction injection and payload delivery, which are security vulnerabilities in a general context. The `send_email` tool is simulated and does not actually send emails, which is a positive from a security perspective.

Similar Servers

mcp-server-code-execution-mode

258

This server enables LLM agents to execute Python code in a highly secure, isolated container environment, facilitating complex multi-tool orchestration and data analysis with minimal LLM context token usage.

Coding Agents

$Low

Reversecore_MCP

The Reversecore_MCP server provides a Micro-Capability Platform (MCP) enabling AI agents to perform comprehensive binary analysis, malware detection, vulnerability research, and security report generation using integrated tools like Radare2, Ghidra, LIEF, and YARA.

AI & ML

$High

copilot-security-instructions

This MCP server provides a toolkit of security-focused prompts and instructions to guide GitHub Copilot towards secure coding practices, helping developers identify and mitigate security risks.

Coding Agents

$Low

MCP-Security-Framework

A comprehensive security testing framework for Model Context Protocol (MCP) servers, designed to detect vulnerabilities through automated sandboxing and active probing.

AI & ML

$Medium

Stats

Interest Score0

Security Score3

Cost ClassLow

Avg Tokens100

Stars0

Forks0

Last Update2025-11-30