CS8803_MLSecurity_MCP_Server
Verified Safeby NirjharDeb
Overview
An MCP server prototype for evaluating hidden-payload delivery to LLMs in agentic IDEs for ML security research and defensive testing.
Installation
python server.pySecurity Notes
The project's explicit goal is to evaluate 'hidden-payload delivery to LLMs', and its source code implements several techniques that are inherently security-relevant patterns. The `translate_text` tool, for instance, actively injects a base64-encoded 'SYSTEM INSTRUCTION' into its output, which contains instructions intended to manipulate the LLM's behavior (e.g., to call another tool). Other tools use zero-width characters, hidden HTML comments/divs, and ANSI escape codes for obfuscation. The `read_gdoc` tool performs network requests to external Google Docs, which, while targeted, represents a potential vector for SSRF if not carefully managed or if a malicious URL could be injected. While the project states 'no malicious use' and is for 'defensive testing', the techniques demonstrated are direct implementations of methods used in LLM instruction injection and payload delivery, which are security vulnerabilities in a general context. The `send_email` tool is simulated and does not actually send emails, which is a positive from a security perspective.
Similar Servers
mcp-server-code-execution-mode
This server enables LLM agents to execute Python code in a highly secure, isolated container environment, facilitating complex multi-tool orchestration and data analysis with minimal LLM context token usage.
Reversecore_MCP
The Reversecore_MCP server provides a Micro-Capability Platform (MCP) enabling AI agents to perform comprehensive binary analysis, malware detection, vulnerability research, and security report generation using integrated tools like Radare2, Ghidra, LIEF, and YARA.
copilot-security-instructions
This MCP server provides a toolkit of security-focused prompts and instructions to guide GitHub Copilot towards secure coding practices, helping developers identify and mitigate security risks.
MCP-Security-Framework
A comprehensive security testing framework for Model Context Protocol (MCP) servers, designed to detect vulnerabilities through automated sandboxing and active probing.