Back to Home
greynewell icon

mcpbr

Verified Safe

by greynewell

Overview

A benchmark runner for evaluating Model Context Protocol (MCP) servers by comparing LLM agent performance with and without MCP tools on software engineering tasks.

Installation

Run Command
mcpbr run -c mcpbr.yaml

Environment Variables

  • ANTHROPIC_API_KEY

Security Notes

The server's core functionality involves executing user-defined MCP server commands and agent-generated code (patches, PoC exploits) within Docker containers. While Docker provides a layer of isolation, malicious configurations for the 'mcp_server' command or arguments could lead to undesirable actions within the container. The Claude Code CLI is run with '--dangerously-skip-permissions', though it is executed by a non-root user ('mcpbr') inside the container. The project's 'SECURITY.md' acknowledges these risks and advises users to only use trusted MCP servers and be aware of network access within containers.

Similar Servers

Stats

Interest Score87
Security Score7
Cost ClassHigh
Avg Tokens7000
Stars14
Forks5
Last Update2026-01-18

Tags

MCPbenchmarkLLM agentsSWE-benchCyberGym