nano-banana-mcp
Verified Safeby Riti0208
Overview
This server provides a Model Context Protocol (MCP) interface for advanced image generation, editing, analysis, and batch processing using Google's Gemini 2.5 Flash Image Preview and Gemini 3 Pro Image Preview models.
Installation
node dist/index.jsEnvironment Variables
- GEMINI_API_KEY
Security Notes
The server uses `@modelcontextprotocol/sdk` with a `StdioServerTransport`, which primarily communicates over standard I/O, limiting direct external network exposure. The Google Gemini API key is correctly managed via environment variables (`GEMINI_API_KEY`) and not hardcoded. File system operations (`fs.readFile`, `fs.writeFile`, `fs.mkdir`) are performed for image input and output. The default output directory is `~/Downloads/nano-banana-images`, but custom `outputDir` and `imagePath` arguments are accepted. While control over these paths could theoretically lead to unintended file access in a highly compromised client environment, this is a common and expected interaction for a local agent-based tool and not an inherent vulnerability in the server logic itself. No 'eval' or other direct code execution vulnerabilities were found.
Similar Servers
gemini-mcp-server
An MCP server providing a suite of 7 AI-powered tools (Image Gen/Edit, Chat, Audio Transcribe, Code Execute, Video/Image Analysis) powered by Google Gemini, featuring a self-learning "Smart Tool Intelligence" system for prompt enhancement and user preference adaptation.
nanobanana-api-mcp
An MCP server providing image generation and editing capabilities via the Google Gemini API, integrable with various AI coding assistants and IDEs.
ultimate-image-gen-mcp
A professional MCP server for Google's Gemini 3 Pro Image Preview, enabling state-of-the-art image generation with advanced reasoning, high-resolution output (1K-4K), up to 14 reference images, Google Search grounding, and automatic thinking mode.
gemini-mcp
The server provides a Model Context Protocol (MCP) interface to Google Gemini AI services, enabling multimodal generation including image creation, image editing, and video production.