lemonade
Verified Safeby lemonade-sdk
Overview
Lemonade Server is a high-performance C++ HTTP server providing local OpenAI-compatible API endpoints for various AI inference tasks including large language models (LLMs), embeddings, reranking, and audio transcription, with a focus on AMD Ryzen AI hardware acceleration.
Installation
./build/lemonade-routerEnvironment Variables
- LEMONADE_API_KEY
- HF_TOKEN
- LEMONADE_OFFLINE
- LEMONADE_LLAMACPP_VULKAN_BIN
- LEMONADE_LLAMACPP_ROCM_BIN
- LEMONADE_LLAMACPP_METAL_BIN
- LEMONADE_LLAMACPP_CPU_BIN
- RYZENAI_SKIP_PROCESSOR_CHECK
- LEMONADE_CACHE_DIR
- LD_LIBRARY_PATH
- OCL_SET_SVM_SIZE
Security Notes
The server binds to `localhost` by default, which is safe; however, if configured to bind to `0.0.0.0`, it would expose the API externally. The API lacks authentication by default if the `LEMONADE_API_KEY` environment variable is not set, posing a risk if exposed. Cross-Origin Resource Sharing (CORS) is enabled for all origins (`*`), a common practice for local development but a potential vulnerability for externally-exposed servers. It automatically downloads model backends and LLM models from trusted GitHub and Hugging Face repositories, which can introduce supply chain risks if these sources are compromised, though it utilizes version pinning and explicit URLs. On Windows, the build incorporates Control Flow Guard, ASLR, and DEP. Linux builds are headless-only and use permissively licensed dependencies.
Similar Servers
osaurus
Osaurus is an AI edge runtime for macOS, enabling users to run local and cloud AI models, orchestrate tools via the Model Context Protocol (MCP), and power AI applications and workflows on Apple Silicon.
claude-prompts-mcp
Enhances AI assistant behavior through structured prompt management, multi-step chains, quality gates, and autonomous verification loops, primarily for development tasks.
remembrances-mcp
Provides long-term memory, knowledge base, and semantic code indexing capabilities for AI agents.
ai-suite
An end-to-end suite for self-hosting AI agents, LLMs, and automation workflows locally or in the cloud.