Back to Home
lemonade-sdk icon

lemonade

Verified Safe

by lemonade-sdk

Overview

Lemonade Server is a high-performance C++ HTTP server providing local OpenAI-compatible API endpoints for various AI inference tasks including large language models (LLMs), embeddings, reranking, and audio transcription, with a focus on AMD Ryzen AI hardware acceleration.

Installation

Run Command
./build/lemonade-router

Environment Variables

  • LEMONADE_API_KEY
  • HF_TOKEN
  • LEMONADE_OFFLINE
  • LEMONADE_LLAMACPP_VULKAN_BIN
  • LEMONADE_LLAMACPP_ROCM_BIN
  • LEMONADE_LLAMACPP_METAL_BIN
  • LEMONADE_LLAMACPP_CPU_BIN
  • RYZENAI_SKIP_PROCESSOR_CHECK
  • LEMONADE_CACHE_DIR
  • LD_LIBRARY_PATH
  • OCL_SET_SVM_SIZE

Security Notes

The server binds to `localhost` by default, which is safe; however, if configured to bind to `0.0.0.0`, it would expose the API externally. The API lacks authentication by default if the `LEMONADE_API_KEY` environment variable is not set, posing a risk if exposed. Cross-Origin Resource Sharing (CORS) is enabled for all origins (`*`), a common practice for local development but a potential vulnerability for externally-exposed servers. It automatically downloads model backends and LLM models from trusted GitHub and Hugging Face repositories, which can introduce supply chain risks if these sources are compromised, though it utilizes version pinning and explicit URLs. On Windows, the build incorporates Control Flow Guard, ASLR, and DEP. Linux builds are headless-only and use permissively licensed dependencies.

Similar Servers

Stats

Interest Score100
Security Score7
Cost ClassLow
Avg Tokens150
Stars2022
Forks169
Last Update2026-01-19

Tags

LLMAI InferenceHTTP ServerLocal AIOpenAI API Compatible