Back to Home
lemonade-sdk icon

lemonade

Verified Safe

by lemonade-sdk

Overview

The Lemonade C++ Server provides a lightweight, high-performance HTTP API for local Large Language Model (LLM) inference and model management, leveraging hardware accelerators like AMD Ryzen AI NPU, integrated GPUs, and discrete GPUs.

Installation

Run Command
./lemonade-router

Environment Variables

  • HF_TOKEN
  • LEMONADE_CACHE_DIR
  • LEMONADE_OFFLINE
  • LEMONADE_CI_MODE
  • LEMONADE_LLAMACPP_BACKEND
  • LEMONADE_LLAMACPP_<BACKEND>_BIN
  • RYZENAI_SKIP_PROCESSOR_CHECK
  • OCL_SET_SVM_SIZE
  • LD_LIBRARY_PATH

Security Notes

The server utilizes `system()` calls for external tool checks (e.g., `where flm`, `vulkaninfo`) and for installing/extracting binaries (e.g., `unzip`, PowerShell `Expand-Archive`). While the commands and paths are largely constructed internally or derived from trusted sources (GitHub/Hugging Face releases), and some user input is validated (e.g., custom llama-server args), any interaction with external processes carries inherent risk. The HTTP server defaults to binding on `localhost`, which mitigates the impact of its permissive CORS policy (`Access-Control-Allow-Origin: *`). Single-instance protection is implemented via system-wide mutexes or file locks.

Similar Servers

Stats

Interest Score100
Security Score8
Cost ClassLow
Avg Tokens150
Stars1822
Forks150
Last Update2025-12-05

Tags

LLM inferenceAI accelerationHybrid AIModel servingC++ backend