Temporary performance degradation

We are currently experiencing service degradation and working on resolving this. Thank you for your patience and understanding.

Lemonade Server

Ken VanDine (ken-vandine) Publisher Star developer Star developer

Install latest/stable of Lemonade Server

Ubuntu 16.04 or later?

Make sure snap support is enabled in your Desktop store.


Install using the command line

sudo snap install lemonade-server

Don't have snapd? Get set up for snaps.

Channel Version Published

Details for Lemonade Server

Package name

  • lemonade-server

License

  • MIT

Last updated

  • 29 April 2026 - latest/stable
  • Yesterday - latest/edge

Websites


Contact


Source code


Report a bug


Report a Snap Store violation

Share this snap

Generate an embeddable card to be shared on external websites.

Local AI server with OpenAI-compatible API

Lemonade Server is a lightweight, high-performance local AI inference server that provides an OpenAI-compatible API for running large language models on your own hardware.

Features:

  • OpenAI-compatible REST API (chat completions, embeddings, etc.)
  • Multiple backend support: Vulkan, ROCm (AMD GPUs), and CPU
  • Automatic model management and caching
  • Support for GGUF models from Hugging Face
  • Low latency local inference
  • Runs as a background service

Supported Hardware:

  • AMD GPUs: RDNA3 (RX 7000), RDNA4 (RX 9000), Strix Point/Halo APUs
  • Any Vulkan-capable GPU
  • CPU fallback for systems without GPU acceleration

Quick Start: The server starts automatically after installation. Access the API at: http://localhost:8000/api/v1

ROCm Support (AMD GPUs): For ROCm GPU acceleration, connect the process-control interface: sudo snap connect lemonade-server:process-control

Documentation: https://lemonade-server.ai/


Install Lemonade Server on your Linux distribution

Choose your Linux distribution to get detailed installation instructions. If yours is not shown, get more details on the installing snapd documentation.