Local LLM REST API using llama-server and Qwen 0.5B
A fully automated deployment of a lightweight, CPU-only Large Language Model.
This snap packages llama.cpp's server with Qwen2.5-0.5B to expose an
OpenAI-compatible REST API.
Enable snaps on Kubuntu and install haproxy-spoe-vibes
Snaps are applications packaged with all their dependencies to run on all popular Linux distributions from a single build. They update automatically and roll back gracefully.
Snaps are discoverable and installable from the Snap Store, an app store with an audience of millions.