I have been using lemonade for nearly a year already. On Strix Halo I am using nothing else - althou...

dennemark • today at 1:10 PM • 2 replies • view on HN

I have been using lemonade for nearly a year already. On Strix Halo I am using nothing else - although kyuz0's toolboxes are also nice (https://kyuz0.github.io/amd-strix-halo-toolboxes/)

Nowadays you get TTS, STT, text & image generation and image editing should also be possible. Besides being able to run via rocm, vulkan or on CPU, GPU and NPU. Quite a lot of options. They have a quite good and pragmatic pace in development. Really recommend this for AMD hardware!

Edit: OpenAI and i think nowaday ollama compatible endpoints allow me to use it in VSCode Copilot as well as i.e. Open Web UI. More options are shown in their docs.

Replies

UncleOxidant • today at 6:19 PM

How much of a speedup might I get for, say, Qwen3.5-122B if I were to run with lemonade on my Strix Halo vs running it using vulkan with llama.cpp ?

➕ show 1 reply

syntaxing • today at 2:30 PM

Have you used it with any agents or claw? If so, which model do you run?

➕ show 2 replies

alt Hacker News

Replies