In my experience using llama.cpp (which ollama uses internally) on a Strix Halo, whether ROCm or Vul...

cpburns2009 • today at 2:58 PM • 1 reply • view on HN

In my experience using llama.cpp (which ollama uses internally) on a Strix Halo, whether ROCm or Vulkan performs better really depends on the model and it's usually within 10%. I have access to an RX 7900 XT I should compare to though.

Replies

metalliqaz • today at 3:38 PM

Perhaps I should just google it, but I'm under the impression that ollama uses llama.cpp internally, not the other way around.

Thanks for that data point I should experiment with ROCm

➕ show 2 replies

alt Hacker News

Replies