> This creates a recurring pattern on r/LocalLLaMA: new model launches, people try it throug...

kennywinker • today at 6:36 AM • 1 reply • view on HN

> This creates a recurring pattern on r/LocalLLaMA: new model launches, people try it through Ollama, it’s broken or slow or has botched chat templates, and the model gets blamed instead of the runtime.

Seems like maybe, at least some of the time, you’re being underwhelmed my ollama not the model.

The better performance point alone seems worth switching away

Replies

speedgoose • today at 6:57 AM

I follow the llama.cpp runtime improvements and it’s also true for this project. They may rush a bit less but you also have to wait for a few days after a model release to get a working runtime with most features.

➕ show 1 reply

alt Hacker News

Replies