Aw that’s a shame; I’m running the official llama.cpp on my Spark-alike, and it works great now. Pro...

girvo • last Sunday at 12:37 PM • 0 replies • view on HN

Aw that’s a shame; I’m running the official llama.cpp on my Spark-alike, and it works great now. Proper triple head too which is what it is trained on, gets me up to 35-40tk/s decode

alt Hacker News