logoalt Hacker News

GrinningFooltoday at 12:17 AM1 replyview on HN

That's a huge gap for llama.cpp server - any idea why?


Replies

zambellitoday at 12:47 AM

Best guess is it's native mode. The function calling template is just broken for Nemo.

I did go with an extreme example in the post (but true). Other deltas are smaller but still statistically significant. 30 pt swing between llamserver prompt vs ollama, 4-5pt swing between llamafile and llamaserver prompt.