logoalt Hacker News

rubiquitytoday at 5:10 PM1 replyview on HN

llama.cpp and llama-swap do this better than Ollama and with far more control.


Replies

circularfoyerstoday at 6:51 PM

Don't even need to use llama-swap anymore now that llama-server supports the same functionality.