logoalt Hacker News

2ndorderthoughttoday at 12:29 PM1 replyview on HN

I see your updated post. Switch over to llamacpp and look up recommended quants and settings. A good place for this info is on /r/localllama


Replies

gchamonlivetoday at 12:37 PM

Yep! I'm currently trying vllm, then I'll give llamacpp a try too