The official Q4_K_S gguf is quite good and has very good 35 tps generation on a M1 mac studio. Should be much faster on recent Macs, especially M5.
What’s “Q4_K_S gguf” and where do I get it? Is it easy to install and configure on a MacBook?
What’s “Q4_K_S gguf” and where do I get it? Is it easy to install and configure on a MacBook?