> then I recommend https://github.com/Mozilla-Ocho/llamafile which ships as a single file with no dependencies and runs on CPU with great performance. Like, such great performance that I've mostly given up on GPU for LLMs. It was a game changer.
First time that I have a "it just works" experience with LLMs on my computer. Amazing. Thanks for the recommendation!