Nice, just tried that with "tell me a long tall tale" as the prompt and got: ...

simonw • 10/11/2024 • 1 reply • view on HN

Nice, just tried that with "tell me a long tall tale" as the prompt and got:

    Speed: 26.41 tok/s

jodleif • 10/12/2024

How much with llama.cpp? A 1b model should be a lot faster on a m2

➕ show 1 reply

alt Hacker News