alt
Hacker News
am17an
•
today at 4:56 AM
•
0 replies
•
view on HN
Use llama.cpp? I get 250 toks/sec on gpt-oss using a 4090, not sure about the mac speeds