alt
Hacker News
ranger_danger
•
today at 5:03 AM
•
0 replies
•
view on HN
with regular llama.cpp on a 3070ti I get 60tok/s TG with the 9B model, it's quite impressive.