logoalt Hacker News

bravetravelertoday at 6:45 PM0 repliesview on HN

I'm largely 'all natural', any of my little LLM usage is local. 128G Strix system, a not-super-dense Qwen or Gemma variant will get 50-80 tok/s output. Not subscribing to Claude/GPT/etc even in the unlikely event these are the last local models released; simply not needed.