The interesting thing about models this small is they should be able to be put on a single Taalas ch...

NotSuspicious • today at 5:16 AM • 1 reply • view on HN

The interesting thing about models this small is they should be able to be put on a single Taalas chip (the HC1 already runs a Llama 3.1 8B model). We're already at the point where half-decent reasoning could be run on an ASIC (and at mind-boggling speeds).

Replies

pants2 • today at 5:53 AM

Yeah, if they can fit an 8B model that's really good at improving the output by thinking, running at 16K tok/s on Taalas would be mind-blowing.

alt Hacker News

Replies