logoalt Hacker News

Salgatyesterday at 10:39 PM1 replyview on HN

Local models are much less energy efficient right?


Replies

HDBaseTyesterday at 11:57 PM

It's a good question, although I think hard to quantify.

If you are simply measuring Watt Cost per Token, you are missing the mark drastically. You have to measure quality output per Watt.

It sounds reasonably difficult to benchmark this, maybe I'm wrong though.