logoalt Hacker News

esafaktoday at 3:08 PM2 repliesview on HN

Hear, hear. Even if the model fits, a few tokens per second make no sense. Time is money too.


Replies

hex4def6today at 7:41 PM

If I can start an agent and be able to walk away for 8 hours, and be confident it's 'smart' enough to complete a task unattended, that's still useful.

At 3 tk/s, that's still 100-150 pages of a book, give or take.

tempoponettoday at 3:56 PM

Maybe for a coding agent, but a daily/weekly report on sensitive info?

If it were 2016 and this technology existed but only in 1 t/s, every company would find a way to extract the most leverage out of it.

show 2 replies