logoalt Hacker News

layoriclast Tuesday at 11:47 PM1 replyview on HN

Thanks for posting the performance numbers from your own validation. 6-7 tokens/sec is quite remarkable for the hardware.


Replies

geerlingguylast Tuesday at 11:49 PM

Some more benchmarking, and with larger outputs (like writing an entire relatively complex TODO list app) it seems to go down to 4-6 tokens/s. Still impressive.