logoalt Hacker News

drralast Friday at 7:23 AM0 repliesview on HN

Absolutely. Anyone working on inference token level knows how wasteful it all is especially in multimodal tokens.