Absolutely. Anyone working on inference token level knows how wasteful it all is especially in multi...

drra • last Friday at 7:23 AM • 0 replies • view on HN

Absolutely. Anyone working on inference token level knows how wasteful it all is especially in multimodal tokens.

alt Hacker News