logoalt Hacker News

siliconc0wyesterday at 10:06 PM1 replyview on HN

I reach for OpenCode + Kimi to save tokens on lower priority stuff and because it's quite fast on Fireworks AI.


Replies

polski-gtoday at 12:07 AM

I'm 90% sure Fireworks serves up quantized models.