I reach for OpenCode + Kimi to save tokens on lower priority stuff and because it's quite fast on Fireworks AI.
I'm 90% sure Fireworks serves up quantized models.
I'm 90% sure Fireworks serves up quantized models.