logoalt Hacker News

kristianptoday at 1:15 AM1 replyview on HN

Openrouter has an "exacto" [1] option to favour higher quality providers for a given model. Have you found any benefits to using that?

Edit: Kimi K2 uses int4 during its training as well as inference [2]. I wonder if that affects the quality if different gguf creators may not convert these correctly?

[1] https://openrouter.ai/docs/guides/routing/model-variants/exa...

[2] https://www.reddit.com/r/LocalLLaMA/comments/1pzfuqg/why_kim...


Replies

gertlabstoday at 1:20 AM

I did not know about this! We've put a lot of effort into probing providers and their offerings and auto-selecting the best options. I wonder how well their exacto option works.

Going to test it out, thanks!