logoalt Hacker News

csunosertoday at 3:30 PM2 repliesview on HN

Yes*. At least from my limited usage of deepseek-flash for a few billion tokens on openrouter, the cache-hit rate is >95%. And I simply used the claude code harness pointed at the openrouter anthropic compatible endpoint with no fluff.


Replies

port11today at 8:16 PM

Did you get proper tool use? Some CC-driven models seem to get a bit off when it comes to MCP usage. For example: I really struggled to get Kimi to use Serena, which I think ended up costing too many tokens.

schaefertoday at 3:40 PM

thank you!