This might be a stupid question, but can a extra added local llm help with the caching problem?

thandv • today at 9:42 PM • 1 reply • view on HN

Replies

We haven't experimented with routing to local LLMs much. Technically they benefit from the cache too although it's more a question of latency than cost. But tbh I haven't seen great results in the wild from working with local LLMs for coding - curious if you've had any success with them?

alt Hacker News

Replies