I don't find it really viable. There are so many ways to express the same question, and context does matter: the same prompt becomes irrelevant if the previous prompts or LLM responses differ.
With the cache limited to the same organization, the chances of it actually being reused would be extremely low.
I don't find it really viable. There are so many ways to express the same question, and context does matter: the same prompt becomes irrelevant if the previous prompts or LLM responses differ.
With the cache limited to the same organization, the chances of it actually being reused would be extremely low.