This seems like a problem simply stated but not simply solved. I think Grokipedia or whatever it was called was a real exercise in “no one cares about cached LLM output”. The ephemeral nature of LLM output is somehow a core property of its utility. Kind of like I never share a Google search with a coworker, I share the link I found.