I can't wait until we get to 100% storage/cost/compute reduction for LLMs. Every thought you could have thought pre-conceived in high-fidelity super-resolution. Every action you could have taken predicted and simulated in advance courtesy of Openthropic and the USA Sovereign Wealth Fund.
100% reduction is impossible for something which should work, because -100% means it is now 0
Reminds me of 'Learning to be me' by Greg Egan
You would obviously be trading storage for compute and time to retrieve the storage.