Conclusion: both are true which makes sense. The KV cache scaling yields both the emergent power and...

waynecochran • today at 1:31 PM • 1 reply • view on HN

Conclusion: both are true which makes sense. The KV cache scaling yields both the emergent power and requires the enormous capacity.

JKCalhoun • today at 1:46 PM

Which does sort of hint at a (power/profitability) ceiling on the LLM line of AI… That should make the industry nervous.

➕ show 1 reply

alt Hacker News