logoalt Hacker News

waynecochrantoday at 1:31 PM1 replyview on HN

Conclusion: both are true which makes sense. The KV cache scaling yields both the emergent power and requires the enormous capacity.


Replies

JKCalhountoday at 1:46 PM

Which does sort of hint at a (power/profitability) ceiling on the LLM line of AI… That should make the industry nervous.

show 1 reply