It's not possible with today's LLM models, but we are not wedded to the current architectu...

airstrike • today at 7:08 PM • 1 reply • view on HN

It's not possible with today's LLM models, but we are not wedded to the current architecture.

SlinkyOnStairs • today at 7:35 PM

Realistically, we are.

This is not some arbitrary design choice, it's the core compromise to make LLMs viable to train at all.

➕ show 1 reply

alt Hacker News