logoalt Hacker News

airstriketoday at 7:08 PM1 replyview on HN

It's not possible with today's LLM models, but we are not wedded to the current architecture.


Replies

SlinkyOnStairstoday at 7:35 PM

Realistically, we are.

This is not some arbitrary design choice, it's the core compromise to make LLMs viable to train at all.

show 1 reply