So why then do we stop training LLMs and keep them stored at a specific state? Is it perhaps because...

datsci_est_2015 • yesterday at 10:58 AM • 1 reply • view on HN

So why then do we stop training LLMs and keep them stored at a specific state? Is it perhaps because the results become terrible and LLMs have a delicate optimal state for general use? This sounds like an even worse case for a model of intelligence.

Replies

stavros • yesterday at 11:04 AM

Nope, it's not that, but it's nice of you to offer a straw man. Makes the argument flow better.

➕ show 1 reply

alt Hacker News

Replies