Its not the training/tuning, its pretty much the nature of llms. The whole idea is to give a be...

ActorNightly • yesterday at 7:49 PM • 1 reply • view on HN

Its not the training/tuning, its pretty much the nature of llms. The whole idea is to give a best quess of the token. The more complex dynamics behind the meaning of the words and how those words relate to real world concepts isn't learned.

Replies

vidarh • yesterday at 8:33 PM

You're not making any sense. The best guess will often be refusals if they see enough of that in the training data, so of course it is down to training

And I literally saw the effect of this first hand, in seeing how the project I worked on was actively part of training this behaviour into a major model.

As for your assertion they don't learn the more complex dynamics, that was trite and not true already several years ago.

alt Hacker News

Replies