logoalt Hacker News

Cheer2171yesterday at 5:30 PM1 replyview on HN

> LLMs all

Sounds like you don't know how RLHF works. Everything you describe is post-training. Base models can't even chat, they have to be trained to even do basic conversational turn taking.


Replies

jacquesmyesterday at 6:50 PM

> Everything you describe is post-training. Base models can't even chat, they have to be trained to even do basic conversational turn taking.

So, that's still training then, so not 'post-training'. Just a different training phase.

show 1 reply