> LLMs all
Sounds like you don't know how RLHF works. Everything you describe is post-training. Base models can't even chat, they have to be trained to even do basic conversational turn taking.
> Everything you describe is post-training. Base models can't even chat, they have to be trained to even do basic conversational turn taking.
So, that's still training then, so not 'post-training'. Just a different training phase.
> Everything you describe is post-training. Base models can't even chat, they have to be trained to even do basic conversational turn taking.
So, that's still training then, so not 'post-training'. Just a different training phase.