logoalt Hacker News

nomelyesterday at 9:16 PM1 replyview on HN

The "alignment tax".


Replies

behnamohyesterday at 9:18 PM

Exactly. Even this paper shows how model creativity significantly drops and the models experience mode collapse like we saw in GANs, but the companies keep using RLHF...

https://arxiv.org/abs/2406.05587

show 1 reply