Exactly. Even this paper shows how model creativity significantly drops and the models experience mo...

behnamoh • yesterday at 9:18 PM • 1 reply • view on HN

Exactly. Even this paper shows how model creativity significantly drops and the models experience mode collapse like we saw in GANs, but the companies keep using RLHF...

https://arxiv.org/abs/2406.05587

Replies

nomel • yesterday at 9:28 PM

A nice talk about a researcher's experience/benchmarks with raw GPT-4, before and after RLHF:

https://www.youtube.com/watch?v=qbIk7-JPB2c

➕ show 1 reply

alt Hacker News

Replies