OpenAI and the other big players clearly RLHF with different users in mind than professionals. They’...

teaearlgraycold • last Sunday at 7:38 PM • 0 replies • view on HN

OpenAI and the other big players clearly RLHF with different users in mind than professionals. They’re optimizing for sycophancy and general pleasantness. It’s beautiful to finally see a big model that hasn’t been warped in this way. I want a model that is borderline rude in its responses. Concise, strict, and as distrustful of me as I am of it.

alt Hacker News