logoalt Hacker News

Davidzhenglast Tuesday at 7:28 PM1 replyview on HN

I'm in agreement--RLHF won't lead to massively more intelligent beings than humans. But I said RL not RLHF


Replies

benreesmanlast Tuesday at 10:36 PM

Well what you said is:

"On the contrary, I believe in every verifiable domain RL must drive the agent to be the most intelligent (relative to RL award) it can be under the constraints--and often it must become more intelligent than humans in that environment."

And I said it's not that simple, in no way demonstrated, unlikely with current technology, and basically, nope.

show 1 reply