"On the contrary, I believe in every verifiable domain RL must drive the agent to be the most intelligent (relative to RL award) it can be under the constraints--and often it must become more intelligent than humans in that environment."
And I said it's not that simple, in no way demonstrated, unlikely with current technology, and basically, nope.
Well what you said is:
"On the contrary, I believe in every verifiable domain RL must drive the agent to be the most intelligent (relative to RL award) it can be under the constraints--and often it must become more intelligent than humans in that environment."
And I said it's not that simple, in no way demonstrated, unlikely with current technology, and basically, nope.