We can trust the feedback we give it based on the output it provides. | alt Hacker News

alt Hacker News

jack_pp • today at 5:03 PM • 1 reply • view on HN

We can trust the feedback we give it based on the output it provides.

Replies

ambicapter • today at 5:16 PM

What kind of feedback are you giving? What's the reward function?

➕ show 1 reply