logoalt Hacker News

echoangle02/20/20250 repliesview on HN

Maybe the dog just values immediate reward higher even though it understands it could get even more later? How would you control for that?