logoalt Hacker News

EternalFurylast Monday at 8:59 PM1 replyview on HN

They learn the value of specific actions in specific contexts based on the rewards they received during their play time. Specific actions and specific contexts are not transferable for various reasons. John quoted that varying frame rates and variable latency between action and effect really confuse the models.


Replies

nightpoollast Monday at 9:41 PM

Okay, so fuzz the frame rate and latency? That feels very easy to fix.

show 1 reply