logoalt Hacker News

CuriouslyClast Thursday at 11:43 PM1 replyview on HN

Game playing is the next frontier. Model economically valuable tasks as games and have the agents play/compete. Alphabench and Vendingbench show the potential of this approach.


Replies

ossa-mayesterday at 2:14 AM

A decade of reinforcement and agentic learning was spent playing games (Google Deepmind AlphaGo, AlphaStar, OpenAI Five), including against each other. So what makes it a new frontier?

show 1 reply