logoalt Hacker News

emp17344yesterday at 9:42 PM1 replyview on HN

This doesn’t make sense. They are fundamentally different things, so an observation made about Alphazero does not help you learn anything about LLMs.


Replies

ordinaryatomyesterday at 9:56 PM

I am not sure, self-play with LLMs self generated synthetic data is becoming a trendy topic in LLMs research.