logoalt Hacker News

cratermoonlast Thursday at 6:45 PM1 replyview on HN

https://thebullshitmachines.com/lesson-16-the-first-step-fal...


Replies

Philpaxlast Thursday at 7:25 PM

This doesn't seem to really address synthetic data, let alone RL-based reasoning.

show 1 reply