logoalt Hacker News

oezilast Sunday at 9:19 PM0 repliesview on HN

I meant scaling the base training before RL.