alt
Hacker News
oezi
•
last Sunday at 9:19 PM
•
0 replies
•
view on HN
I meant scaling the base training before RL.