alt
Hacker News
nolist_policy
•
today at 7:02 PM
•
0 replies
•
view on HN
Is distillation or synthetic data used during pre-training? If yes how much?