logoalt Hacker News

upbeat_generaltoday at 7:25 AM0 repliesview on HN

This isn’t quite accurate. Data weighting is quite important in pretraining.