logoalt Hacker News

vanuatuyesterday at 6:49 PM1 replyview on HN

I don't think its much of an issue

- Rl envs + synthetic data + human annotated

- Usage data from codex/claude code/cursor

Most of the model abilities in coding come from post-training, not pretraining


Replies

torben-friisyesterday at 6:54 PM

A better question is what's left for those who don't have access to that. We went from publicly available to vacuumed from private users

show 1 reply