logoalt Hacker News

computerexyesterday at 7:04 PM1 replyview on HN

That’s not remotely true. They did distillation as a cheap solution to the cold start problem. You need data/trajectories to hill climb to higher capabilities. All large Chinese labs do RLAIF.


Replies

sulamyesterday at 7:10 PM

Oh yes, not remotely true. Which is why the frontier labs all have invested heavily in trying to identify and thwart distillers, using known company names / domains to drive their exclusion lists.

/s

show 1 reply