logoalt Hacker News

james2doyleyesterday at 8:57 PM2 repliesview on HN

You should check out "model collapse". It seems that an abundance of content, that is more and more AI generated these days, may not be a viable option. There is also a vast amount of data that is increasingly going private or behind paywalls


Replies

platinumradyesterday at 9:20 PM

People love harping on this one, but model collapse hasn't turned out to be an issue in practice.

show 6 replies
gruezyesterday at 9:19 PM

>You should check out "model collapse". It seems that an abundance of content, that is more and more AI generated these days, may not be a viable option.

Doom-saying about "model collapse" is kind of funny when OpenAI and Anthropic are mad at Chinese model makers for "distilling" their models, ie. using their outputs to train their own models.

show 2 replies