logoalt Hacker News

ern_aveyesterday at 7:41 PM1 replyview on HN

My guess is that AI training is the main issue.

Data that you can prove was generated by humans is now exceedingly valuable ...and most of that comes from the days before LLMs. The situation is a bit like how steel manufactured before the nuclear age is valuable.


Replies

adamnemecekyesterday at 7:44 PM

But why would people train on excerpts from Google Books when whole books can be downloaded on libgen and such?

show 2 replies