logoalt Hacker News

fc417fc802today at 12:34 AM1 replyview on HN

#1 may well put #2 out of a living but that isn't the same as stealing and doesn't (at least in and of itself) make it unsustainable. The fact that models were trained on scraped content isn't a matter of technical necessity but rather the path of least resistance (lowest cost in this case). Synthetic data is increasingly used for reasons of quantity, quality, and various technical considerations.


Replies

tw04today at 12:37 AM

All of the major players in AI currently, literally stole to build their models. There isn’t one out there that hasn’t. So yes, it is the same as stealing because they were LITERALLY, in the literal sense, stealing.

show 1 reply