#1 may well put #2 out of a living but that isn't the same as stealing and doesn't (at least in and of itself) make it unsustainable. The fact that models were trained on scraped content isn't a matter of technical necessity but rather the path of least resistance (lowest cost in this case). Synthetic data is increasingly used for reasons of quantity, quality, and various technical considerations.
All of the major players in AI currently, literally stole to build their models. There isn’t one out there that hasn’t. So yes, it is the same as stealing because they were LITERALLY, in the literal sense, stealing.