logoalt Hacker News

chrischentoday at 5:36 PM2 repliesview on HN

Yes but all the AI companies took all the public data, so when you pay for an AI model you are paying for the marginal service of building a model off that data, not for the data itself. What we should do is ensure that the data is available to more people to train AI models... but sadly this doesn't seem to be happening. Instead AI companies that were first-movers got to train off public data, and as the companies and businesses that own this data get wise they're going to start charging people to train off the data. This will make it much more difficult for anyone to train a model in the future as it will become expensive, and the companies that did happen to already train off public data will get a bit of incumbent's advantage.


Replies

forbiddenvoidtoday at 6:13 PM

I don't really buy this argument. When you buy a physical product, you are paying the entire product lifecycle, not just the marginal aspect of retail distribution. This is the same thing. The marginal inference has to come FROM somewhere. It doesn't just appear out of nowhere.

Larrikintoday at 6:29 PM

AI companies took public, private, and copyrighted data. Your position is that because these big companies stole so much we should let them get away with it by devaluing it further so everyone can ignore intellectual property law.