logoalt Hacker News

mschuster91today at 8:58 AM2 repliesview on HN

And on top of that - no matter if you develop open-source or proprietary software, who is to guarantee the AI didn't get trained with GPL (or even worse, leaked proprietary) source code? Who is going to pay my lawyer when someone files a copyright lawsuit and all I have as an excuse is that I "AI-laundered" my code?

And some projects like WINE or ReactOS probably have to worry about that even more given they need to guarantee clean-room reverse engineering...


Replies

voidUpdatetoday at 9:22 AM

Given the amount of web scraping LLM providers have been doing, I'd say it's likely that any code that is publicly accessible on the internet has been incorporated into it's training data, whatever its license

cyclotron3ktoday at 10:32 AM

I'm not disagreeing with you, but it's worth noting there were plenty of GPL violations before LLMs existed.

show 1 reply