logoalt Hacker News

rjswtoday at 10:59 AM1 replyview on HN

If the LLM was trained on any GPL licenced code then there is an argument that all output is GPL too, legal departments should be worried.


Replies

graemeptoday at 11:23 AM

I am not aware of any argument for that. Even if the output is a derivative work (which is very doubtful) that would make it a breach of copyright to distribute it under another license, not automatically apply the GPL.

If the output is a derivative work of the input then you would be in breach of copyright if the training data is GPL, MIT, proprietary - anything other than public domain or equivalent.