logoalt Hacker News

tensortoday at 6:30 PM6 repliesview on HN

I still find the idea that "learning" from code is "stealing" kind of ridiculous.


Replies

bohtoday at 7:58 PM

Yes I guess there's also no such thing as stealing in torrents since the computer "learns" the data and returns it in a transcoded fashion so it's technically not a reproduction. Yes LLMs can reproduce passages from copyrighted works verbatim but that's only because it "learned" it and it's just telling you what it "knows".

The mental calisthenics required to justify this stuff must be exhausting.

show 1 reply
nkrisctoday at 8:05 PM

I find it more ridiculous to equate the act of a human learning with for-profit AI training without recompense to the authors of the training material.

MagicMoonlighttoday at 8:42 PM

If I “learned” your essay and handed it in, would you be happy with that?

lo_zamoyskitoday at 7:34 PM

If there were the case, then imagine having to give it back!

pydrytoday at 7:32 PM

If you can set a copyright trap and an LLM reproduces it I think it's pretty clear cut that it's more than just "learning".

I have seen LLMs do all sorts of crap which was clearly reproduction of training material.

This is also why people are most impressed with how much better it is at reproducing boilerplate rather than, say, imaginative new ideas.

show 1 reply
estimator7292today at 6:45 PM

Learning, probably not.

Copy/pasting at scale, yes

show 2 replies