logoalt Hacker News

bcjdjsndontoday at 10:48 AM0 repliesview on HN

Problem is there's a lot more than a single repo in training data, the corpus is massive... Should the author of a blog post on cats also be compensated for simply being in the same training data as the git repo?