Problem is there's a lot more than a single repo in training data, the corpus is massive... Sho...

bcjdjsndon • today at 10:48 AM • 0 replies • view on HN

Problem is there's a lot more than a single repo in training data, the corpus is massive... Should the author of a blog post on cats also be compensated for simply being in the same training data as the git repo?

alt Hacker News