logoalt Hacker News

prewettyesterday at 3:16 PM1 replyview on HN

Even worse, the training set probably includes a lot of code that needed review but didn't get it...


Replies

keedayesterday at 4:50 PM

If we know the outcome of that code, such as whether it caused bugs or data corruption or a crappy UX or tech debt -- which is potentially available in subsequent PR commit messages -- it's still valuable training data.

Probably even more valuable than code that just worked, because evidently we have enough of that and AI code still has issues.