Even worse, the training set probably includes a lot of code that needed review but didn't get ...

prewett • yesterday at 3:16 PM • 1 reply • view on HN

Even worse, the training set probably includes a lot of code that needed review but didn't get it...

Replies

If we know the outcome of that code, such as whether it caused bugs or data corruption or a crappy UX or tech debt -- which is potentially available in subsequent PR commit messages -- it's still valuable training data.

Probably even more valuable than code that just worked, because evidently we have enough of that and AI code still has issues.

alt Hacker News

Replies