logoalt Hacker News

err4ntyesterday at 6:27 PM3 repliesview on HN

How does it know the difference?


Replies

beng-nltoday at 6:20 PM

This might not always work, but whenever possible, a working exploit could be demanded, working in a form that can be automatically verified to work.

scubboyesterday at 6:39 PM

I'm still on the AI-skeptic side of the spectrum (though shifting more towards "it has some useful applications"), but, I think the easy answer is - if different models/prompts are used in generation than in quality-/correctness-checking.

jgalt212yesterday at 10:49 PM

I think Claude, given enough time to mull it over, could probably come up with some sort of bug severity score.