logoalt Hacker News

dotty-last Tuesday at 9:50 PM1 replyview on HN

You joke, but that's a very real approach that AI pentesting companies do take: an agent that creates reports, and an agent that 'validates' reports with 'fresh context' and a different system prompt that attempts to reproduce the vulnerability based on the report details.

*Edit: the paper seems to suggest they had a 'Triager' for vulnerability verification, and obviously that didn't catch all the false positives either, ha.


Replies

tptaceklast Tuesday at 10:04 PM

Can't be any worse than Fortify was!

show 1 reply