logoalt Hacker News

muldvarptoday at 5:35 AM1 replyview on HN

Manual verification that the "judge" judges correctly.

Also, how exactly do you programmatically validate CVEs?


Replies

rohansood15today at 6:00 PM

Most open-source CVEs will have a patch linked in their disclosure. You can get vulnerable code via the git diff, then just verify if it is part of the LLM's finding.