logoalt Hacker News

OsrsNeedsf2Pyesterday at 11:04 PM2 repliesview on HN

Why isn't Claude doing QA testing for you?


Replies

PunchyHamsteryesterday at 11:22 PM

Why isn't it doing it for Anthropic ?

show 1 reply
slopinthebagyesterday at 11:06 PM

I can't tell if this is sarcasm, but if not, you cant rely on the thing that produced invalid output to validate it's own output. That is fundementally insufficient, despite it potentially catching some errors.

show 7 replies