logoalt Hacker News

bityardlast Monday at 4:43 PM2 repliesview on HN

The assertion in the issue report is that Claude saw a sharp decline in quality over the last few months. However, the report itself was allegedly generated by Claude.

Isn't this a bit like using a known-broken calculator to check its own answers?


Replies

nyeahlast Monday at 4:55 PM

If a known-broken calculator claims it's broken, I more or less concur. (Chain of reasoning omitted here.)

itemize123yesterday at 8:42 AM

if it's not broken then we trust the assertion that it's broken. if it's broken then it's broken.

it's analysis of what is broken is probably wrong or at least incomplete though