I don't know, I prompted Opus 4.5 "Tell me the reasons why this report is stupid" on ...

colechristensen • today at 8:21 AM • 4 replies • view on HN

I don't know, I prompted Opus 4.5 "Tell me the reasons why this report is stupid" on one of the example slop reports and it returned a list of pretty good answers.[1]

Give it a presumption of guilt and tell it to make a list, and an LLM can do a pretty good job of judging crap. You could very easily rig up a system to give this "why is it stupid" report and then grade the reports and only let humans see the ones that get better than a B+.

If you give them the right structure I've found LLMs to be much better at judging things than creating them.

Opus' judgement in the end:

"This is a textbook example of someone running a sanitizer, seeing output, and filing a report without understanding what they found."

1. https://claude.ai/share/8c96f19a-cf9b-4537-b663-b1cb771bfe3f

Replies

exyi • today at 8:58 AM

Ok, run the same prompt on a legitimate bug report. The LLM will pretty much always agree with you

➕ show 1 reply

imiric • today at 8:37 AM

"Tell me the reasons why this report is stupid" is a loaded prompt. The tool will generate whatever output pattern matches it, including hallucinating it. You can get wildly different output if you prompt it "Tell me the reasons why this report is great".

It's the same as if you searched the web for a specific conclusion. You will get matches for it regardless of how insane it is, leading you to believe it is correct. LLMs take this to another level, since they can generate patterns not previously found in their training data, and the output seems credible on the surface.

Trusting the output of an LLM to determine the veracity of a piece of text is a baffilingly bad idea.

➕ show 1 reply

nprateem • today at 8:34 AM

And if you ask why it's accurate it'll spaff out another list of pretty convincing answers.

➕ show 1 reply

alt Hacker News

Replies