"Surgical "is the kind of wordage that LLMs seem to love to output. I have had to put in m...

polotics • today at 6:09 PM • 2 replies • view on HN

"Surgical "is the kind of wordage that LLMs seem to love to output. I have had to put in my .md file the explicit statement that the word "surgical" should only be used when referring to an actual operation at the block...

Replies

fredmendoza • today at 6:14 PM

you're right, they are tools. that's kind of the point. PAL is a subprocess that runs a python expression. Z3 is a constraint solver. regex is regex. calling them "surgical" is just about when they fire, not what they are. the model generates correctly 90%+ of the time. the guardrails only trigger on the 7 specific patterns we found in the tape. to be clear, the ~8.0 score is the raw model with zero augmentation. no tools, no tricks. just the naive wrapper. the guardrail projections are documented separately. all the code is in the article for anyone who wants to review it.

mrtesthah • today at 6:28 PM

The core issue is that the LLM is using rhetoric to try to convince or persuade you. That's what you need to tell it not to do.

➕ show 1 reply

alt Hacker News

Replies