Yeah. I usually do this by telling it to be adversarial and find gaps and holes. Not fool proof but ...

fridder • today at 3:39 PM • 1 reply • view on HN

Yeah. I usually do this by telling it to be adversarial and find gaps and holes. Not fool proof but it does seem to increase the quality. It has helped when using local models in particular.

Replies

SubiculumCode • today at 4:09 PM

Yeah, you have to shortcut the RL-trained people pleasing

alt Hacker News

Replies