For open-weights models, censorship removal is now a "solved" problem. If you wait a few d...

hleszek • today at 6:11 PM • 1 reply • view on HN

For open-weights models, censorship removal is now a "solved" problem. If you wait a few days after a new model release, someone will have made a heretic ( https://github.com/p-e-w/heretic ) version with the censorship removed, so in a way the only use for censorship now is to avoid lawsuits, not reduce improper usage.

Replies

jakkos • today at 6:17 PM

Any time I've tried an "abliterated" model, heretic or other, it has always damaged the capabilities of the original model and will still often refuse or produce garbage at a lot of "unsafe" requests.

➕ show 1 reply

alt Hacker News

Replies