logoalt Hacker News

Pocomonyesterday at 8:01 PM1 replyview on HN

> Heretic is a tool that removes censorship (aka "safety alignment") from transformer-based language models without expensive post-training.

I've noticed such "safety alignment" with the current LLMs. Not just insisting on providing the orthodox answer but - if presented with verifiable facts - nothing. “I'm sorry Dave but I can't help you with that” - or words to such effect.

Also: Youtube keeps automatically erasing rude words. How can you do serious historical research with this nonsense?


Replies

henry2023today at 2:49 AM

God forbid offended advertisers. Better to erase history than to lose some shiny pennies.