logoalt Hacker News

saithoundtoday at 2:53 AM1 replyview on HN

I use Pangram quite extensively (burning through my 600 token allowance every month). They managed to get their false positive rate impressively low: if Pangram says something is 100% AI-written, you can trust that.

But they need to improve their humanizer dataset. Right now, most models can be given system prompts which cause them to emit text classified as 100% human. It looks like their automated humanizers do worse than these system prompts. Or (alarming if so) they chose not to include ones that would make their product look unreliable.


Replies

meander_watertoday at 4:03 AM

GPTZero is much better at handling humanized outputs. Also has a similar false positive rate to Pangram.