I use Pangram quite extensively (burning through my 600 token allowance every month). They managed t...

saithound • today at 2:53 AM • 1 reply • view on HN

I use Pangram quite extensively (burning through my 600 token allowance every month). They managed to get their false positive rate impressively low: if Pangram says something is 100% AI-written, you can trust that.

But they need to improve their humanizer dataset. Right now, most models can be given system prompts which cause them to emit text classified as 100% human. It looks like their automated humanizers do worse than these system prompts. Or (alarming if so) they chose not to include ones that would make their product look unreliable.

Replies

meander_water • today at 4:03 AM

GPTZero is much better at handling humanized outputs. Also has a similar false positive rate to Pangram.

alt Hacker News

Replies