Someone here mentioned a whole ago that the labs deliberately haven't tried to train these char...

sidpatil • yesterday at 9:01 PM • 1 reply • view on HN

Someone here mentioned a whole ago that the labs deliberately haven't tried to train these characteristics out of their models, because leaving them in makes it easier to identify, and therefore exclude, LLM-generated text from their training corpus.

Replies

blymphony • yesterday at 10:24 PM

But it's odd that these characteristics are the same across models from different labs. I find it hard to believe that researchers across competing companies are coordinating on something like that.

alt Hacker News

Replies