logoalt Hacker News

Akranazontoday at 8:26 AM1 replyview on HN

Detecting LLM-generated text is basically solved by modern watermarking techniques (https://arxiv.org/abs/2306.09194). However, the main trouble with watermark-based approaches is that you have to get every LLM provider to adopt it. A student trying to cheat could always opt for some open-weight Chinese model, if the word spreads that the major providers are compromised.


Replies

yorwbatoday at 8:59 AM

Section 6, "Removing Watermarks," of the paper you cite makes it very clear that detecting LLM-generated text is not solved if the user takes measures to avoid detection.