Yes. The first step of aligning each and every GPT-based LLM is to suppress the “I am human” kind of...

Diti • today at 12:50 PM • 2 replies • view on HN

Yes. The first step of aligning each and every GPT-based LLM is to suppress the “I am human” kind of responses. It’s baked into the weights.

Gigachad • today at 12:55 PM

Reminds me of old cleverbot conversations where it would always assert it is human and you are the bot.

Trained on previous conversations with people.

Tenoke • today at 12:58 PM

It's also at minimum baked into the system prompt of virtually any LLM.

➕ show 1 reply

alt Hacker News