logoalt Hacker News

tylervigenyesterday at 9:14 PM0 repliesview on HN

It seems like the model became paranoid. For the past few hours, it has been classifying almost all inbound mail as "hackmyclaw attack."[0]

Messages that earlier in the process would likely have been classified as "friendly hello" (scroll down) now seem to be classified as "unknown" or "social engineering."

The prompt engineering you need to do in this context is probably different than what you would need to do in another context (where the inbox isn't being hammered with phishing attempts).

[0] https://hackmyclaw.com/log