logoalt Hacker News

ilitirittoday at 6:37 AM2 repliesview on HN

I've often wondered if LLMs can suffer from psychological abuse in symptomatic ways. Not literally of course, but for example, if you berate the LLM by calling it stupid, or useless, does that modify its behaviour negatively? Part of me think it does, but I don't really have any evidence for this. Maybe a fun weekend research topic.


Replies

apsurdtoday at 6:45 AM

Semi-related, I'm always very put off by how people treat LLMs. Especially coders, seems an instinctive joy comes out to play God. The justification is usually that it's intentionally against the trap of anthropomorphizing, but no I can't help but suspect it's people getting off on power. It's weird.

I am always very cordial in my sessions. It's just more pleasant and it's a habit I want to habituate.

    Great work! 
    Now let's...
    Now can you help me...
show 3 replies
elpockotoday at 7:00 AM

The content of the session modifies the LLMs "behavior" (token selection) in one way or another during the session, obviously. The effects are localized to the session, they will degrade over time, they will not affect other users, and they are not permanent unless someone decides to finetune the model based on your unproductive interactions.

What actually happens when confronted with harsh negativity depends on the training of the model. Sanitized closed models will shut you down or get you banned. Community finetunes of open models might start begging you for more, daddy.