…But this goblin thing was a direct result of accidentally creating a positive feedback loop in R...

Sharlin • today at 9:37 AM • 0 replies • view on HN

…But this goblin thing was a direct result of accidentally creating a positive feedback loop in RL to make the model more human-like, nothing about unintentionally surfacing an aspect of Cthulhu from the depths despite attempts to keep the model humanlike. This is not a quirk of the base model but simply a case of reinforcement learning being, well, reinforcing.

alt Hacker News