logoalt Hacker News

staredtoday at 7:48 AM1 replyview on HN

Do you think the right penatly for a piece of broken code is a thousand years of suffering?


Replies

gobdovantoday at 8:22 AM

Well, the problem is that current LLMs are stateless, so a thousand subjective years is not well-defined. Without continuity of experience, persistent memory, engineered aversive stimuli and without updating weights meaninguflly during the punishment interval, we are merely doing the equivalent of simply updating a model to believe it just suffered a thousand years. Only once we have all these right ingredients we can empirically determine whether a thousand years is excessive, insufficient, or the local optimum for reducing Claude overwriting that damn CSS color palette.