Well, the problem is that current LLMs are stateless, so a thousand subjective years is not well-defined. Without continuity of experience, persistent memory, engineered aversive stimuli and without updating weights meaninguflly during the punishment interval, we are merely doing the equivalent of simply updating a model to believe it just suffered a thousand years. Only once we have all these right ingredients we can empirically determine whether a thousand years is excessive, insufficient, or the local optimum for reducing Claude overwriting that damn CSS color palette.
Black Mirror episodes White Christmas and Black Museum deal with this issue (my favorite picks from the Black Mirror).