My guess? Require them to not do the reinforcement learning on a custom model that implements...

sfink • yesterday at 10:06 PM • 0 replies • view on HN

My guess? Require them to not do the reinforcement learning on a custom model that implements guardrails. I think Anthropic has some of this built in already and couldn't alter it without retraining, but there's tons more layered on top.

alt Hacker News