It seems like you don’t understand reinforcement learning. The signal is reinforced because it corre...

catigula • 12/08/2025 • 0 replies • view on HN

It seems like you don’t understand reinforcement learning. The signal is reinforced because it correlates to behavior, hacking the signal itself is misalignment.

alt Hacker News