"Did the vehicle just crash" has a short feedback loop, very amenable to RL. "Did thi...

khafra • today at 10:52 AM • 1 reply • view on HN

"Did the vehicle just crash" has a short feedback loop, very amenable to RL. "Did this product strategy tank our earnings/reputation/compliance/etc" can have a much longer, harder to RL feedback loop.

But maybe not that much longer; METR task length improvement is still straight lines on log graphs.

Replies

dist-epoch • today at 11:03 AM

The AI has read all the business books, blogs and stories.

Unless your CEO is Steve Jobs, it's hard to imagine it being much worse than your average pointy haired boss.

➕ show 2 replies

alt Hacker News

Replies