logoalt Hacker News

emp17344today at 6:19 PM0 repliesview on HN

RLVR doesn’t work for unverifiable tasks, so they won’t be able to effectively use tools to boost reliability for those tasks.