logoalt Hacker News

irthomasthomastoday at 11:28 AM0 repliesview on HN

Claude in Claude code has been shown to perform persistently worse in evals than claude + a minimal harness.