logoalt Hacker News

bandramitoday at 5:18 AM0 repliesview on HN

Very cool. Claude failed hard on this a few months ago. Gemma and phi have gotten better at it in recent versions, too, though qwen is still confidently getting it wrong.