logoalt Hacker News

XCSmetoday at 3:11 PM0 repliesview on HN

Also Claude/Fable models are quite bad at instructions following: https://artificialanalysis.ai/evaluations/ifbench