Modern reasoning models are actually pretty good at arithmetic and almost certainly would have caugh...

gordonhart • yesterday at 1:29 PM • 1 reply • view on HN

Modern reasoning models are actually pretty good at arithmetic and almost certainly would have caught this error if asked.

Source: we benchmark this sort of stuff at my company and for the past year or so frontier models with a modest reasoning budget typically succeed at arithmetic problems (except for multiplication/division problems with many decimal places, which this isn't).

Replies

RobotToaster • yesterday at 1:42 PM

Interesting, how have you found they have been performing at more complex things like calculus and analysis?

➕ show 1 reply

alt Hacker News

Replies