On benchmarks GPT 5.2 was roughly equivalent to Opus 4.5 but most people who've used both for S...

ifwinterco • yesterday at 7:54 PM • 4 replies • view on HN

On benchmarks GPT 5.2 was roughly equivalent to Opus 4.5 but most people who've used both for SWE stuff would say that Opus 4.5 is/was noticeably better

Replies

CraigJPerry • yesterday at 8:57 PM

There's an extended thinking mode for GPT 5.2 i forget the name of it right at this minute. It's super slow - a 3 minute opus 4.5 prompt is circa 12 minutes to complete in 5.2 on that super extended thinking mode but it is not a close race in terms of results - GPT 5.2 wins by a handy margin in that mode. It's just too slow to be useable interactively though.

➕ show 1 reply

elAhmo • yesterday at 8:00 PM

I mostly used Sonnet/Opus 4.x in the past months, but 5.2 Codex seemed to be on par or better for my use case in the past month. I tried a few models here and there but always went back to Claude, but with 5.2 Codex for the first time I felt it was very competitive, if not better.

Curious to see how things will be with 5.3 and 4.6

georgeven • yesterday at 8:04 PM

Interesting. Everyone in my circle said the opposite.

➕ show 2 replies

SatvikBeri • yesterday at 10:05 PM

I pretty consistently heard people say Codex was much slower but produced better results, making it better for long-running work in the background, and worse for more interactive development.

alt Hacker News

Replies