logoalt Hacker News

lacooljlast Thursday at 7:01 PM2 repliesview on HN

lol I love how OpenAI just straight up doesn't compare their model to others on these release pages. Basically telling us they know Gemini and Opus are better but they don't want to draw attention to it


Replies

qwesr123last Thursday at 7:06 PM

Not sure why they don't compare with others, but they are actually leading on the benchmarks they published. See here (bottom) for a chart comparing to other models: https://marginlab.ai/blog/swe-bench-deep-dive/

show 2 replies
dbbklast Thursday at 8:14 PM

This was the one thing I scanned for. No comparison against Opus. See ya.

show 1 reply