Is o3 that much better than o1? It can solve that Arc-AGI benchmark thing at huge compute cost, but even with o1, the main attraction (for me) seems to me that it can spit out giant blocks of code, following huge prompts.
I'm kinda ignorant, but I'm not sure in what way is o3 better.
yes o3 is better, but I would argue it is not yet clear for which cases it is absolutely crucial to use o3 instead of o1.
> It can solve that Arc-AGI benchmark thing at huge compute cost
Considering DeepSeek v3 trained for $5-6M and their R1 API pricing is 30x less than o1, I wouldn’t expect this to hold true for long. Also seems like OpenAI isn’t great at optimization.