Less than a year to destroy Arc-AGI-2 - wow.
It's still useful as a benchmark of cost/efficiency.
It's a useless meaningless benchmark though, it just got a catchy name, as in, if the models solve this it means they have "AGI", which is clearly rubbish.
Arc-AGI score isn't correlated with anything useful.
I unironically believe that arc-agi-3 will have a introduction to solved time of 1 month