I can totally see "it's not really AGI because it doesn't consistently outperform those three top 0.000001% outlier human experts yet if they work together".
It'll be a while until the ability to move the goalposts of "actual intelligence" is exhausted entirely.
Well right now, my niece of 7 years outperforms all LLM contenders in drawing a Pelican on a bicycle