logoalt Hacker News

pawelk411today at 7:15 AM1 replyview on HN

Yeah but thats literally above ASI, let alone AGI. Average human scores <1% on this bench, opus scores 97.1% when given an actual vision access, which means agi was long ago achieved


Replies

vova_hn2today at 11:16 AM

> opus scores 97.1% when given an actual vision access

Do you have a source for this? I would be very curious to see how top models do with vision.

show 2 replies