> So maybe the AI labs have been paying attention after all! > I think this mainly demonstra...

_puk • today at 4:14 AM • 1 reply • view on HN

> So maybe the AI labs have been paying attention after all!

> I think this mainly demonstrates that the pelican on the bicycle has firmly exceeded its limits as a useful benchmark.

As acknowledged in the article.

kzrdude • today at 4:51 AM

Gemini 3.1 basically takes it home on that benchmark, anyway, it's done.

alt Hacker News