Every ai labs train on the test set. That is a big part of why we see benchmark climbing from 1% to ...

retinaros • yesterday at 10:39 PM • 1 reply • view on HN

Every ai labs train on the test set. That is a big part of why we see benchmark climbing from 1% to 30% after a few models iterations

latentsea • today at 3:57 AM

Models themselves definitely aren't getting better.

alt Hacker News