logoalt Hacker News

stale2002today at 4:11 AM1 replyview on HN

ok! So if someone uses an existing, checkpointed, open source model then the answer is yes the results are valid and it doesn't matter that the tests are public.


Replies

measurablefunctoday at 4:35 AM

Yes, assuming the checkpoint was before the announcement & public availability of the test set.