logoalt Hacker News

lostmsutoday at 2:20 PM1 replyview on HN

How's the reproducibility of the results? Like avg score of 10 runs vs original.


Replies

dnhkngtoday at 2:56 PM

Author here: The code is up on GitHub.

The probes I used seem to help identify good configurations, but are quite noisey. A small probe set was initially used to make the scan tractable, and then the higher ranked models were retested on a set ~10x larger.