logoalt Hacker News

Retr0idlast Monday at 5:56 PM1 replyview on HN

I'm just saying it's epistemically unrigorous to the point of being equivalent to anecdata.


Replies

gchamonlivelast Monday at 6:13 PM

How should one conduct such a rigourously reproducible experiment when LLMs by nature aren't deterministic and when you don't have access to the model you are comparing to from months ago?

show 1 reply