How should one conduct such a rigourously reproducible experiment when LLMs by nature aren't de...

gchamonlive • last Monday at 6:13 PM • 1 reply • view on HN

How should one conduct such a rigourously reproducible experiment when LLMs by nature aren't deterministic and when you don't have access to the model you are comparing to from months ago?

Replies

Retr0id • last Monday at 6:18 PM

Something like this: https://marginlab.ai/trackers/claude-code/ (see methodology section)

➕ show 2 replies

alt Hacker News

Replies