Yes, this is the pass@k metric from code generation research. Found the relevant paper Evaluating La...

danoandco • yesterday at 6:43 PM • 1 reply • view on HN

Yes, this is the pass@k metric from code generation research. Found the relevant paper Evaluating Large Language Models Trained on Code (Chen et al., 2021) which introduced the metric.

Replies

hmokiguess • yesterday at 7:15 PM

Interesting, and how does Twill uses it in that feature?

➕ show 2 replies

alt Hacker News

Replies