> If f(paper) -> code is the weakest part of the chain, it makes sense to target that. my po...

riku_iki • 04/03/2025 • 1 reply • view on HN

> If f(paper) -> code is the weakest part of the chain, it makes sense to target that.

my point is that LLMs are already potentially seeing solution on github, so you can't use that benchmark as metric unless there is some explanation.

kelseyfrog • 04/03/2025

How does that work with knowledge cutoff?

➕ show 1 reply

alt Hacker News