Interesting. Would love a comparison to pi.dev (Not Ohmypi) How does this perform in day to day c...

Mashimo • today at 1:18 PM • 1 reply • view on HN

Interesting. Would love a comparison to pi.dev (Not Ohmypi)

How does this perform in day to day coding tasks, outside of benchmarks?

Replies

GodelNumbering • today at 1:24 PM

https://github.com/dirac-run/dirac#-evals

README has eval of 8 tasks over 7 agents (including both pi and omp). Pi-mono costs second lowest across the 8 tasks (after Dirac) but occasionally misses produces incomplete changes.

Interestingly, 2 tasks where pi missed some changes both were the tasks that benefitted from AST symbol understanding (e.g. find all instances of things that refer to this symbol and change those things). Since pi relies on bash type tooling, it missed some occurrences

➕ show 2 replies

alt Hacker News

Replies