logoalt Hacker News

manojldsyesterday at 8:41 PM1 replyview on HN

I was talking about enterprise agents and then realized the question is more about coding agents.


Replies

sanderjdyesterday at 8:53 PM

Ah I see! Yes, I was talking about a coding harness, not an enterprise agent. I entirely agree with you that your suggestion of driving it via evals is the right thing for that use case!