logoalt Hacker News

adriancooneyyesterday at 2:10 PM1 replyview on HN

What's the error rate of the pen guy?

Also, if your AI has a 20% error rate, you're not holding it right. You need to spend more time keeping it on rails - unit tests, integration tests, e2e tests, local dev + browser use, preview deployments, staging environments, phased rollouts, AI PR reviews, rolling releases. The error rate will be much closer to 0%.


Replies

davebrenyesterday at 3:21 PM

How does a phased rollout improve LLM error rates exactly?

show 1 reply