logoalt Hacker News

ai_slop_hateryesterday at 9:05 PM1 replyview on HN

Does that mean we should use a larger model as judge for evals, not a smaller one?


Replies

dist-epochyesterday at 9:54 PM

That was always the advice. Use the best model you can afford.

But some problems are easy and you can get away with a smaller model.