logoalt Hacker News

tarr11yesterday at 6:16 PM1 replyview on HN

What do you think this particular prompt is evaluating for?

The more popular these particular evals are, the more likely the model will be trained for them.


Replies