logoalt Hacker News

cgorllayesterday at 10:29 PM2 repliesview on HN

I checked with the team and it may have been some temporary rate-limiting issue. We've rectified the results, it seems to be an isolated case.

https://www.ctgt.ai/benchmarks


Replies

rancar2today at 2:21 AM

Thanks for the thoroughness! I look forward to the next steps as you all apply this approach in other unique ways to have even better results.

SomaticPirateyesterday at 10:39 PM

Are these benchmarks correct that adding Anthropic's Constitutional AI system prompt lowered results across all the models?