logoalt Hacker News

Rastonburyyesterday at 8:26 PM0 repliesview on HN

Yeah it's COT is interesting, it was supposedly RL on evaluations and gets paranoid that it's being evaluated and in a simulation. I asked it to critique output from another LLM and told it my colleague produced it, in COT it kept writing "colleague" in quotes as if it didn't believe me which I found amusing