logoalt Hacker News

spacebacontoday at 5:27 PM0 repliesview on HN

On problems this close to active research, seeing the model’s internal reasoning at the points of highest effort is more valuable than pass/fail outcomes alone, which is what SRT-Introspect makes possible on frozen models.

https://github.com/space-bacon/SRT