logoalt Hacker News

dominotwyesterday at 10:42 PM0 repliesview on HN

even anthropic uses 'user reports' in alignment system card.

Do they lack "testing strategy" to test their own alignment?

Can you share the you testing strategies that are letting you plug and play models.