logoalt Hacker News

david_shiyesterday at 8:34 PM0 repliesview on HN

> I believe eval startups can work when they're targeting safety benchmarks specifically.

Are there any examples of successful startups doing this?