logoalt Hacker News

olalondeyesterday at 6:56 PM2 repliesview on HN

Would be cool to have a benchmark with actually unsolved math and science questions, although I suspect models are still quite a long way from that level.


Replies

gowldyesterday at 9:44 PM

Does folding a protein count? How about increasing performance at Go?