Hold your horses, that’s a long way off. The best math AI tool we currently have, Aletheia, was only...

emp17344 • today at 5:18 PM • 1 reply • view on HN

Hold your horses, that’s a long way off. The best math AI tool we currently have, Aletheia, was only able to solve 13 out of 700 attempted open Erdos problems, only 4 of which were solved autonomously: https://arxiv.org/html/2601.22401v3

Clearly, these models still struggle with novel problems.

Replies

slibhb • today at 6:17 PM

> Clearly, these models still struggle with novel problems.

Do they struggle with novel problems more or less than humans?

➕ show 1 reply

alt Hacker News

Replies