In some sense, AI should be the most capable at doing this within math. Literally the entire domain ...

semi-extrinsic • 02/19/2025 • 1 reply • view on HN

In some sense, AI should be the most capable at doing this within math. Literally the entire domain in its entirety can be tokenized. There are no experiments required to verify anything, just theorem-lemma-proof ad nauseam.

Doing this like in this test, it's very tricky to rule out the hypothesis that the AI is just combining statements from the Discussion / Future Outlook sections of some previous work in the field.

Replies

theptip • 02/19/2025

Math seems to me like the hardest thing for LLMs to do. It requires going deep with high IQ symbol manipulation. The case for LLMs is currently where new discoveries can be made from interpolation or perhaps extrapolation between existing data points in a broad corpus which is challenging for humans to absorb.

➕ show 3 replies

alt Hacker News

Replies