logoalt Hacker News

miroljubtoday at 2:42 PM5 repliesview on HN

Solves? It's a part of the training set. Nothing more, nothing less.


Replies

rpdillontoday at 2:49 PM

Opening sentences:

> Shock! Shock! I learned yesterday that an open problem I’d been working on for several weeks had just been solved by Claude Opus 4.6— Anthropic’s hybrid reasoning model that had been released three weeks earlier! It seems that I’ll have to revise my opinions about “generative AI” one of these days. What a joy it is to learn not only that my conjecture has a nice solution but also to celebrate this dramatic advance in automatic deduction and creative problem solving.

show 1 reply
allreducetoday at 4:30 PM

I encourage you to look at what the current models with a bit of harnessing are capable of, e.g. Opus 4.6 and Claude Code. Try to make it solve some mathematics-heavy problem you come up with. If only to get a more accurate picture of whats going on.

Unfortunately, these tools generalize way beyond regurgitating the training set. I would not assume they stay below human capabilities in the next few years.

Why any moral person would continue building these at this point I don't know. I guess in the best case the future will have a small privileged class of humans having total power, without need for human workers or soldiers. Picture a mechanical boot stomping on a human face forever.

nemo1618today at 4:29 PM

If this was a joke, it certainly flew over most people's heads...

jcimstoday at 3:08 PM

Prove it.

show 1 reply
mwigdahltoday at 2:46 PM

Did you read the article? It was an open problem.

show 1 reply