A year ago it couldn't do tasks like this at all, what makes you beleive it can progress only this far but no further?
Random number generators can't solve open math problems, but it looks like AI agents can? [1]
[1] https://www-cs-faculty.stanford.edu/~knuth/papers/claude-cyc...