I did a post [0] about this last year, and vanilla LLMs didn’t do nearly as well as I’d expected on ...

jerpint • today at 2:52 PM • 1 reply • view on HN

I did a post [0] about this last year, and vanilla LLMs didn’t do nearly as well as I’d expected on advent of code, though I’d be curious to try this again with Claude code and codex

[0] https://www.jerpint.io/blog/2024-12-30-advent-of-code-llms/

Replies

the_duke • today at 4:27 PM

LLMs, and especially coding focused models, have come a very long way in the past year.

The difference when working on larger tasks that require reasoning is night and day.

In theory it would be very interesting to go back and retry the 2024 tasks, but those will likely have ended up in the training data by now...

➕ show 2 replies

alt Hacker News

Replies