logoalt Hacker News

softwaredougyesterday at 11:03 PM1 replyview on HN

Isn't the study a year old by now? Things have evolved very quickly in the last few months.


Replies

nkmnzyesterday at 11:20 PM

Yes. No agents, no deep research, no tools, and just Sonnet-3.5 and 3.7 - I’d love to see the same study today with Opus-4.6 and Codex-5.3