logoalt Hacker News

port11yesterday at 7:53 PM1 replyview on HN

I’ve had it go through a 50-page PDF of dense, inter-connected specs, and it correctly flagged everything that was done, somewhat done, and missing. It went into a lot of detail and explained where the code deviated from the spec.

It felt, at least for me, light an impressive step up. Opus 4.8 was already very thorough; but sadly verbose and ‘loopy’ when you push back on its plans. Fable is what I’d use all day if I could afford it!


Replies

YumpiLumpusyesterday at 8:31 PM

How do you know if it was done correctly if it's 50 pages of dense specs?

show 2 replies