logoalt Hacker News

b3ingyesterday at 6:55 PM2 repliesview on HN

For coding it’s really bad. Writing is ok, chat is good. It’ll get better but it’s not that close yet


Replies

nullcyesterday at 11:58 PM

Bad is mystifying. Unassisted but for handing it a pile of PDFs of relevant academic papers and my initial codebase, I had hermes agent based on qwen-3.6 27B implement karatsuba multiplication of characteristic-2 polynomials in C++ in an existing codebase with an internal field arithmetic library. It correctly found the 'obvious' optimizations using the field properties. Then I had it implement the recursive halfgcd algorithm for these polynomials using it.

It wrote extensive test cases and validated them with mutation testing (per my standard instructions)-- took many tries getting the algorithms right but with the tests handy it found and fixed the errors.

It's inconceivable to me to call it bad!

jedisct1yesterday at 8:34 PM

Depends on the language and harness, I guess.

It works really well for me, at least for Python and JavaScript, with swival.dev as a harness.

show 1 reply