For coding it’s really bad. Writing is ok, chat is good. It’ll get better but it’s not that close ye...

b3ing • yesterday at 6:55 PM • 2 replies • view on HN

For coding it’s really bad. Writing is ok, chat is good. It’ll get better but it’s not that close yet

Replies

Bad is mystifying. Unassisted but for handing it a pile of PDFs of relevant academic papers and my initial codebase, I had hermes agent based on qwen-3.6 27B implement karatsuba multiplication of characteristic-2 polynomials in C++ in an existing codebase with an internal field arithmetic library. It correctly found the 'obvious' optimizations using the field properties. Then I had it implement the recursive halfgcd algorithm for these polynomials using it.

It wrote extensive test cases and validated them with mutation testing (per my standard instructions)-- took many tries getting the algorithms right but with the tests handy it found and fixed the errors.

It's inconceivable to me to call it bad!

jedisct1 • yesterday at 8:34 PM

Depends on the language and harness, I guess.

It works really well for me, at least for Python and JavaScript, with swival.dev as a harness.

➕ show 1 reply

alt Hacker News

Replies