I've been testing M3 for agentic tasks on Hermes and it just gets way too confused. I have rea...

LUmBULtERA • yesterday at 1:53 PM • 1 reply • view on HN

I've been testing M3 for agentic tasks on Hermes and it just gets way too confused. I have really poor result from it compared to GPT-5.4 mini/regular or GLM-5.2 (and even 5.1).

Replies

stevenhubertron • yesterday at 3:01 PM

This has been my experience as well to the letter.

➕ show 1 reply

alt Hacker News

Replies