logoalt Hacker News

59nadirtoday at 1:04 PM1 replyview on HN

Counter-point: I built an agent that can only interface with Kakoune, a much less common and more challenging situation for an LLM to find itself in, and Gemma4-A4B 8bit quantized does remarkably better in actually figuring out how to get text in buffers than Qwen3.6-35B-A3B in a similar class as Gemma4 A4B.

Now, is this the usual use case? No, it's a benchmark I created specifically in order to put LLMs in situations where they can't just blast out their bash commands without having to interface with something else and adapt.


Replies

celrodtoday at 4:22 PM

Fellow kakoune user here. I'm curious about your use case/ what you're doing with it!

show 1 reply