Stupid question, but are there models worth using that specialize in a particular programming langua...

01100011 • today at 2:25 AM • 2 replies • view on HN

Stupid question, but are there models worth using that specialize in a particular programming language? For instance, I'd love to be able to run a local model on my GPU that is specific to C/C++ or Python. If such a thing exists, is it worth it vs one of the cloud-based frontier models?

I'm guessing that a model which only covers a single language might be more compact and efficient vs a model trained across many languages and non-programming data.

Replies

girvo • today at 3:17 AM

I'm currently experimenting with (trying to) fine tune Qwen3.5 to make it better at a given language (Nim in this case); but I am quite bad at this, and honestly am unsure if it's even really fully feasible at the scale I have access to. Certainly been fun so far though, and I have a little Asus GX10 box on the way to experiment some more!

cpburns2009 • today at 2:27 AM

I'd be interested in this too. I think that's what post-training can achieve but I've never looked into it.

alt Hacker News

Replies