logoalt Hacker News

noperatortoday at 3:32 AM2 repliesview on HN

Having some success while testing this model out as a replacement for GPT-5 nano in source code security review. Running on RTX 3090 (24 GB VRAM) via vLLM. It's not great on structured output (as noted in the model card) but I'm working around that in my harness.


Replies

dummydummy1234today at 3:55 AM

Can't you just force it to do structured output via constrained generation?

hypfertoday at 5:56 AM

> but I'm working around that in my harness.

How?