Surprised models still output tools as text when for ages we’ve been able to constrain the output at...

aetherspawn • yesterday at 11:22 PM • 3 replies • view on HN

Surprised models still output tools as text when for ages we’ve been able to constrain the output at the inference engine level and constrain the model what tools, parameters etc are available

Edit: found it, it’s called Grammar-Constrained Decoding (GCD)

Replies

jdiff • today at 1:56 AM

I imagine the challenge comes from recognizing that your model is trying to call a tool before it actually has and only constraining output then. Running a separate pass for an optionally-empty list of tools afterwards may work, but maybe constraining its output like that causes many spurious tool calls.

miketery • today at 2:20 AM

Some model providers when using json_schema: true (eg. with_structured_output), it does constrain the output.

CompleteSkeptic • today at 1:39 AM

constrained decoding tends to make models dumber - this is why it's rarely used

alt Hacker News

Replies