I tried it on openrouter and set max tokens to 8192, and every response is truncated, even in non-th...

deepsquirrelnet • today at 7:43 PM • 1 reply • view on HN

I tried it on openrouter and set max tokens to 8192, and every response is truncated, even in non-thinking mode. Maybe there's an issue with the deployment, but in your link also shows it generates tons of output tokens.

Replies

XCSme • today at 7:49 PM

Oh yeah, I just noticed, like 3x the reasoning tokens.

alt Hacker News

Replies