Most inference engines would return the reasoning tokens though, wouldn't you see that the reasoning_content (or whatever your engine calls it) was filled while content wasn't?
Yeah, I had been ignoring the reasoning tokens for the summarize call
Yeah, I had been ignoring the reasoning tokens for the summarize call