Pretty sure we're talking about the output text, not the tensors.
These LLM replies are really getting annoying.
These LLM replies are really getting annoying.