Instruction Tuned. It indicates that thinking tokens (eg <think> </think>) are not ...

bigyabai • today at 5:57 PM • 2 replies • view on HN

Instruction Tuned. It indicates that thinking tokens (eg <think> </think>) are not included in training.

Replies

That’s not what it means. "-it" just indicates the model is instruction-tuned, i.e. trained to follow prompts and behave like an assistant. It doesn’t imply anything about whether thinking tokens like <think>....</think> were included or excluded during training. Thats a separate design choice and varies by model.

➕ show 1 reply

alt Hacker News

Replies