logoalt Hacker News

bigyabaitoday at 5:57 PM2 repliesview on HN

Instruction Tuned. It indicates that thinking tokens (eg <think> </think>) are not included in training.


Replies

flux3125today at 6:39 PM

That’s not what it means. "-it" just indicates the model is instruction-tuned, i.e. trained to follow prompts and behave like an assistant. It doesn’t imply anything about whether thinking tokens like <think>....</think> were included or excluded during training. Thats a separate design choice and varies by model.

show 1 reply