It means that model was tuned to to act as chat bot. So write a reply on behalf of assistant and sto...

petu • yesterday at 9:15 PM • 0 replies • view on HN

It means that model was tuned to to act as chat bot. So write a reply on behalf of assistant and stop generating (by inserting special "end of turn" token to signal inference engine to stop generation).

Base model (without instruction/chat tuning) just generates text non stop ("autocomplete on steroids") and text is not necessarily even formatted as chat -- most text in training data isn't dialogue, after all.

alt Hacker News