> to simulate a real user typing
The models return a realtime stream of tokens.
This was already the case before LLMs became a thing. This is still the case for no-intelligence step by step bots.
This was already the case before LLMs became a thing. This is still the case for no-intelligence step by step bots.