The models generate a token distribution. Which one to pick is a choice. One can sample from the distribution, hence the randomness.