>DNNs/LLMs can only predict next tokens based on training data.
How do they decide between using 'a' or 'an'?
I don't get the argument; how do you decide between using 'a' or 'an'?
They pick random top-k next token based on their amazing 4chan/reddit training data, duh.
I don't get the argument; how do you decide between using 'a' or 'an'?