Unless LLMs architecture have changed, that is exactly what they are doing. You might need to...

csomar • today at 8:28 AM • 1 reply • view on HN

Unless LLMs architecture have changed, that is exactly what they are doing. You might need to learn more how LLMs work.

Replies

andy12_ • today at 9:47 AM

Unless the LLM is a base model or just a finetuned base model, it definitely doesn't predict words just based on how likely they are in similar sentences it was trained on. Reinforcement learning is a thing and all models nowadays are extensively trained with it.

If anything, they predict words based on a heuristic ensemble of what word is most likely to come next in similar sentences and what word is most likely to give a final higher reward.

➕ show 2 replies

alt Hacker News

Replies