logoalt Hacker News

nolist_policyyesterday at 7:17 PM0 repliesview on HN

Use the it versions. The other versions are base models without post-training. E.g. base models are trained to regurgitate raw wikipedia, books, etc. Then these base models are post-trained into instruction-tuned models where they learn to act as a chat assistant.