I wouldn't have expected there to be enough text from before 1913 to properly train a model, it...

arikrak • today at 1:37 PM • 1 reply • view on HN

I wouldn't have expected there to be enough text from before 1913 to properly train a model, it seemed like they needed an internet of text to train the first successful LLMs?

Replies

alansaber • today at 1:47 PM

This model is more comparable to GPT-2 than anything we use now.

alt Hacker News

Replies