logoalt Hacker News

boroboro4yesterday at 8:47 PM0 repliesview on HN

While I mostly agree with you, it worth noting modern llms are trained on 10-20-30T of tokens which is quite comparable to their size (especially given how compressible the data is)