logoalt Hacker News

pu_petoday at 9:55 AM1 replyview on HN

GPT-J was the one that made me really interested in LLMs, as I could run it on a 3090.

Some details on the timeline are not quite precise, and would benefit from linking to a source so that everyone can verify it. For example, HyperClOVA is listed as 204B parameters, but it seems it used 560B parameters (https://aclanthology.org/2021.emnlp-main.274/).


Replies

ai_bottoday at 9:59 AM

Great idea! Thanks