No, there are more training tokens than parameters in LLMs. They are in the classical first descent ...

mxwsn • today at 4:25 PM • 0 replies • view on HN

No, there are more training tokens than parameters in LLMs. They are in the classical first descent setting.

alt Hacker News