Is the approach fundamentally limited to smaller models? Or could you theoretically train a model as...

nayroclade • today at 2:54 AM • 0 replies • view on HN

Is the approach fundamentally limited to smaller models? Or could you theoretically train a model as powerful as the largest models, but much faster?

alt Hacker News