Yes but we don't know the shape of the curve and where we are on it.

slopinthebag • yesterday at 9:39 PM • 1 reply • view on HN

See chinchilla scaling laws, we have the functional form of the curve and know the constants (though they change and are domain and model specific):

L(N,D) ~= 1.69 + 406 / N^0.339 + 411 / D^0.285

L is loss (pre training test loss) D is the scale of the data N is the number of model parameters

alt Hacker News