Fundamentally I don't believe second-order methods get better data efficiency by itself, but ch...

alyxya • today at 1:21 AM • 0 replies • view on HN

Fundamentally I don't believe second-order methods get better data efficiency by itself, but changes to the optimizer can because the convergence behavior changes. ML theory lags behind the results in practice.

alt Hacker News