It is possible to compute the approximate gradient (direction to step) without using the formulas: w...

macleginn • yesterday at 10:45 AM • 0 replies • view on HN

It is possible to compute the approximate gradient (direction to step) without using the formulas: we can change the value of each parameter individually, compute the loss, set the values of all parameters in such a way that the loss is minimized, and then repeat. This means, however, that we have to do number-of-parameters forward passes for one optimization step, which is very expensive. With formulas, we can compute all these values in one backward pass.

alt Hacker News