Yes. Pretraining and fine-tuning use standard Adam optimizers (usually with weight-decay). Reinforce...

samsartor • yesterday at 5:10 PM • 0 replies • view on HN

Yes. Pretraining and fine-tuning use standard Adam optimizers (usually with weight-decay). Reinforcement learning has been the odd-man out historically, but these days almost all RL algorithms also use backprop and gradient descent.

alt Hacker News