So distribute copies of the model in RAM to multiple machines, have each machine update different pa...

greenavocado • today at 1:46 PM • 1 reply • view on HN

So distribute copies of the model in RAM to multiple machines, have each machine update different parts of the model weights, and sync updates over the network

Replies

olliepro • today at 4:34 PM

decentralized training makes a lot more sense when the required hardware isn't a $40K GPU...

alt Hacker News

Replies