logoalt Hacker News

KeplerBoyyesterday at 12:47 PM0 repliesview on HN

Sure, one could think of some kind of pipeline parallelism where you only need a fast transfer to the next step in the model and that would boost throughput but not increase model size.