logoalt Hacker News

wmftoday at 8:27 AM1 replyview on HN

You still need to do a forward pass per token. With massive batching and full pipelining you might be able to break the dependencies and output one token per cycle but clearly they aren't doing that.


Replies

ameliustoday at 11:19 AM

More aggressive pipelining will probably be the next step.