logoalt Hacker News

pestsyesterday at 9:31 PM0 repliesview on HN

You're doing operations on the memory once it's been transferred to gpu memory. Either shuffling it around various caches or processors or feeding it into tensor cores or other matrix operations. You don't want to be sitting idle.