Also, PCI can have peer to peer connections, skipping the CPU entirely. See e.g. https://developer.nvidia.com/gpudirect