... but not in deep learning or am I missing something important here?
Yes, absolutely in deep learning. Custom fused CUDA kernels everywhere.
Yes, absolutely in deep learning. Custom fused CUDA kernels everywhere.