logoalt Hacker News

antireztoday at 12:02 AM1 replyview on HN

I cut the difference in speed by half by taking the activations on the GPU. Time to sleep but will continue tomorrow.


Replies

Numerlortoday at 2:26 AM

Have you tried e.g. Mojo that can vectorize/do SIMD without having to do intrinsics everywhere?