I've had Opus 4.5 hand rolling CUDA kernels and writing a custom event loop on io_uring lately ...

wild_egg • last Wednesday at 12:46 AM • 1 reply • view on HN

I've had Opus 4.5 hand rolling CUDA kernels and writing a custom event loop on io_uring lately and both were done really well. Need to set up the right feedback loops so it can test its work thoroughly but then it flies.

Replies

jaggederest • last Wednesday at 1:02 AM

Yeah I've handed it a naive scalar implementation and said "Make this use SIMD for Mac Silicon / NEON" and it just spits out a working implementation that's 3-6x faster and passes the tests, which are binary exact specifications.

alt Hacker News

Replies