logoalt Hacker News

jb199111/04/20252 repliesview on HN

Just depends on how the manufacturer of the GPU handles code written in different languages. For example, what level of API access, what level of abstraction, and how is the source compiled i.e. how optimized is it. For example, on an apple GPU, you’ll see benchmarks that openCL and metal can vary depending on the tasks.


Replies

embedding-shape11/04/2025

Right, but that'd depend a lot on the context, task, hardware and so on.

What parent said seemed more absolute and less relative, almost positing it as there is no point in using cuda (since it's "sub-optimal" and people should use SPIR-V obviously. I was curious in the specifics about that.

sigbottle11/04/2025

I mean, nvidia exposes some pretty low level primitives, and you can always fiddle with the PTX as deepseek did.