logoalt Hacker News

Why isn't AMD's MI300X competitive?

37 pointsby colonCapitalDeelast Tuesday at 1:11 AM26 commentsview on HN

Comments

oneofthosetoday at 11:31 AM

> AMD’s software experience is riddled with bugs [...] AMD’s weaker-than-expected software Quality Assurance (QA) culture and its challenging out of the box experience.

This has anecdotally been true since forever. Back in the day, OpenCL implementations were passing conformance test but performance was poor. They could not turn hardware capabilities into performance for compute users. Drivers were buggy. Documentation was poor compared to NVidia's docs and forum. Offerings were inconsistent (look up Sycl from Codeplay) and ownership of what it is like to develop for AMD was unclear. The notion that it might not have improved or is only now improving is puzzling. It can't be for the lack of recognizing the problem. Intuitively it does not seem so difficult. I'm curious what the reasons are.

show 2 replies
geremiiahtoday at 12:28 PM

I wonder if hiring is a big factor here. I presume, all the really good systems+parallel programmers would rather gain more experience on NVIDIA hardware than AMD, so given the choice, they'd go with NVIDIA. Does AMD do enough to win them over?

Havoctoday at 10:17 AM

Mirrors the geohotz rants about AMD at the time, though as others point out this - 2024 - is ancient news in AI world and not quite sure what value it adds to the current discussions

show 1 reply
andy_ppptoday at 10:23 AM

Please just get everything in PyTorch to work, and work well (and across all graphics cards too). This is the starting point and it doesn't matter how you do it. But the fact you cannot even do some very basic stuff on AMD is going to mean you are left unused by researchers, so getting further up the stack is going to be almost impossible.

show 2 replies
ZiiStoday at 10:28 AM

Correction: Why wasn't it competitive 2 years ago; basically half the AI summer ago.

fancyfredbottoday at 11:10 AM

Please amend the title, this is a December 2024 article and the conclusions are misleading in 2026

pstuartlast Tuesday at 1:54 AM

If AMD's betting the company on their AI compute, they had best follow the advice in the article because the only way to compete with NVIDIA is to meet/exceed not just the performance but also the DevX.

show 2 replies
threeptstoday at 11:24 AM

NVIDIA has such a big moat around their CUDA architecture such that I don't think AMD will ever be able to outcompete them in AI compute unless they somehow find 2-3 nobel prize level breakthroughs today.

pilililo2today at 12:10 PM

This is from more than 2 years ago, why post this now?

DiabloD3last Tuesday at 4:09 AM

I love how they just butcher that article.

I remember when it came out a little over a year ago, and its just as wrong as it is today as it was then.

_aavaa_last Tuesday at 2:36 AM

[2024]

arka2147483647today at 11:06 AM

The important part of Hardware, is Software

After all, if the Software does not work, its just a Paperweight

show 1 reply
agunapaltoday at 11:41 AM

Nvidia had the first movers advantage. Nvidia spent so many years perfecting CUDA to work well with PyTorch. Before ROCM, there was only CUDA. There were so many developers building their use cases on top of PyTorch+CUDA, and bringing all that feedback to PyTorch, this made CUDA battle ready and stable. AMD can get there, especially now with demand for compute, but as someone already said here, the biggest focus needs to be on PyTorch

wewewedxfgdftoday at 11:01 AM

AMD just doesn't seem to be that good at software.