It is very comparable if you work out the $/tok/s on inference. I did some napkin math and it looks like you’re getting roughly 3x the performance for 3x the cost. Red v2 vs Mac Studio M3 Ultra 96GB.
If you compare tokens/kWh efficiency then my math has Mac Studio being about 1.5x more efficient.