Great if you have 16 independent workloads, terrible for things that care about communication between threads.
It has 16 CCD, each with only one thread enabled, latency between CCD is ~150ns.
150ns is actually surprisingly high! I didn't realize it was so bad. That's about 2-3x as much latency as fetching from DRAM based on what I see in peoples AIDA64 results.
Surprise surprise, not every tool is right for every job.
150ns is actually surprisingly high! I didn't realize it was so bad. That's about 2-3x as much latency as fetching from DRAM based on what I see in peoples AIDA64 results.