Not quite. They have 128GB of ram that can be allocated in the BIOS, up to 96GB to the GPU.
You don't have to statically allocate the VRAM in the BIOS. It can be dynamically allocated. Jeff Geerling found you can reliably use up to 108 GB [1].
[1]: https://www.jeffgeerling.com/blog/2025/increasing-vram-alloc...
allocation is irrelevant. as an owner of one of these you can absolutely use the full 128GB (minus OS overhead) for inference workloads
You don't have to statically allocate the VRAM in the BIOS. It can be dynamically allocated. Jeff Geerling found you can reliably use up to 108 GB [1].
[1]: https://www.jeffgeerling.com/blog/2025/increasing-vram-alloc...