logoalt Hacker News

zozbot234yesterday at 5:29 PM0 repliesview on HN

You can cluster Mac Studios using Thunderbolt connections and enable RDMA for distributed inference. This will be slower than a single node but is still the best bang-for-the-buck wrt. doing inference on very-large-sized models.