logoalt Hacker News

storuslast Sunday at 7:55 PM0 repliesview on HN

Does this also run with Exo Labs' token pre-fill acceleration using DGX Spark? I.e. take 2 Sparks and 2 MacStudios and get a comparable inference speed to what 2x M5 Ultras will be able to do?