Does this also run with Exo Labs' token pre-fill acceleration using DGX Spark? I.e. take 2 Spar...

storus • last Sunday at 7:55 PM • 0 replies • view on HN

Does this also run with Exo Labs' token pre-fill acceleration using DGX Spark? I.e. take 2 Sparks and 2 MacStudios and get a comparable inference speed to what 2x M5 Ultras will be able to do?

alt Hacker News