Now we need someone try run Kimi K2.6 on old Xeon and DDR3. After all these platforms do support up to 768GB RAM.
It’ll work but yield a token per minute. With ancient servers the throughput is the limiting aspect not mem size
It’ll work but yield a token per minute. With ancient servers the throughput is the limiting aspect not mem size