logoalt Hacker News

janderlandtoday at 4:54 PM0 repliesview on HN

Has Kimi found a way to vastly reduce the amount of VRAM required without running at 3 tokens per second? That’s the real concern.