logoalt Hacker News

johndoughyesterday at 9:51 PM0 repliesview on HN

I was wondering whether multiple GPUs make it go appreciably faster when limited by VRAM. Do you have some tokens/sec numbers for text generation?