logoalt Hacker News

brookman64k01/20/20251 replyview on HN

Thank you. Which is currently the most capable version running reasonably fast on a 3090 (24GB of VRAM)?


Replies

danielhanchen01/20/2025

The Llama distilled version Q4_K_M should be reasonably fast and good!!