logoalt Hacker News

intothemildtoday at 9:14 AM0 repliesview on HN

Well. Right now buying hardware to run your own models tops off at about 32gb VRAM at any price point that's not insane. Sure you can get a Mac mini, or a PC equivalent. But the problem is RAM.

More RAM means bigger models, which means smarter models.

Which is why Qwen and Gemma have been so interesting to a lot of us who run our own... Now 32gb VRAM isn't so bad, as these models can be run on that with decent results.

Where this gets interesting is in a couple years, when all the A100, etc, all the Enterprise hardware hits eBay.