logoalt Hacker News

mhitzayesterday at 2:32 PM0 repliesview on HN

For sure I was running on autopilot with that reply. Though in Q4 I would expect it to fit, as 24B-A4B Gemma model without CPU offloading got up to 18GB of VRAM usage