There are a bunch of 4bit quants in the GGUF link and the 0xSero has some smaller stuff too. Might still be too big and you'll need to ungpu poor yourself.
yeah there is no way to run 4.7 on a 32g vram this flash is something that im also waiting to try later tonight
yeah there is no way to run 4.7 on a 32g vram this flash is something that im also waiting to try later tonight