Why would you want to? It's like using a hammer for screws.
CPU compute is infinity times less expensive and much easier to work with in general
To maximise the VRAM available for an LLM on the same machine. That's why I asked myself the same question, anyway.
CPU compute is infinity times less expensive and much easier to work with in general