logoalt Hacker News

DrBenCarsontoday at 12:08 AM1 replyview on HN

How are you using that RAM with the GPU?


Replies

canpantoday at 12:12 AM

Llama.cpp with automatic offload to main memory. You can also use Ollama, it is easier, but slower.

show 1 reply