logoalt Hacker News

lostmsutoday at 4:47 PM1 replyview on HN

Why would llama with --mmap crash?


Replies

zozbot234today at 4:58 PM

This doesn't surprise me all that much, mmap support gets little attention in general and interacts poorly with GPU-side inference. (And that's with it being default, you don't even really need to specify it as a CLI option.) OP has raised a discussion with the llama.cpp folks https://github.com/ggml-org/llama.cpp/discussions/20852 but little interest so far