logoalt Hacker News

lxgrtoday at 5:10 PM2 repliesview on HN

Am I missing something or are the Ollama versions of this (https://ollama.com/library/gemma4/tags) text-only for now?


Replies

philipkglasstoday at 5:13 PM

Since ollama has diverged from llama.cpp, it will take a bit of time for ollama to support multi-modality. If you're using plain llama.cpp it looks like a PR has already merged for this model with vision and audio support:

https://github.com/ggml-org/llama.cpp/pull/24077

show 1 reply
satvikpendemtoday at 5:50 PM

Just use llama.cpp or Unsloth Studio which wraps it, I don't know why anyone use Ollama anymore.