u/Frag_De_Muerte

So I'm using Gemma4:26b on my mac studio. It runs. It can see images with uploading it into the ollama interface. Openclaw runs on a vps on a proxmox box right next to it (all in the same local network). I have my heartbeats and my design agent currently configured to the Gemma model. Everything seems to work.

I can't, for the life of me figure out why the agent cannot use the ollama Gemma4 model vision capabilities. It keeps getting aborted via ollama through telegram. What is going on? Has anyone gotten vision capabilities to work via ollama? I really don't want to burn tokens on openrouter or codex for this.

I've configured models.json and openclaw.json for the provider description to include both "text", "image" doesn't seem to help.

Ollama models and Vision capability