u/myworkreddit

LMStudio support for Gemma 4 supports variable image resolution through a configurable visual token budget

Does LM Studio have support for the following feature? If not, can it be incorporated into a near-future release?

Gemma 4 supports variable image resolution through a configurable visual token budget, which controls how many tokens are used to represent an image. A higher token budget preserves more visual detail at the cost of additional compute, while a lower budget enables faster inference for tasks that don't require fine-grained understanding.

The supported token budgets are: 70, 140, 280, 560, and 1120.
    Use lower budgets for classification, captioning, or video understanding, where faster inference and processing many frames outweigh fine-grained detail.
    Use higher budgets for tasks like OCR, document parsing, or reading small text.
reddit.com
u/myworkreddit — 13 hours ago