r/LLMStudio

▲ 5 r/LLMStudio+1 crossposts

Qwen models for coding, using qwen-code - my experience

u/Undici77 — 1 day ago
▲ 1 r/LLMStudio+1 crossposts

I want to build a multilingual philosophical LLM trained on thousands of philosophy books — how insane is this for a beginner?

u/Future_Safe1609 — 1 day ago
▲ 9 r/LLMStudio+1 crossposts

Building a 200B Local AI Agent That Controls My Apps - Where Do I Start?

u/ocottog — 2 days ago
🔥 Hot ▲ 379 r/LLMStudio+3 crossposts

Someone just shipped an open reasoning-distilled Qwen3.6-35B-A3B, fine-tuned to imitate Claude Opus 4.7’s chain-of-thought: - 35B MoE, ~3B active/token → fits on one A100/H100 - Thinks in <think>...</think> like the teacher - Apache 2.0, weights + dataset both public

u/Anony6666 — 5 days ago
▲ 8 r/LLMStudio+4 crossposts

local AI financial terminal Bloomberg-style charting + 5-agent consensus engine, zero cloud, free to use

u/Informal_Corner_1624 — 4 days ago
▲ 3 r/LLMStudio+1 crossposts

Is there a place where I can compare generation of tokens per second of 1 GPU VRAM+RAM vs 2 GPUs for those models that don't fit in 1 GPU?

u/misanthrophiccunt — 5 days ago
▲ 3 r/LLMStudio+1 crossposts

I built a sleek, open-source Android client for local LLMs (Ollama, LocalAI, Lm Studio, etc.)

u/dark_horse_techie — 5 days ago

LMStudio support for Gemma 4 supports variable image resolution through a configurable visual token budget

Does LM Studio have support for the following feature? If not, can it be incorporated into a near-future release?

Gemma 4 supports variable image resolution through a configurable visual token budget, which controls how many tokens are used to represent an image. A higher token budget preserves more visual detail at the cost of additional compute, while a lower budget enables faster inference for tasks that don't require fine-grained understanding.

The supported token budgets are: 70, 140, 280, 560, and 1120.
    Use lower budgets for classification, captioning, or video understanding, where faster inference and processing many frames outweigh fine-grained detail.
    Use higher budgets for tasks like OCR, document parsing, or reading small text.
reddit.com
u/myworkreddit — 11 hours ago