r/LLMStudio

▲ 16 r/LLMStudio+3 crossposts

Built a local LM Studio stats panel that shows what my AI stack is actually doing

u/Sea_Manufacturer6590 — 15 hours ago

▲ 5 r/LLMStudio+1 crossposts

Qwen models for coding, using qwen-code - my experience

u/Undici77 — 1 day ago

▲ 1 r/LLMStudio+1 crossposts

I want to build a multilingual philosophical LLM trained on thousands of philosophy books — how insane is this for a beginner?

u/Future_Safe1609 — 1 day ago

▲ 9 r/LLMStudio+1 crossposts

Building a 200B Local AI Agent That Controls My Apps - Where Do I Start?

u/ocottog — 2 days ago

▲ 1 r/LLMStudio

LM Studio - configuration

u/Unhappy-Laugh6973 — 1 day ago

🔥 Hot ▲ 379 r/LLMStudio+3 crossposts

Someone just shipped an open reasoning-distilled Qwen3.6-35B-A3B, fine-tuned to imitate Claude Opus 4.7’s chain-of-thought: - 35B MoE, ~3B active/token → fits on one A100/H100 - Thinks in <think>...</think> like the teacher - Apache 2.0, weights + dataset both public

u/Anony6666 — 5 days ago

▲ 2 r/LLMStudio+3 crossposts

SDPF — Software Development Prompting Framework

u/Available_Bat_420 — 2 days ago

▲ 8 r/LLMStudio+4 crossposts

local AI financial terminal Bloomberg-style charting + 5-agent consensus engine, zero cloud, free to use

u/Informal_Corner_1624 — 4 days ago

▲ 3 r/LLMStudio+2 crossposts

Claude - Use Cases In Sales

u/Category_Major — 3 days ago

▲ 2 r/LLMStudio+2 crossposts

GEO vs SEO: Why 68% of AI Answers Come From Reddit in 2026

u/IntroductionTop5993 — 4 days ago

▲ 3 r/LLMStudio+1 crossposts

Is there a place where I can compare generation of tokens per second of 1 GPU VRAM+RAM vs 2 GPUs for those models that don't fit in 1 GPU?

u/misanthrophiccunt — 5 days ago

▲ 3 r/LLMStudio+1 crossposts

I built a sleek, open-source Android client for local LLMs (Ollama, LocalAI, Lm Studio, etc.)

u/dark_horse_techie — 5 days ago

▲ 1 r/LLMStudio

Has Lm Studios Wikipedia plugin stopped working for you try this

u/karmakazi_ — 4 days ago

▲ 1 r/LLMStudio

If you had to describe what an LLM is to an ordinary person.

u/CaptnSpalding — 4 days ago

▲ 1 r/LLMStudio+1 crossposts

Most people are using LLMs wrong

u/Open-Ease685 — 3 days ago

▲ 0 r/LLMStudio

LMStudio support for Gemma 4 supports variable image resolution through a configurable visual token budget

Does LM Studio have support for the following feature? If not, can it be incorporated into a near-future release?

Gemma 4 supports variable image resolution through a configurable visual token budget, which controls how many tokens are used to represent an image. A higher token budget preserves more visual detail at the cost of additional compute, while a lower budget enables faster inference for tasks that don't require fine-grained understanding.

The supported token budgets are: 70, 140, 280, 560, and 1120.
    Use lower budgets for classification, captioning, or video understanding, where faster inference and processing many frames outweigh fine-grained detail.
    Use higher budgets for tasks like OCR, document parsing, or reading small text.

reddit.com

u/myworkreddit — 11 hours ago

▲ 1 r/LLMStudio

Laptop Config Settings to Manage Overheating...

u/Peacelake — 3 days ago

r/LLMStudio

Built a local LM Studio stats panel that shows what my AI stack is actually doing

Qwen models for coding, using qwen-code - my experience

I want to build a multilingual philosophical LLM trained on thousands of philosophy books — how insane is this for a beginner?

Building a 200B Local AI Agent That Controls My Apps - Where Do I Start?

LM Studio - configuration

Someone just shipped an open reasoning-distilled Qwen3.6-35B-A3B, fine-tuned to imitate Claude Opus 4.7’s chain-of-thought: - 35B MoE, ~3B active/token → fits on one A100/H100 - Thinks in &lt;think&gt;...&lt;/think&gt; like the teacher - Apache 2.0, weights + dataset both public

SDPF — Software Development Prompting Framework

local AI financial terminal Bloomberg-style charting + 5-agent consensus engine, zero cloud, free to use

Claude - Use Cases In Sales

GEO vs SEO: Why 68% of AI Answers Come From Reddit in 2026

Is there a place where I can compare generation of tokens per second of 1 GPU VRAM+RAM vs 2 GPUs for those models that don't fit in 1 GPU?

I built a sleek, open-source Android client for local LLMs (Ollama, LocalAI, Lm Studio, etc.)

Has Lm Studios Wikipedia plugin stopped working for you try this

If you had to describe what an LLM is to an ordinary person.

Most people are using LLMs wrong

LMStudio support for Gemma 4 supports variable image resolution through a configurable visual token budget

Laptop Config Settings to Manage Overheating...

Someone just shipped an open reasoning-distilled Qwen3.6-35B-A3B, fine-tuned to imitate Claude Opus 4.7’s chain-of-thought: - 35B MoE, ~3B active/token → fits on one A100/H100 - Thinks in <think>...</think> like the teacher - Apache 2.0, weights + dataset both public