u/Sure_Proposal_9207

Why no good providers of Gemma 3.6 35B?

Why no good providers of Gemma 3.6 35B?

Gemma 4 26b A4b https://openrouter.ai/google/gemma-4-26b-a4b-it is a really good model, but when I use it via OpenRouter the max token/s providers is like 30-40 tokens/s. There also seems to be cold starts where some requests take 105 seconds to complete (for short text prompts).

I could save a tremendous amount of money in my service if a proper provider existed, but am now using gemini 3.1 flash lite instead, which has twice the cost.

u/Sure_Proposal_9207 — 6 days ago