u/Head_Leek_880

▲ 8 r/ollama

I’ve been running a side project that uses API inference and have been dropping $50+ a month on OpenRouter. I keep seeing discussions about Ollama Cloud as a cheaper alternative, but whenever I search for posts about it, the feedback tends to be pretty negative. Everyone seems frustrated about something.
Before I make the switch, I’m curious what people’s actual experience has been. What’s working for you? What isn’t? I’m mainly interested in whether the cost savings are real and whether the reliability is decent enough for something I’m running regularly (nothing crazy—just steady inference, not huge volume).
Also interested in hearing from people who tried it and went back to something else, or people who stuck with it. What made you switch back or stay?
I know there’s a lot of skepticism about it around here, so I’m genuinely trying to understand if it’s a “don’t use this” situation or more of a “use it but know the quirks” situation.
Thanks!

reddit.com
u/Head_Leek_880 — 16 days ago