r/XiaomiMiMo_Official

https://preview.redd.it/xdpk85uuguyg1.png?width=2329&format=png&auto=webp&s=fc896318b4dd603d7eaf5a808de55c569de374d2

Adquirí el token plan "Lite", para probar la eficiencia y el poder de Mimo 2.5 pro, hice que arregle un bug pequeño.
y por ese promp me consumió el 9% de la suscripción en algo sencillo.
Necesito saber si es un bug o si en verdad funciona de esa manera.
si funciona así no vale la pena la suscripción.
¡¡Ni Claude se atrevió a tanto!!

reddit.com

u/DMG-Z — 12 days ago

▲ 10 r/XiaomiMiMo_Official

They say on their page it is 1.6 billion credit and mimo v2.5 pro takes 2 credit per token, mimo v2.5 takes 1 credit per token but here is how they get you, cached token is still billed the same credit per round trip, absolutely not suitable for coding cli then, because every single one of them by design would keep going back and forth with toolcalls, that's how they work, normally inference providers charge 1% for the pre existing cached context, but Xiaomi takes the full amount, I did 10 small tasks like not even that deep, small tasks and it is already at 12 or so million credit used, it used probably under a million context tasks were that mini, like saying hello, and mv this folder around, write some sql etc, like 10 total prompts same session, credit cost keeps snow balling, they don't mention nothing of this sort in the token plan docs or anything anywhere, for a big task it would be what 200 million token uncached, so 400million credit if you used mimo v2.5 pro, so with max 100$ plan you can use it for 4 tasks PER MONTH, honestly get anything over mimo token/coding plan

reddit.com

u/FearlessGround3155 — 11 days ago

▲ 9 r/XiaomiMiMo_Official

Xiaomi Mimi v2.5 Pro token plan Cache charges

Hello, you appear to be charging way too much for cache hits on your token plans. In fact I would say it’s really bad.

It’s so bad that I actually went through 2 months of tokens with very little effort, just simple agentic work.

Your press release and blogs talk about how the models are designed for agentic workflow but it doesn’t look like your billing of cache is.

In fact when I did the math it was much cheaper for me to use the API than the token plan! Why is that possible?

Why do you charge the same for cache hits as you do for output on the token plan!?

Also why is this not clearly documented anywhere so it’s obvious to everyone that choosing the token plan for agentic workflow and what Xiaomi says the model was built for would be prohibitively expensive?

reddit.com

u/SnotFunk — 3 days ago