u/Bright_Mood6357

Hermes + DeepSeek Flash using an insane amount of tokens for anyone else?

I’ve been using Hermes with DeepSeek Flash recently and the token usage feels way higher than I expected.

Even during pretty normal coding sessions or smaller tasks, the context size seems to grow insanely fast. After a while it feels like every action is sending a massive amount of previous context back to the model.

I’m trying to figure out if this is just normal behavior with Hermes, something related to agent/tool usage, or if I messed up some configuration somewhere.

Is anyone else dealing with this?

reddit.com
u/Bright_Mood6357 — 5 days ago