▲ 18 r/hermesagent
Hermes + DeepSeek Flash using an insane amount of tokens for anyone else?
I’ve been using Hermes with DeepSeek Flash recently and the token usage feels way higher than I expected.
Even during pretty normal coding sessions or smaller tasks, the context size seems to grow insanely fast. After a while it feels like every action is sending a massive amount of previous context back to the model.
I’m trying to figure out if this is just normal behavior with Hermes, something related to agent/tool usage, or if I messed up some configuration somewhere.
Is anyone else dealing with this?
u/Bright_Mood6357 — 5 days ago