
i built a open source cli for reducing token waste in claude code / codex workflows
ai coding agents (claude code, codex, cursor) burn tokens on things that don't help you ship. i started digging through local claude code + codex logs after burning way more tokens than i expected and realized a huge amount of the waste was context related: generated artifacts, oversized instruction files, repeated tool output, broad repo exploration, stale session state, etc.
so i built prismodev, a local cli that reads repo files + local claude code/codex logs and surfaces token/context waste. no api keys, no login, nothing leaves your machine.
npx getprismo doctor scans your repo and local session logs, flags missing .claudeignore / .cursorignore, finds oversized CLAUDE.md / AGENTS.md files, detects generated artifacts/logs/build output getting pulled into context, estimates avoidable spend, generates compact .prismo context packs, and shows a before/after score. it went from 79 → 91 on my repo in one run.
npx getprismo watch adds live context-pressure monitoring during sessions and catches repeated file reads, generated artifact leaks, oversized tool output, and possible command/tool loops before they spiral. watch --auto continuously updates a live guardrails file with the current issue and exact instructions for the agent to follow as context pressure changes.
npx getprismo watch --rescue generates a paste-ready recovery prompt when a session starts going sideways and pushes the agent back toward the smallest useful context/workflow.
npx getprismo firewall auth-bug creates a scoped context policy before a task starts so the agent stays inside a smaller context boundary instead of wandering through the whole repo.
npx getprismo cc timeline generates a postmortem timeline showing what leaked into context, which files/commands repeated, and where tool-output spikes happened during expensive claude code sessions.
everything runs locally. reads logs from ~/.codex/sessions/ and ~/.claude/projects/.
github: github.com/shanirsh/prismodev
would genuinely love feedback on false positives, missing waste patterns, or workflows that create the most context bloat.i built a open source cli for reducing token waste in claude code / codex workflows