Token Waste in Claude Code: A User's Self-Audit Shows Behavioral Fixes Beat Model Switching

A Reddit user spent a week measuring where their Claude Code tokens actually went, rather than just complaining about the May price changes. Their conclusion: most burn was self-inflicted, and behavioral changes bought back more headroom than switching models would have.
Biggest Wins
/clearbetween unrelated tasks — a stale 200k-token context riding along for a one-line fix was the single most expensive habit.- Make it plan before it touches files. One planning pass, then execute — cheaper and better than explore-edit-explore in a loop.
- Stop letting it re-read files it just touched. If it just edited a file, it does not need to reopen it to "verify." Say so once in your rules.
- Search with a subagent, not the main thread. Grep-and-read across a repo dumps the whole haystack into your main context permanently. A subagent returns just the answer.
- Kill always-on and
-ploops you are not watching. Background agents burning tokens while you sleep are most of the horror-story bills.
None of these fixes required a new subscription, a wrapper, or an MCP server. It was discipline the user admits being too lazy to apply while limits felt infinite.
The post acknowledges that none of this fixes the actual price hikes — it just stops you burning extra on top of them.
📖 Read the full source: r/ClaudeAI
👀 See Also

Multi-Agent Orchestration in OpenClaw: Centralize Rules, Spawn Sub-Agents
An OpenClaw user describes moving from duplicated workspace instructions to a single main agent that spawns sub-agents, enforcing architectural rules (e.g., persist structured data as .JSON) across all agent workspaces.

Save on Claude Code Bills by Routing Planning Tokens to Cheaper Models
A user cut $40 in overage fees by splitting Claude Code workflows: planning steps go to Haiku 3.5, actual edits and decisions stay on Opus/Sonnet. A 30-line wrapper handles routing; setup took ~2 hours.

Silent Success: One Dev's Approach to Cron Job Alerting
A developer on r/openclaw stops sending success notifications for healthy cron runs, alerting only on auth failures, state corruption, or repeated failures.

Claude Agent Sends Memes via Discord Using Tenor GIFs — Full Protocol
A Claude code agent setup adds context‑aware emoji reactions, one‑sentence status messages, and validated Tenor GIF URLs to every Discord transport message, with explicit web search and fetch steps to avoid dead links.