Claude Code Token Waste Fix: Disable Attribution Header for Better Cache Hits

Claude Code has been wasting tokens on every new session since version 2.1.69 due to a billing attribution header that breaks prompt caching. The issue is documented in multiple GitHub issues (#40652, #34629, #40524) with no official response from Anthropic as of the source publication.
What's Happening
Since v2.1.69, Claude Code inserts a billing attribution string into the first block of your system prompt: x-anthropic-billing-header: cc_version=2.1.88.a3f; cc_entrypoint=cli; cch=00000;
The .a3f part is a 3-character hash computed from your first message in each conversation using this function:
function computeHash(firstUserMessage, version) {
const chars = [4, 7, 20].map(i => firstUserMessage[i] || "0").join("");
return sha256("59cf53e54c78" + chars + version).slice(0, 3);
}Different conversations with different first messages generate different hashes every time.
Why This Breaks Caching
Anthropic's caching requires 100% identical prompt segments. The cache is shared across your entire Organization or Workspace, not per session. The billing header sits at the front of the ~23K token system prompt, and since it changes per conversation, the prefix never matches, causing cache misses on every new chat.
Benchmark Results
A controlled A/B test showed:
- Header ON (default): 48% cache hit rate, ~12K tokens rebuilt per session
- Header OFF: 99.98% cache hit rate, zero cache creation on 3 out of 4 sessions
The result is 7x cheaper per session on system prompt processing.
The Fix
Add this to your shell configuration:
export CLAUDE_CODE_ATTRIBUTION_HEADER=falseFor zsh users:
echo 'export CLAUDE_CODE_ATTRIBUTION_HEADER=false' >> ~/.zshrc
source ~/.zshrcNew sessions pick it up automatically. Existing sessions don't need restarting—the hash doesn't change mid-conversation, and they don't interfere with new sessions.
Safety and Background
This is not a hack—the environment variable exists in the source code as a proper feature toggle. claude-code-router and CLIProxyAPI have been shipping with this disabled in production with no reported issues.
Anthropic likely implemented this to track which version and entrypoint (CLI vs SDK vs GitHub Action) made each API call, placing it in the system prompt because Bedrock/Vertex don't forward custom headers.
📖 Read the full source: r/ClaudeAI
👀 See Also

Practical OpenClaw Usage Insights from Hands-On Experience
A Reddit user shares seven specific lessons from using OpenClaw, covering setup challenges, VM deployment, Skills vs. MCP integration, context organization, credential security, multiple agents, and model selection strategies.

Claude's Research Output Varies by Language: Same Prompt, Different Sources
A Reddit test shows Claude returning different sources and developments across English, Chinese, Russian, Spanish, and Hindi prompts — same model, same structure, diverging results.

Using ntfy for OpenClaw agent notifications
A developer shares their experience using ntfy.sh's self-hosted version for push notifications from OpenClaw agents, avoiding Discord/Telegram bots by running ntfy serve on the same VPS and using HTTP post requests.

Enforcing AI Agent Compliance: Bootstrap Language and Tool-Based Approaches
A developer shares practical methods for improving AI agent compliance, including using negative language in bootstraps and switching from soft rules to hard-coded tools when needed.