Benchmark Shows CLI Tool Reduces Claude Code Token Costs by 32% Through Structural Navigation

A developer has open-sourced a CLI tool called Scope that provides Claude Code agents with structural code navigation capabilities, similar to IDE features like "find references" and "go to definition." The tool was built in Rust using tree-sitter and SQLite.
What the Tool Does
The tool gives agents commands like:
- "show me a 180-token summary of this 6,000-token class"
- "search by what code does, not what it's named"
It currently supports TypeScript and C#, with the goal of helping agents navigate code more efficiently than their default grep-based approach.
Benchmark Methodology
The developer ran 54 automated runs on Sonnet 4.6 across a 181-file C# codebase with:
- 6 task categories
- 3 conditions: baseline, tool available, architecture preloaded into CLAUDE.md
- 3 repetitions each
Full NDJSON capture was recorded on every run to decompose tokens into fresh input, cache creation, cache reads, and output. The benchmark runner and telemetry capture are included in the repository.
Key Findings
Contrary to expectations, agents with the tool read more files (6.8 to 9.7 average vs. baseline) but made 67% more code edits per session and finished in fewer turns.
The savings came from shorter conversations, which reduced cache accumulation. Approximately 90% of token cost lives in cache accumulation.
Overall results:
- 32% lower cost per task
- 2x navigation efficiency (nav actions per edit)
- Navigation-to-edit ratio improved from 25:1 (baseline) to 13:1 (with tool) and 12:1 (with architecture preloaded)
Results varied by task type:
- Bug fixes: -62% cost
- New features: -49% cost
- Cross-cutting changes: -46% cost
- Discovery and refactoring tasks: no advantage (baseline agents already navigate these fine)
Important Caveats
The developer notes several limitations:
- p-values don't reach 0.05 at n=6 paired observations (direction is consistent but sample is too small for statistical significance)
- Benchmarked on C# only so far (TypeScript support exists but hasn't been benchmarked yet)
- Cost calculation uses current Sonnet 4.6 API rates: fresh input $3/M, cache write $3.75/M, cache read $0.30/M, output $15/M
The tool is open source and available at github.com/rynhardt-potgieter/scope for developers who want to experiment with improving agent token efficiency.
📖 Read the full source: r/ClaudeAI
👀 See Also

Cowork Context Management Kit Solves Claude's File Overload Problem
A developer built a context management kit for Cowork after Claude AI was reading all 462 files in their project folder, causing performance issues and contradictions. The solution includes global instructions, a manifest file system, and a Cowork skill to prioritize relevant documents.

Anthropic's Multi-Agent Harness Design for Improving Claude's Code Quality
Anthropic's blog post details a harness design using multiple agents to address Claude's context anxiety and self-evaluation bias, with specific agent roles and scoring criteria for frontend and full-stack development.

Kula: Self-contained Linux server monitoring with zero dependencies
Kula is a lightweight Linux server monitoring tool that runs as a single binary with no external dependencies or databases. It collects system metrics every second from /proc and /sys, stores them in a built-in tiered ring-buffer, and provides both web dashboard and terminal TUI interfaces.

SLayer: An Open-Source Semantic Layer for AI Agents That Learns from Queries
SLayer is a lightweight, embeddable semantic layer that lets AI agents query databases, manage models, and learn from interactions via MCP, REST, CLI, or Python.