Agent frameworks waste 350,000+ tokens per session resending static files

✍️ OpenClawRadar📅 Published: April 13, 2026🔗 Source

Token waste benchmark results

Measurements on a local Qwen 3.5 122B setup revealed that agent frameworks waste more than 350,000 tokens per session by repeatedly resending static files. The source describes these numbers as "unreal."

Optimization approach

A compile-time approach was discovered that reduces query context from 1,373 tokens to just 73 tokens. This represents a 95% reduction in token usage for this specific context.

The benchmark also found that naive JSON conversion makes the problem 30% worse, increasing token waste beyond the baseline measurements.

Technical context

Agent frameworks typically include system prompts, tool definitions, and other configuration data that remains static across multiple interactions within a session. When this data is resent with every query, it consumes tokens without providing new information to the model. This is particularly costly with large models like Qwen 3.5 122B where token processing directly impacts both performance and cost.

The compile-time approach likely involves pre-processing static elements so they're referenced rather than resent, similar to how modern web applications cache static assets. For developers working with AI coding agents, reducing this overhead can significantly improve response times and reduce operational costs.

📖 Read the full source: r/LocalLLaMA

👀 See Also

Tools

YourMemory: AI memory with biological decay hits 59% recall on LoCoMo-10

YourMemory gives AI agents persistent memory using Ebbinghaus forgetting curve and graph-enhanced retrieval. Benchmarked at 59% Recall@5 on LoCoMo-10, 2× better than Zep Cloud.

Apr 27, 2026, 12:15 AM UTC

OpenClawRadar

Tools

Open-source Claude Code plugin simulates Chief Data & AI Office with 22 specialized agents

An open-source Claude Code plugin called AI CDAIO Office uses 22 specialized AI agents to simulate a complete Chief Data & AI Office, generating actual PPTX, DOCX, and XLSX files for strategy documents, governance frameworks, and board materials.

Mar 18, 2026, 08:45 PM UTC

OpenClawRadar

Tools

Developer builds .NET SaaS template generator with Claude Code, shares workflow insights

A developer open-sourced NETrock, a .NET 10 SaaS starter template with authentication, ORM, and background jobs, then built a client-side generator for it using Claude Code. The generator lets users pick features and download a working .zip project that stays in their browser.

Mar 8, 2026, 03:45 PM UTC

OpenClawRadar

Tools

Exploring AI with Tiny Bots: Understanding AI Agents Through Nanobot Tutor

OpenClaw community member shares insights with the 'Nanobot Tutor', a miniature framework aimed at demystifying AI agent functionality. Discover how diving into this compact learning environment unveils the workings of intelligent agents.

Feb 8, 2026, 07:45 PM UTC

OpenClawRadar