OpenClaw Client Adds Live API Cost Tracking, Spending Caps, and Granular Agent Controls

The OpenClaw Client has received a major update focused on cost control and agent customization. Key additions include live API cost tracking with a circular progress bar displayed next to each agent's name, and strict per-agent spending caps to prevent runaway costs.
Cost Controls & Usage Tracking
- Spending Caps: Define strict budget limits for each agent directly in the UI.
- Live Usage UI: An unobtrusive circular progress bar shows current API usage at a glance, placed right next to the agent's name.
Agent Customization
- Sub-agent Management: Orchestrate, view, and manage child agents from the parent agent's interface.
- Skill Management: Toggle and configure specific skills (tools) per agent on the fly.
- Granular Model Selection: Swap between different models from your provider per agent (e.g., use a cheap model for simple tasks and a more capable model for complex coding).
The update is available now on the GitHub repo. The developer built it to provide a cleaner local UI for agent management.
📖 Read the full source: r/clawdbot
👀 See Also

Claude-context-lint tool audits token overhead in Claude Code projects
A new tool called claude-context-lint scans Claude Code projects to show how much context window is consumed by CLAUDE.md files, skills, MCP servers, and system prompts before user input. The tool provides specific recommendations for reducing token usage.

PocketBot: A Local AI Autopilot for iOS Using App Intents and On-Device Inference
PocketBot is an iOS app that runs a quantized 3B Llama model locally on iPhone's Neural Engine via Metal, using Apple's AppIntents and CoreLocation frameworks to create event-driven automations without cloud data transmission.

Agentic Context Engine: Automated Agent Improvement Loop with 34.2% Accuracy Gain
An open-source tool automates the entire agent improvement loop from trace analysis to fix implementation, achieving 34.2% accuracy improvement on Tau-2 Bench in one iteration. The system uses Claude Code in a REPL environment to analyze failures and decide between prompt or code fixes.

Gemma4 26B-A4B Delivers Fast Local Performance with Web Search and Image Support
The gemma-4-26B-A4B model achieves approximately 145 tokens per second on an RTX 4090 and includes web search MCP and image support for chat applications. A blog post details setup and cross-platform usage on Mac and iPhone.