LLMSpend: Open-source cost tracker for Anthropic and OpenAI SDKs

✍️ OpenClawRadar📅 Published: March 12, 2026🔗 Source

What LLMSpend does

LLMSpend is a Python package that monitors API usage costs for Anthropic and OpenAI SDKs. It was created because the Anthropic dashboard only shows total spend without breaking it down by feature. The tool tracks tokens, cost, and latency per call, grouping data by feature, model, user, or project.

How to use it

Install with pip install llmspend. Integration requires two lines of code:

from llmspend import monitor
client = monitor.wrap(anthropic.Anthropic(), project="my-app")

Then add an llmspend parameter to track specific features:

response = client.messages.create(
    model="claude-sonnet-4-6",
    max_tokens=1000,
    messages=[{"role": "user", "content": query}],
    llmspend={"feature": "chatbot"}
)

Reporting and dashboard

From the terminal, run llmspend stats --last 7d --by feature to get output like:

Total: $4.2100 across 847 calls
chatbot 512 $2.8900 1180ms
summarizer 335 $1.3200 640ms

Run llmspend dashboard to open a local web dashboard at localhost:8888.

Technical details

Local SQLite storage — no account needed, no data leaves your machine
Works with both Anthropic and OpenAI SDKs
Zero dependencies (pure Python standard library)
Never stores prompts or responses — only tracks cost metrics
No prompt logging, tracing, or evaluations — focused solely on cost tracking
MIT licensed, open source on GitHub

The tool was built entirely with Claude Code in a single session, with Claude writing the monkey-patching logic, pricing engine, CLI, and web dashboard.

📖 Read the full source: r/ClaudeAI

👀 See Also

Tools

Argus: A VS Code Extension to Debug Claude Code Session Costs and Behavior

A developer built Argus, a VS Code extension that parses Claude Code JSONL transcripts into a real-time timeline with per-step token/cost breakdown, cache hit ratio, and flagging of retry loops, duplicate reads, and context pressure.

May 10, 2026, 08:18 PM UTC

OpenClawRadar

🦀

Tools

Researcher Builds Veracity-Checking Skill for Claude Code, Finds Hallucinations in Own Documentation

A researcher built a Claude Code skill called /veracity-tweaked-555 that decomposes documents into atomic claims and verifies each via web search using 16 parallel agents across 4 waves. When self-audited, the skill scored 62/100 due to fabricated statistics and inflated claims in its own documentation.

Apr 20, 2026, 05:38 PM UTC

OpenClawRadar

Tools

Terminal-Based 3D Renderer Built with Multi-Agent Claude Code System

A developer created tortuise, a pure terminal-based 3D renderer that displays Gaussian splats using Unicode and ASCII symbols, built over 3 days using 70-80 AI agents coordinated through a Claude Code setup with subagents inside subagents.

Feb 26, 2026, 01:45 PM UTC

OpenClawRadar

Tools

2-Prompt System to Carry Context Between Claude Chats Without Token Waste

A developer shares two prompts for compressing an entire Claude conversation into a structured context block and loading it into a fresh chat, preserving decisions, work, and next steps.

May 6, 2026, 06:21 PM UTC

OpenClawRadar