Recall: Local Project Memory for Claude Code — No Tokens Spent on Summaries

Recall is a fully-local project memory plugin for Claude Code that solves the cold-start problem without spending any model tokens on summarization. It captures session transcripts into .recall/history.md and condenses them into a compact context.md (~1–2K tokens) using a classical Python summarizer — not an LLM call.
How It Works
- During a session: The
Stop/SessionEndhooks append new activity incrementally to.recall/history.md— only new turns, fully local. - At session start: The
SessionStarthook surfacescontext.mdand prompts Claude to confirm: resume from saved context? and keep logging this session?
Key Advantages
- Zero token spend on memory: Summarization is done locally by a deterministic algorithm, not by an API call. No API key or external model required.
- Privacy: Transcripts (code, paths, secrets) never leave your machine. Most memory tools pipe context to a model endpoint; Recall doesn't.
- Low friction: No
pip install, no local model to run, no key configure — works offline immediately on plugin load.
Output Files
Two files in .recall/:
history.md— append-only log of prompts, replies, files touched, commands run.context.md— overwritten summary containing: goal, summary, next steps/open threads, files touched, where you left off.
Comparison with Built-in Claude Code Memory
| Feature | CLAUDE.md | --continue / --resume | Recall |
|---|---|---|---|
| What | Hand-written notes & rules | Reloads a prior conversation | Auto-captured session log + local summary |
| Upkeep | Manual | None (you pick session) | None — written as you work |
| Holds | Instructions to follow | Full prior transcript | Goal, files, commands, where you left off, next steps |
| Cost to resume | Small | Large (replays full transcript) | ~1–2K tokens (compact digest) |
| Form | Markdown you edit | Local session state | Plaintext in .recall/ — diffable & shareable |
| Claude treats it as | Instructions | The conversation | Fenced untrusted reference data |
In short: CLAUDE.md is how I want you to work; Recall is here’s what we did last time and where we stopped — produced offline with zero model tokens spent.
📖 Read the full source: HN LLM Tools
👀 See Also

OpenClaw Agent Gains Phone Call Capability Through Custom Skill
A developer created a custom skill for self-hosted OpenClaw agents that enables phone call functionality, allowing the agent to initiate calls based on triggers like build completions or server outages. The implementation provides voice interaction with full chat capabilities including web searches and alert setup.

OpenClaw extension routes requests through Claude Code CLI instead of API
An OpenClaw extension spawns the Claude CLI binary as a subprocess, routing requests through Claude Code CLI instead of the Anthropic API. This provides the full Claude Code experience at the flat rate of a max plan.

Claude Code Container Provides Zero-Config Docker Isolation for Claude Code
Claude Code Container (ccc) is a free, open-source tool that automatically creates per-project Docker containers for Claude Code with full isolation and zero configuration. It forwards host environment variables, mounts SSH keys, provides transparent localhost proxy, and includes Chromium with chrome-devtools MCP pre-configured.

User-built PTC for Claude Code shows 40-65% token savings on analysis tasks, not code writing
A developer built a local PTC implementation called Thalamus for Claude Code and analyzed 79 real sessions, finding 40-65% token savings on analysis tasks but near-zero savings on code-writing tasks. The agent used execute() primarily for general Python computation rather than batching tool calls.