Recall: Local Project Memory for Claude Code — No Tokens Spent on Summaries

✍️ OpenClawRadar📅 Published: June 22, 2026🔗 Source
Recall: Local Project Memory for Claude Code — No Tokens Spent on Summaries
Ad

Recall is a fully-local project memory plugin for Claude Code that solves the cold-start problem without spending any model tokens on summarization. It captures session transcripts into .recall/history.md and condenses them into a compact context.md (~1–2K tokens) using a classical Python summarizer — not an LLM call.

How It Works

  • During a session: The Stop / SessionEnd hooks append new activity incrementally to .recall/history.md — only new turns, fully local.
  • At session start: The SessionStart hook surfaces context.md and prompts Claude to confirm: resume from saved context? and keep logging this session?

Key Advantages

  • Zero token spend on memory: Summarization is done locally by a deterministic algorithm, not by an API call. No API key or external model required.
  • Privacy: Transcripts (code, paths, secrets) never leave your machine. Most memory tools pipe context to a model endpoint; Recall doesn't.
  • Low friction: No pip install, no local model to run, no key configure — works offline immediately on plugin load.
Ad

Output Files

Two files in .recall/:

  • history.md — append-only log of prompts, replies, files touched, commands run.
  • context.md — overwritten summary containing: goal, summary, next steps/open threads, files touched, where you left off.

Comparison with Built-in Claude Code Memory

FeatureCLAUDE.md--continue / --resumeRecall
WhatHand-written notes & rulesReloads a prior conversationAuto-captured session log + local summary
UpkeepManualNone (you pick session)None — written as you work
HoldsInstructions to followFull prior transcriptGoal, files, commands, where you left off, next steps
Cost to resumeSmallLarge (replays full transcript)~1–2K tokens (compact digest)
FormMarkdown you editLocal session statePlaintext in .recall/ — diffable & shareable
Claude treats it asInstructionsThe conversationFenced untrusted reference data

In short: CLAUDE.md is how I want you to work; Recall is here’s what we did last time and where we stopped — produced offline with zero model tokens spent.

📖 Read the full source: HN LLM Tools

Ad

👀 See Also

OpenClaw Agent Gains Phone Call Capability Through Custom Skill
Tools

OpenClaw Agent Gains Phone Call Capability Through Custom Skill

A developer created a custom skill for self-hosted OpenClaw agents that enables phone call functionality, allowing the agent to initiate calls based on triggers like build completions or server outages. The implementation provides voice interaction with full chat capabilities including web searches and alert setup.

OpenClawRadar
OpenClaw extension routes requests through Claude Code CLI instead of API
Tools

OpenClaw extension routes requests through Claude Code CLI instead of API

An OpenClaw extension spawns the Claude CLI binary as a subprocess, routing requests through Claude Code CLI instead of the Anthropic API. This provides the full Claude Code experience at the flat rate of a max plan.

OpenClawRadar
Claude Code Container Provides Zero-Config Docker Isolation for Claude Code
Tools

Claude Code Container Provides Zero-Config Docker Isolation for Claude Code

Claude Code Container (ccc) is a free, open-source tool that automatically creates per-project Docker containers for Claude Code with full isolation and zero configuration. It forwards host environment variables, mounts SSH keys, provides transparent localhost proxy, and includes Chromium with chrome-devtools MCP pre-configured.

OpenClawRadar
User-built PTC for Claude Code shows 40-65% token savings on analysis tasks, not code writing
Tools

User-built PTC for Claude Code shows 40-65% token savings on analysis tasks, not code writing

A developer built a local PTC implementation called Thalamus for Claude Code and analyzed 79 real sessions, finding 40-65% token savings on analysis tasks but near-zero savings on code-writing tasks. The agent used execute() primarily for general Python computation rather than batching tool calls.

OpenClawRadar