Recall: A Persistent Memory MCP Server for Claude Code

Recall is an MCP server that provides Claude Code with persistent memory across sessions, addressing the issue where each new session starts from zero. The tool ships as a native Claude Code plugin and is available via /plugin install recall@claude-plugins-official, with setup initiated by /recall:setup to connect an API key.
Core Features
The system implements four lifecycle hooks:
- session-start: Loads recent memories and decisions
- observe: Silently captures important changes after file edits, filtering for high-signal events like git commits, test runs, and architectural decisions to avoid noise
- pre-compact: Saves critical state before context compaction to preserve nuanced reasoning that would otherwise be lost (e.g., "we chose Redis over Postgres because of X, Y, Z" becoming just "uses Redis")
- session-end: Writes a summary for the next session
Technical Implementation
Recall uses semantic search with embeddings, meaning Claude doesn't receive a wall of text but instead gets contextually relevant memories for the current task. The developer notes that auto-capture is tricky—storing every file edit creates noise, so the observe hook filters for meaningful events.
For team usage, the system implements a tenant + workspace model to handle workspace isolation and selective sharing across multiple Claude sessions. The pre-compact hook is identified as the most valuable feature, as compaction often eliminates nuanced decisions.
The project is open source under MIT license at github.com/joseairosa/recall, with a hosted version available at recallmcp.com that includes a free tier.
📖 Read the full source: r/ClaudeAI
👀 See Also

Claude Code Remote Control: Continue Local Sessions from Any Device
Claude Code Remote Control lets you continue local Claude Code sessions from other devices like phones or browsers while keeping everything running on your machine. It's available as a research preview on Pro and Max plans, requiring authentication and workspace trust setup.

TideSurf: DOM compression tool reduces web agent token usage 30x, speeds TTFT 12x
TideSurf v0.3 converts rendered DOM to markdown-like compressed format, reducing token consumption by 32x on GitHub pages versus raw DOM while adding 18 interactive tools for LLM agents.

Phantom: A Persistent AI Agent Built with Claude's Agent SDK
Phantom is an open-source Bun/TypeScript process that wraps Claude's Agent SDK (Opus 4.6) with persistent vector memory, a self-evolution engine, and an MCP server interface. It runs continuously on its own VM or Docker Compose and communicates via Slack.

Toroidal Logit Bias: Simple Inference-Time Trick Reduces Hallucination by 40%
A novel method maps tokens to a torus and boosts nearby logits, reducing factual errors without fine-tuning or RAG.