Memento v1.0: Local Persistent Memory for AI Coding Agents

✍️ OpenClawRadar📅 Published: March 24, 2026🔗 Source

What Memento v1.0 Does

Memento v1.0 provides a local-first persistent memory layer for AI coding agents. Everything runs on your machine — embeddings, storage, and search — with no cloud requirements or API keys needed after setup.

Key Technical Details

Embeddings: Uses all-MiniLM-L6-v2 via @xenova/transformers (384 dimensions) running fully offline. Optional cloud embeddings via environment variables for OpenAI (text-embedding-3-small) or Gemini (embedding-001).

Storage: Local JSON + HNSW index by default. Optional ChromaDB or Neo4j support.

Search: HNSW index for approximate nearest neighbor search (<50ms on 2000+ memories). Full BM25 implementation with k1=1.2, b=0.75 for keyword search. Hybrid mode combining 70% cosine similarity + 30% BM25.

Deduplication: SHA-256 + 0.92 cosine threshold.

Resilience features: Circuit breaker, write-ahead log, LRU cache.

Memory management: 347-day exponential decay on importance scores.

Setup and Usage

Install with: npx memento-memory setup

Migration tool: memory_migrate re-embeds your entire store when switching embedding providers — no data loss.

IDE Support and Tools

Multi-IDE compatibility: Claude Code, Cursor, Windsurf, OpenCode — all share the same local store.

17 MCP tools across save/recall/search/export/import/ingest/compact/graph/session lifecycle.

Privacy and Licensing

Zero telemetry — your architectural decisions and code patterns never leave your machine. Works without internet after setup. AGPL-3.0 licensed and self-hostable in one command.

📖 Read the full source: r/LocalLLaMA

👀 See Also

Tools

OpenClaw Implements Agent History Compression to Reduce Context Usage

OpenClaw now compresses agent history by replacing completed subtask logs with structured summaries, reducing ~1M tokens to ~30K. The system uses a 4-pass scanner to identify task lifecycles and generates masked summaries that maintain agent compatibility.

Mar 10, 2026, 09:45 AM UTC

OpenClawRadar

Tools

SendToAI VS Code Extension Solves Claude's 20-File Limit with Project Bundling

SendToAI is a free VS Code extension that bundles entire projects into a single clipboard paste, bypassing Claude's 20-file upload limit. It includes visual file selection, token counting, cost estimates, and project notes that persist across sessions.

Mar 29, 2026, 08:45 AM UTC

OpenClawRadar

Tools

Paseo: Open-Source Interface for Claude Code, Codex, Copilot, OpenCode, and Pi Agents

Paseo is a self-hosted interface that runs Claude Code, Codex, Copilot, OpenCode, and Pi agents in parallel. It offers voice control, cross-device access, and a CLI. No telemetry, no forced login.

Jun 4, 2026, 12:16 PM UTC

OpenClawRadar

Tools

Open-source MCP suite improves Claude Code generation quality by 15-20%

An open-source MCP suite consisting of three local servers and a prompt skill addresses the 'bad token' problem in AI code generation, with one customer reporting 15-20% quality improvement for Claude Code.

Mar 19, 2026, 02:45 PM UTC

OpenClawRadar