Memento v1.0: Local Persistent Memory for AI Coding Agents

What Memento v1.0 Does
Memento v1.0 provides a local-first persistent memory layer for AI coding agents. Everything runs on your machine — embeddings, storage, and search — with no cloud requirements or API keys needed after setup.
Key Technical Details
Embeddings: Uses all-MiniLM-L6-v2 via @xenova/transformers (384 dimensions) running fully offline. Optional cloud embeddings via environment variables for OpenAI (text-embedding-3-small) or Gemini (embedding-001).
Storage: Local JSON + HNSW index by default. Optional ChromaDB or Neo4j support.
Search: HNSW index for approximate nearest neighbor search (<50ms on 2000+ memories). Full BM25 implementation with k1=1.2, b=0.75 for keyword search. Hybrid mode combining 70% cosine similarity + 30% BM25.
Deduplication: SHA-256 + 0.92 cosine threshold.
Resilience features: Circuit breaker, write-ahead log, LRU cache.
Memory management: 347-day exponential decay on importance scores.
Setup and Usage
Install with: npx memento-memory setup
Migration tool: memory_migrate re-embeds your entire store when switching embedding providers — no data loss.
IDE Support and Tools
Multi-IDE compatibility: Claude Code, Cursor, Windsurf, OpenCode — all share the same local store.
17 MCP tools across save/recall/search/export/import/ingest/compact/graph/session lifecycle.
Privacy and Licensing
Zero telemetry — your architectural decisions and code patterns never leave your machine. Works without internet after setup. AGPL-3.0 licensed and self-hostable in one command.
📖 Read the full source: r/LocalLLaMA
👀 See Also

W2A — an open protocol for agent sensors: giving local agents real-time perception
W2A (World2Agent) is an open protocol standardizing the perception layer for AI agents — self-hostable, TS SDK, Apache 2.0. It lets agents receive real-time signals from sensors without one-off scripts.

MegaClaw: Containerized OpenClaw Setup with Playwright and Homebrew
MegaClaw is a two-image Podman setup for OpenClaw that addresses common installation issues like permission errors and missing dependencies. It uses a multi-stage build with pre-installed Playwright and Homebrew, and bakes user configuration into a runtime image.

Hollow AgentOS: Run Claude-like agents locally on RTX 5070 using Qwen 3.5 9B
A self-modifying agent system running Qwen 3.5 9B on local hardware cuts Claude API costs by 50%. Uses iterative testing and self-improvement loop to develop software without human intervention.

Open-source Claude skill for management consulting frameworks and case studies
A free, MIT-licensed Claude skill provides structured reference material for management consulting work, including frameworks, industry context, and case studies. The project consists of 80+ markdown files organized by domain and seeks contributors to expand coverage.