Open-source memory system for LLM agents achieves high benchmark scores

Memory system for LLM agents with published benchmarks
A developer has built a persistent memory system for Claude Code and OpenClaw that gives LLM agents actual context continuity across sessions. The system achieves benchmark scores of 90.8% on LoCoMo (beating every published system) and 89.1% on LongMemEval.
Architecture and framework compatibility
The architecture is adapter-based, currently hooking into lifecycle events, but the core components (storage, retrieval, intelligence) are framework-agnostic. The retrieval pipeline uses a 4-channel RRF approach with FTS5, Qdrant KNN, recency, and graph walk. The intelligence layer includes intent classification, experience patterns, and RL policy components that could plug into any agent framework.
Setup and tech stack
Quick setup requires:
ollama pull snowflake-arctic-embed2
bun install && bun run build && bun run setup
node dist/angel/index.cjsTech stack includes TypeScript, SQLite (better-sqlite3), Qdrant, Ollama, esbuild, and Vitest.
Key design decisions
- Dual-write system with SQLite as truth source and Qdrant for acceleration, with graceful degradation
- Every operation is non-throwing — individual failures never break the pipeline
- Ephemeral hooks (millisecond lifetime) for capture, persistent Angel for reflection
- RL policy models are pure TypeScript (Float32Array math, no PyTorch)
- Content-length-aware embedding backfill in background
The project contains 29K lines of code, 1,968 tests, and is MIT licensed at https://github.com/grigorijejakisic/Claudex.
📖 Read the full source: r/openclaw
👀 See Also

Clooks: A Persistent Hook Runtime for Claude Code
Clooks is a persistent HTTP daemon that handles Claude Code hook dispatch without process spawning, reducing latency from ~34.6ms to ~0.31ms per invocation. It includes automatic migration, LLM handlers with prompt templates, dependency resolution, and plugin packaging.

OpenClaw developer builds unified memory system for AI agents
A developer has built a 15-tool unified memory system for OpenClaw AI agents that combines structured facts, vector search, entity graphs, episode timelines, hierarchical compression, and event-driven coordination. The system runs locally without cloud dependencies or monthly fees.

TranscriptionSuite v1.1.2 adds WhisperX, NeMo, and VibeVoice models
TranscriptionSuite v1.1.2 now offers three transcription pipelines: WhisperX with PyAnnote diarization, NeMo models (Parakeet & Canary) with PyAnnote diarization, and VibeVoice models with built-in diarization. The update includes a model manager, parallel processing, shortcut controls, and a 24kHz recording pipeline for VibeVoice.

codebase-md: Tool auto-generates CLAUDE.md with git hook maintenance
codebase-md v0.1.0 scans projects to generate CLAUDE.md files with architecture detection, dependency health checks, and git insights. It includes git hooks to keep documentation fresh and supports other AI coding tools with additional config files.