OpenClaw developer builds Kumiho cognitive memory plugin for persistent agent collaboration

A developer has built Kumiho, a cognitive memory system for OpenClaw that addresses the framework's lack of persistent memory across sessions. The system uses a knowledge graph to maintain context from previous conversations, decisions, and collaborative work.
What Kumiho does
Kumiho is an AI cognitive memory system backed by a knowledge graph. The core concept is graph-native memory where facts connect to other facts, decisions have provenance, and outputs have versioned identities for later retrieval.
OpenClaw integration
The developer created an OpenClaw plugin called @kumiho/openclaw-kumiho that:
- Hooks into every conversation to recall relevant context before the agent responds
- Captures structured summaries after conversations
- Runs a nightly Dream State cycle that enriches the graph while idle (deprecating stale facts, creating relationship edges, adding semantic tags)
Creative memory layer
Kumiho includes a creative memory system where:
- Every document, plan, or piece of code the agent produces gets a versioned
kref - Each output has a
DERIVED_FROMedge back to the memory that inspired it - When resuming a project weeks later,
creative_recallbrings the full history back
Developer motivation
The developer appreciated OpenClaw's multi-channel agents, plugin architecture, and seamless Telegram/Discord integration, but wanted memory that persisted beyond individual sessions. They specifically wanted the agent to remember the history of collaboration — ideas discussed, decisions made, and work in progress — rather than just factual recall.
📖 Read the full source: r/openclaw
👀 See Also

Fine-tuned Qwen3-0.6B model outperforms 120B teacher on structured function calling
Distil Labs published an end-to-end pipeline that fine-tunes a Qwen3-0.6B model to achieve 79.5% exact match on IoT smart home function calling, outperforming a 120B teacher model by 29 points. The pipeline uses production traces to generate synthetic training data without manual annotation.

Tredict MCP Server Enables Claude to Create and Push Training Plans to Sports Watches
A developer built a Tredict MCP Server for Claude.ai and Claude Code that creates complex endurance training plans via prompts and automatically uploads structured workouts to Garmin, Coros, Suunto, and Wahoo watches. The server includes an MCP App for visual feedback within Claude chat.

ClawCut Proxy Released on GitHub to Optimize OpenClaw for Small LLMs
ClawCut is an experimental proxy that manipulates, injects JSON calls, and extracts JSON clutter from OpenClaw to reduce cognitive overload on small models (7B-8B) running on limited hardware.

Ollama's Technical Issues and Community Controversy
Ollama, a popular local LLM tool, faces criticism for downplaying its reliance on llama.cpp, license compliance issues, and technical problems with its custom backend including performance regressions and reintroduced bugs.