CogniLayer: An MCP Server for Persistent Memory in Claude Code

CogniLayer is an MCP server built specifically for Claude Code that provides persistent memory across sessions. The tool addresses the problem where Claude Code forgets everything when starting a new session—including project architecture, past decisions, and debugging insights—forcing users to re-explain context repeatedly.
Technical Implementation
The system uses a SQLite database with FTS5 full-text search and vector embeddings (fastembed + sqlite-vec) to store knowledge locally. It implements hybrid search that combines keyword matching with semantic vector similarity.
Key Features
- Stores 14 types of facts: decisions, patterns, gotchas, error fixes, API contracts, and more
- Staleness detection: warns when remembered facts reference changed files
- Session bridges: automatically passes context between sessions
- Heat decay system: facts age over time (hot/warm/cold) with frequently accessed information staying relevant
- Identity Card system: prevents deploying to the wrong server
- 3 hooks: SessionStart, SessionEnd, PostToolUse
- 10 MCP tools and 7 slash commands
Setup and Usage
To install and use CogniLayer:
git clone https://github.com/LakyFx/CogniLayer.git
python install.py
After starting Claude Code in any project, run the /onboard command. The tool runs entirely locally with SQLite and works on Windows, macOS, and Linux.
The project was built using Claude Code sessions, with Claude helping design the database schema, write the MCP server, build the hook system, implement the hybrid search ranker, and iterate on the embedding pipeline. The tool is 100% free, open source under GPL v3, with UI strings currently in Czech but functional with any language.
📖 Read the full source: r/ClaudeAI
👀 See Also

OmniCoder-9B fine-tune shows strong performance for agentic coding on 8GB VRAM systems
A Reddit user tested OmniCoder-9B, a fine-tune of Qwen3.5-9B on Opus traces, with OpenCode and reported 40+ tokens per second speeds using Q4_K_M GGUF quantization at 100k context length on an 8GB VRAM system.

Bullshit Benchmark Tests LLM Resistance to Nonsensical Prompts
The Bullshit Benchmark evaluates whether AI models identify and push back on obvious nonsense prompts instead of confidently generating incorrect answers. Results show Claude models perform significantly better than Gemini models at detecting nonsensical questions.

engram v3.4.0 Adds Anthropic Plugin to Keep Claude Code Running Under New Rate Limits
engram v3.4.0 introduces a dedicated Anthropic plugin for Claude Code, adding three skills to manage costs, query context, and surface errors. Install with `/plugin install engram` or `npm install -g engramx@latest`.

CRMy: Open Source CRM and Customer Context Engine for OpenClaw
CRMy is an open source CRM and Customer Context Engine built specifically for OpenClaw agents. It includes a complete CLI, OpenClaw plugin with 12 CRM tools, PostgreSQL backend, and self-hosted deployment with two commands.