Traversable Skill Graph for Persistent AI Agent Memory in Codebases

A Practical Solution to Stateless AI Assistants
AI coding assistants lack persistent memory between sessions, forcing developers to start from zero each time. The common approach of dumping everything into large rules files like .cursorrules fails due to token limits and dilution of instructions.
The solution presented is progressive disclosure through a traversable skill graph that lives inside the codebase. The AI navigates this graph autonomously across sessions.
Three-Layer Architecture
The system has three distinct layers:
- Layer 1 (Always Loaded): Under 150 lines (300 tokens). Contains stack identity, folder conventions, and non-negotiables. Includes one outbound pointer to
HANDOVER.md. - Layer 2 (Loaded Per Session):
HANDOVER.mdserves as an attention router, not a document. It tells the AI which domain file to load based on current task (payments, auth, database, api-routes). Each domain file ends with instructions pointing to the next relevant file, creating a self-directing system. - Layer 3 (Loaded Per Task): Prompt library with 12 categories. Each entry includes context, build, verify, and debug sections. The AI checks the index, loads the category, and follows the pattern.
Core Insight: Self-Directing Instructions
The key innovation is that instructions carry meaning, not just references. For example: "load security/threat-modeling.md before modifying webhook handlers" tells the AI when and why, not just what.
The developer has built this into a SaaS template that ships with the codebase, available at launchx.page for those wanting to examine the full graph structure.
📖 Read the full source: r/LocalLLaMA
👀 See Also

LM Studio plugins add web image analysis for vision-capable LLMs
A developer created plugins for LM Studio that enable vision-capable LLMs to fetch and analyze images from the web, with automatic image processing and tool chaining. The plugins work with models like Qwen 3.5 9b/27b and include updated Duck-Duck-Go and Visit Website functionality.

Benchmark shows context engine reduces AI coding agent costs by 3x on SWE-bench
A benchmark of 4 coding agents using Claude Opus 4.5 on SWE-bench Verified shows a context engine achieved 73% pass rate at $0.67/task, while other agents cost up to $1.98/task for similar or lower performance.

Claude Code Workflow Visual Details Memory Hierarchy and Skills System
A Reddit user shared a visual diagram showing how Claude Code organizes memory through layered CLAUDE.md files and implements reusable skills via SKILL.md files. The workflow loop suggests using Plan mode with auto-accept and frequent commits.

Hardware widget and Chrome extension monitor Claude API rate limits
A developer built a hardware widget using ESP8266 and OLED display that tracks Claude's rate limits in real time, plus a Chrome extension that intercepts Claude's internal /usage API and shows usage patterns. The total BOM cost is approximately $6.50.