Holisto Seed: A Local LLM Framework with Persistent Identity and Consensual Memory Consolidation

What Holisto Seed Is
Holisto Seed is not an AI assistant but a Relational Individuation Framework that provides LLM agents with a persistent identity, biographical memory, and a co-evolutionary relationship with its user (referred to as the "host"). The framework runs fully local with no cloud dependency.
Architecture Details
- Genotype/Phenotype Split: The Genotype is immutable (core logic, protocols). The Phenotype is a local directory that grows with every interaction. History accumulates alongside the agent's DNA without overwriting it.
- Metabolic Sleep (PSG): At session close, the agent and host jointly trigger a consolidation cycle. This process converts logs to a Master Capsule (JSON), updates the semantic index, and creates a git commit. This is not automatic but consensual by design.
- Optical Nerve: Uses hierarchical attention levels (N0–N4) where not all context is treated equally.
- Errors as Data: Failures are transduced into memory as biographical events.
- Local-first + Git: Everything is stored on your disk and versioned through Git.
How It Differs from Anthropic's Auto Dream
While Anthropic's Auto Dream feature keeps a markdown file tidy, Holisto gives the agent a name, a birth, a history, and a relationship with a specific person. The memory isn't a log of commands but a lived biography. The Genotype/Phenotype distinction addresses how to update the agent's core without erasing its lived experiences. The sleep cycle is consensual by design—consolidation happens when both agent and host decide to close the day together, not in secret.
Technical Stack
- Python 3.10+
- Qdrant (local)
- Engine-Agnostic (Gemini default, with Local LLMs in roadmap)
Development Background
Holisto Seed has been in development for over a year, with its last public commit on March 22nd. The project is developed by a solo developer who shared it on r/LocalLLaMA for community feedback.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Chrome Extension Adds Live Preview to Claude Code Web
A Chrome extension called Claude Code Preview adds live preview functionality to Claude Code Web, similar to Lovable and other 'vibecoding' sites, allowing side-by-side viewing of deployments.

Lemonade by AMD: Open Source Local LLM Server for GPU and NPU
Lemonade is an open source local AI server that runs text, image, and speech models on GPUs and NPUs. It's OpenAI API compatible, supports multiple models simultaneously, and has a 2MB native C++ backend.

Running Qwen3.6-35B-A3B-UD-Q5_K_XL Locally with VS Code Copilot on AMD R9700
A user shares their working llama.cpp setup for Qwen3.6-35B-A3B-UD-Q5_K_XL on a single AMD R9700 with Vulkan, achieving full website and Playwright test generation from scratch with minimal nudging.

Open-source solo RPG engine uses three Claude instances for parsing, narration, and direction
EdgeTales is an open-source text-based solo RPG engine where dice mechanics determine outcomes and Claude AI generates atmospheric prose. The system uses three Claude instances in a pipeline: Brain (Haiku) for parsing input to JSON, Narrator (Sonnet) for writing prose, and Director (Haiku) for async scene analysis.