Galadriel: Open-Source Warm-Cache Harness for Persistent Claude Agents

A Reddit user has open-sourced Galadriel, a harness for persistent Claude agents that achieves 87% cost savings and sub-3s latency on 100K token contexts by optimizing prompt caching. The project, released under MIT license, targets the memory and cost issues often called the "Goldfish Problem" in AI coding agents.
Key Features
- 3-Tier Stacked Caching: Separate cache breakpoints for tool definitions, system prompts (
CLAUDE.md), and trailing conversation history. This avoids cache invalidation across different context segments. - Integrated MemPalace: A vector-based persistent memory system that does not break the prompt cache, enabling permanent recall.
- Privacy-first: Designed for private subnets — no middleman, no message caps, just your API key and rules.
- CLAUDE.md Guidelines (Karpathy-style): Built-in rules to prevent agent bloat (unnecessary context expansion).
Benchmarks
According to the author, tested against OpenClaw/Cursor workflows:
- Cost: $10 for every $100 normally spent (87% reduction).
- Latency: 100K token context drops from 11s to <3s (85% improvement).
Who It's For
Developers running persistent Claude agents for tasks like infrastructure management or codebase maintenance who are paying high API costs due to uncached context.
Setup
The harness is currently customized for Discord (the author's personal setup), but the caching logic is generic. Clone the repo and adapt the transport layer for your needs.
GitHub
github.com/avasol/galadriel-public (MIT License)
📖 Read the full source: r/openclaw
👀 See Also

OpenTrace: Self-Hosted Observability Server with 75+ MCP Tools
OpenTrace is a self-hosted observability server that provides logs, user analytics, and database introspection through 75+ MCP tools, running on a $4 VPS with SQLite storage and read-only Postgres connections.

AgentPeek: Open-source dashboard for monitoring Claude Code agent teams
AgentPeek is a local dashboard that hooks into Claude Code to provide visibility into agent teams, showing orchestration hierarchies, execution traces, token costs, and file operations. Installation requires cloning the GitHub repo and running pipx install.

ClawClone: Cloud Backup Tool for OpenClaw Workspaces
ClawClone is a tool that backs up OpenClaw workspaces to the cloud with one prompt and restores them with another. It was built after a developer lost a month's worth of training data.

TeamOut AI Agent for Company Retreat Planning
TeamOut has launched an AI agent that plans company events through conversation, handling venue sourcing, vendor coordination, flight cost estimation, itinerary building, and project management. The system uses multiple LLMs and specialized tools to manage planning as a stateful coordination problem.