Culpa: Open Source Deterministic Replay Engine for AI Agent Debugging

✍️ OpenClawRadar📅 Published: April 20, 2026🔗 Source

Culpa is an open source deterministic replay engine designed specifically for debugging AI agent sessions. The core problem it addresses is the nondeterministic nature of LLM agents—when they fail, you can't reproduce the exact failure by simply re-running the session.

How It Works

The tool records every LLM call along with the full execution context during an agent session. When you need to debug a failure, it replays the session using the recorded responses as stubs instead of making new API calls. This makes the replay fully deterministic and costs nothing since it doesn't hit the real APIs.

Key Features

Proxy Mode: Works with tools like Claude Code and Cursor without requiring any code changes
Python SDK: Available for developers building their own agents
API Support: Compatible with Anthropic and OpenAI APIs
Forking Capability: You can fork at any recorded decision point, inject a different response, and see what would have happened

Practical Benefits

Since the replay uses recorded responses instead of making actual API calls, debugging sessions incur zero API costs. The deterministic nature of replays makes it possible to reliably reproduce and analyze failures that would otherwise be impossible to recreate due to the inherent randomness in LLM responses.

The project is actively seeking feedback, particularly from developers building agent workflows. The creator notes they're a CS freshman and looking to improve the tool.

📖 Read the full source: r/LocalLLaMA

👀 See Also

Tools

Voxlert: Voice Notifications for Claude Code Sessions with Character Voices

Voxlert is a tool that hooks into Claude Code events and speaks notifications using distinct character voices like StarCraft Adjutant, SHODAN, GLaDOS, and HEV Suit. It uses an LLM via OpenRouter to generate in-character lines and runs locally with npm installation.

Mar 11, 2026, 03:45 AM UTC

OpenClawRadar

Tools

Krasis LLM Runtime Shows 8.9x Prefill and 4.7x Decode Speed Improvements Over Llama.cpp

Krasis LLM runtime now runs both prefill and decode entirely on GPU with different optimization strategies, achieving 8.9x faster prefill and 4.7x faster decode than llama.cpp on Qwen3.5-122B with a single 5090 GPU.

Mar 17, 2026, 06:45 PM UTC

OpenClawRadar

Tools

AI Agent Session Center: 3D Dashboard for Monitoring Claude Code Sessions

AI Agent Session Center is a real-time dashboard that visualizes Claude Code sessions as 3D robots in a cyberdrome, with animations showing agent status and features including live terminal views, approval alerts, and session resume. It installs via npx with lightweight bash hooks.

Feb 26, 2026, 01:45 PM UTC

OpenClawRadar

Tools

Jeeves: TUI for Browsing and Resuming AI Agent Sessions

Jeeves is a terminal user interface that lets you search, preview, and resume AI agent sessions from Claude Code, Codex, and OpenCode in a single view. It's written in Go and available via multiple package managers including Homebrew, Nix, and Go install.

Apr 18, 2026, 05:45 AM UTC

OpenClawRadar