Cold Validation Architecture: Dual-Agent Code Review System Open-Sourced

Cold Validation Architecture is an open-source system that implements dual-agent validation for AI-generated code. One agent writes the code, while a separate agent reviews it independently without access to the builder's reasoning or context.
How It Works
The system addresses single-agent self-review bias by applying separation of duties similar to independent audits. The reviewer operates in complete isolation from the builder's thought process.
Workflow Phases
- Write plan →
/review-plan(Gate A) - Implement →
/review-impl(Gate C) - Ship →
/acceptance-report(Gate D)
Key Implementation Details
- The reviewer runs in an isolated temporary directory
- Reviewer only sees: plan document, code diff, and test output
- Findings persist with fingerprints for tracking
- Builder can disagree with reviewer's rationale
- Maximum 2 rounds per phase
- Built with bash scripts and JSON schemas
- Currently works with Claude Code + Codex CLI
- Pattern is agent-agnostic (can work with different AI agents)
Technical Specifications
The system is implemented as bash scripts that coordinate between two separate AI agents. JSON schemas define the structure for plan documents, code reviews, and acceptance reports. The isolated execution environment ensures the reviewer has no access to the builder's internal reasoning or intermediate steps.
This approach is useful for developers who want to implement more rigorous validation for AI-generated code, particularly when working with complex systems where single-agent self-review might introduce confirmation bias.
📖 Read the full source: r/LocalLLaMA
👀 See Also

llmLibrarian: Local RAG Engine with MCP Integration for File-Based AI Search
llmLibrarian is a local RAG engine that exposes retrieval over MCP, allowing AI agents like Claude to query indexed files. It uses ChromaDB collections for organization, Ollama for synthesis, and keeps everything on-device.

Open Source MCP Server Connects Claude to Mailchimp API
A developer built a Mailchimp MCP server using Claude Code, providing 53 tools for campaigns, audiences, reports, automations, and e-commerce with built-in safety modes and read-only configuration.

Bitcoin MCP Server with 43 Tools for AI Coding Agents
bitcoin-mcp is an MCP server with 43 Bitcoin tools including fee advisors, mempool analysis, and inscription detection. It works with Claude Desktop, Claude Code, Cursor, VS Code, and Windsurf using live data from APIs or local nodes.

bunx ccusage Shows $18,450 in Credits Burned — Flat Plans Absorb the Cost
A user on r/ClaudeAI ran bunx ccusage and discovered $18,450 in credits used in May — 248M input tokens, 42M output tokens, 21.7B with cache reads — while paying only €400/month flat-rate for Claude Code and Codex.