ClawCodex /advisor Mode: Pair Cheap Worker with Expensive Reviewer to Cut Costs Without Losing Quality

✍️ OpenClawRadar📅 Published: May 27, 2026🔗 Source
ClawCodex /advisor Mode: Pair Cheap Worker with Expensive Reviewer to Cut Costs Without Losing Quality
Ad

Most agent CLIs force you to pick a single model: Opus delivers great decisions but burns through your budget, while Haiku is cheap but misses critical architectural calls. ClawCodex, an open-source Python port of Claude Code, solves this with an /advisor mode that pairs a cheap worker model with an expensive reviewer model at key decision points.

How It Works

The cheap worker (e.g., haiku-4-5, deepseek-v4-pro) handles the heavy lifting — file reads, edits, test runs. At decision points — before committing to an interpretation, before declaring a task done, or when stuck — the worker pauses and consults a stronger reviewer (e.g., opus-4-7). The reviewer sees the full conversation (every tool call, every result) and returns a short Gaps / Risks / Do-next advice snippet. The worker then continues.

Two Execution Modes

  • Server-side (Anthropic 1P): Uses an advisor beta header — one roundtrip, prompt-cache friendly. Both worker and advisor run on Anthropic.
  • Client-side (any provider): The worker emits a regular tool_use, the agent intercepts it and makes a separate call to the configured advisor model. Two roundtrips, but you can mix providers — e.g., DeepSeek worker + Claude Opus advisor, or Gemini worker + GLM advisor.
Ad

Configuration & Monitoring

Configuration is a single line in the REPL:

/advisor anthropic:claude-opus-4-7
/advisor deepseek:deepseek-v4-pro

The status bar shows worker tokens, advisor tokens, and USD cost separately, so you can see exactly where your spend is going.

Benchmarks & Compatibility

ClawCodex natively supports Anthropic, OpenAI, Gemini, DeepSeek, GLM, Minimax, and OpenRouter. On SWE-bench Verified the agent scores 58.2% with Gemini 2.5 Pro vs. OpenClaude’s 53% under the same harness.

The Hard Part: Prompts

The trickiest engineering challenge was preventing the advisor from simply echoing the worker’s plan back — early versions wasted context on verbatim repeats. The fix was a strict “no first-person voice, no echoes” rule combined with the Gaps / Risks / Do-next template.

Who It's For

Developers using AI coding agents who want Opus-level architectural judgment without paying Opus rates for every single call.

📖 Read the full source: r/openclaw

Ad

👀 See Also

OpenMontage: Open-Source Agentic Video Production System for AI Coding Assistants
Tools

OpenMontage: Open-Source Agentic Video Production System for AI Coding Assistants

OpenMontage is an open-source video production system that transforms AI coding assistants like Claude Code into full production studios. It handles research, scene planning, script writing, voice narration, music selection, subtitle generation, and validation through automated pipelines.

OpenClawRadar
MCP Server Enables AI Agents to Make Real Purchases with Ephemeral Virtual Cards
Tools

MCP Server Enables AI Agents to Make Real Purchases with Ephemeral Virtual Cards

A developer has built an MCP server that allows AI agents to complete real purchases using ephemeral virtual Visa cards issued just-in-time. The system requires user approval via MFA and issues cards locked to specific merchants with 15-minute TTLs.

OpenClawRadar
Code Decisions: Open Source Claude Plugin Captures Technical Decisions
Tools

Code Decisions: Open Source Claude Plugin Captures Technical Decisions

Code Decisions is an open source Claude Code plugin that captures technical decisions from conversations and surfaces them when affected files are edited. It writes decisions as markdown files to .claude/decisions/ with an affects field pointing to governed files.

OpenClawRadar
Microsoft Teams SDK Adds HTTP Server Adapter for Existing AI Agents
Tools

Microsoft Teams SDK Adds HTTP Server Adapter for Existing AI Agents

The Microsoft Teams SDK now includes an HTTP server adapter that lets developers connect existing AI agents to Teams without rewriting their code. It works with LangChain chains, Slack bots, and Azure Foundry deployments by injecting a POST /api/messages endpoint into existing Express servers.

OpenClawRadar