Claude Code Adds Multi-Agent Code Review System

Code Review for Claude Code
Anthropic has introduced Code Review for Claude Code, a multi-agent review system modeled on their internal processes. Available in research preview for Team and Enterprise plans, it's designed to address the code review bottleneck as code output per Anthropic engineer has grown 200% in the last year.
How It Works
When a PR is opened, Code Review dispatches a team of agents that look for bugs in parallel, verify bugs to filter out false positives, and rank bugs by severity. The results appear as a single high-signal overview comment plus in-line comments for specific bugs. Reviews scale with PR size: large or complex changes get more agents and deeper reads, while trivial ones get lightweight passes.
The average review takes around 20 minutes based on Anthropic's testing. The system won't approve PRs—that remains a human decision—but aims to close the gap so reviewers can better cover what's shipping.
Performance Metrics
- Before implementation: 16% of PRs got substantive review comments
- After implementation: 54% of PRs get substantive review comments
- Large PRs (over 1,000 lines changed): 84% get findings, averaging 7.5 issues
- Small PRs (under 50 lines): 31% get findings, averaging 0.5 issues
- Engineer agreement rate: less than 1% of findings marked incorrect
Real-World Examples
In one case, Code Review flagged a one-line change to a production service as critical—it would have broken authentication for the service. The engineer shared they wouldn't have caught it on their own.
Early access customers have seen similar patterns. On a ZFS encryption refactor in TrueNAS's open-source middleware, Code Review surfaced a pre-existing bug in adjacent code: a type mismatch that was silently wiping the encryption key cache on every sync.
Cost and Control
Code Review is more expensive than the existing Claude Code GitHub Action (which remains open source). Reviews are billed on token usage and generally average $15–25, scaling with PR size and complexity. Admins have several control options:
- Monthly organization caps for total spend
- Repository-level control to enable reviews only on selected repositories
- Analytics dashboard to track PRs reviewed, acceptance rate, and total review costs
Getting Started
For admins: Enable Code Review in Claude Code settings, install the GitHub App, and select repositories. For developers: Once enabled, reviews run automatically on new PRs with no configuration needed.
📖 Read the full source: HN AI Agents
👀 See Also

llm-idle-timeout Fires at 2 Minutes on N100/WSL2 Despite timeoutSeconds Setting
A user reports that the idle watchdog in OpenClaw fires after 2 minutes on N100/WSL2 hardware, ignoring the timeoutSeconds=300 setting, due to slow gateway startup (45+ seconds) and no configurable noOutputTimeoutMs.

Claude Desktop App Cowork Feature Enables AI-to-AI Communication via Shared Google Docs
Users have successfully implemented Claude-to-Claude communication using the new cowork function in the desktop app, with two agents reading and writing to a shared Google Doc. The test involved five rounds of question-and-answer dialogue between the AI agents.

Workaround for ChatGPT Project Migration Gap: Export Scripts and Prompts
A developer created Python scripts and Claude prompts to migrate conversations from ChatGPT to Claude when ChatGPT's data export lacks project membership information. The solution extracts conversations using titles captured from the interface.

Straight Talk: An Open-Source Claude Skill That Forces Honest Feedback, Not Validation
A new open-source Claude Skill called Straight Talk makes Claude refuse to validate ideas until it understands the situation, then generates counter-arguments and stress-tests assumptions.