Codev: AI Agent Workflow Cuts 106 PRs in 14 Days

Codev is an open-source AI agent coordination system that enforces a structured development workflow. The project demonstrates how to move AI from prototyping to production work with specific practices extracted from handling 106 pull requests in 14 days.

Six core practices

Specs and plans are source code: Specifications and plans live in git alongside source code, not in chat history. A new agent reads arch.md for the big picture, then its specific spec. This ensures you always know why something was built.
Three models review every phase: Claude, Gemini, and Codex catch almost entirely different bugs. No single model found more than 55% of issues. In testing, 20 bugs were caught before shipping: Claude Code found 5 bugs, while Gemini and Codex caught another 15, including a severe security issue Claude missed.
Enforce the process, don't suggest it: A state machine forces Spec → Plan → Implement → Review → PR. The AI can't skip steps, and tests must pass before advancing. The system provides rails because AIs don't stick to the plan by themselves.
Annotate, don't edit: Most work involves writing specs and reviews that guide the code, rather than hacking at files in an open-ended chat.
Agents coordinate agents: An architect agent spawns builder agents into isolated git worktrees. You direct the architect; it directs the builders. They message each other asynchronously.
Manage the whole lifecycle: Most AI tools help write code faster (about 30% of the job). The other 70% involves planning, reviewing, integrating, deployment scripts, and managing staging vs production. Codev has AI run the entire pipeline from spec to PR and beyond.

Results and costs

The system enabled one engineer to produce what a team of 3-4 would typically do. Code quality measured 1.2 points better on a 10-point scale compared to using Claude Code alone. The approach takes longer and uses more tokens, but costs remain reasonable at approximately $1.60 per PR.

According to the developer, the protocol enforcement is the game changer: "I would find the AI just wouldn't stick to specs or plans." The agent coordination also proved effective, with the architect agent managing multiple builder agents fixing different bugs simultaneously.

📖 Read the full source: HN AI Agents

Codev: AI agent workflow for 106 PRs in 14 days

Six core practices

Results and costs

👀 See Also

SkyClaw v2.2 Rust AI Agent Runtime Adds OpenAI OAuth and Custom Tool Authoring

ClawControl v1.7.1 fixes daily usage issues in OpenClaw client

DeepSeek Reasonix: Native Coding Agent with High Caching and Low Cost

Six GitHub Repositories for Claude Code Development