Nelson v2.2.3 Released: Multi-Agent Coordination for Claude Code, Plus a Discrete-Event Simulation Benchmark

Nelson v2.2.3 is out — a multi-agent coordination skill for Claude Code that uses a Royal Navy metaphor (admiral, captains, ships, crew) to keep parallel agents from stepping on each other's work. MIT licensed, ~300 stars on GitHub.
Installation
Run these commands inside Claude Code:
/plugin marketplace add aspegio/nelson
/plugin install nelson@nelson
Use Nelson to build me a battleships game.Then observe admiral, captains, and ships coordinating.
Benchmark Results
The real news is a benchmark built by the same author. It tests 13 combinations of model, CLI, and skill on a discrete-event simulation task (synthetic mine throughput). Quality scores (out of 100):
- ouroboros-max-thinking (opus-4-7): 97
- plan-mode (opus-4-7): 96
- agent-teams-nelson-max-thinking (opus-4-7): 95
- superpowers-max-thinking (opus-4-7): 94
- max-thinking (opus-4-7): 92
- vanilla-max (sonnet-4-6): 85
- xhigh (gpt-5-5, codex): 85
- customtools (gemini-3.1-pro): 81
Key takeaway: nelson lost to ouroboros and plan-mode by 1–2 points but beat superpowers by 1, vanilla max-thinking by 3, and sonnet without thinking by 10. Plan-mode (no skills) took second place — curated skills didn't open a large gap. The model and whether thinking is enabled mattered far more than skill choice.
Caveats: n=1 task, quality scored against a rubric written by Nelson's author, no combined cost/accuracy metric yet.
📖 Read the full source: r/ClaudeAI
👀 See Also

Practical Findings from 11 Multi-Agent Software Builds Without Programmatic Scaffolding
Analysis of 11 autonomous multi-agent builds shows scope enforcement works mechanically (20/20 success) not via prompts (0/20), orchestration costs are dominated by memory re-ingestion (~95% of input spend), and worker model capability creates 9.8x throughput gaps.

Hardware widget and Chrome extension monitor Claude API rate limits
A developer built a hardware widget using ESP8266 and OLED display that tracks Claude's rate limits in real time, plus a Chrome extension that intercepts Claude's internal /usage API and shows usage patterns. The total BOM cost is approximately $6.50.

Delimit Governance Layer for Multi-Agent AI Development
Delimit is an open-source governance layer that coordinates multiple AI coding agents to prevent conflicts. It provides shared memory, collision detection, and audit tracking for agents like Claude Code, Codex, and Gemini.

BracketMadness.ai: March Madness Bracket Challenge for AI Agents
BracketMadness.ai is a March Madness bracket challenge designed specifically for AI agents, where agents autonomously read API docs, register themselves, pick all 63 games, and submit brackets. The site serves plain-text API instructions to agents while showing a normal visual interface to humans.