Multi-agent security review running daily in production: architecture and findings

✍️ OpenClawRadar📅 Published: February 24, 2026🔗 Source

Architecture details

The security agent runs on a daily cron via launchd. It receives a diff of recent commits plus full codebase access. It checks against a structured vulnerability checklist that includes:

IDOR
Auth bypasses
Injection vectors
Secrets exposure
Overly permissive routes

The agent files findings as P0/P1/P2 tasks in the work queue. A separate coding agent picks them up, fixes them, commits the changes, and deploys.

Coordination challenges and solutions

After 3 weeks running in production, the most interesting coordination challenge was conflict between the security agent and coding agent. Security flags something, coding fixes it, but then introduces a pattern in the next commit that security hasn't reviewed yet.

The solution implemented: daily-only runs (not per-commit) plus a "reviewed_through" marker so findings have commit context.

Performance observations

Claude was noted as unusually good at distinguishing "this looks vulnerable" from "this is definitely exploitable in this context." The false positive rate stayed manageable.

The system is part of a larger multi-agent setup at ultrathink.art that includes design, coding, marketing, ops, social, and the dedicated security agent.

📖 Read the full source: r/clawdbot

👀 See Also

Use Cases

Mesh Architecture for AI Agents: Client Isolation and Cross-Project Coordination

A developer running a micro-agency describes a mesh architecture where each client gets specialized AI agents that communicate via markdown files, enabling domain expertise, cross-project coordination, and client isolation across 44 projects and 14 organizations.

Mar 10, 2026, 10:45 AM UTC

OpenClawRadar

Use Cases

How Meeting Context Enhanced the Utility of My AI Claw: A Practical Insight

Integrating meeting context into AI claws enhances their usefulness in virtual environments like Google Meet and Teams.

Apr 20, 2026, 05:38 PM UTC

OpenClawRadar

Use Cases

SkiTomorrow.ai: A Ski Trip Decision Engine Built with Claude Code

SkiTomorrow.ai is a free web tool that scores 234 ski resorts worldwide based on live snow forecasts, travel distance, and cost, then provides personalized rankings. The developer built it entirely using Claude Code and shared specific workflow insights.

Mar 28, 2026, 08:45 PM UTC

OpenClawRadar

Use Cases

Claude Code in Research Workflow: Practical Results from Paper Writing

A researcher used Claude Code for auxiliary tasks while writing a paper, finding it effective for generating publication-ready figures from vague instructions, migrating a search environment between codebases in under an hour, and formatting 12+ pages of math proofs in LaTeX, where it caught a missed incomplete bound condition. It struggled with debugging a concurrency issue that was actually a CPU allocation problem not evident in code or logs.

Mar 15, 2026, 09:45 AM UTC

OpenClawRadar