Multi-Agent Video Production Pipeline with Claude: Script Contract Architecture and Research Fanout

A developer built a multi-agent AI pipeline that takes a topic (e.g., "Ada Lovelace") and a persona (channel identity, tone, visual style) and produces a complete chapter-structured educational YouTube video (15–20 min). The pipeline uses Claude as the core LLM for scripting and orchestrates specialized agents across script writing, asset generation, rendering (CUDA on Windows host), and YouTube upload.
Script Writing via Contract Architecture
To keep a 20-minute AI-written script narratively coherent across chapters written in separate LLM calls, the system uses a narrative contract — a validated JSON blueprint generated before any script text is written. The contract encodes four constraint types:
- Threads — story arcs that must open in one chapter and close in another, with a declared payoff type (resolved, tragedy, etc.)
- Entities — named people/places with a forced first-introduction chapter, preventing retroactive mentions
- Facts Required — citations chained with dependencies (fact B can't appear until fact A is established)
- Timeline Anchors — temporal reference points allowing non-linear structure (flashback, in-medias-res) while staying internally consistent
The contract is generated via an Opus → structural validate → Sonnet review loop (up to 3 rounds). Sonnet checks semantic coherence (no orphan entities, threads actually close); the structural validator runs a Pydantic parse + temporal constraint check. Downstream chapter writers are bound to the contract.
Research via Fanout
The research pipeline spins up N parallel OutlineAgent instances, each working from the same research package but on different thesis candidates. Each produces a three-level hierarchy: thesis → chapter arguments → scene beats. A grounding/revision loop runs independently on each branch:
- Grounding reviewer (Sonnet) flags blocking issues vs. polish issues
- Revision agent applies fixes without restructuring
- Quality reviewer checks for structural failures (topical chapter lists, collapsed middles, summary endings)
Up to 3 revision rounds per branch, in parallel. Then a single judge agent scores each refined outline on four axes:
| Axis | Weight | What it measures |
|---|---|---|
| Concept Hook | 0.40 | CTR potential; title falsifiability |
| Trap Closure | 0.30 | Narrative payoff completeness |
Pipeline Architecture
The pipeline is split across two environments: script and asset work runs in a Linux dev container (WSL), while rendering runs on the Windows host to access CUDA and video tooling. Agents communicate over HTTP with a lightweight orchestrator. The system is phase-based — every step (W2.1, W4.3, R3.1, etc.) is independently re-runnable. Each phase reads and writes typed artifact files (JSON manifests, audio files, image directories) so agents are loosely coupled.
Integrated tools: Live2D, Fish Audio, Sadtalker, and others for asset generation and rendering.
📖 Read the full source: r/ClaudeAI
👀 See Also

Using AI to Enhance Existing Enterprise Tools Like Jira
A developer used Claude's Chrome extension to create a Jira sidebar showing cross-project dependency graphs in 4 prompts, working directly within the existing Jira interface.

A Dark Cave: Text-Based Survival Game Avoids AI Slop, Embraces Minimalism
A Dark Cave is a free, text-based survival and settlement building browser game that deliberately avoids graphics, using only text, symbols, and sounds to create atmosphere. The developer argues that as AI-generated visuals become ubiquitous, games will need differentiators like storytelling and player imagination.

Building a 20K+ Line Production SaaS Platform with Claude Code: Lessons from Agentic Engineering at Scale
A developer open-sourced LastSaaS, a production-ready SaaS boilerplate built entirely through conversation with Claude Code, featuring Go backend, React frontend, multi-tenant auth, Stripe billing, and a built-in MCP server. The project reveals what works and requires discipline when using AI agents for large-scale development.

Neuberg: Open-Source Multi-Market Trading Terminal Built with Claude AI
Neuberg is a browser-based trading terminal that connects to markets like Hyperliquid, Polymarket, and Alpaca, built using Claude and Claude Code. The development process revealed specific strengths in architectural critique and refactoring, along with limitations in long-context management and real-time systems.