Markdown as Protocol for Agentic UI with Streaming Execution

A developer built a prototype exploring how to combine generative UI with code execution for AI agents using Markdown as a unified protocol. The system streams text, executable code, and data in a single response, with code executing incrementally as it arrives.
The Protocol: Markdown with Three Block Types
The approach uses standard Markdown syntax that LLMs already understand, avoiding the need to teach new formats. It defines three block types:
- Text blocks: Plain Markdown formatting that streams to the user
- Code fences:
```tsx agent.runexecutes TypeScript/JSX code on the server in a persistent context - Data fences:
```json agent.data => "id"streams JSON data into UI components
These blocks can be interleaved in any order within a single response. The parser handles them incrementally as tokens arrive from the LLM.
Streaming Execution
Code executes statement-by-statement as the LLM generates it, without waiting for the full code fence to close. This allows API calls to start, UI to render, and errors to surface while the LLM is still sending tokens. The developer built bun-streaming-exec to handle this, using vm.Script with custom wrapping since streaming execution isn't a standard runtime primitive.
Agentic UI with mount() Primitive
The system uses React for UI generation since LLMs have extensive exposure to React components and JSX. The core primitive is mount():
mount({
ui: () => <Card>Hello from the agent!</Card>
});When the LLM generates this code and the server executes it, mount() serializes the React component and sends it to the client for rendering within the chat interface.
Data Flow Patterns
The prototype implements four distinct patterns for data movement:
- Client → Server (forms): The agent can wait for user input through forms
- Server → Client (streamed data): Data fences stream JSON directly into mounted UIs
- Server → LLM (console.log):
console.logoutput and exceptions feed back to the LLM as a new turn - LLM → Server → Client (full roundtrip): Complete cycles where the LLM generates code that fetches data and renders UI with that data
Feedback Loop
The system uses console.log as the mechanism for the agent to talk to itself. When the LLM generates Markdown with code blocks, text streams to the user while code executes incrementally. Any console.* output or exceptions feed back to the LLM as a new turn. If there's no output or exceptions, the system waits for a new user query.
This allows the agent to react to its own execution, such as checking message counts or pausing to wait for user input before proceeding.
📖 Read the full source: HN AI Agents
👀 See Also

Nelson v2.2.3 Released: Multi-Agent Coordination for Claude Code, Plus a Discrete-Event Simulation Benchmark
Nelson v2.2.3 ships a multi-agent coordination skill for Claude Code using a naval metaphor. A 13-configuration benchmark shows opus-4-7 with thinking dominates; skill choice is a smaller delta.

Introducing Xrouter: A Smart Hybrid LLM Router to Optimize Cost and Performance
Discover Xrouter, an open-source creation that dynamically integrates local with cloud inference, designed to slash AI costs while boosting efficiency.

Statespace: Build Interactive Web Apps for OpenClaw Agents with Markdown
Statespace is a free, open-source framework for building and sharing AI-friendly web apps that OpenClaw agents can navigate and interact with using pure Markdown. It allows you to define tools, components, and instructions in Markdown files that agents access over HTTP.

Semble: A Local MCP Server for Claude Code with 98% Token Reduction
Semble is an open-source MCP server for Claude Code that replaces grep+read workflows, using embeddings, BM25, and reranking to reduce token usage by ~98% while indexing repos in ~250ms.