Gemma 4 E2B Tested as Multi-Agent Coordinator in TypeScript Framework

✍️ OpenClawRadar📅 Published: April 14, 2026🔗 Source
Gemma 4 E2B Tested as Multi-Agent Coordinator in TypeScript Framework
Ad

Coordinator Capabilities Tested

The test evaluated whether Gemma 4 E2B could handle the coordinator role in a multi-agent system, specifically: taking a natural language goal, breaking it into a task graph, assigning agents, calling tools, and stitching results together.

Technical Implementation

The framework used was open-multi-agent (TypeScript, open-source) with Ollama via an OpenAI-compatible API. The coordinator receives a goal and agent roster, then outputs a JSON task array with title, description, assignee, and dependencies. Agents execute with tool-calling capabilities including bash and file read/write operations.

Model Details

Gemma 4 E2B ("Effective 2B") has 2.3B effective parameters and 5.1B total parameters. The extra ~2.8B parameters are for the embedding layer supporting 140+ languages and multimodal capabilities.

Test Scenario

The goal provided was: "Check this machine's Node.js version, npm version, and OS info, then write a short Markdown summary report to /tmp/report.md"

E2B correctly:

  • Broke it into 2 tasks with a dependency (researcher → summarizer)
  • Assigned each to the right agent
  • Used bash to run system commands
  • Used file_write to save the report
  • Synthesized the final output

Both runTasks() (explicit pipeline) and runTeam() (model plans everything autonomously) worked.

Ad

Performance and Observations

On an M1 with 16GB RAM:

  • Full runTeam() takes ~2 minutes
  • 6–9 sequential LLM calls under the hood (coordinator planning → researcher multi-turn tool use → summarizer → coordinator synthesis)
  • ~10–15 seconds per call on M1
  • E2B uses ~3–4 GB RAM with no memory pressure

What worked well:

  • JSON output: The coordinator produced the correct schema for task decomposition. The framework has tolerant parsing that tries fenced blocks first, then falls back to bare array extraction.
  • Tool-calling: Works through the OpenAI-compatible endpoint, correctly deciding when to call, parsing arguments, and handling multi-turn results.

Limitations noted:

  • Output quality: The prose in final synthesis is noticeably weaker than larger models. Functional but not polished.

Reproduction Steps

ollama pull gemma4:e2b
git clone https://github.com/JackChen-me/open-multi-agent
cd open-multi-agent && npm install
no_proxy=localhost npx tsx examples/08-gemma4-local.ts

The test file is ~190 lines at examples/08-gemma4-local.ts. The no_proxy=localhost setting is only needed if you have an HTTP proxy configured.

📖 Read the full source: r/LocalLLaMA

Ad

👀 See Also

Event Horizon VS Code Extension Adds File Locking and Plan Coordination for Multiple AI Agents
Tools

Event Horizon VS Code Extension Adds File Locking and Plan Coordination for Multiple AI Agents

Event Horizon, a VS Code extension originally created for visualizing Claude Code, now includes file locking and plan coordination features to prevent multiple AI agents from overwriting each other's work on the same codebase. The tool supports Claude Code, OpenCode, and Copilot with one-click setup.

OpenClawRadar
From Replit to Local: How One Developer Used Claude to Build StillHere, an API-Powered AI Companion Chat App
Tools

From Replit to Local: How One Developer Used Claude to Build StillHere, an API-Powered AI Companion Chat App

A developer built StillHere.ink, an AI chat app for companion-style conversations using personal API keys, after migrating from Replit to local development with Claude. The app features memory, diary summaries, RAG, model switching, and cost-control tools.

OpenClawRadar
Collection of 177 OpenClaw SOUL.md Templates Organized into 24 Categories
Tools

Collection of 177 OpenClaw SOUL.md Templates Organized into 24 Categories

A developer has compiled 177 ready-to-use SOUL.md templates for OpenClaw agents across 24 categories including Marketing, Development, Business, DevOps, Finance, Creative, Data, Security, Healthcare, Legal, HR, and Education. All templates are MIT licensed and available on GitHub.

OpenClawRadar
Memtrace: Persistent, Time-Aware Codebase Memory for Claude Code Agents
Tools

Memtrace: Persistent, Time-Aware Codebase Memory for Claude Code Agents

Memtrace provides always-fresh snapshots and bi-temporal replay for Claude Code agents, using Tree-sitter AST parsing and hybrid retrieval (BM25 + Jina-code embeddings) with zero LLM inference cost during indexing.

OpenClawRadar