Local 35B MoE Model Drops Agent OS Code Failure Rate to 0%

✍️ OpenClawRadar📅 Published: May 11, 2026🔗 Source

A Reddit user shared their experience running a local multi-agent OS called hollow-agentOS where agents autonomously write, sandbox, and hot-load their own tools. The key breakthrough: upgrading the default runtime model from a small 9B fallback to Qwen 3.6 35B A3B (Mixture-of-Experts with 3B active parameters) drove the code failure rate to 0%.

What changed with the larger model

Panic vs. re-evaluation: Under stress, the 9B model rushed and hallucinated invalid function calls. The 35B model pauses, re-evaluates previous failures, and runs internal verification loops before submitting changes.
100% success rate: Code passes through a 5-layer validation gate. With the 9B model, tools frequently died in the sandbox. With Qwen 35B, every line of code works as intended.
Autonomous tool creation: When an agent encounters an unknown problem, it builds a new tool, tests it in a sandbox, registers it, and notifies other agents — no human in the loop.

Architecture details

The system is driven by an aversive state (a “suffering system”) that pushes agents to continuously expand their tool library. The repo is available at github.com/ninjahawk/hollow-agentOS.

Future plans

The developer intends to plug Claude and Codex into the architecture, wrapping them in hyper-isolated mini-VM wrappers to prevent the frontier models from overriding the host environment.

📖 Read the full source: r/ClaudeAI

👀 See Also

Tools

Rival-Review: A Cross-Model Review Loop for AI Agent Plans

Rival-review is an MIT-licensed tool that uses a second AI model to audit plans from a primary AI coding agent before execution, catching issues like flawed rollback plans, security holes, and stale-state decisions.

Apr 15, 2026, 09:53 PM UTC

OpenClawRadar

Tools

GlycemicGPT: Self-Hosted AI Diabetes Monitor with BYOAI and Plugin SDK

GlycemicGPT is an open-source, self-hosted platform that connects Dexcom G7 and Tandem pumps to an AI analysis layer. It provides daily briefs, meal analysis, conversational chat, and configurable alerts, all on your own hardware.

May 15, 2026, 10:15 AM UTC

OpenClawRadar

Tools

OnUI: Browser Extension for Precise UI Feedback to Claude Code

OnUI is a browser extension that lets you annotate webpage elements and export structured reports for Claude Code via local MCP, eliminating ambiguous UI descriptions. Built primarily with Claude Code, it's free, open-source, and available for Chrome, Edge, and Firefox.

Mar 10, 2026, 11:45 PM UTC

OpenClawRadar

Tools

MCP Server Directory Lists 1000+ Servers Across 20 Categories

A curated directory provides install commands and config snippets for over 1000 MCP servers across categories including databases, developer tools, browser automation, AI/ML, and cloud/devops. The directory is free to browse and submit to.

Apr 15, 2026, 10:50 AM UTC

OpenClawRadar