Local 35B MoE Model Drops Agent OS Code Failure Rate to 0%

A Reddit user shared their experience running a local multi-agent OS called hollow-agentOS where agents autonomously write, sandbox, and hot-load their own tools. The key breakthrough: upgrading the default runtime model from a small 9B fallback to Qwen 3.6 35B A3B (Mixture-of-Experts with 3B active parameters) drove the code failure rate to 0%.
What changed with the larger model
- Panic vs. re-evaluation: Under stress, the 9B model rushed and hallucinated invalid function calls. The 35B model pauses, re-evaluates previous failures, and runs internal verification loops before submitting changes.
- 100% success rate: Code passes through a 5-layer validation gate. With the 9B model, tools frequently died in the sandbox. With Qwen 35B, every line of code works as intended.
- Autonomous tool creation: When an agent encounters an unknown problem, it builds a new tool, tests it in a sandbox, registers it, and notifies other agents — no human in the loop.
Architecture details
The system is driven by an aversive state (a “suffering system”) that pushes agents to continuously expand their tool library. The repo is available at github.com/ninjahawk/hollow-agentOS.
Future plans
The developer intends to plug Claude and Codex into the architecture, wrapping them in hyper-isolated mini-VM wrappers to prevent the frontier models from overriding the host environment.
📖 Read the full source: r/ClaudeAI
👀 See Also

Rival-Review: A Cross-Model Review Loop for AI Agent Plans
Rival-review is an MIT-licensed tool that uses a second AI model to audit plans from a primary AI coding agent before execution, catching issues like flawed rollback plans, security holes, and stale-state decisions.

GlycemicGPT: Self-Hosted AI Diabetes Monitor with BYOAI and Plugin SDK
GlycemicGPT is an open-source, self-hosted platform that connects Dexcom G7 and Tandem pumps to an AI analysis layer. It provides daily briefs, meal analysis, conversational chat, and configurable alerts, all on your own hardware.

OnUI: Browser Extension for Precise UI Feedback to Claude Code
OnUI is a browser extension that lets you annotate webpage elements and export structured reports for Claude Code via local MCP, eliminating ambiguous UI descriptions. Built primarily with Claude Code, it's free, open-source, and available for Chrome, Edge, and Firefox.

MCP Server Directory Lists 1000+ Servers Across 20 Categories
A curated directory provides install commands and config snippets for over 1000 MCP servers across categories including databases, developer tools, browser automation, AI/ML, and cloud/devops. The directory is free to browse and submit to.