Qwen3.x models fail silently in OpenClaw due to streaming output format mismatch

The Problem
When running Qwen3.x models locally with OpenClaw in streaming mode, the models output their responses to the reasoning field instead of the expected content field. OpenClaw interprets empty content as a failure and silently falls through to the next model in your fallback chain without throwing an error. This results in the wrong model answering queries.
The Solution
The fix involves implementing a small proxy that sits between OpenClaw and Ollama. This proxy performs two key functions:
- Translates the API format between the two systems
- Injects
think: falseinto the appropriate location
Once correctly configured, Qwen3.x models pass full tool-call evaluation with perfect scores (15/15), including execution, file reading, web search, Sheets integration, Slack integration, and memory operations.
Implementation Details
The source provides a comprehensive write-up covering:
- Proxy setup instructions
- Six specific configuration settings that must all be correct
- Monitoring approaches
- What doesn't work (common pitfalls to avoid)
The full technical details, including code snippets and configuration examples, are available in the linked gist.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Code Skills vs. Custom Agents: A Mental Model Based on Task Consistency
A Reddit user clarifies the distinction between Claude Code skills and custom agents: skills execute the same steps every time, while custom agents require reasoning and adaptation. The post also covers parallel subagents, delegation, hooks, and building blocks.

Running Qwen3.6-35B-A3B with ~190k Context on 8GB VRAM + 32GB RAM – Setup & Benchmarks
A Reddit user shares a working llama.cpp configuration for Qwen3.6-35B-A3B GGUF models on an RTX 4060 (8GB VRAM) + 32GB DDR5, achieving 37-51 tok/s at 192k context using TurboQuant and specific flags.

Setting Up MCP Servers in llama-server Web UI: A Practical Guide
A Reddit user shares specific steps to configure MCP servers in llama-server's web UI, including installing uv, creating a config.json file with server definitions, running mcp-proxy, and modifying URLs for proper integration.

Solving Gemini CLI write_file Not Found in OpenClaw: Two Fixes Required
OpenClaw agents using google-gemini-cli can't write files (write_file / default_api_write_file missing) due to wrong tools.profile and missing --approval-mode auto_edit flag in the subprocess. Fix: set profile to full and inject the flag via cliBackends config.