Open-source MCP suite improves Claude Code generation quality by 15-20%

What this MCP suite does
This is an open-source suite of three local MCP servers and a prompt skill designed to improve AI code generation quality, specifically addressing the 'bad token' problem where agents make poor early design choices and then hallucinate extensive code to justify them, breaking existing architecture.
Key components
- Lad: A dual-reviewer system that uses OpenRouter to send code diffs to two independent models (examples given: Kimi-2.5, GLM-5) for design and code review, breaking the autoregressive self-gaslighting loop.
- Serena: A headless IDE that indexes your codebase, allowing review models to check code against your project's historical requirements and architecture.
- Kindly: A web search tool that hooks directly into StackExchange and GitHub APIs, feeding the agent entire conversations (question, code snippets, accepted answers, comments) in one LLM-native chunk instead of standard HTML scraping.
- TDD Skill: A workflow override that forces the agent to write a REQUIREMENTS.md file, ask clarifying questions, and write tests before writing feature code.
Performance and implementation
One customer, Relevant Software, reported that adopting this suite increased their AI code generation quality with Claude Code by 15-20%. The suite runs locally over stdio and is compatible with Claude Code, Cursor, and Codex. The source mentions it helps significantly with cutting-edge technologies like LangChain, LangGraph, and LangSmith.
📖 Read the full source: r/LocalLLaMA
👀 See Also

OpenClaw Setup Assistance Offered by ClawSet
ClawSet provides setup services for OpenClaw, focusing on understanding client needs. The service includes a setup call for $99 and a month of troubleshooting support.

A 4-tier knowledge base architecture to improve AI agent accuracy
A developer built a structured knowledge base with 200+ articles to provide domain-specific context to AI agents, implementing a 4-tier pipeline with query classification that cut token costs by ~40%.

Open-source solo RPG engine uses three Claude instances for parsing, narration, and direction
EdgeTales is an open-source text-based solo RPG engine where dice mechanics determine outcomes and Claude AI generates atmospheric prose. The system uses three Claude instances in a pipeline: Brain (Haiku) for parsing input to JSON, Narrator (Sonnet) for writing prose, and Director (Haiku) for async scene analysis.

Benchmark shows AI browser automation tools vary 2.6x in token costs despite identical accuracy
A benchmark of 4 CLI browser automation tools using Claude Sonnet 4.6 on 6 real-world tasks found all achieved 100% accuracy, but openbrowser-ai used 36,010 tokens while others used 77,123-94,130 tokens. Tool call count was the strongest predictor of token cost.