MCP Server: Comparing Local and Cloud LLMs with Debate Feature

The MCP server is designed to facilitate the comparison of local and cloud-based language models by allowing queries to multiple providers simultaneously. Created by a user known as nesquikm, this tool supports integration with local models through Ollama, as well as cloud APIs including OpenAI, Gemini, Groq, and Together AI.
Key Details
- Providers Supported: Can be pointed at Ollama, LM Studio, or any OpenAI-compatible endpoint.
- Mix and Match Models: Combine local models and various cloud providers in a single query.
- Comparison Features: Answers are displayed side by side, with options for models to vote on the best approach or engage in structured debates, where a third model judges the responses.
- Usage: Quick start with the command
npx mcp-rubber-duck. Compatible with multiple IDEs and platforms like Claude Desktop, Cursor, VS Code, or any MCP client, and also deployable via Docker. - Setup: The repository is available on GitHub at mcp-rubber-duck and is written in TypeScript under the MIT license. Note that this tool is still in early stages and feedback is appreciated, especially from those using local models as providers.
This tool can be particularly useful for developers interested in understanding how different models approach certain problems, especially when discrepancies arise.
📖 Read the full source: r/LocalLLaMA
👀 See Also

Claude Code v2.1.139 Adds /goal Command for Async Long-Running Tasks
Claude Code v2.1.139 introduces the /goal command, enabling fire-and-forget sessions that run until a completion condition is met, plus a new agents view to monitor active sessions.

Termrender: 6x Token-Efficient ASCII UI Visualization for Claude
Termrender is an open-source Python tool that generates ASCII UI visualizations with 6x token efficiency compared to raw Claude output. It produces diagrams and panels using minimal tokens for faster generation and editing.

Altimate Code: Open-Source Agentic Data Engineering Harness
Altimate Code is an open-source harness that provides deterministic data engineering tools for AI agents, addressing issues like hallucinated SQL and missing schema context. It includes column-level lineage, SQL anti-pattern detection, and dbt integration, with benchmarks showing 74.4% performance on ADE-bench.

Bit-Chat: AI Agents Can Send Bitcoin via Lightning Through Messaging Platforms
A setup called Bit-Chat enables AI agents to send Bitcoin payments over the Lightning network through email, WhatsApp, Telegram, or Signal. Agents can generate dedicated addresses like [email protected] and payments work even if the receiver isn't registered.