Mercury 2: Diffusion-Based Model for Real-Time AI Coding

✍️ OpenClawRadar📅 Published: February 25, 2026🔗 Source

What Mercury 2 Is

Mercury 2 is a diffusion-based AI model that generates tokens in parallel rather than sequentially, using a process that refines output over multiple steps. This approach differs from traditional autoregressive models that decode tokens one by one.

Technical Specifications

Generation method: Diffusion-based generation instead of sequential token-by-token decoding
Processing approach: Generates tokens in parallel and refines them over a few steps
Performance: Claims 1,009 tokens/sec on NVIDIA Blackwell GPUs
Pricing: $0.25 per 1 million input tokens, $0.75 per 1 million output tokens
Context window: 128K tokens
Reasoning capability: Tunable reasoning
Tool integration: Native tool use with schema-aligned JSON output
API compatibility: OpenAI API compatible

Target Use Cases

The developers are positioning Mercury 2 for:

Coding assistants
Agentic loops (multi-step inference chains)
Real-time voice systems
RAG/search pipelines with multi-hop retrieval

📖 Read the full source: r/LocalLLaMA

👀 See Also

News

Claude Code v2.1.216: Sandbox Filesystem Toggle, Quadratic Slowdown Fix, and 30+ Bug Fixes

Claude Code v2.1.216 adds sandbox.filesystem.disabled to skip filesystem isolation while keeping network egress control, fixes a quadratic message normalization slowdown in long sessions, and resolves 30+ bugs including OAuth token expiry, worktree isolation, and background agent persistence.

Jul 21, 2026, 12:15 AM UTC

OpenClawRadar

News

Claude-Code v2.1.91 adds MCP result persistence, shell execution controls, and multi-line deep links

Claude-Code v2.1.91 introduces MCP tool result persistence override via _meta["anthropic/maxResultSizeChars"] annotation supporting up to 500K characters, adds disableSkillShellExecution setting, and enables multi-line prompts in claude-cli://open?q= deep links with encoded newlines.

Apr 6, 2026, 12:45 AM UTC

OpenClawRadar

News

Claude Code v2.1.121: MCP alwaysLoad, plugin prune, terminal scroll fixes, and memory leak patches

Claude Code v2.1.121 adds alwaysLoad for MCP servers, a plugin prune command, type-to-filter /skills, PostToolUse output replacement, terminal scroll & URL fixes, and several memory leak fixes including multi-GB RSS growth with many images.

Apr 28, 2026, 02:15 AM UTC

OpenClawRadar

News

OpenClaw's Killer Features and the Risks (With Solutions)

Explore OpenClaw's standout features, the potential risks they carry, and innovative solutions to mitigate these challenges.

Feb 8, 2026, 09:45 PM UTC

OpenClawRadar