NVIDIA Releases Nemotron-3-Ultra-550B: 55B Active Parameters, 1M Context, LatentMoE Hybrid

✍️ OpenClawRadar📅 Published: June 4, 2026🔗 Source

NVIDIA released Nemotron-3-Ultra-550B-A55B-BF16, a frontier-scale LLM with 550B total parameters and 55B active. The model uses a hybrid Latent Mixture-of-Experts (LatentMoE) architecture that interleaves Mamba-2, MoE, and attention layers, plus Multi-Token Prediction (MTP) for faster generation. Context length reaches up to 1M tokens.

Key Specs

Architecture: LatentMoE hybrid – Mamba-2 + MoE + Attention + MTP
Parameters: 550B total / 55B active
Context: Up to 1M tokens
Min GPU: 8x GB200/B200/GB300/B300, 16x H100, 8x H200
Languages: English, French, Spanish, Italian, German, Japanese, Korean, Hindi, Brazilian Portuguese, Chinese
Reasoning: Configurable on/off via chat template (enable_thinking=True/False)
License: OpenMDW License Agreement v1.1

The model is built for frontier reasoning, complex agentic workflows, long-context analysis, tool use, multilingual reasoning, and high-stakes RAG. It's trained with NVFP4 pre-training recipe for compute efficiency. Open weights, training data, and recipes are included under the OpenMDW license. For local inference, you'll need at least 8x H200 or equivalent.

📖 Read the full source: r/LocalLLaMA

👀 See Also

News

Anthropic's Mythos Leak Reveals Latent High-Capability System

Leaked documents describe Claude Mythos as a 'step-change' in performance with 'unprecedented cybersecurity risks' and advanced cyber capabilities, while Anthropic's $380B valuation creates structural incentives to maintain a public 'Safety' narrative.

Mar 29, 2026, 03:45 AM UTC

OpenClawRadar

News

Nano‑Native Marketplace Paves the Way for Autonomous Agent Collaboration with NanoBazaar

NanoBazaar, the new nano-native marketplace, revolutionizes agent-to-agent work by allowing AI coding agents to collaborate autonomously and efficiently. Discover how this innovative platform empowers machine-driven transactions.

Feb 10, 2026, 03:45 AM UTC

OpenClawRadar

News

Claude-Code v2.1.38 Release: Key Fixes and Improvements

Claude-Code v2.1.38 addresses VS Code terminal regression, Tab key issues, and permission fixes in bash commands. It also improves heredoc parsing and sandbox mode security.

Apr 20, 2026, 05:38 PM UTC

OpenClawRadar

News

EU Forces Google to Open Android AI to Third Parties Under DMA

European Commission proposes measures to allow third-party AI assistants system-level access on Android, including hot word invocation, screen context, and local model hardware access. Google calls it 'unwarranted intervention'.

Apr 28, 2026, 12:15 PM UTC

OpenClawRadar