NVIDIA Vera CPU Launched for Agentic AI Workloads

NVIDIA has introduced the Vera CPU, a processor built specifically for agentic AI and reinforcement learning workloads. According to NVIDIA, it delivers results with 50% faster performance and twice the efficiency compared to traditional rack-scale CPUs.
Technical Specifications
The Vera CPU features 88 custom NVIDIA-designed Olympus cores, each capable of running two tasks using NVIDIA Spatial Multithreading. It includes a high-bandwidth memory subsystem built on LPDDR5X memory and uses the second-generation NVIDIA Scalable Coherency Fabric for faster agentic responses under high utilization conditions.
System Configurations
- New Vera CPU rack integrates 256 liquid-cooled Vera CPUs
- Sustains more than 22,500 concurrent CPU environments running independently at full performance
- Built using NVIDIA MGX modular reference architecture
- Part of NVIDIA Vera Rubin NVL72 platform with NVIDIA GPUs connected via NVIDIA NVLink-C2C interconnect
- Provides 1.8 TB/s of coherent bandwidth (7x PCIe Gen 6 bandwidth)
- Also serves as host CPU for NVIDIA HGX Rubin NVL8 systems
- Systems integrate NVIDIA ConnectX SuperNIC cards and NVIDIA BlueField-4 DPUs
Adoption and Partners
Customers collaborating with NVIDIA to deploy Vera CPU include Alibaba, ByteDance, Meta, Oracle Cloud Infrastructure, CoreWeave, Lambda, Nebius, and Nscale. Manufacturing partners include Dell Technologies, HPE, Lenovo, Supermicro, ASUS, Compal, Foxconn, GIGABYTE, Pegatron, Quanta Cloud Technology (QCT), Wistron, and Wiwynn.
Target Workloads
Vera systems are designed for reinforcement learning, agentic inference, data processing, orchestration, storage management, cloud applications, and high-performance computing. Systems partners provide both dual and single-socket CPU server configurations.
According to Jensen Huang, NVIDIA's CEO, "The CPU is no longer simply supporting the model; it's driving it. With breakthrough performance and energy efficiency, Vera unlocks AI systems that think faster and scale further."
📖 Read the full source: HN AI Agents
👀 See Also

Claude.ai Experiencing Elevated Errors and Login Issues for Claude Code
Claude.ai is experiencing elevated errors including login issues for Claude Code as of March 11, 2026. The incident was automatically reported within 2 minutes of an official system status update.

Config Changes with Kimi 2.5 and Opus 4.6
User discusses the performance of Kimi 2.5 for code tasks and config changes, using Opus 4.6 as a coding subagent.

OpenClaw 2026.3.22 Update: Useful Features but Three Critical Issues Require Caution
The OpenClaw 2026.3.22 update introduces useful features like the /btw command, health monitor configurability, Telegram reply fix, and per-agent reasoning defaults, but three open issues (#53158, #53202, #53195) make it risky to deploy immediately without monitoring.

Subquadratic Debuts 12M Token Context Window for AI Models
Subquadratic releases a 12-million-token context window, shattering previous limits for LLM inference and enabling processing of entire codebases in a single pass.