Nemotron 3 Super: NVIDIA's 120B open-weight model built for agentic workloads
NVIDIA's Nemotron 3 Super packs 120B parameters into 12B active, combining Mamba-2, Transformers, and a novel LatentMoE — all open-weight and purpose-built for multi-agent systems.