NVIDIA launches Nemotron 3 Nano, Super and Ultra models

What's new? Nvidia unveiled Nemotron 3 models; Nano is live with 30b parameters and 1m-token context while Super and Ultra debut in 2026 with 4-bit NVFP4 on Blackwell;

· 1 min read
Nemotron
Image: NVIDIA

NVIDIA has unveiled the Nemotron 3 family of open models, which includes the Nano, Super, and Ultra variants, designed to support the growing demand for agentic AI systems across various industries. The family leverages a hybrid mixture-of-experts architecture, enabling high efficiency and accuracy for tasks requiring multiple AI agents.

Nemotron 3 Nano is immediately available, offering a 30-billion-parameter model that activates up to 3 billion parameters per task. It is optimized for low-cost, high-throughput use cases such as debugging, content summarization, and information retrieval. This model achieves up to four times the throughput compared to its predecessor and introduces a 1-million-token context window, significantly improving long-context reasoning.

The Nemotron 3 Super and Ultra models, with 100 billion and 500 billion parameters respectively, are tailored for complex, multi-agent workflows and are scheduled for release in the first half of 2026. These models utilize NVIDIA's 4-bit NVFP4 training format on Blackwell architecture, reducing memory needs and accelerating training, allowing larger models to run efficiently on existing infrastructure.

NVIDIA, a leader in GPU and AI technology, is providing not only the models but also open training datasets and reinforcement learning libraries to foster transparent and specialized AI agent development. The Nemotron 3 suite supports deployment on a range of platforms, including major cloud services and on-premises NVIDIA-accelerated infrastructure. Early integration by prominent enterprises and startups signals strong industry interest, with independent benchmarking organizations highlighting Nemotron 3's efficiency and accuracy in its class.

Source