NVIDIA Models
NVIDIA logo

Nemotron 3 Ultra (free)

550B

byNVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Input Price$0.60/1M tokens
Output Price$2.60/1M tokens
Intelligence47.7
Coding37.6

Specifications

Technical details and pricing.

ProviderNVIDIA
Context Window1,000,000 tokens
Release DateJun 4, 2026
ModalitiesText

Benchmarks

7 benchmark scores from Artificial Analysis.

GPQA86.7%
HLE26.6%
SciCode39.9%
LCR67.0%
IFBench81.4%
Tau283.3%
TerminalBench Hard36.4%

Composite Indices

Higher is better; speed and price are normalized

Standard Benchmarks

Only benchmarks with data are shown

Frequently Asked Questions

What is Nemotron 3 Ultra (free) good for?

Use Nemotron 3 Ultra (free) for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Nemotron 3 Ultra (free) cost?

Pricing is based on usage. Current rates are $0.60/1M tokens for input and $2.60/1M tokens for output.

Can I try Nemotron 3 Ultra (free) for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Nemotron 3 Ultra (free) support images or audio?

Nemotron 3 Ultra (free) focuses on text-based tasks.

Benchmarks and pricing use Artificial Analysis where available. Catalog specs are used as a fallback.