All Models
NVIDIA AI Models
Builds Nemotron models optimized for NVIDIA hardware. Leading GPU maker powering most AI training.
Nemotron 3 Nano Omni 30B A3B Reasoning
NVIDIA
A powerful AI model for general-purpose tasks.
ContextN/A
Speed285 tok/s
InputN/A
OutputN/A
ReasoningYes
Nemotron Cascade 2 30B A3B
NVIDIA
A powerful AI model for general-purpose tasks.
ContextN/A
SpeedN/A
InputN/A
OutputN/A
ReasoningNo
Nemotron 3 Nano 4B
NVIDIA
A powerful AI model for general-purpose tasks.
ContextN/A
SpeedN/A
InputN/A
OutputN/A
ReasoningNo
Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA
A powerful AI model for general-purpose tasks.
ContextN/A
Speed150 tok/s
InputN/A
OutputN/A
ReasoningYes
Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA
A powerful AI model for general-purpose tasks.
ContextN/A
Speed134 tok/s
InputN/A
OutputN/A
ReasoningYes
Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA
A powerful AI model for general-purpose tasks.
ContextN/A
Speed87 tok/s
InputN/A
OutputN/A
ReasoningYes
Llama 3.3 Nemotron Super 49B V1.5
NVIDIA
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context.
Context131K
Speed45 tok/s
InputText
OutputText
ReasoningYes