Model Comparison

Llama 3.2 Instruct 90B (Vision)
vs. M3

Comparing 2 AI models · 11 benchmarks · Meta, MiniMax

Recommended Pick

M3 10 metric wins

Strongest on: Value, Input price, Output price

Best Value

100.0 value score

58.1 reasoning / $0.52/1M

Lowest Price

$0.30/1M input price

Best Reasoning

58.1 reasoning score

Blends available reasoning benchmarks

Best for Coding

43.4 coding index

Composite Indices

Higher is better; speed and price are normalized

Only benchmarks with data are shown

Best value

M3 has the strongest quality-to-price mix at 100.0 out of 100 value points.

Price gap

M3 is 4.6x cheaper on input tokens than Llama 3.2 Instruct 90B (Vision).

Speed gap

M3 generates about 1.2x as many tokens per second as Llama 3.2 Instruct 90B (Vision).

Reasoning gap

M3 leads Llama 3.2 Instruct 90B (Vision) by 33.7 points on reasoning.

Top-pick rationale

M3 wins 10 measurable categories, including Value, Input price, Output price, Blended price.

Live compare

Run one prompt through the selected models and compare response quality with live speed and cost context.

Llama 3.2 Instruct 90B (Vision)

Inference API

Metric	Me Llama 3.2 Instruct 90B (Vision) Meta	Top Pick Mi M3 MiniMax
Pricing per 1M tokens
Input Cost	$1.38/1M	$0.30/1M
Output Cost	$1.38/1M	$1.20/1M
Blended (3:1)	$1.38/1M	$0.52/1M
Specifications
Organization	Meta	MiniMax
Release Date	Sep 25, 2024	Jun 1, 2026
Performance & Speed
Throughput	46.4 tok/s	57.2 tok/s
TTFT	569ms	1843ms
Latency	569ms	36819ms
Composite Indices
Value Score	16.0	100.0
Reasoning Score	24.4	58.1
Intelligence	6.2	44.4
Coding	—	43.4
Standard Benchmarks
GPQA	43.2%	92.9%
MMLU Pro	67.1%	—
HLE	4.9%	37.1%
LiveCodeBench	21.4%	—
MATH 500	62.9%	—
AIME (Original)	5.0%	—
SciCode	24.0%	45.4%
LCR	—	74.0%
IFBench	—	82.9%
TAU-bench v2	—	88.9%
TerminalBench Hard	—	42.4%