Model Comparison

gpt-oss-120b (high)
vs. MiMo-V2.5-Pro

Comparing 2 AI models · 10 benchmarks · OpenAI, Xiaomi

Recommended Pick

MiMo-V2.5-Pro 9 metric wins

Strongest on: Intelligence, Coding, GPQA

Best Value

gpt-oss-120b (high)

100.0 value score

61.5 reasoning / $0.26/1M

Lowest Price

gpt-oss-120b (high)

$0.15/1M input price

Best Reasoning

gpt-oss-120b (high)

61.5 reasoning score

Blends available reasoning benchmarks

Best for Coding

MiMo-V2.5-Pro

45.5 coding index

Composite Indices

Higher is better; speed and price are normalized

Standard Benchmarks

Only benchmarks with data are shown

Differences That Matter

Best value

gpt-oss-120b (high) has the strongest quality-to-price mix at 100.0 out of 100 value points.

Price gap

gpt-oss-120b (high) is 2.9x cheaper on input tokens than MiMo-V2.5-Pro.

Speed gap

gpt-oss-120b (high) generates about 8.6x as many tokens per second as MiMo-V2.5-Pro.

Reasoning gap

gpt-oss-120b (high) leads MiMo-V2.5-Pro by 7.3 points on reasoning.

Coding gap

MiMo-V2.5-Pro leads gpt-oss-120b (high) by 16.9 points on coding.

Live compare

Response Face-Off

Run one prompt through the selected models and compare response quality with live speed and cost context.

gpt-oss-120b (high)

OpenAI

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

MiMo-V2.5-Pro

Xiaomi

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

Which answer was more useful?

AI Chat

Chat with 80+ models

Chat for free

Inference API

EU-hosted inference

Get API access

Full Comparison

Metric	Op gpt-oss-120b (high) OpenAI	Top Pick Xi MiMo-V2.5-Pro Xiaomi
Pricing per 1M tokens
Input Cost	$0.15/1M	$0.43/1M
Output Cost	$0.60/1M	$0.87/1M
Blended (3:1)	$0.26/1M	$0.54/1M
Specifications
Organization	OpenAI	Xiaomi
Release Date	Aug 5, 2025	Apr 22, 2026
Performance & Speed
Throughput	350.5 tok/s	40.7 tok/s
TTFT	524ms	1981ms
Latency	6230ms	51111ms
Composite Indices
Value Score	100.0	42.6
Reasoning Score	61.5	54.2
Intelligence	23.8	42.2
Coding	28.6	45.5
Math	93.4	—
Standard Benchmarks
GPQA	78.2%	86.6%
MMLU Pro	80.8%	—
HLE	18.5%	33.8%
LiveCodeBench	87.8%	—
AIME 2025	93.4%	—
SciCode	38.9%	50.2%
LCR	50.7%	73.3%
IFBench	69.0%	79.9%
TAU-bench v2	65.8%	94.2%
TerminalBench Hard	23.5%	43.2%