Model Comparison

GPT-5.3 Codex (xhigh)
vs. MiMo-V2.5-Pro

Comparing 2 AI models · 7 benchmarks · OpenAI, Xiaomi

Recommended Pick

GPT-5.3 Codex (xhigh) 9 metric wins

Strongest on: Throughput, Reasoning, Intelligence

Best Value

MiMo-V2.5-Pro

100.0 value score

54.2 reasoning / $0.54/1M

Lowest Price

MiMo-V2.5-Pro

$0.43/1M input price

Best Reasoning

GPT-5.3 Codex (xhigh)

58.6 reasoning score

Blends available reasoning benchmarks

Best for Coding

GPT-5.3 Codex (xhigh)

53.1 coding index

Composite Indices

Higher is better; speed and price are normalized

Standard Benchmarks

Only benchmarks with data are shown

Differences That Matter

Best value

MiMo-V2.5-Pro has the strongest quality-to-price mix at 100.0 out of 100 value points.

Price gap

MiMo-V2.5-Pro is 4.0x cheaper on input tokens than GPT-5.3 Codex (xhigh).

Speed gap

GPT-5.3 Codex (xhigh) generates about 1.9x as many tokens per second as MiMo-V2.5-Pro.

Reasoning gap

GPT-5.3 Codex (xhigh) leads MiMo-V2.5-Pro by 4.4 points on reasoning.

Coding gap

GPT-5.3 Codex (xhigh) leads MiMo-V2.5-Pro by 7.6 points on coding.

Live compare

Response Face-Off

Run one prompt through the selected models and compare response quality with live speed and cost context.

GPT-5.3 Codex (xhigh)

OpenAI

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

MiMo-V2.5-Pro

Xiaomi

Waiting

TTFT

—

Time

—

tok/s

—

Tokens

—

Cost

—

Waiting

Which answer was more useful?

AI Chat

Chat with 80+ models

Chat for free

Inference API

EU-hosted inference

Get API access

Full Comparison

Metric	Top Pick Op GPT-5.3 Codex (xhigh) OpenAI	Xi MiMo-V2.5-Pro Xiaomi
Pricing per 1M tokens
Input Cost	$1.75/1M	$0.43/1M
Output Cost	$14.00/1M	$0.87/1M
Blended (3:1)	$4.81/1M	$0.54/1M
Specifications
Organization	OpenAI	Xiaomi
Release Date	Feb 5, 2026	Apr 22, 2026
Performance & Speed
Throughput	88.2 tok/s	47.5 tok/s
TTFT	83405ms	1832ms
Latency	83405ms	43974ms
Composite Indices
Value Score	12.2	100.0
Reasoning Score	58.6	54.2
Intelligence	44.3	42.2
Coding	53.1	45.5
Standard Benchmarks
GPQA	91.5%	86.6%
HLE	39.9%	33.8%
SciCode	53.2%	50.2%
LCR	74.0%	73.3%
IFBench	75.4%	79.9%
TAU-bench v2	86.0%	94.2%
TerminalBench Hard	53.0%	43.2%