Xiaomi Models
Xiaomi logo

MiMo-V2.5

byXiaomi

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

Chat withMiMo-V2.5
Input Price$1.00/1M tokens
Output Price$3.00/1M tokens
Intelligence53.8
Coding45.5

Specifications

Technical details and pricing.

ProviderXiaomi
Context Window1,048,576 tokens
Release DateApr 22, 2026
ModalitiesText, Audio, Image, Video โ†’ Text
CapabilitiesVision, Audio Input

Benchmarks

7 benchmark scores from Artificial Analysis.

GPQA86.6%
HLE33.8%
SciCode50.2%
LCR73.3%
IFBench79.9%
Tau294.2%
TerminalBench Hard43.2%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is MiMo-V2.5 good for?

Use MiMo-V2.5 for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does MiMo-V2.5 cost?

Pricing is based on usage. Current rates are $1.00/1M tokens for input and $3.00/1M tokens for output.

Can I try MiMo-V2.5 for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does MiMo-V2.5 support images or audio?

MiMo-V2.5 can understand images.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.

Customer Support