MiMo-V2-Omni

Name: MiMo-V2-Omni
Brand: Xiaomi

vonXiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited for complex real-world tasks that span modalities, 256K context window.

Chatten mitMiMo-V2-Omni

Eingabepreis$0.00/1M tokens

Ausgabepreis$0.00/1M tokens

Intelligenz43.4

Coding35.5

Spezifikationen

Technische Details und Preise.

AnbieterXiaomi

Kontextfenster262,144 tokens

Veröffentlichungsdatum19. März 2026

ModalitätenText, Audio, Image, Video → Text

FähigkeitenVision, Audio Input

Benchmarks

7 Benchmark-Scores von Artificial Analysis.

GPQA82.8%

HLE19.9%

SciCode36.7%

LCR66.7%

IFBench53.5%

Tau291.2%

TerminalBench Hard34.8%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Häufig gestellte Fragen

Wofür eignet sich MiMo-V2-Omni?

Nutzen Sie MiMo-V2-Omni für alltägliche Aufgaben wie Schreiben, Zusammenfassen, Brainstorming und klare Erklärungen.

Wie viel kostet MiMo-V2-Omni?

Die Abrechnung erfolgt nutzungsbasiert. Aktuell kostet die Eingabe $0.00/1M tokens und die Ausgabe $0.00/1M tokens.

Kann ich MiMo-V2-Omni kostenlos testen?

Ja. Sie können sofort einen Chat starten und das Modell testen, bevor Sie sich für einen Plan entscheiden.

Unterstützt MiMo-V2-Omni Bilder oder Audio?

MiMo-V2-Omni kann Bilder verstehen.

Ähnliche Modelle

Weitere Modelle, die Sie sich ansehen könnten.

MiMo-V2-Pro

Xiaomi

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios.

Details →

MiMo-V2-Flash

Xiaomi

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi.

Details →

Riverflow V2 Pro

Sourceful

Riverflow V2 Pro is the most powerful variant of Sourceful's Riverflow 2.0 lineup, best for top-tier control and perfect text rendering.

Details →

Benchmarks und Preise stammen, sofern verfügbar, von Artificial Analysis. OpenRouter-Spezifikationen dienen als Fallback.