losclouds

AI Models / Compare

MiMo-V2-Omni

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - vis

Creator
Xiaomi
Lifecycle
Active
Context
262.1K
Max output
65.5K
Released
Mar 18, 2026
Status
unknown
Input
$0.40 / 1M tokens
Output
$2.00 / 1M tokens
Cached read
$0.08 / 1M tokens
Cached write
/ 1M tokens
Batch discount
%
Source
OpenRouter
Verified
Apr 5, 2026 (High)

Capabilities

Modalities
textaudioimagevideotext
Capabilities
reasoningaudioInputimageInputpromptCachingfunctionCallingstructuredOutputs
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
MiMo-V2-Omnixiaomi-mimo-v2-omniActiveMar 18, 2026Model available via OpenRouter.

Host Coverage

HostTypeContextPricing NoteDifferences
OpenRouteraggregator262.1K$0.40/1M in · $2.00/1M out via OpenRouter
Migration Guidance

Change Events
DateTypeTitleDescriptionSource
Mar 18, 2026family_addedMiMo-V2-Omni publishedModel made available via OpenRouter.OpenRouter

Other models from Xiaomi

MiMo-V2-Flash, MiMo-V2-Pro