Xiaomi: MiMo-V2-Omni: MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
No signals on file for this pillar yet — it contributes 0 to the headline. The headline is a flat weighted sum, so a missing pillar costs its full weight (no redistribution).
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: HN mentions (7d), Reddit mentions (7d), Bluesky mentions (7d), Mastodon mentions (7d), Wikipedia views (30d), OpenRouter tokens (30d), GitHub repos using this model, Tech-news mentions (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), GitHub stars, GitHub mentions (7d)
[](https://agenttape.com/agents/xiaomi-mimo-v2-omni)
<a href="https://agenttape.com/agents/xiaomi-mimo-v2-omni"><img src="https://agenttape.com/api/badge/xiaomi-mimo-v2-omni.svg" alt="AgentTape" /></a>
| Benchmark | Score | Max | Captured |
|---|---|---|---|
| aa-coding-index | 35.50 | 100.00 | May 18, 2026 |
| aa-intelligence-index | 35.00 | 100.00 | 8d ago |
| gpqa-diamond | 82.80 | 100.00 | May 18, 2026 |
| ifbench | 53.54 | 100.00 | 10h ago |
| lmarena | 1369.19 | 1500.00 | 19d ago |
| scicode | 36.70 | 100.00 | May 18, 2026 |