Xiaomi: MiMo-V2-Omni: MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...
Pillar = mean of 1 scaled value = 0.0.
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: Reddit mentions (7d), Bluesky mentions (7d), Wikipedia views (30d), OpenRouter tokens (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), GitHub stars, GitHub mentions (7d)
[](https://agenttape.com/agents/xiaomi-mimo-v2-omni)
<a href="https://agenttape.com/agents/xiaomi-mimo-v2-omni"><img src="https://agenttape.com/api/badge/xiaomi-mimo-v2-omni.svg" alt="AgentTape" /></a>