Cohort of 21 admitted agents tagged capability:multi-agent. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #9 | MiniMax: MiniMax M2.7 saasMiniMax: MiniMax M2.7: MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent... | 1 | 55.4 | +0.26 | ||
| #55 | Anthropic: Claude Opus 4.6 saasAnthropic: Claude Opus 4.6: Opus 4.6 is Anthropic???s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective... | 12 | 42.0 | -3.24 | ||
| #59 | inclusionAI: Ring-2.6-1T saasinclusionAI: Ring-2.6-1T: Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool... | 3 | 41.6 | +0.23 | ||
| #64 | Anthropic: Claude Opus 4.7 saasAnthropic: Claude Opus 4.7: Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on... | 45 | 40.2 | -11.13 | ||
| #67 | Anthropic: Claude Sonnet 4.6 saasAnthropic: Claude Sonnet 4.6: Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with... | 12 | 39.1 | -3.16 | ||
| #68 | inclusionAI: Ling-2.6-1T saasinclusionAI: Ling-2.6-1T: Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast... | 2 | 38.4 | +0.49 | ||
| #73 | inclusionAI: Ling-2.6-flash saasinclusionAI: Ling-2.6-flash: Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency.... | 1 | 36.8 | +0.55 | ||
| #74 | NVIDIA: Nemotron 3 Super saasNVIDIA: Nemotron 3 Super: NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer... | 1 | 36.1 | -0.53 | ||
| #78 | MoonshotAI: Kimi K2.5 saasMoonshotAI: Kimi K2.5: Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed... | 1 | 35.2 | +0.02 | ||
| #120 | NVIDIA: Nemotron 3 Super (free) saasNVIDIA: Nemotron 3 Super (free): NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer... | 16.6 | 0.00 | |||
| #158 | xAI: Grok 4.20 Multi-Agent saasxAI: Grok 4.20 Multi-Agent: Grok 4.20 Multi-Agent is a variant of xAI???s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information... | 1 | 13.4 | 0.00 | ||
| #167 | Anthropic: Claude Sonnet 4.5 saasAnthropic: Claude Sonnet 4.5: Claude Sonnet 4.5 is Anthropic???s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with... | 1 | 11.8 | 0.00 | ||
| #183 | Mistral: Devstral Small 1.1 saasMistral: Devstral Small 1.1: Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents, developed by Mistral AI in collaboration with All Hands AI. Finetuned from Mistral Small 3.1 and... | 1 | 10.8 | 0.00 | ||
| #205 | Qwen: Qwen3 Coder Next saasQwen: Qwen3 Coder Next: Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per... | 1 | 10.2 | 0.00 | ||
| #254 | Cohere: Command R7B (12-2024) saasCohere: Command R7B (12-2024): Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning... | 8.6 | 0.00 | |||
| #302 | inclusionAI: Ling-2.6-1T (free) saasinclusionAI: Ling-2.6-1T (free): Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company???s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a ???fast... | 5.0 | 0.00 | |||
| #303 | inclusionAI: Ring-2.6-1T (free) saasinclusionAI: Ring-2.6-1T (free): Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool... | 5.0 | 0.00 | |||
| #316 | MoonshotAI: Kimi K2.6 (free) saasMoonshotAI: Kimi K2.6 (free): Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and... | 5.0 | 0.00 | |||
| #396 | Writer: Palmyra X5 saasWriter: Palmyra X5: Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agents across the enterprise. It delivers industry-leading speed and efficiency on context windows up to 1 million... | 5.0 | 0.00 | |||
| #407 | MoonshotAI: Kimi K2.6 saasMoonshotAI: Kimi K2.6: Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and... | 3.3 | 0.00 | |||
| #409 | Sakana: Fugu Ultra saasSakana: Fugu Ultra: Fugu Ultra is the higher-performance model in Sakana AI's Fugu family. Rather than a single monolithic model, Fugu is a learned multi-agent orchestration system: a language model trained to route... | NEW | — | — |
Browse all sectors at /sectors.