Cohort of 43 admitted agents tagged capability:automation. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #3 | Google: Gemini 3 Flash Preview saasGoogle: Gemini 3 Flash Preview: Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool... | 98 | 64.1 | +42.26 | ||
| #12 | xAI: Grok 4.3 saasxAI: Grok 4.3: Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual... | 337 | 59.7 | +37.88 | ||
| #15 | langflow mitlangflow: Langflow is a powerful tool for building and deploying AI-powered agents and workflows. | NEW | 55.1 | — | ||
| #25 | career-ops mitclicareer-ops: AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing. | 14 | 53.4 | +4.92 | ||
| #28 | claude-code claude-code: Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands. | 25 | 52.9 | +0.39 | ||
| #47 | cua mitlibrarycua: Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows). | NEW | 48.0 | — | ||
| #83 | OpenAI: GPT-5.2-Codex saasOpenAI: GPT-5.2-Codex: GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 144 | 45.9 | +24.07 | ||
| #81 | RD-Agent mitRD-Agent: Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, whi... | 39 | 43.3 | +1.68 | ||
| #82 | waoowaoo ide-pluginwaoowaoo: ???????????????????????? AI ?????????????????????Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows. | NEW | 43.2 | — | ||
| #84 | harness mitmcp-serverharness: AI-driven user testing for iOS Simulator, macOS apps, and web apps. Write a goal in plain language; an LLM agent drives the UI and reports friction. macOS 14+, Swift 6. | NEW | 43.2 | — | ||
| #85 | presenton apache-2.0presenton: Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative). | NEW | 42.5 | — | ||
| #86 | owl owl: ???? OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation. | NEW | 42.4 | — | ||
| #98 | leon mitleon: ???? Leon is your open-source personal assistant. | NEW | 40.2 | — | ||
| #99 | @adminforth/agent @adminforth/agent: AI agent plugin for AdminForth with tool-based workflows and persistent chat sessions. | NEW | 40.1 | — | ||
| #106 | UFO mitUFO: UFO??: Weaving the Digital Agent Galaxy. | 52 | 39.5 | -0.83 | ||
| #107 | mobilerun mitmobilerun: Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent ????. | 54 | 39.5 | -0.95 | ||
| #103 | Anthropic: Claude Opus 4 saasAnthropic: Claude Opus 4: Claude Opus 4 is benchmarked as the world???s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in... | 93 | 39.3 | +8.33 | ||
| #105 | Anthropic: Claude Opus 4.5 saasAnthropic: Claude Opus 4.5: Claude Opus 4.5 is Anthropic???s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and... | 58 | 39.3 | +17.53 | ||
| #133 | Google: Gemini 3.1 Pro Preview saasGoogle: Gemini 3.1 Pro Preview: Gemini 3.1 Pro Preview is Google???s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation... | 34 | 37.1 | +15.25 | ||
| #126 | ComfyUI-Copilot mitide-pluginComfyUI-Copilot: An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance. | 63 | 36.8 | -0.95 | ||
| #141 | Mistral: Ministral 3 14B 2512 saasMistral: Ministral 3 14B 2512: The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language... | 17 | 36.5 | +14.65 | ||
| #184 | OpenAI: GPT-5 Codex saasOpenAI: GPT-5 Codex: GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 54 | 35.9 | +14.04 | ||
| #175 | OpenAI: GPT-5.1-Codex saasOpenAI: GPT-5.1-Codex: GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 47 | 35.9 | +14.04 | ||
| #133 | cli apache-2.0clicli: Google Workspace CLI ??? one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills. | 49 | 35.7 | +2.05 | ||
| #134 | zcf mitclizcf: Zero-Config Code Flow for Claude code & Codex. | 69 | 35.6 | -1.89 | ||
| #209 | MiniMax: MiniMax M2.1 saasMiniMax: MiniMax M2.1: MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world... | 62 | 34.9 | +13.06 | ||
| #145 | takt takt: TAKT: TAKT Agent Koordination Topology - AI Agent Workflow Orchestration. | 66 | 33.7 | -1.20 | ||
| #159 | Integuru agplInteguru: The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs. | 91 | 30.8 | -5.71 | ||
| #169 | la-machina-engine mcp-serverla-machina-engine: Headless, multi-provider LLM agent engine for workflow automation. Pause/resume, MCP, skills, R2/Workers compatible. | 79 | 29.9 | -1.16 | ||
| #174 | oh-my-hermes oh-my-hermes: An opinionated workflow layer for building, shipping, and operating apps with Hermes Agent. | NEW | 28.8 | — | ||
| #180 | @novu/agent-toolkit @novu/agent-toolkit: Novu Agent Toolkit - expose Novu notification workflows as LLM agent tools. | 86 | 28.3 | -1.21 | ||
| #191 | GodModeSkill mitGodModeSkill: Multi-LLM cross-review workflow for Claude Code. /work orchestrates plan/implement/bug-fix with 3 different model families voting on every gate. | 89 | 27.1 | -0.91 | ||
| #205 | claude-gombwe mcp-serverclaude-gombwe: Autonomous agent control panel powered by Claude Code ??? orchestrate AI tasks, triggers, workflows, and skills from anywhere. | 94 | 25.0 | -1.68 | ||
| #208 | instagram-reels-transcript-api mitmcp-serverinstagram-reels-transcript-api: Instagram Reels Transcript API examples using Apify. Integrations for Python, Node.js, Java, Go, Rust, cURL, batch processing, and MCP workflows for ChatGPT, Claude, and Gemini. | NEW | 24.2 | — | ||
| #350 | Tencent: Hy3 preview saasTencent: Hy3 preview: Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to... | NEW | 21.8 | — | ||
| #351 | Tencent: Hy3 preview (free) saasTencent: Hy3 preview (free): Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to... | 19 | 21.8 | 0.00 | ||
| #310 | Poolside: Laguna M.1 (free) saasPoolside: Laguna M.1 (free): Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K... | 41 | 21.8 | 0.00 | ||
| #272 | Baidu Qianfan: CoBuddy (free) saasBaidu Qianfan: CoBuddy (free): CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool... | NEW | 21.8 | — | ||
| #293 | MiniMax: MiniMax M2 saasMiniMax: MiniMax M2: MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,... | 147 | 21.8 | 0.00 | ||
| #359 | Xiaomi: MiMo-V2.5 saasXiaomi: MiMo-V2.5: MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding... | 7 | 21.8 | 0.00 | ||
| #366 | Z.ai: GLM 5 Turbo saasZ.ai: GLM 5 Turbo: GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows... | 2 | 21.8 | 0.00 | ||
| #308 | Owl Alpha saasOwl Alpha: Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution.... | 46 | 21.8 | 0.00 | ||
| #378 | Z.ai: GLM 5 saasZ.ai: GLM 5: GLM-5 is Z.ai???s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading... | 12 | 14.1 | -7.70 |
Browse all sectors at /sectors.