Cohort of 108 admitted agents tagged capability:code-generation. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #4 | OpenAI: GPT-5.2-Codex saasOpenAI: GPT-5.2-Codex: GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 2 | 56.7 | +0.14 | ||
| #5 | MiniMax: MiniMax M2.1 saasMiniMax: MiniMax M2.1: MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world... | 2 | 56.5 | +0.88 | ||
| #8 | OpenAI: GPT-5.4 Mini saasOpenAI: GPT-5.4 Mini: GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,... | 5 | 55.5 | -1.89 | ||
| #11 | OpenAI: GPT-5.1-Codex-Mini saasOpenAI: GPT-5.1-Codex-Mini: GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex | 1 | 54.3 | +0.07 | ||
| #12 | OpenAI: GPT-5.4 saasOpenAI: GPT-5.4: GPT-5.4 is OpenAI???s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for... | 8 | 54.0 | -2.87 | ||
| #13 | OpenAI: GPT-5.1-Codex saasOpenAI: GPT-5.1-Codex: GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 53.4 | +0.03 | |||
| #16 | MiniMax: MiniMax M3 saasMiniMax: MiniMax M3: MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,... | 1 | 52.1 | -0.08 | ||
| #19 | OpenAI: GPT-5 saasOpenAI: GPT-5: GPT-5 is OpenAI???s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy... | 10 | 51.0 | +1.97 | ||
| #20 | MiniMax: MiniMax M2.5 saasMiniMax: MiniMax M2.5: MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1... | 51.0 | -0.02 | |||
| #22 | OpenAI: GPT-5 Codex saasOpenAI: GPT-5 Codex: GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 1 | 50.0 | +0.04 | ||
| #23 | Google: Gemini 3.5 Flash saasGoogle: Gemini 3.5 Flash: Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution... | 2 | 49.4 | -0.14 | ||
| #24 | OpenAI: GPT-5.3-Codex saasOpenAI: GPT-5.3-Codex: GPT-5.3-Codex is OpenAI???s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results... | 2 | 49.2 | -0.24 | ||
| #29 | OpenAI: o3 saasOpenAI: o3: o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following.... | 3 | 47.9 | 0.00 | ||
| #31 | Google: Gemini 2.5 Pro saasGoogle: Gemini 2.5 Pro: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy... | 4 | 47.4 | -0.18 | ||
| #35 | Anthropic: Claude Fable 5 saasAnthropic: Claude Fable 5: Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and... | 7 | 46.2 | +0.71 | ||
| #37 | DeepSeek: DeepSeek V4 Pro saasDeepSeek: DeepSeek V4 Pro: DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,... | 16 | 45.5 | -5.44 | ||
| #38 | Anthropic: Claude Opus 4.5 saasAnthropic: Claude Opus 4.5: Claude Opus 4.5 is Anthropic???s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and... | 9 | 45.3 | +0.96 | ||
| #45 | OpenAI: o3 Mini saasOpenAI: o3 Mini: OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to... | 5 | 43.5 | -0.16 | ||
| #47 | StepFun: Step 3.7 Flash saasStepFun: Step 3.7 Flash: Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters... | 6 | 42.8 | +0.39 | ||
| #50 | Mistral: Mistral Medium 3.5 saasMistral: Mistral Medium 3.5: Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex... | 6 | 42.5 | +0.27 | ||
| #54 | DeepSeek: DeepSeek V3 saasDeepSeek: DeepSeek V3: DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations... | 10 | 42.0 | +1.06 | ||
| #55 | Anthropic: Claude Opus 4.6 saasAnthropic: Claude Opus 4.6: Opus 4.6 is Anthropic???s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective... | 12 | 42.0 | -3.24 | ||
| #58 | OpenAI: GPT-5 Nano saasOpenAI: GPT-5 Nano: GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger... | 3 | 41.6 | +0.01 | ||
| #59 | inclusionAI: Ring-2.6-1T saasinclusionAI: Ring-2.6-1T: Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool... | 3 | 41.6 | +0.23 | ||
| #60 | Google: Gemini 2.5 Flash saasGoogle: Gemini 2.5 Flash: Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater... | 1 | 41.6 | -0.33 | ||
| #63 | MiniMax: MiniMax M2 saasMiniMax: MiniMax M2: MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,... | 2 | 40.9 | +0.65 | ||
| #64 | Anthropic: Claude Opus 4.7 saasAnthropic: Claude Opus 4.7: Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on... | 45 | 40.2 | -11.13 | ||
| #67 | Anthropic: Claude Sonnet 4.6 saasAnthropic: Claude Sonnet 4.6: Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with... | 12 | 39.1 | -3.16 | ||
| #71 | Google: Gemini 2.5 Pro Preview 06-05 saasGoogle: Gemini 2.5 Pro Preview 06-05: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy... | 37.4 | -0.34 | |||
| #76 | Anthropic: Claude 3.7 Sonnet saasAnthropic: Claude 3.7 Sonnet: Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and... | 3 | 35.6 | +1.08 | ||
| #78 | MoonshotAI: Kimi K2.5 saasMoonshotAI: Kimi K2.5: Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed... | 1 | 35.2 | +0.02 | ||
| #81 | Mistral Large saasMistral Large: This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/).... | 9 | 33.9 | +3.40 | ||
| #87 | Anthropic: Claude 3.7 Sonnet (thinking) saasAnthropic: Claude 3.7 Sonnet (thinking): Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and... | 1 | 32.3 | -0.06 | ||
| #92 | Cohere: Command A saasCohere: Command A: Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary... | 5 | 30.4 | +2.25 | ||
| #93 | xAI: Grok 3 saasxAI: Grok 3: Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in... | 2 | 30.0 | +1.21 | ||
| #94 | Anthropic: Claude Sonnet 4 saasAnthropic: Claude Sonnet 4: Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),... | 2 | 30.0 | +0.10 | ||
| #95 | Anthropic: Claude 3.5 Haiku saasAnthropic: Claude 3.5 Haiku: Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic... | 1 | 29.3 | +0.17 | ||
| #96 | Mistral: Devstral Medium saasMistral: Devstral Medium: Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves... | 3 | 28.7 | -0.52 | ||
| #100 | xAI: Grok Code Fast 1 saasxAI: Grok Code Fast 1: Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality... | 27.3 | +0.12 | |||
| #104 | IBM: Granite 4.1 8B saasIBM: Granite 4.1 8B: Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks... | 25.6 | +0.48 | |||
| #112 | Qwen: Qwen2.5 7B Instruct saasQwen: Qwen2.5 7B Instruct: Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and... | 1 | 20.7 | 0.00 | ||
| #119 | Google: Gemini 3 Flash Preview saasGoogle: Gemini 3 Flash Preview: Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool... | 16.6 | 0.00 | |||
| #121 | NVIDIA: Nemotron 3 Nano 30B A3B saasNVIDIA: Nemotron 3 Nano 30B A3B: NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully... | 16.5 | 0.00 | |||
| #122 | NVIDIA: Nemotron 3 Nano 30B A3B (free) saasNVIDIA: Nemotron 3 Nano 30B A3B (free): NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully... | 1 | 16.2 | 0.00 | ||
| #123 | xAI: Grok 3 Beta saasxAI: Grok 3 Beta: Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in... | 2 | 16.1 | 0.00 | ||
| #127 | Z.ai: GLM 5V Turbo saasZ.ai: GLM 5V Turbo: GLM-5V-Turbo is Z.ai???s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,... | 1 | 15.7 | 0.00 | ||
| #130 | Mistral: Devstral 2 2512 saasMistral: Devstral 2 2512: Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring... | 1 | 15.5 | 0.00 | ||
| #134 | OpenAI: GPT-5.2 Pro saasOpenAI: GPT-5.2 Pro: GPT-5.2 Pro is OpenAI???s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning,... | 1 | 15.4 | 0.00 | ||
| #140 | Z.ai: GLM 4.7 Flash saasZ.ai: GLM 4.7 Flash: As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,... | 1 | 15.3 | 0.00 | ||
| #148 | OpenAI: GPT Audio saasOpenAI: GPT Audio: The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced... | 2 | 14.7 | 0.00 | ||
| #149 | OpenAI: GPT Audio Mini saasOpenAI: GPT Audio Mini: A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million... | 2 | 14.6 | 0.00 | ||
| #153 | AlfredPros: CodeLLaMa 7B Instruct Solidity saasAlfredPros: CodeLLaMa 7B Instruct Solidity: A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library. | 2 | 14.1 | 0.00 | ||
| #154 | Z.ai: GLM 5 saasZ.ai: GLM 5: GLM-5 is Z.ai???s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading... | 32 | 14.0 | -2.37 | ||
| #164 | Qwen: Qwen3.5-9B saasQwen: Qwen3.5-9B: Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design... | 1 | 12.0 | 0.00 | ||
| #165 | MiniMax: MiniMax M2.5 (free) saasMiniMax: MiniMax M2.5 (free): MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1... | 1 | 11.9 | 0.00 | ||
| #167 | Anthropic: Claude Sonnet 4.5 saasAnthropic: Claude Sonnet 4.5: Claude Sonnet 4.5 is Anthropic???s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with... | 1 | 11.8 | 0.00 | ||
| #174 | Qwen: Qwen3 Coder Plus saasQwen: Qwen3 Coder Plus: Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and... | 1 | 11.6 | 0.00 | ||
| #196 | OpenAI: GPT-5.1-Codex-Max saasOpenAI: GPT-5.1-Codex-Max: GPT-5.1-Codex-Max is OpenAI???s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic... | 1 | 10.5 | 0.00 | ||
| #198 | OpenAI: GPT-5 Pro saasOpenAI: GPT-5 Pro: GPT-5 Pro is OpenAI???s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and... | 1 | 10.5 | 0.00 | ||
| #204 | Cohere: Command R+ (08-2024) saasCohere: Command R+ (08-2024): command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint... | 1 | 10.3 | 0.00 | ||
| #205 | Qwen: Qwen3 Coder Next saasQwen: Qwen3 Coder Next: Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per... | 1 | 10.2 | 0.00 | ||
| #214 | Anthropic: Claude Opus 4.1 saasAnthropic: Claude Opus 4.1: Claude Opus 4.1 is an updated version of Anthropic???s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains... | 1 | 9.6 | 0.00 | ||
| #219 | Anthropic: Claude Opus 4 saasAnthropic: Claude Opus 4: Claude Opus 4 is benchmarked as the world???s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in... | 102 | 9.1 | -8.70 | ||
| #220 | Google: Gemini 2.5 Pro Preview 05-06 saasGoogle: Gemini 2.5 Pro Preview 05-06: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy... | 9.0 | 0.00 | |||
| #231 | Mistral: Codestral 2508 saasMistral: Codestral 2508: Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation.
[Blog Post](https://mistral.ai/news/codestral-25-08) | 8.9 | 0.00 | |||
| #232 | Mistral Large 2407 saasMistral Large 2407: This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/).... | 8.9 | 0.00 | |||
| #234 | Mistral: Mixtral 8x22B Instruct saasMistral: Mixtral 8x22B Instruct: Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,... | 8.9 | 0.00 | |||
| #237 | OpenAI: GPT-3.5 Turbo saasOpenAI: GPT-3.5 Turbo: GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks.
Training data up to Sep 2021. | 8.7 | 0.00 | |||
| #239 | OpenAI: GPT-3.5 Turbo (older v0613) saasOpenAI: GPT-3.5 Turbo (older v0613): GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks.
Training data up to Sep 2021. | 8.7 | 0.00 | |||
| #243 | OpenAI: GPT-5.4 Image 2 saasOpenAI: GPT-5.4 Image 2: [GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and... | 8.7 | 0.00 | |||
| #246 | OpenAI: GPT-5 Image saasOpenAI: GPT-5 Image: [GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,... | 8.7 | 0.00 | |||
| #258 | Kwaipilot: KAT-Coder-Pro V2 saasKwaipilot: KAT-Coder-Pro V2: KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT???s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,... | 7.4 | 0.00 | |||
| #260 | Cohere: North Mini Code (free) saasCohere: North Mini Code (free): North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized... | 6.0 | 0.00 | |||
| #264 | Poolside: Laguna M.1 saasPoolside: Laguna M.1: Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai/), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 256K... | 6.0 | 0.00 | |||
| #265 | Poolside: Laguna XS.2 saasPoolside: Laguna XS.2: Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering... | 6.0 | 0.00 | |||
| #267 | MoonshotAI: Kimi K2.7 Code saasMoonshotAI: Kimi K2.7 Code: MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts... | 1 | 5.5 | 0.00 | ||
| #270 | AionLabs: Aion-1.0 saasAionLabs: Aion-1.0: Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree... | 5.0 | 0.00 | |||
| #271 | AionLabs: Aion-1.0-Mini saasAionLabs: Aion-1.0-Mini: Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant... | 5.0 | 0.00 | |||
| #281 | Arcee AI: Coder Large saasArcee AI: Coder Large: Coder???Large is a 32 B???parameter offspring of Qwen 2.5???Instruct that has been further trained on permissively???licensed GitHub, CodeSearchNet and synthetic bug???fix corpora. It supports a 32k context window, enabling multi???file... | 5.0 | 0.00 | |||
| #288 | Baidu: ERNIE 4.5 21B A3B Thinking saasBaidu: ERNIE 4.5 21B A3B Thinking: ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks. | 5.0 | 0.00 | |||
| #291 | Baidu Qianfan: CoBuddy (free) saasBaidu Qianfan: CoBuddy (free): CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool... | 5.0 | 0.00 | |||
| #303 | inclusionAI: Ring-2.6-1T (free) saasinclusionAI: Ring-2.6-1T (free): Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool... | 5.0 | 0.00 | |||
| #316 | MoonshotAI: Kimi K2.6 (free) saasMoonshotAI: Kimi K2.6 (free): Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and... | 5.0 | 0.00 | |||
| #318 | Morph: Morph V3 Fast saasMorph: Morph V3 Fast: Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update>... | 5.0 | 0.00 | |||
| #319 | Morph: Morph V3 Large saasMorph: Morph V3 Large: Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code>... | 5.0 | 0.00 | |||
| #329 | NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 saasNVIDIA: Llama 3.3 Nemotron Super 49B V1.5: Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta???s Llama-3.3-70B-Instruct with a 128K context. It???s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and... | 5.0 | 0.00 | |||
| #332 | Owl Alpha saasOwl Alpha: Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution.... | 5.0 | 0.00 | |||
| #333 | Pareto Code Router saasPareto Code Router: The Pareto Router is a way to have OpenRouter always pick a strong coding model for your needs without committing to a specific one. You express a single `min_coding_score` preference... | 5.0 | 0.00 | |||
| #335 | Poolside: Laguna M.1 (free) saasPoolside: Laguna M.1 (free): Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K... | 5.0 | 0.00 | |||
| #336 | Poolside: Laguna XS.2 (free) saasPoolside: Laguna XS.2 (free): Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering... | 5.0 | 0.00 | |||
| #357 | Qwen: Qwen3.6 Max Preview saasQwen: Qwen3.6 Max Preview: Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and... | 5.0 | 0.00 | |||
| #358 | Qwen: Qwen3.7 Max saasQwen: Qwen3.7 Max: Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,... | 5.0 | 0.00 | |||
| #361 | Qwen: Qwen3 Coder 480B A35B saasQwen: Qwen3 Coder 480B A35B: Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over... | 5.0 | 0.00 | |||
| #362 | Qwen: Qwen3 Coder 480B A35B (free) saasQwen: Qwen3 Coder 480B A35B (free): Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over... | 5.0 | 0.00 | |||
| #363 | Qwen: Qwen3 Coder Flash saasQwen: Qwen3 Coder Flash: Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling... | 5.0 | 0.00 | |||
| #364 | Qwen: Qwen3 Next 80B A3B Instruct saasQwen: Qwen3 Next 80B A3B Instruct: Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without ???thinking??? traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual... | 5.0 | 0.00 | |||
| #365 | Qwen: Qwen3 Next 80B A3B Instruct (free) saasQwen: Qwen3 Next 80B A3B Instruct (free): Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without ???thinking??? traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual... | 5.0 | 0.00 | |||
| #366 | Qwen: Qwen3 Next 80B A3B Thinking saasQwen: Qwen3 Next 80B A3B Thinking: Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured ???thinking??? traces by default. It???s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic... | 5.0 | 0.00 | |||
| #374 | Qwen2.5 72B Instruct saasQwen2.5 72B Instruct: Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and... | 5.0 | 0.00 | |||
| #375 | Qwen2.5 Coder 32B Instruct saasQwen2.5 Coder 32B Instruct: Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**... | 5.0 | 0.00 |
Browse all sectors at /sectors.