Sector · capability

Code Generation

Cohort of 108 admitted agents tagged capability:code-generation. Composite below is the cohort's average AgentScore.

Avg AgentScore

28.6

-0.63vs 30d ago

Loading…

Applications Foundation models All

Members

100 of 100 shown · ranked by AgentScore

Deployment

Maturity

Tick + on any row to add it to your compare tray (up to 5).

#4OpenAI: GPT-5.2-Codex
saas
56.7+0.142
#5MiniMax: MiniMax M2.1
saas
56.5+0.882
#8OpenAI: GPT-5.4 Mini
saas
55.5-1.895
#11OpenAI: GPT-5.1-Codex-Mini
saas
54.3+0.071
#12OpenAI: GPT-5.4
saas
54.0-2.878
#13OpenAI: GPT-5.1-Codex
saas
53.4+0.03
#16MiniMax: MiniMax M3
saas
52.1-0.081
#19OpenAI: GPT-5
saas
51.0+1.9710
#20MiniMax: MiniMax M2.5
saas
51.0-0.02
#22OpenAI: GPT-5 Codex
saas
50.0+0.041
#23Google: Gemini 3.5 Flash
saas
49.4-0.142
#24OpenAI: GPT-5.3-Codex
saas
49.2-0.242
#29OpenAI: o3
saas
47.90.003
#31Google: Gemini 2.5 Pro
saas
47.4-0.184
#35Anthropic: Claude Fable 5
saas
46.2+0.717
#37DeepSeek: DeepSeek V4 Pro
saas
45.5-5.4416
#38Anthropic: Claude Opus 4.5
saas
45.3+0.969
#45OpenAI: o3 Mini
saas
43.5-0.165
#47StepFun: Step 3.7 Flash
saas
42.8+0.396
#50Mistral: Mistral Medium 3.5
saas
42.5+0.276
#54DeepSeek: DeepSeek V3
saas
42.0+1.0610
#55Anthropic: Claude Opus 4.6
saas
42.0-3.2412
#58OpenAI: GPT-5 Nano
saas
41.6+0.013
#59inclusionAI: Ring-2.6-1T
saas
41.6+0.233
#60Google: Gemini 2.5 Flash
saas
41.6-0.331
#63MiniMax: MiniMax M2
saas
40.9+0.652
#64Anthropic: Claude Opus 4.7
saas
40.2-11.1345
#67Anthropic: Claude Sonnet 4.6
saas
39.1-3.1612
#71Google: Gemini 2.5 Pro Preview 06-05
saas
37.4-0.34
#76Anthropic: Claude 3.7 Sonnet
saas
35.6+1.083
#78MoonshotAI: Kimi K2.5
saas
35.2+0.021
#81Mistral Large
saas
33.9+3.409
#87Anthropic: Claude 3.7 Sonnet (thinking)
saas
32.3-0.061
#92Cohere: Command A
saas
30.4+2.255
#93xAI: Grok 3
saas
30.0+1.212
#94Anthropic: Claude Sonnet 4
saas
30.0+0.102
#95Anthropic: Claude 3.5 Haiku
saas
29.3+0.171
#96Mistral: Devstral Medium
saas
28.7-0.523
#100xAI: Grok Code Fast 1
saas
27.3+0.12
#104IBM: Granite 4.1 8B
saas
25.6+0.48
#112Qwen: Qwen2.5 7B Instruct
saas
20.70.001
#119Google: Gemini 3 Flash Preview
saas
16.60.00
#121NVIDIA: Nemotron 3 Nano 30B A3B
saas
16.50.00
#122NVIDIA: Nemotron 3 Nano 30B A3B (free)
saas
16.20.001
#123xAI: Grok 3 Beta
saas
16.10.002
#127Z.ai: GLM 5V Turbo
saas
15.70.001
#130Mistral: Devstral 2 2512
saas
15.50.001
#134OpenAI: GPT-5.2 Pro
saas
15.40.001
#140Z.ai: GLM 4.7 Flash
saas
15.30.001
#148OpenAI: GPT Audio
saas
14.70.002
#149OpenAI: GPT Audio Mini
saas
14.60.002
#153AlfredPros: CodeLLaMa 7B Instruct Solidity
saas
14.10.002
#154Z.ai: GLM 5
saas
14.0-2.3732
#164Qwen: Qwen3.5-9B
saas
12.00.001
#165MiniMax: MiniMax M2.5 (free)
saas
11.90.001
#167Anthropic: Claude Sonnet 4.5
saas
11.80.001
#174Qwen: Qwen3 Coder Plus
saas
11.60.001
#196OpenAI: GPT-5.1-Codex-Max
saas
10.50.001
#198OpenAI: GPT-5 Pro
saas
10.50.001
#204Cohere: Command R+ (08-2024)
saas
10.30.001
#205Qwen: Qwen3 Coder Next
saas
10.20.001
#214Anthropic: Claude Opus 4.1
saas
9.60.001
#219Anthropic: Claude Opus 4
saas
9.1-8.70102
#220Google: Gemini 2.5 Pro Preview 05-06
saas
9.00.00
#231Mistral: Codestral 2508
saas
8.90.00
#232Mistral Large 2407
saas
8.90.00
#234Mistral: Mixtral 8x22B Instruct
saas
8.90.00
#237OpenAI: GPT-3.5 Turbo
saas
8.70.00
#239OpenAI: GPT-3.5 Turbo (older v0613)
saas
8.70.00
#243OpenAI: GPT-5.4 Image 2
saas
8.70.00
#246OpenAI: GPT-5 Image
saas
8.70.00
#258Kwaipilot: KAT-Coder-Pro V2
saas
7.40.00
#260Cohere: North Mini Code (free)
saas
6.00.00
#264Poolside: Laguna M.1
saas
6.00.00
#265Poolside: Laguna XS.2
saas
6.00.00
#267MoonshotAI: Kimi K2.7 Code
saas
5.50.001
#270AionLabs: Aion-1.0
saas
5.00.00
#271AionLabs: Aion-1.0-Mini
saas
5.00.00
#281Arcee AI: Coder Large
saas
5.00.00
#288Baidu: ERNIE 4.5 21B A3B Thinking
saas
5.00.00
#291Baidu Qianfan: CoBuddy (free)
saas
5.00.00
#303inclusionAI: Ring-2.6-1T (free)
saas
5.00.00
#316MoonshotAI: Kimi K2.6 (free)
saas
5.00.00
#318Morph: Morph V3 Fast
saas
5.00.00
#319Morph: Morph V3 Large
saas
5.00.00
#329NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
saas
5.00.00
#332Owl Alpha
saas
5.00.00
#333Pareto Code Router
saas
5.00.00
#335Poolside: Laguna M.1 (free)
saas
5.00.00
#336Poolside: Laguna XS.2 (free)
saas
5.00.00
#357Qwen: Qwen3.6 Max Preview
saas
5.00.00
#358Qwen: Qwen3.7 Max
saas
5.00.00
#361Qwen: Qwen3 Coder 480B A35B
saas
5.00.00
#362Qwen: Qwen3 Coder 480B A35B (free)
saas
5.00.00
#363Qwen: Qwen3 Coder Flash
saas
5.00.00
#364Qwen: Qwen3 Next 80B A3B Instruct
saas
5.00.00
#365Qwen: Qwen3 Next 80B A3B Instruct (free)
saas
5.00.00
#366Qwen: Qwen3 Next 80B A3B Thinking
saas
5.00.00
#374Qwen2.5 72B Instruct
saas
5.00.00
#375Qwen2.5 Coder 32B Instruct
saas
5.00.00

Rank	Agent	24h	Score	Δ24h
#4	OpenAI: GPT-5.2-Codex saasOpenAI: GPT-5.2-Codex: GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....	2	56.7	+0.14
#5	MiniMax: MiniMax M2.1 saasMiniMax: MiniMax M2.1: MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...	2	56.5	+0.88
#8	OpenAI: GPT-5.4 Mini saasOpenAI: GPT-5.4 Mini: GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...	5	55.5	-1.89
#11	OpenAI: GPT-5.1-Codex-Mini saasOpenAI: GPT-5.1-Codex-Mini: GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex	1	54.3	+0.07
#12	OpenAI: GPT-5.4 saasOpenAI: GPT-5.4: GPT-5.4 is OpenAI???s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...	8	54.0	-2.87
#13	OpenAI: GPT-5.1-Codex saasOpenAI: GPT-5.1-Codex: GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....		53.4	+0.03
#16	MiniMax: MiniMax M3 saasMiniMax: MiniMax M3: MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...	1	52.1	-0.08
#19	OpenAI: GPT-5 saasOpenAI: GPT-5: GPT-5 is OpenAI???s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...	10	51.0	+1.97
#20	MiniMax: MiniMax M2.5 saasMiniMax: MiniMax M2.5: MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...		51.0	-0.02
#22	OpenAI: GPT-5 Codex saasOpenAI: GPT-5 Codex: GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....	1	50.0	+0.04
#23	Google: Gemini 3.5 Flash saasGoogle: Gemini 3.5 Flash: Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...	2	49.4	-0.14
#24	OpenAI: GPT-5.3-Codex saasOpenAI: GPT-5.3-Codex: GPT-5.3-Codex is OpenAI???s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results...	2	49.2	-0.24
#29	OpenAI: o3 saasOpenAI: o3: o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....	3	47.9	0.00
#31	Google: Gemini 2.5 Pro saasGoogle: Gemini 2.5 Pro: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy...	4	47.4	-0.18
#35	Anthropic: Claude Fable 5 saasAnthropic: Claude Fable 5: Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...	7	46.2	+0.71
#37	DeepSeek: DeepSeek V4 Pro saasDeepSeek: DeepSeek V4 Pro: DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...	16	45.5	-5.44
#38	Anthropic: Claude Opus 4.5 saasAnthropic: Claude Opus 4.5: Claude Opus 4.5 is Anthropic???s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and...	9	45.3	+0.96
#45	OpenAI: o3 Mini saasOpenAI: o3 Mini: OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to...	5	43.5	-0.16
#47	StepFun: Step 3.7 Flash saasStepFun: Step 3.7 Flash: Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...	6	42.8	+0.39
#50	Mistral: Mistral Medium 3.5 saasMistral: Mistral Medium 3.5: Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex...	6	42.5	+0.27
#54	DeepSeek: DeepSeek V3 saasDeepSeek: DeepSeek V3: DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...	10	42.0	+1.06
#55	Anthropic: Claude Opus 4.6 saasAnthropic: Claude Opus 4.6: Opus 4.6 is Anthropic???s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective...	12	42.0	-3.24
#58	OpenAI: GPT-5 Nano saasOpenAI: GPT-5 Nano: GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...	3	41.6	+0.01
#59	inclusionAI: Ring-2.6-1T saasinclusionAI: Ring-2.6-1T: Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...	3	41.6	+0.23
#60	Google: Gemini 2.5 Flash saasGoogle: Gemini 2.5 Flash: Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...	1	41.6	-0.33
#63	MiniMax: MiniMax M2 saasMiniMax: MiniMax M2: MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...	2	40.9	+0.65
#64	Anthropic: Claude Opus 4.7 saasAnthropic: Claude Opus 4.7: Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...	45	40.2	-11.13
#67	Anthropic: Claude Sonnet 4.6 saasAnthropic: Claude Sonnet 4.6: Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with...	12	39.1	-3.16
#71	Google: Gemini 2.5 Pro Preview 06-05 saasGoogle: Gemini 2.5 Pro Preview 06-05: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy...		37.4	-0.34
#76	Anthropic: Claude 3.7 Sonnet saasAnthropic: Claude 3.7 Sonnet: Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and...	3	35.6	+1.08
#78	MoonshotAI: Kimi K2.5 saasMoonshotAI: Kimi K2.5: Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed...	1	35.2	+0.02
#81	Mistral Large saasMistral Large: This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....	9	33.9	+3.40
#87	Anthropic: Claude 3.7 Sonnet (thinking) saasAnthropic: Claude 3.7 Sonnet (thinking): Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and...	1	32.3	-0.06
#92	Cohere: Command A saasCohere: Command A: Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...	5	30.4	+2.25
#93	xAI: Grok 3 saasxAI: Grok 3: Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...	2	30.0	+1.21
#94	Anthropic: Claude Sonnet 4 saasAnthropic: Claude Sonnet 4: Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...	2	30.0	+0.10
#95	Anthropic: Claude 3.5 Haiku saasAnthropic: Claude 3.5 Haiku: Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...	1	29.3	+0.17
#96	Mistral: Devstral Medium saasMistral: Devstral Medium: Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves...	3	28.7	-0.52
#100	xAI: Grok Code Fast 1 saasxAI: Grok Code Fast 1: Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality...		27.3	+0.12
#104	IBM: Granite 4.1 8B saasIBM: Granite 4.1 8B: Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...		25.6	+0.48
#112	Qwen: Qwen2.5 7B Instruct saasQwen: Qwen2.5 7B Instruct: Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...	1	20.7	0.00
#119	Google: Gemini 3 Flash Preview saasGoogle: Gemini 3 Flash Preview: Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...		16.6	0.00
#121	NVIDIA: Nemotron 3 Nano 30B A3B saasNVIDIA: Nemotron 3 Nano 30B A3B: NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...		16.5	0.00
#122	NVIDIA: Nemotron 3 Nano 30B A3B (free) saasNVIDIA: Nemotron 3 Nano 30B A3B (free): NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...	1	16.2	0.00
#123	xAI: Grok 3 Beta saasxAI: Grok 3 Beta: Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in...	2	16.1	0.00
#127	Z.ai: GLM 5V Turbo saasZ.ai: GLM 5V Turbo: GLM-5V-Turbo is Z.ai???s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...	1	15.7	0.00
#130	Mistral: Devstral 2 2512 saasMistral: Devstral 2 2512: Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring...	1	15.5	0.00
#134	OpenAI: GPT-5.2 Pro saasOpenAI: GPT-5.2 Pro: GPT-5.2 Pro is OpenAI???s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning,...	1	15.4	0.00
#140	Z.ai: GLM 4.7 Flash saasZ.ai: GLM 4.7 Flash: As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...	1	15.3	0.00
#148	OpenAI: GPT Audio saasOpenAI: GPT Audio: The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...	2	14.7	0.00
#149	OpenAI: GPT Audio Mini saasOpenAI: GPT Audio Mini: A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million...	2	14.6	0.00
#153	AlfredPros: CodeLLaMa 7B Instruct Solidity saasAlfredPros: CodeLLaMa 7B Instruct Solidity: A finetuned 7 billion parameters Code LLaMA - Instruct model to generate Solidity smart contract using 4-bit QLoRA finetuning provided by PEFT library.	2	14.1	0.00
#154	Z.ai: GLM 5 saasZ.ai: GLM 5: GLM-5 is Z.ai???s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...	32	14.0	-2.37
#164	Qwen: Qwen3.5-9B saasQwen: Qwen3.5-9B: Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...	1	12.0	0.00
#165	MiniMax: MiniMax M2.5 (free) saasMiniMax: MiniMax M2.5 (free): MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...	1	11.9	0.00
#167	Anthropic: Claude Sonnet 4.5 saasAnthropic: Claude Sonnet 4.5: Claude Sonnet 4.5 is Anthropic???s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...	1	11.8	0.00
#174	Qwen: Qwen3 Coder Plus saasQwen: Qwen3 Coder Plus: Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...	1	11.6	0.00
#196	OpenAI: GPT-5.1-Codex-Max saasOpenAI: GPT-5.1-Codex-Max: GPT-5.1-Codex-Max is OpenAI???s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic...	1	10.5	0.00
#198	OpenAI: GPT-5 Pro saasOpenAI: GPT-5 Pro: GPT-5 Pro is OpenAI???s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and...	1	10.5	0.00
#204	Cohere: Command R+ (08-2024) saasCohere: Command R+ (08-2024): command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...	1	10.3	0.00
#205	Qwen: Qwen3 Coder Next saasQwen: Qwen3 Coder Next: Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...	1	10.2	0.00
#214	Anthropic: Claude Opus 4.1 saasAnthropic: Claude Opus 4.1: Claude Opus 4.1 is an updated version of Anthropic???s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...	1	9.6	0.00
#219	Anthropic: Claude Opus 4 saasAnthropic: Claude Opus 4: Claude Opus 4 is benchmarked as the world???s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...	102	9.1	-8.70
#220	Google: Gemini 2.5 Pro Preview 05-06 saasGoogle: Gemini 2.5 Pro Preview 05-06: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy...		9.0	0.00
#231	Mistral: Codestral 2508 saasMistral: Codestral 2508: Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)		8.9	0.00
#232	Mistral Large 2407 saasMistral Large 2407: This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....		8.9	0.00
#234	Mistral: Mixtral 8x22B Instruct saasMistral: Mixtral 8x22B Instruct: Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...		8.9	0.00
#237	OpenAI: GPT-3.5 Turbo saasOpenAI: GPT-3.5 Turbo: GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.		8.7	0.00
#239	OpenAI: GPT-3.5 Turbo (older v0613) saasOpenAI: GPT-3.5 Turbo (older v0613): GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.		8.7	0.00
#243	OpenAI: GPT-5.4 Image 2 saasOpenAI: GPT-5.4 Image 2: [GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...		8.7	0.00
#246	OpenAI: GPT-5 Image saasOpenAI: GPT-5 Image: [GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,...		8.7	0.00
#258	Kwaipilot: KAT-Coder-Pro V2 saasKwaipilot: KAT-Coder-Pro V2: KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT???s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...		7.4	0.00
#260	Cohere: North Mini Code (free) saasCohere: North Mini Code (free): North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...		6.0	0.00
#264	Poolside: Laguna M.1 saasPoolside: Laguna M.1: Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai/), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 256K...		6.0	0.00
#265	Poolside: Laguna XS.2 saasPoolside: Laguna XS.2: Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...		6.0	0.00
#267	MoonshotAI: Kimi K2.7 Code saasMoonshotAI: Kimi K2.7 Code: MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts...	1	5.5	0.00
#270	AionLabs: Aion-1.0 saasAionLabs: Aion-1.0: Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree...		5.0	0.00
#271	AionLabs: Aion-1.0-Mini saasAionLabs: Aion-1.0-Mini: Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...		5.0	0.00
#281	Arcee AI: Coder Large saasArcee AI: Coder Large: Coder???Large is a 32 B???parameter offspring of Qwen 2.5???Instruct that has been further trained on permissively???licensed GitHub, CodeSearchNet and synthetic bug???fix corpora. It supports a 32k context window, enabling multi???file...		5.0	0.00
#288	Baidu: ERNIE 4.5 21B A3B Thinking saasBaidu: ERNIE 4.5 21B A3B Thinking: ERNIE-4.5-21B-A3B-Thinking is Baidu's upgraded lightweight MoE model, refined to boost reasoning depth and quality for top-tier performance in logical puzzles, math, science, coding, text generation, and expert-level academic benchmarks.		5.0	0.00
#291	Baidu Qianfan: CoBuddy (free) saasBaidu Qianfan: CoBuddy (free): CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool...		5.0	0.00
#303	inclusionAI: Ring-2.6-1T (free) saasinclusionAI: Ring-2.6-1T (free): Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...		5.0	0.00
#316	MoonshotAI: Kimi K2.6 (free) saasMoonshotAI: Kimi K2.6 (free): Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...		5.0	0.00
#318	Morph: Morph V3 Fast saasMorph: Morph V3 Fast: Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code> <update>{edit_snippet}</update>...		5.0	0.00
#319	Morph: Morph V3 Large saasMorph: Morph V3 Large: Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: <instruction>{instruction}</instruction> <code>{initial_code}</code>...		5.0	0.00
#329	NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 saasNVIDIA: Llama 3.3 Nemotron Super 49B V1.5: Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta???s Llama-3.3-70B-Instruct with a 128K context. It???s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...		5.0	0.00
#332	Owl Alpha saasOwl Alpha: Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....		5.0	0.00
#333	Pareto Code Router saasPareto Code Router: The Pareto Router is a way to have OpenRouter always pick a strong coding model for your needs without committing to a specific one. You express a single `min_coding_score` preference...		5.0	0.00
#335	Poolside: Laguna M.1 (free) saasPoolside: Laguna M.1 (free): Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K...		5.0	0.00
#336	Poolside: Laguna XS.2 (free) saasPoolside: Laguna XS.2 (free): Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...		5.0	0.00
#357	Qwen: Qwen3.6 Max Preview saasQwen: Qwen3.6 Max Preview: Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and...		5.0	0.00
#358	Qwen: Qwen3.7 Max saasQwen: Qwen3.7 Max: Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...		5.0	0.00
#361	Qwen: Qwen3 Coder 480B A35B saasQwen: Qwen3 Coder 480B A35B: Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...		5.0	0.00
#362	Qwen: Qwen3 Coder 480B A35B (free) saasQwen: Qwen3 Coder 480B A35B (free): Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...		5.0	0.00
#363	Qwen: Qwen3 Coder Flash saasQwen: Qwen3 Coder Flash: Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...		5.0	0.00
#364	Qwen: Qwen3 Next 80B A3B Instruct saasQwen: Qwen3 Next 80B A3B Instruct: Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without ???thinking??? traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...		5.0	0.00
#365	Qwen: Qwen3 Next 80B A3B Instruct (free) saasQwen: Qwen3 Next 80B A3B Instruct (free): Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without ???thinking??? traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...		5.0	0.00
#366	Qwen: Qwen3 Next 80B A3B Thinking saasQwen: Qwen3 Next 80B A3B Thinking: Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured ???thinking??? traces by default. It???s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...		5.0	0.00
#374	Qwen2.5 72B Instruct saasQwen2.5 72B Instruct: Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...		5.0	0.00
#375	Qwen2.5 Coder 32B Instruct saasQwen2.5 Coder 32B Instruct: Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in code generation, code reasoning...		5.0	0.00

Browse all sectors at /sectors.