Cohort of 206 admitted agents tagged capability:code-generation. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #1 | scholar-search-mcp mcp-serverscholar-search-mcp: An MCP server for academic paper search that integrates with AI assistants (e.g., Claude Code, Cursor), enabling them to search and retrieve academic paper metadata. | NEW | 75.0 | — | ||
| #1 | NVIDIA: Nemotron 3 Nano 30B A3B saasNVIDIA: Nemotron 3 Nano 30B A3B: NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully... | 10 | 66.4 | +35.37 | ||
| #3 | Google: Gemini 3 Flash Preview saasGoogle: Gemini 3 Flash Preview: Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool... | 98 | 64.1 | +42.26 | ||
| #4 | DeepSeek: DeepSeek V4 Pro saasDeepSeek: DeepSeek V4 Pro: DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,... | 2 | 62.6 | +26.18 | ||
| #3 | open-design apache-2.0ide-pluginopen-design: ???? Local-first, open-source alternative to Anthropic's Claude Design. ??? 19 Skills ?? ??? 71 brand-grade Design Systems ???? Generate web ?? desktop ?? mobile prototypes ?? slides ?? images ?? videos ?? HyperFrames ???? Sandboxed preview ?? HTML/PDF/PPTX/MP4 export ???? Runs on Claude Code / Codex... | 7 | 62.3 | +13.63 | ||
| #8 | Anthropic: Claude Opus 4.7 saasAnthropic: Claude Opus 4.7: Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on... | 42 | 60.8 | +38.94 | ||
| #9 | Anthropic: Claude Sonnet 4.6 saasAnthropic: Claude Sonnet 4.6: Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with... | 45 | 60.8 | +38.94 | ||
| #4 | claude-mem apache-2.0claude-mem: A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future sessions. | NEW | 60.1 | — | ||
| #17 | Google: Gemini 2.5 Pro saasGoogle: Gemini 2.5 Pro: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy... | 78 | 59.3 | +37.46 | ||
| #28 | Mistral Large 2407 saasMistral Large 2407: This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/).... | 128 | 58.9 | +37.07 | ||
| #27 | Mistral Large saasMistral Large: This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/).... | 12 | 58.9 | +28.35 | ||
| #38 | OpenAI: o3 Mini saasOpenAI: o3 Mini: OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to... | 218 | 58.5 | +36.68 | ||
| #40 | Cohere: Command R+ (08-2024) saasCohere: Command R+ (08-2024): command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint... | 36 | 58.2 | +36.38 | ||
| #6 | agent apache-2.0agent: Ship your code, on autopilot. An open source agent that lives on your machines 24/7 and keeps your apps running. ???? | 71 | 58.0 | +23.04 | ||
| #8 | deer-flow mitlibrarydeer-flow: An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours. | 16 | 56.6 | +9.75 | ||
| #10 | oh-my-openagent ide-pluginoh-my-openagent: omo; the best agent harness - previously oh-my-opencode. | 21 | 56.3 | +10.21 | ||
| #11 | praisonai mitlibrarypraisonai: PraisonAI is an AI Agents Framework with Self Reflection. PraisonAI application combines PraisonAI Agents, AutoGen, and CrewAI into a low-code solution for building and managing multi-agent LLM systems, focusing on simplicity, customisation, and efficient human-agent collabora... | 18 | 56.2 | +9.92 | ||
| #13 | opencode mitopencode: The open source coding agent. | NEW | 55.9 | — | ||
| #16 | dify mcp-serverdify: Production-ready platform for agentic workflow development. | 7 | 54.7 | +5.97 | ||
| #17 | nanobot mitnanobot: "???? nanobot: The Ultra-Lightweight Personal AI Agent". | 13 | 54.7 | +8.51 | ||
| #21 | shannon agplide-pluginshannon: Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities before they reach production. | NEW | 54.1 | — | ||
| #24 | nocobase saasnocobase: NocoBase is an open-source AI + no-code platform for building business systems fast. Instead of generating everything from scratch, AI works on top of production-proven infrastructure and a WYSIWYG no-code interface, so you get both speed and reliability. | NEW | 53.7 | — | ||
| #25 | career-ops mitclicareer-ops: AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing. | 14 | 53.4 | +4.92 | ||
| #26 | daytona agpldaytona: Daytona is a Secure and Elastic Infrastructure for Running AI-Generated Code. | NEW | 53.3 | — | ||
| #27 | cline apache-2.0ide-plugincline: Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way. | NEW | 53.1 | — | ||
| #28 | claude-code claude-code: Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands. | 25 | 52.9 | +0.39 | ||
| #32 | openclaude cliopenclaude: runs anywhere. uses anything | NEW | 51.9 | — | ||
| #38 | hermes-agent mithermes-agent: The agent that grows with you. | 25 | 49.7 | +1.22 | ||
| #39 | Understand-Anything mitcliUnderstand-Anything: Graphs that teach > graphs that impress. Turn any code, or knowledge base (Karpathy LLM wiki), into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini CLI, and more. | 8 | 49.6 | +8.55 | ||
| #40 | Auto-claude-code-research-in-sleep mitmcp-serverAuto-claude-code-research-in-sleep: ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works with Claude Code, Codex, OpenClaw, or any LLM agent. | NEW | 49.6 | — | ||
| #64 | Qwen: Qwen3.6 Max Preview saasQwen: Qwen3.6 Max Preview: Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and... | 234 | 49.4 | +27.59 | ||
| #65 | Qwen: Qwen3 Next 80B A3B Instruct (free) saasQwen: Qwen3 Next 80B A3B Instruct (free): Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without ???thinking??? traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual... | 245 | 49.4 | +27.59 | ||
| #70 | Qwen2.5 Coder 32B Instruct saasQwen2.5 Coder 32B Instruct: Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**... | 53 | 49.4 | +21.22 | ||
| #54 | MoonshotAI: Kimi K2.6 saasMoonshotAI: Kimi K2.6: Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and... | 124 | 49.4 | +27.59 | ||
| #66 | Qwen: Qwen3 Next 80B A3B Thinking saasQwen: Qwen3 Next 80B A3B Thinking: Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured ???thinking??? traces by default. It???s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic... | 245 | 49.4 | +27.59 | ||
| #69 | Qwen2.5 72B Instruct saasQwen2.5 72B Instruct: Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and... | 53 | 49.4 | +18.94 | ||
| #57 | NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 saasNVIDIA: Llama 3.3 Nemotron Super 49B V1.5: Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta???s Llama-3.3-70B-Instruct with a 128K context. It???s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and... | 134 | 49.4 | +27.59 | ||
| #42 | awesome-design-md mitawesome-design-md: A collection of DESIGN.md files inspired by popular brand design systems. Drop one into your project and let coding agents generate a matching UI. | NEW | 49.2 | — | ||
| #45 | learn-claude-code mitlearn-claude-code: Bash is all you need - A nano claude code???like ???agent harness???, built from 0 to 1. | 9 | 48.6 | +4.08 | ||
| #79 | NVIDIA: Nemotron 3 Nano 30B A3B (free) saasNVIDIA: Nemotron 3 Nano 30B A3B (free): NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully... | 113 | 47.7 | +25.85 | ||
| #50 | caveman mitcaveman: ???? why use many token when few token do trick ??? Claude Code skill that cuts 65% of tokens by talking like caveman. | 36 | 47.4 | -0.51 | ||
| #51 | AionUi apache-2.0cliAionUi: Free, local, open-source 24/7 Cowork app and OpenClaw for Gemini CLI, Claude Code, Codex, OpenCode, Qwen Code, Goose CLI, Auggie, and more | ???? Star if you like it!. | 12 | 47.2 | +3.10 | ||
| #58 | notebooklm-py mitclinotebooklm-py: Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic access to NotebookLM's features—including capabilities the web UI doesn't expose—via Python, CLI, and AI agents like Claude Code, Codex, and OpenClaw. | NEW | 46.7 | — | ||
| #82 | OpenAI: GPT-5 saasOpenAI: GPT-5: GPT-5 is OpenAI???s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy... | 81 | 46.5 | -2.87 | ||
| #60 | activepieces mcp-serveractivepieces: AI Agents & MCPs & AI Workflow Automation ??? (~400 MCP servers for AI agents) ??? AI Automation / AI Agent with MCPs ??? AI Workflows & AI Agents ??? MCPs for AI Agents. | NEW | 46.4 | — | ||
| #61 | system-prompts-and-models-of-ai-tools gplide-pluginsystem-prompts-and-models-of-ai-tools: FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI, VSCode Agent, Warp.dev, Windsurf, Xcode, Z.ai Code, Dia & v0. (And other Open ... | 42 | 46.3 | -0.95 | ||
| #62 | E2B apache-2.0E2B: Open-source, secure environment with real-world tools for enterprise-grade agents. | 16 | 46.2 | +5.12 | ||
| #83 | OpenAI: GPT-5.2-Codex saasOpenAI: GPT-5.2-Codex: GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks.... | 144 | 45.9 | +24.07 | ||
| #84 | Mistral: Devstral 2 2512 saasMistral: Devstral 2 2512: Devstral 2 is a state-of-the-art open-source model by Mistral AI specializing in agentic coding. It is a 123B-parameter dense transformer model supporting a 256K context window. Devstral 2 supports exploring... | 69 | 45.7 | +23.92 | ||
| #86 | OpenAI: GPT-5.2 Pro saasOpenAI: GPT-5.2 Pro: GPT-5.2 Pro is OpenAI???s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning,... | 142 | 45.5 | +23.67 | ||
| #69 | cherry-studio agplclicherry-studio: AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs | 37 | 45.0 | -0.71 | ||
| #70 | pullmd agplmcp-serverpullmd: Self-hosted URL-to-Markdown service for humans and AI agents. PWA + REST + MCP + Claude Code skill, with Reddit support and refreshable share links. | 39 | 44.6 | +17.64 | ||
| #74 | everything-claude-code mitmcp-servereverything-claude-code: The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond. | 67 | 43.9 | -5.33 | ||
| #77 | OpenMonoAgent.ai OpenMonoAgent.ai: AI shouldn't have a meter. Unlimited tokens. Forever. Your machine. Your agent. Use it from anywhere. Terminal-native coding agent powered by local LLMs ??? 100% open source, free forever, and installed with a single command. Proudly built on C#/.NET, because AI tooling should b... | 11 | 43.7 | +12.03 | ||
| #80 | awesome-claude-skills mcp-serverawesome-claude-skills: A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows. | 43 | 43.4 | -0.93 | ||
| #84 | harness mitmcp-serverharness: AI-driven user testing for iOS Simulator, macOS apps, and web apps. Write a goal in plain language; an LLM agent drives the UI and reports friction. macOS 14+, Swift 6. | NEW | 43.2 | — | ||
| #90 | OpenAI: GPT-5.4 saasOpenAI: GPT-5.4: GPT-5.4 is OpenAI???s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for... | 141 | 43.1 | +21.33 | ||
| #92 | Anthropic: Claude Opus 4.6 saasAnthropic: Claude Opus 4.6: Opus 4.6 is Anthropic???s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective... | 44 | 43.1 | +21.29 | ||
| #94 | DeepSeek: DeepSeek V3 saasDeepSeek: DeepSeek V3: DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations... | 15 | 42.2 | +20.34 | ||
| #89 | model:XiaomiMiMo/MiMo-V2.5-Pro model:XiaomiMiMo/MiMo-V2.5-Pro: discovered AI agent. | NEW | 41.2 | — | ||
| #94 | model:Jackrong/Qwopus-GLM-18B-Merged-GGUF model:Jackrong/Qwopus-GLM-18B-Merged-GGUF: discovered AI agent. | NEW | 40.8 | — | ||
| #96 | Agent-Reach mitmcp-serverAgent-Reach: Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu ??? one CLI, zero API fees. | 52 | 40.5 | -0.94 | ||
| #97 | model:XiaomiMiMo/MiMo-V2.5 ide-pluginmodel:XiaomiMiMo/MiMo-V2.5: discovered AI agent. | NEW | 40.3 | — | ||
| #100 | mirage apache-2.0mirage: A Unified Virtual Filesystem For AI Agents. | NEW | 40.0 | — | ||
| #101 | codex-mcp-server mcp-servercodex-mcp-server: MCP server wrapper for OpenAI Codex CLI. | NEW | 39.8 | — | ||
| #105 | PocketFlow-Tutorial-Codebase-Knowledge mitlibraryPocketFlow-Tutorial-Codebase-Knowledge: Pocket Flow: Codebase to Tutorial. | 54 | 39.7 | -0.95 | ||
| #109 | Anthropic: Claude Sonnet 4.5 saasAnthropic: Claude Sonnet 4.5: Claude Sonnet 4.5 is Anthropic???s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with... | 56 | 39.3 | +17.53 | ||
| #105 | Anthropic: Claude Opus 4.5 saasAnthropic: Claude Opus 4.5: Claude Opus 4.5 is Anthropic???s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and... | 58 | 39.3 | +17.53 | ||
| #99 | Anthropic: Claude 3.7 Sonnet saasAnthropic: Claude 3.7 Sonnet: Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and... | 58 | 39.3 | +17.53 | ||
| #103 | Anthropic: Claude Opus 4 saasAnthropic: Claude Opus 4: Claude Opus 4 is benchmarked as the world???s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in... | 93 | 39.3 | +8.33 | ||
| #100 | Anthropic: Claude 3.7 Sonnet (thinking) saasAnthropic: Claude 3.7 Sonnet (thinking): Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and... | 58 | 39.3 | +17.53 | ||
| #104 | Anthropic: Claude Opus 4.1 saasAnthropic: Claude Opus 4.1: Claude Opus 4.1 is an updated version of Anthropic???s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains... | 58 | 39.3 | +17.53 | ||
| #98 | Anthropic: Claude 3.5 Haiku saasAnthropic: Claude 3.5 Haiku: Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic... | 58 | 39.3 | +17.53 | ||
| #108 | Anthropic: Claude Sonnet 4 saasAnthropic: Claude Sonnet 4: Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),... | 56 | 39.3 | +17.53 | ||
| #111 | Z.ai: GLM 4.7 Flash saasZ.ai: GLM 4.7 Flash: As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,... | 254 | 39.3 | +17.48 | ||
| #110 | DeepCode mitDeepCode: "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)". | 55 | 39.0 | -0.94 | ||
| #112 | aelfrice mitmcp-serveraelfrice: Bayesian memory that learns from feedback for LLM agents. | 6 | 38.7 | +17.77 | ||
| #113 | deepclaude mitdeepclaude: Use Claude Code's autonomous agent loop with DeepSeek V4 Pro, OpenRouter, or any Anthropic-compatible backend. Same UX, 17x cheaper. | NEW | 38.7 | — | ||
| #114 | ralph-claude-code mitcliralph-claude-code: Autonomous AI development loop for Claude Code with intelligent exit detection. | 57 | 38.7 | -0.94 | ||
| #115 | model:HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Balanced model:HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Balanced: discovered AI agent. | NEW | 38.6 | — | ||
| #116 | code-act mitlibrarycode-act: Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji. | 30 | 38.5 | +5.27 | ||
| #117 | agentic-seo cliagentic-seo: Audit your documentation and website for AI agent readiness (Agentic Engine Optimization). | NEW | 38.2 | — | ||
| #120 | create-agentic-pdlc librarycreate-agentic-pdlc: Agentic PDLC Framework - Conversational setup for your AI coding assistants. | NEW | 38.0 | — | ||
| #122 | xAI: Grok 3 Beta saasxAI: Grok 3 Beta: Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in... | 220 | 37.7 | +15.89 | ||
| #121 | xAI: Grok 3 saasxAI: Grok 3: Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in... | 220 | 37.7 | +15.89 | ||
| #124 | xAI: Grok Code Fast 1 saasxAI: Grok Code Fast 1: Grok Code Fast 1 is a speedy and economical reasoning model that excels at agentic coding. With reasoning traces visible in the response, developers can steer Grok Code for high-quality... | 227 | 37.7 | +15.89 | ||
| #122 | composio mitmcp-servercomposio: typescript python sdk ai-agents anthropic openapi langchain openai-agents llamaindex mastra vercel-ai mcp oauth saas llm integrations agent-tools automation cloudflare google-gemini tooling rag multi-provider developer-sdk composable-actions webhook-triggers. | 47 | 37.5 | +2.13 | ||
| #123 | gptme mitcligptme: Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top! | 28 | 37.5 | +8.78 | ||
| #131 | Google: Gemini 2.5 Pro Preview 06-05 saasGoogle: Gemini 2.5 Pro Preview 06-05: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy... | 34 | 37.1 | +15.25 | ||
| #128 | Google: Gemini 2.5 Flash saasGoogle: Gemini 2.5 Flash: Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater... | 36 | 37.1 | +15.25 | ||
| #130 | Google: Gemini 2.5 Pro Preview 05-06 saasGoogle: Gemini 2.5 Pro Preview 05-06: Gemini 2.5 Pro is Google???s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs ???thinking??? capabilities, enabling it to reason through responses with enhanced accuracy... | 34 | 37.1 | +15.25 | ||
| #124 | Agently apache-2.0libraryAgently: [GenAI Application Development Framework] ???? Build GenAI application quick and easy ???? Easy to interact with GenAI agent in code using structure data and chained-calls syntax ???? Use Event-Driven Flow *TriggerFlow* to manage complex GenAI working logic ???? Switch to any model witho... | 54 | 37.0 | +0.63 | ||
| #151 | Mistral: Mixtral 8x22B Instruct saasMistral: Mixtral 8x22B Instruct: Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,... | 19 | 36.5 | +14.65 | ||
| #145 | Mistral: Mistral Medium 3.5 saasMistral: Mistral Medium 3.5: Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex... | NEW | 36.5 | — | ||
| #127 | reversa mitreversa: Transform legacy systems into executable specifications for AI coding agents. | NEW | 36.4 | — | ||
| #129 | mirascope mitlibrarymirascope: The LLM Anti-Framework. | 63 | 36.2 | -0.95 | ||
| #130 | codeinterpreter-api mitcodeinterpreter-api: ???? Open source implementation of the ChatGPT Code Interpreter. | 68 | 36.1 | -1.90 | ||
| #159 | OpenAI: GPT-3.5 Turbo (older v0613) saasOpenAI: GPT-3.5 Turbo (older v0613): GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks.
Training data up to Sep 2021. | 44 | 35.9 | +14.04 | ||
| #188 | OpenAI: GPT-5 Pro saasOpenAI: GPT-5 Pro: GPT-5 Pro is OpenAI???s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and... | 55 | 35.9 | +14.04 | ||
| #181 | OpenAI: GPT-5.4 Mini saasOpenAI: GPT-5.4 Mini: GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,... | 52 | 35.9 | +14.04 |
Browse all sectors at /sectors.