Cohort of 25 admitted agents tagged capability:research. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #1 | scholar-search-mcp mcp-serverscholar-search-mcp: An MCP server for academic paper search that integrates with AI assistants (e.g., Claude Code, Cursor), enabling them to search and retrieve academic paper metadata. | NEW | 75.0 | — | ||
| #7 | xAI: Grok 4.1 Fast saasxAI: Grok 4.1 Fast: Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using... | 339 | 61.9 | +40.07 | ||
| #8 | deer-flow mitlibrarydeer-flow: An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours. | 16 | 56.6 | +9.75 | ||
| #38 | hermes-agent mithermes-agent: The agent that grows with you. | 25 | 49.7 | +1.22 | ||
| #40 | Auto-claude-code-research-in-sleep mitmcp-serverAuto-claude-code-research-in-sleep: ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works with Claude Code, Codex, OpenClaw, or any LLM agent. | NEW | 49.6 | — | ||
| #41 | awesome-generative-ai-guide mitide-pluginawesome-generative-ai-guide: A one stop repository for generative AI research updates, interview resources, notebooks and much more!. | NEW | 49.6 | — | ||
| #89 | xAI: Grok 4.20 Multi-Agent saasxAI: Grok 4.20 Multi-Agent: Grok 4.20 Multi-Agent is a variant of xAI???s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information... | 259 | 44.4 | +22.60 | ||
| #74 | everything-claude-code mitmcp-servereverything-claude-code: The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond. | 67 | 43.9 | -5.33 | ||
| #81 | RD-Agent mitRD-Agent: Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automating these high-value generic R&D processes through R&D-Agent, whi... | 39 | 43.3 | +1.68 | ||
| #111 | tq-trading-agent tq-trading-agent: 🌮 Traidng agent, AI-powered multi-agent stock research & trading strategy orchestration, trading agent - TypeScript, LangGraph, OpenAI-compatible APIs. | NEW | 38.9 | — | ||
| #117 | Perplexity: Sonar Deep Research saasPerplexity: Sonar Deep Research: Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers... | 148 | 37.9 | +16.07 | ||
| #196 | OpenAI: o3 Deep Research saasOpenAI: o3 Deep Research: o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks.
Note: This model always uses the 'web_search' tool which adds additional cost. | 59 | 35.9 | +14.04 | ||
| #199 | OpenAI: o4 Mini Deep Research saasOpenAI: o4 Mini Deep Research: o4-mini-deep-research is OpenAI's faster, more affordable deep research model???ideal for tackling complex, multi-step research tasks.
Note: This model always uses the 'web_search' tool which adds additional cost. | 61 | 35.9 | +14.04 | ||
| #201 | Google: Gemma 2 27B saasGoogle: Gemma 2 27B: Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of... | 181 | 35.6 | +8.72 | ||
| #149 | trading-agents apache-2.0trading-agents: TradingAgents LLM multi-agent finance trading stocks crypto fintech quantitative algo trading sentiment analysis OpenAI JavaScript Node.js research OSS | 67 | 33.0 | -1.03 | ||
| #243 | Microsoft: Phi 4 saasMicrosoft: Phi 4: [Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion... | 221 | 31.1 | +6.22 | ||
| #168 | OpenSearch-VL apache-2.0ide-pluginOpenSearch-VL: 🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement learning. | NEW | 30.0 | — | ||
| #195 | Evolutionary-Alpha-Miner mitide-pluginEvolutionary-Alpha-Miner: Family-aware evolutionary alpha mining with LLM-guided symbolic hybridization. | NEW | 26.7 | — | ||
| #306 | Nous: Hermes 4 70B saasNous: Hermes 4 70B: Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either... | 117 | 21.8 | 0.00 | ||
| #305 | Nous: Hermes 4 405B saasNous: Hermes 4 405B: Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with... | 117 | 21.8 | 0.00 | ||
| #304 | Nous: Hermes 3 70B Instruct saasNous: Hermes 3 70B Instruct: Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements acr... | 117 | 21.8 | 0.00 | ||
| #307 | NousResearch: Hermes 2 Pro - Llama-3 8B saasNousResearch: Hermes 2 Pro - Llama-3 8B: Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced... | 117 | 21.8 | 0.00 | ||
| #356 | Tongyi DeepResearch 30B A3B saasTongyi DeepResearch 30B A3B: Tongyi DeepResearch is an agentic large language model developed by Tongyi Lab, with 30 billion total parameters activating only 3 billion per token. It's optimized for long-horizon, deep information-seeking tasks... | 19 | 21.8 | 0.00 | ||
| #232 | awesome-llm-agent-skills-papers awesome-llm-agent-skills-papers: A curated list of papers, blog posts, and systems on skills for LLM agents — reusable, named capability units that an agent can store, retrieve, compose, and improve over time — together with closely adjacent research on tool use, function calling, procedural memory, and skill... | NEW | 20.2 | — | ||
| #248 | PaperQuay agplPaperQuay: A desktop-first literature manager for PDF reading, translation, paper overviews, and AI agent workflows. | 135 | 17.9 | -7.67 |
Browse all sectors at /sectors.