Cohort of 34 admitted agents tagged capability:rag. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #4 | claude-mem apache-2.0claude-mem: A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future sessions. | NEW | 60.1 | — | ||
| #5 | MinerU MinerU: Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows. | 23 | 58.8 | +12.48 | ||
| #41 | Cohere: Command R7B (12-2024) saasCohere: Command R7B (12-2024): Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning... | 36 | 58.2 | +36.38 | ||
| #40 | Cohere: Command R+ (08-2024) saasCohere: Command R+ (08-2024): command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint... | 36 | 58.2 | +36.38 | ||
| #16 | dify mcp-serverdify: Production-ready platform for agentic workflow development. | 7 | 54.7 | +5.97 | ||
| #18 | open-webui mcp-serveropen-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...) | 6 | 54.7 | +6.18 | ||
| #19 | langchain mitlibrarylangchain: The agent engineering platform. | 17 | 54.5 | -0.39 | ||
| #33 | ragflow apache-2.0ragflow: RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs. | 12 | 51.8 | +4.60 | ||
| #52 | MiniMax: MiniMax M1 saasMiniMax: MiniMax M1: MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it... | 93 | 49.4 | +27.59 | ||
| #56 | NVIDIA: Llama 3.1 Nemotron 70B Instruct saasNVIDIA: Llama 3.1 Nemotron 70B Instruct: NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels... | 38 | 49.4 | +22.27 | ||
| #57 | NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 saasNVIDIA: Llama 3.3 Nemotron Super 49B V1.5: Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta???s Llama-3.3-70B-Instruct with a 128K context. It???s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and... | 134 | 49.4 | +27.59 | ||
| #55 | awesome-llm-apps apache-2.0awesome-llm-apps: 100+ AI Agent & RAG apps you can actually run ??? clone, customize, ship. | 39 | 46.9 | -0.95 | ||
| #88 | langroid mitlibrarylangroid: Harness LLMs with Multi-Agent Programming. | 28 | 41.3 | +2.87 | ||
| #100 | mirage apache-2.0mirage: A Unified Virtual Filesystem For AI Agents. | NEW | 40.0 | — | ||
| #102 | PocketFlow mitlibraryPocketFlow: Pocket Flow: 100-line LLM framework. Let Agents build Agents! | 52 | 39.7 | -0.95 | ||
| #112 | aelfrice mitmcp-serveraelfrice: Bayesian memory that learns from feedback for LLM agents. | 6 | 38.7 | +17.77 | ||
| #117 | Perplexity: Sonar Deep Research saasPerplexity: Sonar Deep Research: Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers... | 148 | 37.9 | +16.07 | ||
| #121 | awesome-LLM-resources apache-2.0mcp-serverawesome-LLM-resources: ??????????? ??????????????????LLM?????????????????????????????????Agent??????????????????AI??????????????????????????????????????????????????????o1 ?????????MCP?????????????????????????????????????????? | Summary of the world's best LLM resources. | 65 | 37.8 | -1.87 | ||
| #122 | composio mitmcp-servercomposio: typescript python sdk ai-agents anthropic openapi langchain openai-agents llamaindex mastra vercel-ai mcp oauth saas llm integrations agent-tools automation cloudflare google-gemini tooling rag multi-provider developer-sdk composable-actions webhook-triggers. | 47 | 37.5 | +2.13 | ||
| #126 | ComfyUI-Copilot mitide-pluginComfyUI-Copilot: An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance. | 63 | 36.8 | -0.95 | ||
| #137 | generative-ai mitmcp-servergenerative-ai: Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation. | 66 | 35.0 | -1.31 | ||
| #212 | Qwen: Qwen3 Max saasQwen: Qwen3 Max: Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It... | 95 | 34.2 | +12.37 | ||
| #151 | @loongsuite/opentelemetry-util-genai @loongsuite/opentelemetry-util-genai: OpenTelemetry GenAI utility for standardized telemetry collection across LLM, Agent, Embedding, Tool, Retrieval, Rerank, Memory and more. | NEW | 32.7 | — | ||
| #152 | AgenticRAG-Survey libraryAgenticRAG-Survey: Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents. | 67 | 32.5 | -0.95 | ||
| #155 | Agent_Memory_Techniques apache-2.0Agent_Memory_Techniques: Agent memory for LLMs: 30 runnable Jupyter notebooks covering conversation buffers, vector stores, knowledge graphs, episodic and semantic memory, MemGPT, Mem0, Letta, Zep, Graphiti, LoCoMo benchmarks, and production patterns. | NEW | 31.9 | — | ||
| #245 | Google: Gemma 3n 2B (free) saasGoogle: Gemma 3n 2B (free): Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, designed to operate efficiently at an effective parameter size of 2B while leveraging a 6B architecture. Based... | 135 | 30.8 | +8.98 | ||
| #181 | @mcp-abap-adt/ollama-embedder mcp-server@mcp-abap-adt/ollama-embedder: Ollama embedding provider and OllamaRag convenience class for @mcp-abap-adt/llm-agent. | 74 | 28.2 | +0.89 | ||
| #187 | krusch-context-mcp mitmcp-serverkrusch-context-mcp: A unified Zero-Trust MCP server that gives IDE agents local semantic codebase search, isolated episodic project memory, and hallucination-free framework RAG. | NEW | 27.5 | — | ||
| #362 | Z.ai: GLM 4.5 saasZ.ai: GLM 4.5: GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly... | 4 | 21.8 | 0.00 | ||
| #273 | Baidu: Qianfan-OCR-Fast (free) saasBaidu: Qianfan-OCR-Fast (free): Qianfan-OCR-Fast is a domain-specific multimodal large model purpose-built for OCR. By leveraging specialized OCR training data while preserving versatile multimodal intelligence, it provides a powerful performance upgrade over Qianfan-OCR. | 205 | 21.8 | 0.00 | ||
| #341 | Relace: Relace Search saasRelace: Relace Search: The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a codebase and return relevant files to the user request. In contrast to RAG, relace-search performs agentic... | 19 | 21.8 | 0.00 | ||
| #229 | emotional-memory-agent mitemotional-memory-agent: A multi-tenant emotional memory system powered by an LLM agent, featuring user-level privacy isolation, long-term memory persistence, RAG retrieval, diary writing, and support for personalized interactions in QQ private and group chats. | NEW | 20.7 | — | ||
| #372 | LiquidAI: LFM2.5-1.2B-Thinking (free) saasLiquidAI: LFM2.5-1.2B-Thinking (free): LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks, data extraction, and RAG???while still running comfortably on edge devices. It supports long context (up to 32K tokens) and is... | 242 | 14.1 | -7.70 | ||
| #249 | Modular RAG MCP Server mcp-serverModular RAG MCP Server: A pluggable, observable modular RAG service framework that exposes tool interfaces via the MCP protocol, enabling AI assistants like Copilot and Claude to directly query knowledge bases. | NEW | 0.0 | — |
Browse all sectors at /sectors.