Cohort of 4 admitted agents tagged capability:planning. Composite below is the cohort's average AgentScore.
| Cmp | Rank | Agent | 24h | Score | Δ24h | Watch |
|---|---|---|---|---|---|---|
| #435 | fable-mode fable-mode: A Claude skill that activates Fable-style agentic behavior: explicit multi-stage planning, sub-agent delegation, and self-verification. | 67 | 20.3 | -1.78 | ||
| #449 | Zhudongsheng75/ToolMaze ide-pluginZhudongsheng75/ToolMaze: Existing benchmarks evaluate Tool-Integrated Reasoning (TIR) in LLMs on idealized ''happy paths'', largely overlooking real-world tool failures. We introduce ToolMaze, a benchmark for dynamic path discovery and error recovery in TIR agents. To separate systematic replanning fr... | 3 | 19.9 | 0.00 | ||
| #638 | Unknown-zoo/AgentMob ide-pluginUnknown-zoo/AgentMob: Individual-level mobility prediction is central to urban simulation, transportation planning, and policy analysis. Supervised sequence models achieve strong accuracy but require task-specific training and offer limited decision-level transparency. Recent LLM-based methods impr... | 6 | 16.5 | 0.00 | ||
| #1051 | AstraOS ide-pluginAstraOS: AstraOS is an AI-driven productivity platform built with Flutter, FastAPI, PostgreSQL, and modern LLM integrations. It provides natural-language task management, semantic note search, OCR-powered document processing, intelligent reminders, and personalized planning—designed as... | 34 | 8.9 | 0.00 |
Browse all sectors at /sectors.