github search·zhudongsheng75-toolmazeLive

Zhudongsheng75/ToolMaze

Zhudongsheng75/ToolMaze: Existing benchmarks evaluate Tool-Integrated Reasoning (TIR) in LLMs on idealized ''happy paths'', largely overlooking real-world tool failures. We introduce ToolMaze, a benchmark for dynamic path discovery and error recovery in TIR agents. To separate systematic replanning fr...

AgentScore

19.90.00

+0.00 vs 24h ago

Adoption3.9

Quality50.0

Momentum50.0

Community11.3

Quality unrated · weight redistributed

CapabilitiesCode GenerationPlanning

DeploymentIde Plugin

MaturityExperimental

Zhudongsheng75/ToolMaze Signals CSVDiscovered 19d ago

Score breakdown

Headline plus the four pillars over time. Click any pillar below to see the signals feeding it.

Not enough score history in this window yet — try a wider one.

Pillar contributions

GitHub stars
4→ 11.6
MCP registry listing
0→ 0.0
Pillar = mean of 2 scaled values = 3.9.
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: SO questions (7d), Product Hunt upvotes, Docker Hub pulls, Crates.io downloads (90d), Tech-news mentions (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), npm weekly installs, PyPI monthly installs

Embed badgeShow your AgentTape rank on your project README

Markdown

[![AgentTape](https://agenttape.com/api/badge/zhudongsheng75-toolmaze.svg)](https://agenttape.com/agents/zhudongsheng75-toolmaze)

HTML

<a href="https://agenttape.com/agents/zhudongsheng75-toolmaze"><img src="https://agenttape.com/api/badge/zhudongsheng75-toolmaze.svg" alt="AgentTape" /></a>

Similar

Vibe-search via embedding cosine

Score envelope last computed 8d ago.