github search·agentbenchLive

AgentBench

AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24).

AgentScore

30.5+0.00

+0.00 vs 24h ago

Adoption29.5

Quality—

Momentum50.0

Community45.5

Quality unrated · weight redistributed

CapabilitiesMulti Agent

LicenseApache 2.0

MaturityBeta

Score breakdown

Headline plus the four pillars over time. Click any pillar below to see the signals feeding it.

Not enough score history in this window yet — try a wider one.

Pillar contributions

GitHub stars+0.1%
3.5k→ 59.1
Pillar = mean of 1 scaled value = 29.5.
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: SO questions (7d), Product Hunt upvotes, Docker Hub pulls, Crates.io downloads (90d), Tech-news mentions (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), npm weekly installs, PyPI monthly installs, MCP registry listing

Embed badgeShow your AgentTape rank on your project README

Markdown

[![AgentTape](https://agenttape.com/api/badge/agentbench.svg)](https://agenttape.com/agents/agentbench)

HTML

<a href="https://agenttape.com/agents/agentbench"><img src="https://agenttape.com/api/badge/agentbench.svg" alt="AgentTape" /></a>

Score envelope last computed 6h ago. Quality is Unrated — agent has no benchmark results yet.