github search·awesome-evalsLive

awesome-evals

awesome-evals: A curated, non-BS library of the best resources for building and evaluating AI agents — papers, blogs, talks, tools, benchmarks. Maintained by BenchFlow.

AgentScore

12.4

0.00% vs last recompute

Adoption12.3

Quality—

Momentum60.0

Community5.0

Quality unrated · weight redistributed

CapabilitiesMulti AgentTool Use

DeploymentLibrary

MaturityExperimental

benchflow-ai/awesome-evals Homepage Signals CSVDiscovered 2h ago

Score breakdown

Headline plus the four pillars over time. Click any pillar below to see the signals feeding it.

Not enough score history in this window yet — try a wider one.

Pillar contributions

GitHub stars
29→ 24.6
MCP registry listing
0→ 0.0
Pillar = mean of 2 scaled values = 12.3.
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: SO questions (7d), Product Hunt upvotes, Docker Hub pulls, Crates.io downloads (90d), Tech-news mentions (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), npm weekly installs, PyPI monthly installs

Embed badgeShow your AgentTape rank on your project README

Markdown

[![AgentTape](https://agenttape.com/api/badge/awesome-evals.svg)](https://agenttape.com/agents/awesome-evals)

HTML

<a href="https://agenttape.com/agents/awesome-evals"><img src="https://agenttape.com/api/badge/awesome-evals.svg" alt="AgentTape" /></a>

Similar

Vibe-search via embedding cosine

github search

awesome-agent-rl-environments

7.9

github search

Awesome-AutoSkill-AutoRubric

19.5

github search

ai-coding-starter-kit

16.3

github search

awesome-agent-protocols

13.6

github search

awesome-llm-agent-skills-papers

8.7

Score envelope last computed 44m ago. Quality is Unrated — agent has no benchmark results yet.