AgentTape
Anthropic: Claude 3.7 Sonnet (thinking): AgentScore, benchmarks and signals | AgentTape