AgentTape
OpenAI: o1: AgentScore, benchmarks and signals | AgentTape