AgentTape
OpenAI: o4 Mini: AgentScore, benchmarks and signals | AgentTape