sharminsrishty/osct: Large Language Models (LLMs) perform well on many language tasks, but their Theory of Mind (ToM) reasoning is still uneven in complex social settings. Existing benchmarks, including ExploreToM, do not always test the recursive beliefs and information asymmetries that make thes...
Pillar = mean of 2 scaled values = 3.9.
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: SO questions (7d), Product Hunt upvotes, Docker Hub pulls, Crates.io downloads (90d), Tech-news mentions (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), npm weekly installs, PyPI monthly installs
[](https://agenttape.com/agents/sharminsrishty-osct)
<a href="https://agenttape.com/agents/sharminsrishty-osct"><img src="https://agenttape.com/api/badge/sharminsrishty-osct.svg" alt="AgentTape" /></a>