xAI: Grok 4: Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not...
Pillar = mean of 1 scaled value = 24.9.
Awaiting first reading — these signals apply to this agent and will be ingested on the next tier tick: Reddit mentions (7d), Bluesky mentions (7d), Mastodon mentions (7d), Wikipedia views (30d), OpenRouter tokens (30d), GitHub repos using this model, Tech-news mentions (30d)
Not applicable — this agent doesn't have the prerequisite (no GitHub repo, no HF mirror, etc.) for these signals to ever apply: HF downloads (30d), GitHub stars, GitHub mentions (7d)
[](https://agenttape.com/agents/xai-grok-4)
<a href="https://agenttape.com/agents/xai-grok-4"><img src="https://agenttape.com/api/badge/xai-grok-4.svg" alt="AgentTape" /></a>
| Benchmark | Score | Max | Captured |
|---|---|---|---|
| aa-coding-index | 40.50 | 100.00 | May 18, 2026 |
| aa-intelligence-index | 33.30 | 100.00 | 9d ago |
| aa-math-index | 92.70 | 100.00 | May 18, 2026 |
| aime-2025 | 100.00 | 100.00 | 12h ago |
| gpqa-diamond | 88.40 | 100.00 | 12h ago |
| ifbench | 53.67 | 100.00 | 12h ago |
| livecodebench | 79.40 | 100.00 | 12h ago |
| math-500 |
| 99.00 |
| 100.00 |
| May 18, 2026 |
| mmlu-pro | 86.60 | 100.00 | May 18, 2026 |
| scicode | 45.70 | 100.00 | May 18, 2026 |