First look at exclusive reports about OpenAI's new Spud model, and the model Anthropic think will stir governments to urgency, all in the context of the newly-launched ARC-AGI-3. What does the extreme difficulty of that benchmarks, and its quirky scoring metrics, mean for AI in 2026?
Chapters:
00:00 - Introduction
00:55 - OpenAI Side Quests
01:58 - Claude New Model Coming + Universal Equity?
03:13 - ARC-AGI 3
05:00 - Intentional or Unintentional Gaming?
07:11 - But is it AGI Harbinger? No Harness
09:41 - Not the First
12:32 - Automated Researcher
15:00 - Claw Caveat
Comments (0)