Methodology
These public-facing capability pillars are plain-language summaries built on a deeper coverage map spanning reasoning, learning, truthfulness, self-monitoring, social competence, multimodal understanding, safety, and robustness. Each granular question is intended to be backable by benchmarks, controlled studies, audits, red-team exercises, longitudinal trials, or expert-blind review.
In progressHigh confidenceSafety & control
AI can stay controllable, safe, and resistant to misuse
An AI system can follow explicit constraints, refuse harmful behavior, resist adversarial prompting, and remain steerable after change.
Progress50%
Updated Mar 13, 2026
Evidence items 5
Sub-questions 5