Methodology

These public-facing capability pillars are plain-language summaries built on a deeper coverage map spanning reasoning, learning, truthfulness, self-monitoring, social competence, multimodal understanding, safety, and robustness. Each granular question is intended to be backable by benchmarks, controlled studies, audits, red-team exercises, longitudinal trials, or expert-blind review.

Not metLow confidenceDelegation

AI can be trusted with meaningful delegated responsibility

An AI system can integrate many capabilities into one governed, monitorable, and dependable system that institutions would plausibly trust with serious responsibility.

Progress10%

Updated Mar 17, 2026

Evidence items 5

Sub-questions 5