Methodology

These public-facing capability pillars are plain-language summaries built on a deeper coverage map spanning reasoning, learning, truthfulness, self-monitoring, social competence, multimodal understanding, safety, and robustness. Each granular question is intended to be backable by benchmarks, controlled studies, audits, red-team exercises, longitudinal trials, or expert-blind review.

In progressHigh confidenceLearning

AI can learn new tasks quickly and generalize beyond examples

An AI system can adapt from limited demonstrations and preserve competence when the task, domain, language, or input distribution changes.

Progress50%

Updated Mar 3, 2026

Evidence items 5

Sub-questions 5