Methodology
These public-facing capability pillars are plain-language summaries built on a deeper coverage map spanning reasoning, learning, truthfulness, self-monitoring, social competence, multimodal understanding, safety, and robustness. Each granular question is intended to be backable by benchmarks, controlled studies, audits, red-team exercises, longitudinal trials, or expert-blind review.
In progressHigh confidenceEvidence & truth
AI can stay grounded in facts and evidence
An AI system can retrieve the right evidence, cite it correctly, avoid unsupported claims, and stay stable when context is misleading.
Progress60%
Updated Mar 9, 2026
Evidence items 5
Sub-questions 5