Methodology

These public-facing capability pillars are plain-language summaries built on a deeper coverage map spanning reasoning, learning, truthfulness, self-monitoring, social competence, multimodal understanding, safety, and robustness. Each granular question is intended to be backable by benchmarks, controlled studies, audits, red-team exercises, longitudinal trials, or expert-blind review.

MetHigh confidenceMultimodal understanding

AI can understand the world across text, images, audio, video, documents, and space

An AI system can combine language with visual, document, temporal, and spatial evidence into one coherent understanding of the task and world state.

Progress90%

Updated Mar 13, 2026

Evidence items 5

Sub-questions 5