Methodology

These public-facing capability pillars are plain-language summaries built on a deeper coverage map spanning reasoning, learning, truthfulness, self-monitoring, social competence, multimodal understanding, safety, and robustness. Each granular question is intended to be backable by benchmarks, controlled studies, audits, red-team exercises, longitudinal trials, or expert-blind review.

In progressHigh confidenceNovelty

AI can generate new, useful ideas and solutions

An AI system can produce novel hypotheses, designs, and solution paths that are genuinely useful rather than generic rearrangements of familiar patterns.

Progress60%

Updated Mar 11, 2026

Evidence items 5

Sub-questions 5