Methodology

The public tracker is generated from a maintained workbook. The Questions sheet holds the dimension and question structure, and the Evidence sheet holds the published evidence entries linked to those questions.

In progressHigh confidenceLearning & generalization

AI can adapt and generalize beyond the exact examples it has seen

This dimension covers in-context adaptation and generalization across domains, novel tasks, and long contexts.

Progress38%

Updated Apr 10, 2026

Evidence items 3

Questions 4