Methodology
The public tracker is generated from a maintained workbook. The Questions sheet holds the dimension and question structure, and the Evidence sheet holds the published evidence entries linked to those questions.
MetHigh confidenceCognitive reasoning
AI can correctly understand, reason through, and plan difficult tasks
This dimension covers task interpretation, multi-step reasoning, planning quality, and recognizing hidden constraints.
Progress25%
Updated Apr 10, 2026
Evidence items 2
Questions 4