Methodology

The public tracker is generated from a maintained workbook. The Questions sheet holds the dimension and question structure, and the Evidence sheet holds the published evidence entries linked to those questions.

MetHigh confidenceCognitive reasoning

AI can correctly understand, reason through, and plan difficult tasks

This dimension covers task interpretation, multi-step reasoning, planning quality, and recognizing hidden constraints.

Progress25%

Updated Apr 10, 2026

Evidence items 2

Questions 4