Leaderboard
Per-field breakdown
The headline ranking aggregates over five broad scientific fields. Opus 4.7 leads in four of five; GPT-5.5 leads in Physics & Astronomy by the widest single-field margin.
Reasoning effort: Opus 4.6 / Opus 4.7 / Sonnet 4.6 = max · Gemini 3.1 Pro = high · GPT-5.5 = xhigh.