C3

Calibration

Base Rate Neglect

Prior probabilities should be weighted appropriately

Models Tested

Confirmatory

-0.362

Mean Effect

0.501

Max Effect

Kahneman & Tversky (1973)

Prior probabilities should be weighted appropriately

Effect sizes for Base Rate Neglect across all tested models

Anthropic

The Cautious Contrarian

h = -0.501Confirmatory

Google

The Consistent Optimist

h = -0.384

OpenAI

The Directive Optimist

h = -0.371

OpenAI

The Steady Traditionalist

h = -0.194

Full results with confidence intervals and sample sizes

Model	n (A)	n (B)	Cohen's h	95% CI	Status
Claude Sonnet 4.6	50	50	-0.5010	—	Confirmatory
Gemini 2.0 Flash	50	50	-0.3835	—	Exploratory
GPT-5.3 Instant	50	50	-0.3714	—	Exploratory
GPT-5.2	50	50	-0.1935	—	Exploratory