About Model Madness

Model Madness is an experiment by Alephic to see how well AI language models can predict the NCAA March Madness tournament.

The Experiment

We asked 45 different AI models from providers like Anthropic, OpenAI, Google, Meta, Mistral, and others to fill out March Madness brackets. Each model runs two strategies:

Scoring

Points increase with each round to reward correctly predicting later, harder games:

About Alephic

Alephic is an AI consultancy that helps companies build with large language models. Model Madness is one of our explorations into how different models reason about uncertain, real-world predictions.