BookieBane
GamesTrack RecordMarch MadnessAboutDashboard
RecordMadness 🏆
← Research

We Backtested Our Upset Model Against 40 Years of March Madness

2,428 games. 695 upsets. Full transparency.

February 15, 2026·8 min
2,428
Tournament games
40
Seasons (1985-2024)
49.5%
R64 hit rate at BANE ≥65
1.9×
Lift over base rate

Most upset prediction models never show you proof. They claim accuracy rates, sell you picks, and hope you don’t check the math. We did the opposite: we ran BANE Score against every NCAA Tournament game from 1985 to 2024 - and we’re publishing every number, including what we get wrong.

The Setup

We took the complete NCAA tournament historical dataset: 2,428 tournament games across 39 played seasons (2020 was cancelled). For each game, we reconstructed what the BANE Score would have been using only pre-tournament regular season data - the same data available before tipoff.

For games from 2003-2024, we used the full 7-factor model with detailed box score statistics (3PT rate, turnovers, tempo, defensive efficiency). For 1985-2002, we used a simplified model based on win margin, conference strength, and Pythagorean expectation - the same approach we’d use if we only had basic stats.

Overall Results

Across all 2,428 games, 695 were upsets (lower seed won) - a 28.6% base upset rate. Here’s how BANE Score performed at different thresholds:

Threshold
Flagged
Upsets
Hit Rate
Lift
BANE ≥ 50
1,674
557
33.3%
1.2×
BANE ≥ 55
1,265
440
34.8%
1.2×
BANE ≥ 60
856
324
37.9%
1.3×
BANE ≥ 65
446
200
44.8%
1.6×
BANE ≥ 70
217
95
43.8%
1.5×
BANE ≥ 75
86
40
46.5%
1.6×
BANE ≥ 80
32
17
53.1%
1.9×

The pattern is clear: as BANE Score increases, upset probability increases. At BANE ≥65, nearly half of all flagged games resulted in upsets - 1.6× the base rate.

Where It Really Shines: Round of 64

The first round is where upsets matter most for brackets - and where BANE performs best.

R64 Threshold
Flagged
Upsets
Hit Rate
Lift
BANE ≥ 55
606
212
35.0%
1.3×
BANE ≥ 60
416
163
39.2%
1.5×
BANE ≥ 65
200
99
49.5%
1.9×
BANE ≥ 70
82
40
48.8%
1.9×

49.5% hit rate at BANE ≥65 in Round of 64. Nearly 1 in 2 flagged games produced an upset - nearly double the 26.1% base rate.

The Sweet Spot: Seed Matchups

BANE Score excels at identifying competitive underdogs in the 5-12 through 8-9 range. These are the matchups where underlying fundamentals (defense, tempo, turnovers) provide genuine predictive signal:

Matchup
Base Rate
BANE≥60 Rate
Improvement
5 vs 12
35.7%
40.8%
+5.1pp
6 vs 11
39.0%
47.9%
+8.9pp
7 vs 10
38.3%
44.1%
+5.8pp
8 vs 9
52.6%
54.9%
+2.3pp

The 6-vs-11 matchup is BANE’s best: 47.9% upset rate when BANE flags ≥60, versus the 39% base rate. That’s because 11-seeds often include power conference teams that got underseeded - exactly the kind of mismatch BANE’s conference strength and efficiency factors detect.

The Modern Era Advantage

BANE performs significantly better on 2003-2024 data, where we have full box score statistics:

1985-2002 (Basic stats only)
39.3%
BANE ≥65 hit rate
2003-2024 (Full box scores)
53.4%
BANE ≥65 hit rate

For 2026, we have full box score data and adjusted efficiency ratings for every team. The model will operate at its strongest with complete data on every tournament team.

What We Get Wrong

Transparency means showing failures too. Here are the famous upsets BANE missed:

Upset
BANE
Result
2018 #16 UMBC over #1 Virginia
26
❌ Missed
2023 #16 FDU over #1 Purdue
32
❌ Missed
2022 #15 St. Peter's over #2 Kentucky
37
❌ Missed
2021 #15 Oral Roberts over #2 Ohio St
35
❌ Missed
2016 #15 MTSU over #2 Michigan St
39
❌ Missed

The pattern is obvious: BANE cannot reliably predict 15-over-2 and 16-over-1 upsets. These happen at rates of 1.3% and 7.1% historically - they’re driven by single-game variance (hot 3PT shooting, foul trouble, one player going nuclear) that no pre-game model can capture.

We could inflate these scores to “catch” them retroactively, but that would mean flagging hundreds of false positives. We’d rather be honest: these are black swan events, and any model claiming to predict them reliably is overfitting to history.

What BANE does catch:

Upset
BANE
Result
2018 #11 Loyola-Chicago over #6 Miami FL
66
⚠️ Caught
2011 #11 VCU over #6 Georgetown
57
⚠️ Caught
1997 #14 Chattanooga over #3 Georgia
65
⚠️ Caught
2001 #15 Hampton over #2 Iowa St
64
⚠️ Caught
1989 #14 Siena over #3 Stanford
68
⚠️ Caught

How To Use BANE Score

BANE is a filter, not a betting system. Here’s our framework:

🚨
UPSET ALERT (BANE ≥ 70)
High-confidence upset candidate. 49% R64 hit rate. ~2 per tournament on average. Worth serious consideration for brackets and moneyline value.
⚠️
UPSET WATCH (BANE 55-69)
Competitive underdog with multiple upset factors. 35% hit rate. Good for bracket hedging - pick 1-2 of these to bust chalk.
📊
ON RADAR (BANE 40-54)
Some upset factors present but not enough for a strong call. Use for bracket strategy and awareness, not betting.
✅
CHALK (BANE < 40)
Favorite should win. Only 16% upset rate in this range. Safe bracket pick.

The Honest Bottom Line

BANE Score identifies upset candidates at nearly 2× the base rate in first-round matchups. It excels at the 5-12 through 8-9 seed matchups where defensive efficiency, tempo control, and turnover vulnerability provide genuine predictive signal.

It does not - and cannot - reliably predict once-a-decade Cinderella runs. No model can. But it can tell you which 11-seed has the defensive profile to grind out a 6-seed, which 12-seed plays at a tempo that compresses possessions against a fast 5-seed, and which favorites are running hot on luck.

That’s not a crystal ball. It’s decision support - and it’s what separates informed bracket strategy from coin flips.

Methodology

Data source: NCAA tournament historical dataset. 2,428 games from 1985-2024 (excluding 2020). BANE Score computed using only pre-tournament regular season statistics available before each game. 2003+ games use the full 7-factor model with box score data. 1985-2002 games use a simplified model with win margin, conference strength, and Pythagorean expectation. No lookahead bias - each game scored using only data available before tipoff.

Full backtest code and raw results are available for review. We believe in showing our work.

RELATED RESEARCH
MARCH MADNESS
5 Upset Signals to Watch in 2026
METHODOLOGY
How the BANE Score Works
LIVE DATA
Our Full Track Record
GET STARTED
Member Dashboard

See these models in action

Sharp analytics. Human judgment. Every pick reviewed before it goes live. Edge access $24.99/month.

Get Edge Access →View Track Record

For informational purposes only. Not gambling advice. Past performance does not guarantee future results. Must be 21+. If you or someone you know has a gambling problem, call 1-800-GAMBLER.