Rating Accuracy & Calibration

How well do our ELO ratings predict match outcomes between established athletes?

65.7%
Prediction Accuracy
3,912
Established Matches
10+
Min Career Matches

We measure accuracy on established players only — both athletes must have 10+ career matches. This excludes first-time competitors whose ratings haven't stabilized yet, giving an honest picture of prediction quality.

Predicted vs Actual Win Rate

For each ELO gap range, we show how often the higher-rated player actually wins. "Expected" is what ELO predicts; "Actual" is what happened.

ELO GapMatchesExpectedActualGap
094153.7%50.6%-3.1%
5089260.5%61.3%+0.8%
10070667.1%66.3%-0.8%
15046973.0%74.8%+1.8%
20037978.4%76.0%-2.4%
25023382.8%79.8%-3.0%
30015186.5%84.8%-1.8%
3507889.5%91.0%+1.5%
4004191.9%90.2%-1.6%
4502293.9%90.9%-3.0%

Full Outcome Distribution

Every match falls into one of 7 categories. From the favored player's perspective. The "Score" column factors in partial credit — submission=1.0, points=0.9, decision=0.8, draw=0.5 — and tracks closely with ELO's expected win rate, showing the ratings are well calibrated across all match outcomes, not just wins and losses.

ELO GapNFav SubFav PtsFav DecDrawUnd DecUnd PtsUnd SubScoreExpectedGap
0941214155107628514717153.3%53.7%-0.4%
50892239192116538010310962.5%60.5%+2.0%
100706241146814952667167.8%67.1%+0.7%
150469180108633725253175.4%73.0%+2.4%
20037915896342722212176.9%78.4%-1.5%
2502331085424141761080.2%82.8%-2.6%
3001518035131342485.7%86.5%-0.8%
35078451610302288.6%89.5%-0.9%
400412773400092.0%91.9%+0.1%
450221631200093.2%93.9%-0.7%

How to read: "Score" is the implied rating using our margin scoring (submission=1.0, points=0.9, decision=0.8, draw=0.5). This is what actually updates ratings. When Score ≈ Expected, the rating system is well-calibrated.

Data from 30,054 matches across 885 events (1998-2026). Ratings use standard ELO with margin-of-victory scoring. Full methodology