What HoFW Actually Measures

HoFW (Hall of Fame Worthiness) answers a single question for every player we track: how good is their Hall of Fame case, on merit, compared to the average real Hall of Famer at their position? A score of 100 means "as strong a case as the typical enshrined player at that position." There's no ceiling — a handful of inner-circle greats clear 200 — and there's no floor below zero, since plenty of good NFL careers never approach Hall of Fame territory.

That's a deliberately different question from "will the committee actually vote this player in, and when." We track both, separately and on purpose: HoFW and Tier measure merit; predicted_class_year and induction_chance measure the real-world prediction. A player can have a strong merit case (high HoFW) while we still predict them several years away from induction, simply because the queue of equally deserving peers ahead of them hasn't cleared yet.

What Goes Into the Score

The model weighs five categories of evidence: career production (counting stats relevant to the player's position), peak value (how dominant they were at their best, not just how long they played), postseason and All-Pro/Pro Bowl honors, championships, and draft pedigree. Production and peak value carry the most weight; honors and rings are real signal but are deliberately capped so they can't single-handedly inflate a score — more on that below.

Everything is era- and position-adjusted. A 1,500-yard rushing season means something different in a run-heavy 1978 offense than in a pass-heavy 2018 one, and a cornerback's case is built from different stats entirely than a guard's. The model fits a separate baseline for each position group rather than scoring everyone against one undifferentiated stat line.

The Honors-Without-Production Problem

One real failure mode we had to design around: Pro Bowl and All-Pro rosters reserve dedicated slots for special-teamers (long snappers, return specialists, gunners), so a player can rack up honors without the on-field production that should accompany a real Hall of Fame case. Left unchecked, a regression model can let a special-teams honor-collector outscore an actual Hall lock on raw inputs. We mitigate this with a value-floor discount that scales honors credit down when it isn't backed by real production — if a player's HoFW looks suspiciously high relative to their stat line, this is usually why, and it's a guardrail, not a bug.

Championships get a related but separate treatment: rather than feeding "won a ring" into the regression as a raw stat (which reopens the same loophole for bench players on dynastic rosters), we apply it as a fixed post-hoc bonus gated by its own steeper discount curve. A player has to show real production to get meaningful credit for having been on a winning roster.

Why Pre-1999 Careers Are Excluded, Not Corrected

Our underlying stats window starts in 1999. Players whose careers fall partly or fully before that — a number of our tracked real Hall of Famers — would have understated production if we used the visible portion of their career as-is. Rather than trying to patch that with a correction feature, we exclude every truncated-career row entirely when fitting the model and computing each position's 100-point anchor. A known-incomplete data point doesn't get to shape the formula for everyone else.

For display, already-enshrined Hall of Famers in this bucket get no fabricated score at all — they're shown as Enshrined with no number attached, because giving them a number we know is built on partial data would be worse than giving them none. Not-yet-inducted players with a partially observed career still get scored by the cleanly fit model, marked with a small asterisk to flag that their own input data is incomplete. We're backfilling real pre-1999 career totals year by year, sourced manually from Pro-Football-Reference, to shrink this group over time — never by scraping the site programmatically, only by hand-checking and pasting specific numbers, since automated scraping of PFR is against their terms.

How We Validate It

A model like this is only useful if it actually predicts what real committees do. We check this two ways. First, cross-validation: the model is trained and tested on held-out real induction outcomes, currently across thousands of tracked player-seasons including dozens of confirmed Hall of Famers. Second, and more concretely, we run the model against every real Hall of Fame induction class since 2005 and check whether the players who actually got in were ranked near the top of our predictions for that class at the time. That per-class validation is what tells us whether the model's predictions would have looked reasonable in real time, not just in aggregate statistics.

Both validation numbers shift slightly as we backfill more pre-1999 data and re-tune the model — we render them live from the underlying dataset rather than writing a fixed number into this page, since a hardcoded figure here would drift out of date the next time the model gets refit.

What This Model Is Still Working Through

This is an alpha product, and we're upfront about where it's incomplete. The biggest open item is the pre-1999 backfill described above — a meaningful chunk of real Hall of Famers still have partially observed careers in our data, and closing that gap is ongoing, manual work. Offensive line and special-teams positions (kickers, punters) also don't have a dedicated scoring pipeline yet, since the stat categories that matter for those positions don't map cleanly onto the skill-position framework the model is built around. We track the full, current list of known limitations directly from the dataset on the Hall of Fame methodology section, rather than maintaining a separate static list here that could fall out of sync.

See It in Action

The full database — every player we track, sortable and filterable by position, tier, and eligibility — lives on the Hall of Fame page. Free access shows the top 250 players by HoFW; a Pro membership unlocks the full database plus a Deep Dive breakdown on every player.