BOOG and the Hall of Fame, Part 1 (Introduction)

In 1889, Hall of Famer John Clarkson had one of the best pitching seasons ever by WAR. He won the Triple Crown with a 2.73 ERA, 284 strikeouts, and a win-loss record of 49-19. He also threw 620 innings, the most in the league and a total that represented more than half the combined workload of his team’s entire pitching staff. Although no one knew it at the time, his retroactively calculated ERA- was 64 and his FIP- was 82; both of those rates led the league among qualified pitchers. If the Cy Young award had existed in the National League at that time, he almost certainly would have been the recipient.
In 2025, Tarik Skubal had a dominant season. He didn’t repeat his Triple Crown achievement from 2024, but his ERA and strikeout numbers both improved from the prior year. His 195 1/3 innings were tied for fourth in the majors, and his 54 ERA- and 58 FIP- not only led among all qualified AL pitchers, but also significantly exceeded Clarkson’s 1889 performance. Skubal’s performance was more than good enough for him to win his second consecutive Cy Young award.
Clarkson’s pitching WAR was 10.9; Skubal’s was 6.6. Whose season did more to bolster his Hall of Fame case?
Baseball has a rich tradition of attempting to quantify as many aspects of player performance as possible, and the Hall of Fame is no exception. Not only are inductees’ plaques chock full of statistical facts and figures, but the process of determining Hall-worthiness has been analyzed by scores of sabermetric luminaries (as well as in countless bar room debates). To this day, a number of Bill James creations are listed on Baseball Reference pages, including the Black Ink Test, which compares the number of times a player led the league in various statistical categories to the average Hall of Fame career, and the Hall of Fame Monitor, which uses a point system to measure the likelihood that a player’s career will warrant a place in Cooperstown. It assigns points to players for reaching statistical benchmarks or achieving other notable criteria — such as winning MVP awards, making All-Star teams, being a regular starter on pennant-winning clubs — and then adds up the points. A player with a score of at least 100 is considered “a good possibility” to make the Hall of Fame, while a score of 130 or better suggests that player’s Hall of Fame case is “a virtual cinch.”
In more recent times, Jay Jaffe’s development of JAWS has become the gold standard for measuring a player’s statistical case for induction because it accounts for both longevity and peak performance. Specifically, it takes a player’s career WAR and averages it with the combined WAR of his best seven seasons. JAWS is widely cited in sabermetric circles and was likely a significant factor in the elections of Tim Raines and Larry Walker by the BBWAA on their 10th and final ballots (in 2017 and 2020, respectively) after years of Jaffe and others beating the drum for them.
In this article I present an alternative perspective on measuring Hall of Fame worthiness, one that takes the philosophical underpinnings of WAR as a concept and arrives at a unique and statistically valid approach to balancing career longevity and peak value, distinctly different from JAWS.
Let’s start with the obvious point: JAWS is a better Hall of Fame evaluator than career WAR. John Olerud has more career WAR than Todd Helton, but Helton has a higher JAWS because he had a superior peak, and thus, he’s in Cooperstown while Olerud is not. That said, despite all that JAWS has done to advance how voters measure Hall-worthiness, it still papers over what I believe to be a fundamental flaw in using WAR for this process: Replacement level is the wrong baseline.
In the context that WAR was developed to measure — a players’ value to his club — WAR makes perfect sense. You’re comparing to the hypothetical freely available replacement player who can be slotted in at virtually no marginal cost to the club’s resources. But when we talk about Hall of Fame worthiness… what is a “replacement level” player in that context? Do we declare players Hall-worthy because they are vastly superior to Quad-A cup-of-coffee guys? No, we declare them Hall-worthy because they are head and shoulders above the rest of the players we see every day. When I was sitting in the Camden Yards bleachers punching out All-Star ballots by hand with a pencil in the 1990s, I was seeing Ken Griffey Jr.’s name next to the other regular AL outfielders and comparing his play to theirs. (I was voting for Brady Anderson and B.J. Surhoff anyway, but I hope you’ll excuse some youthful homerism in this example.) We look at wRC+ and ERA-, stats that compare to major league average, and those play a significant role in determining who gets accolades and statues and numbers retired and plaques in upstate New York. I posit that for end-of-season awards and career retrospectives, Wins Above Average (WAA) is a superior metric for measuring how dominant a player performs relative to his league.
JAWS attempts to correct for this inherent problem with WAR in the context of total career evaluation by adding a peak component, and there are also versions of JAWS that adjust for changing pitcher workloads, measure relievers on a scale that’s closer to starting pitchers or position players, and include framing stats for catchers. But at the end of the day, all of those are essentially ad hoc adjustments to a metric that is still based on an underlying number that has limited applicability to this context.
Dan Szymborski wrote about using WAA in assessing Cooperstown candidacies earlier this year. He plugged in WAA to a JAWS-style calculator and called it altJAWS. I contend that a simpler adjustment (described below) is more appropriate for this purpose. Because WAA is already inherently measuring peak performance by only giving a player credit for being above average, adding an extra bonus for seasons that are among a player’s best is putting a hat on a hat. Raw unadjusted WAA is more flexible to the variety of shapes that an MLB playing career can take. A player whose peak output is concentrated into four or five years, as well as a player who maintains a high level of performance for a decade, is served imperfectly by the seven-year peak component of a JAWS-style system. A simple WAA accumulation treats each season individually and needs no adjustments to determine what counts as “peak” and what counts as “compiling”.
So, can we just calculate a player’s career WAA total and call it a day? Not quite. We don’t want to unduly penalize a player for hanging around long after his peak production has ended, even if that results in some seasons of play where he is below average. Hanging on for a few seasons in post-peak decline in order to hit some counting stat milestones that may be personally meaningful to a player, even as his overall rate of production dips below average, should not result in a ding to his WAA-based statistical case. Therefore, I make an adjustment to zero out any negative WAA seasons. You can never have banked value taken away from you, just like a fan’s memory of seeing their favorite player in his prime can never be dulled by the modern reminder of his lessened capacity to play the game. This approach is favored by no less a sabermetric thought leader than Tom Tango, who said on social media, “The worst thing we do with HOF evaluations is using WAR (Wins Above Replacement) instead of positive-WAA” and has blogged about a quick and dirty Hall of Fame estimator based on similar concepts.
In the long tradition of semi-tortured backronyms for sabermetric inventions, and with love and respect to Tango, I suggest we refer to this metric as BOOG Score (Better than Other Ordinary Guys) instead of positive-WAA.
Perhaps readers take issue with “major league average” as the baseline level for Hall of Fame analysis. Maybe you think All-Star performance is table stakes for Cooperstown hopefuls, and thus, there should be a higher level of production needed to boost a Hall of Fame résumé. Maybe you think there’s no ability like availability, and we need to give more credit to guys who were able to stay on the field for a long time even if they weren’t elite players. Choosing a different threshold as the baseline for this type of metric is absolutely a valid approach. The choice of zero point is somewhat arbitrary, within reasonable bounds; it simply reflects your opinion of “how good is good enough” for various baseball honorifics. As Tango muses in the blog post linked above, “Maybe you would use WAR, and zero-out below that level? Doing so helps pitchers with longevity (like Don Sutton). Maybe you would use a higher standard and zero-out that level? Doing so helps pitchers with shorter careers (like Koufax). There’s no right or wrong answer here.”
For the purpose of introducing this metric, I chose to use a baseline that is relatively easy to calculate from public FanGraphs data, which gave me a high degree of confidence that my implementation was internally consistent and properly calculated. Other baseball minds may come to different conclusions and suggest alternative “zero points.” BOOG score is a conversation starter, not a conversation ender.
For those interested in the nitty gritty details of calculating BOOG score, the next paragraph is for you. Others may feel free to skip down to the tables below.
Dan Szymborski’s article noted that FanGraphs does not provide an easily referenced WAA number on its leaderboards, so he approximated with a value of 2.0 WAR per 200 innings pitched or 650 plate appearances. However, there is publicly accessible FanGraphs data to allow a more accurate calculation with some intermediate steps. Batter replacement level is calculated at 20 runs per 600 PA, which can be subtracted from a player’s RAR (Runs Above Replacement) total and then converted to wins based on season-specific run environment data. For pitchers, I calculated the league total of WAR per IP for every league-season that’s publicly available at FanGraphs, then multiplied each pitcher-season’s innings total by that ratio to find how much a pitcher’s single-season WAR total should be debited to convert to WAA. By definition, this results in a league-wide total of 0 WAA per season as intended. I also reproduced Dan’s JAWS calculations with FanGraphs data, but unlike his results, I did not exclude 19th century pitching. This makes the difference in JAWS vs. BOOG Score for pitchers even more apparent. Like Dan, I chose to use fJAWS for this comparison instead of the more commonly cited “standard” version of JAWS (which uses Baseball Reference data), so that the differences between the lists are only due to the WAA vs. WAR methodology, sidestepping any underlying “disagreement” between Baseball Reference’s WAR model and FanGraphs’.
Below are the top 50 hitters and pitchers by both fJAWS and BOOG Score, and how they compare. Active players are evaluated through the end of the 2025 season.
| # | Name | fWAR | fWAR7 | fJAWS | BOOG | BOOG rk | Rank Delta |
|---|---|---|---|---|---|---|---|
| 1 | Babe Ruth | 179.4 | 89.1 | 134.3 | 135.3 | 1 | 0 |
| 2 | Barry Bonds | 164.4 | 77.2 | 120.8 | 126.2 | 2 | 0 |
| 3 | Willie Mays | 149.8 | 70.6 | 110.2 | 112.0 | 3 | 0 |
| 4 | Ty Cobb | 149.0 | 69.2 | 109.1 | 108.5 | 4 | 0 |
| 5 | Honus Wagner | 138.1 | 68.6 | 103.3 | 101.8 | 5 | 0 |
| 6 | Rogers Hornsby | 129.1 | 75.5 | 102.3 | 100.8 | 7 | 1 |
| 7 | Ted Williams | 129.8 | 72.4 | 101.1 | 100.9 | 6 | -1 |
| 8 | Henry Aaron | 136.3 | 56.4 | 96.4 | 96.3 | 8 | 0 |
| 9 | Tris Speaker | 130.2 | 61.4 | 95.8 | 94.0 | 9 | 0 |
| 10 | Stan Musial | 126.4 | 64.3 | 95.3 | 88.5 | 10 | 0 |
| 11 | Lou Gehrig | 115.9 | 69.7 | 92.8 | 86.8 | 11 | 0 |
| 12 | Eddie Collins | 120.1 | 62.7 | 91.4 | 83.0 | 12 | 0 |
| 13 | Alex Rodriguez | 113.6 | 64.5 | 89.0 | 79.6 | 14 | 1 |
| 14 | Mickey Mantle | 112.3 | 65.5 | 88.9 | 81.6 | 13 | -1 |
| 15 | Mike Schmidt | 106.5 | 59.1 | 82.8 | 76.9 | 16 | 1 |
| 16 | Mel Ott | 110.1 | 54.4 | 82.3 | 77.2 | 15 | -1 |
| 17 | Jimmie Foxx | 101.6 | 61.9 | 81.7 | 73.1 | 18 | 1 |
| 18 | Rickey Henderson | 106.3 | 56.2 | 81.3 | 70.4 | 19 | 1 |
| 19 | Nap Lajoie | 102.2 | 58.0 | 80.1 | 73.8 | 17 | -2 |
| 20 | Joe Morgan | 98.8 | 58.5 | 78.7 | 64.4 | 23 | 3 |
| 21 | Frank Robinson | 104.0 | 50.8 | 77.4 | 67.8 | 20 | -1 |
| 22 | Eddie Mathews | 96.1 | 55.3 | 75.7 | 65.3 | 21 | -1 |
| 23 | Mike Trout | 87.2 | 63.5 | 75.3 | 64.9 | 22 | -1 |
| 24 | Albert Pujols | 89.8 | 57.9 | 73.8 | 61.2 | 24 | 0 |
| 25 | Carl Yastrzemski | 94.8 | 51.7 | 73.3 | 56.2 | 31 | 6 |
| 26 | Cal Ripken Jr. | 92.5 | 53.2 | 72.8 | 56.8 | 30 | 4 |
| 27 | Wade Boggs | 88.4 | 56.1 | 72.3 | 57.3 | 27 | 0 |
| 28 | Joe DiMaggio | 82.7 | 54.4 | 68.5 | 59.2 | 26 | -2 |
| 29 | Roger Connor | 86.2 | 49.5 | 67.9 | 59.4 | 25 | -4 |
| 30 | Al Kaline | 88.9 | 46.7 | 67.8 | 55.4 | 33 | 3 |
| 31 | George Brett | 84.6 | 50.8 | 67.7 | 55.4 | 32 | 1 |
| 32 | Cap Anson | 91.0 | 40.6 | 65.8 | 57.2 | 28 | -4 |
| 33 | Chipper Jones | 84.6 | 46.8 | 65.7 | 52.5 | 36 | 3 |
| 34 | Ken Griffey Jr. | 77.7 | 52.9 | 65.3 | 54.3 | 34 | 0 |
| 35 | Jeff Bagwell | 80.2 | 49.5 | 64.9 | 52.0 | 38 | 3 |
| 36 | George Davis | 84.6 | 44.3 | 64.5 | 54.2 | 35 | -1 |
| 37 | Charlie Gehringer | 78.6 | 49.4 | 64.0 | 49.6 | 41 | 4 |
| 38 | Roberto Clemente | 80.6 | 47.1 | 63.9 | 52.3 | 37 | -1 |
| 39 | Adrian Beltré | 83.5 | 43.9 | 63.7 | 47.9 | 44 | 5 |
| 40 | Dan Brouthers | 79.5 | 47.8 | 63.6 | 57.0 | 29 | -11 |
| 41 | Ed Delahanty | 73.7 | 51.7 | 62.7 | 51.6 | 39 | -2 |
| 42 | Brooks Robinson | 80.2 | 45.1 | 62.6 | 48.5 | 43 | 1 |
| 43 | Ron Santo | 70.9 | 52.7 | 61.8 | 46.9 | 46 | 3 |
| 44 | Pete Rose | 80.2 | 43.4 | 61.8 | 42.8 | 69 | 25 |
| 45 | Arky Vaughan | 72.4 | 50.2 | 61.3 | 48.9 | 42 | -3 |
| 46 | Johnny Bench | 74.8 | 47.4 | 61.1 | 50.5 | 40 | -6 |
| 47 | Frankie Frisch | 74.8 | 46.9 | 60.8 | 45.7 | 50 | 3 |
| 48 | Paul Waner | 74.8 | 44.4 | 59.6 | 43.8 | 60 | 12 |
| 49 | Frank Thomas | 72.1 | 46.8 | 59.4 | 43.7 | 61 | 12 |
| 50 | Bill Dahlen | 77.6 | 40.9 | 59.2 | 46.5 | 47 | -3 |
| # | Name | BOOG | fJAWS | fJAWS Rk | Delta |
|---|---|---|---|---|---|
| 1 | Babe Ruth | 135.3 | 134.3 | 1 | 0 |
| 2 | Barry Bonds | 126.2 | 120.8 | 2 | 0 |
| 3 | Willie Mays | 112.0 | 110.2 | 3 | 0 |
| 4 | Ty Cobb | 108.5 | 109.1 | 4 | 0 |
| 5 | Honus Wagner | 101.8 | 103.3 | 5 | 0 |
| 6 | Ted Williams | 100.9 | 101.1 | 7 | 1 |
| 7 | Rogers Hornsby | 100.8 | 102.3 | 6 | -1 |
| 8 | Henry Aaron | 96.3 | 96.4 | 8 | 0 |
| 9 | Tris Speaker | 94.0 | 95.8 | 9 | 0 |
| 10 | Stan Musial | 88.5 | 95.3 | 10 | 0 |
| 11 | Lou Gehrig | 86.8 | 92.8 | 11 | 0 |
| 12 | Eddie Collins | 83.0 | 91.4 | 12 | 0 |
| 13 | Mickey Mantle | 81.6 | 88.9 | 14 | 1 |
| 14 | Alex Rodriguez | 79.6 | 89.0 | 13 | -1 |
| 15 | Mel Ott | 77.2 | 82.3 | 16 | 1 |
| 16 | Mike Schmidt | 76.9 | 82.8 | 15 | -1 |
| 17 | Nap Lajoie | 73.8 | 80.1 | 19 | 2 |
| 18 | Jimmie Foxx | 73.1 | 81.7 | 17 | -1 |
| 19 | Rickey Henderson | 70.4 | 81.3 | 18 | -1 |
| 20 | Frank Robinson | 67.8 | 77.4 | 21 | 1 |
| 21 | Eddie Mathews | 65.3 | 75.7 | 22 | 1 |
| 22 | Mike Trout | 64.9 | 75.3 | 23 | 1 |
| 23 | Joe Morgan | 64.4 | 78.7 | 20 | -3 |
| 24 | Albert Pujols | 61.2 | 73.8 | 24 | 0 |
| 25 | Roger Connor | 59.4 | 67.9 | 29 | 4 |
| 26 | Joe DiMaggio | 59.2 | 68.5 | 28 | 2 |
| 27 | Wade Boggs | 57.3 | 72.3 | 27 | 0 |
| 28 | Cap Anson | 57.2 | 65.8 | 32 | 4 |
| 29 | Dan Brouthers | 57.0 | 63.6 | 40 | 11 |
| 30 | Cal Ripken Jr. | 56.8 | 72.8 | 26 | -4 |
| 31 | Carl Yastrzemski | 56.2 | 73.3 | 25 | -6 |
| 32 | George Brett | 55.4 | 67.7 | 31 | -1 |
| 33 | Al Kaline | 55.4 | 67.8 | 30 | -3 |
| 34 | Ken Griffey Jr. | 54.3 | 65.3 | 34 | 0 |
| 35 | George Davis | 54.2 | 64.5 | 36 | 1 |
| 36 | Chipper Jones | 52.5 | 65.7 | 33 | -3 |
| 37 | Roberto Clemente | 52.3 | 63.9 | 38 | 1 |
| 38 | Jeff Bagwell | 52.0 | 64.9 | 35 | -3 |
| 39 | Ed Delahanty | 51.6 | 62.7 | 41 | 2 |
| 40 | Johnny Bench | 50.5 | 61.1 | 46 | 6 |
| 41 | Charlie Gehringer | 49.6 | 64.0 | 37 | -4 |
| 42 | Arky Vaughan | 48.9 | 61.3 | 45 | 3 |
| 43 | Brooks Robinson | 48.5 | 62.6 | 42 | -1 |
| 44 | Adrian Beltré | 47.9 | 63.7 | 39 | -5 |
| 45 | Billy Hamilton | 47.4 | 58.0 | 55 | 10 |
| 46 | Ron Santo | 46.9 | 61.8 | 43 | -3 |
| 47 | Bill Dahlen | 46.5 | 59.2 | 50 | 3 |
| 48 | Aaron Judge | 46.3 | 59.0 | 51 | 3 |
| 49 | Johnny Mize | 45.9 | 57.3 | 58 | 9 |
| 50 | Frankie Frisch | 45.7 | 60.8 | 47 | -3 |
| # | Name | fWAR | fWAR7 | fJAWS | BOOG | BOOG rk | Rank Delta |
|---|---|---|---|---|---|---|---|
| 1 | Roger Clemens | 134.3 | 60.8 | 97.6 | 84.5 | 1 | 0 |
| 2 | Walter Johnson | 125.9 | 63.7 | 94.8 | 60.7 | 4 | 2 |
| 3 | Cy Young | 132.2 | 55.9 | 94.1 | 50.8 | 7 | 4 |
| 4 | Greg Maddux | 118.4 | 54.2 | 86.3 | 64.2 | 3 | -1 |
| 5 | Randy Johnson | 108.5 | 62.7 | 85.6 | 66.3 | 2 | -3 |
| 6 | Steve Carlton | 102.7 | 54.0 | 78.4 | 49.5 | 11 | 5 |
| 7 | Christy Mathewson | 96.5 | 57.9 | 77.2 | 44.6 | 17 | 10 |
| 8 | Bert Blyleven | 101.0 | 51.6 | 76.3 | 50.5 | 8 | 0 |
| 9 | Pete Alexander | 97.3 | 53.6 | 75.4 | 41.0 | 22 | 13 |
| 10 | Nolan Ryan | 104.6 | 45.2 | 74.9 | 50.8 | 6 | -4 |
| 11 | Tom Seaver | 96.2 | 53.5 | 74.9 | 46.2 | 14 | 3 |
| 12 | Bob Gibson | 90.1 | 56.8 | 73.4 | 49.5 | 10 | -2 |
| 13 | Gaylord Perry | 97.4 | 45.4 | 71.4 | 41.6 | 20 | 7 |
| 14 | Pedro Martínez | 82.5 | 55.3 | 68.9 | 54.0 | 5 | -9 |
| 15 | Fergie Jenkins | 81.7 | 52.6 | 67.2 | 35.5 | 27 | 12 |
| 16 | Lefty Grove | 82.9 | 48.6 | 65.7 | 41.6 | 21 | 5 |
| 17 | Justin Verlander | 84.0 | 47.3 | 65.6 | 47.7 | 12 | -5 |
| 18 | Clayton Kershaw | 80.7 | 49.9 | 65.3 | 50.4 | 9 | -9 |
| 19 | Curt Schilling | 79.0 | 49.0 | 64.0 | 44.9 | 16 | -3 |
| 20 | Kid Nichols | 78.4 | 49.3 | 63.8 | 22.4 | 72 | 52 |
| 21 | John Smoltz | 82.8 | 44.5 | 63.6 | 46.3 | 13 | -8 |
| 22 | Kevin Brown | 76.1 | 48.5 | 62.3 | 42.5 | 19 | -3 |
| 23 | Don Sutton | 84.3 | 38.9 | 61.6 | 29.7 | 36 | 13 |
| 24 | Robin Roberts | 77.3 | 45.6 | 61.5 | 28.1 | 43 | 19 |
| 25 | Mike Mussina | 81.3 | 41.6 | 61.4 | 45.3 | 15 | -10 |
| 26 | John Clarkson | 67.2 | 54.7 | 61.0 | 16.0 | 148 | 122 |
| 27 | Warren Spahn | 81.4 | 40.4 | 60.9 | 27.4 | 48 | 21 |
| 28 | Tim Keefe | 70.5 | 49.8 | 60.1 | 14.2 | 178 | 150 |
| 29 | Phil Niekro | 77.1 | 43.0 | 60.1 | 25.5 | 60 | 31 |
| 30 | Max Scherzer | 74.5 | 45.6 | 60.0 | 43.7 | 18 | -12 |
| 31 | Jim Kaat | 75.7 | 40.5 | 58.1 | 31.4 | 32 | 1 |
| 32 | Zack Greinke | 72.3 | 41.4 | 56.9 | 37.6 | 24 | -8 |
| 33 | Tommy John | 79.1 | 33.7 | 56.4 | 30.3 | 34 | 1 |
| 34 | Roy Halladay | 64.3 | 46.7 | 55.5 | 39.5 | 23 | -11 |
| 35 | Hal Newhouser | 61.8 | 49.0 | 55.4 | 29.5 | 37 | 2 |
| 36 | Eddie Plank | 71.5 | 39.3 | 55.4 | 22.9 | 69 | 33 |
| 37 | Jim Bunning | 66.2 | 44.0 | 55.1 | 27.4 | 49 | 12 |
| 38 | Tom Glavine | 73.1 | 36.8 | 54.9 | 28.0 | 45 | 7 |
| 39 | Don Drysdale | 65.7 | 43.8 | 54.7 | 28.3 | 41 | 2 |
| 40 | Pud Galvin | 62.9 | 45.5 | 54.2 | 5.8 | 617 | 577 |
| 41 | Bullet Rogan | 59.7 | 48.3 | 54.0 | 37.4 | 25 | -16 |
| 42 | Jim Whitney | 55.4 | 52.3 | 53.9 | 17.0 | 131 | 89 |
| 43 | CC Sabathia | 67.2 | 40.5 | 53.9 | 32.2 | 31 | -12 |
| 44 | Rick Reuschel | 69.4 | 38.0 | 53.7 | 31.0 | 33 | -11 |
| 45 | Juan Marichal | 61.7 | 44.6 | 53.1 | 24.3 | 63 | 18 |
| 46 | Dwight Gooden | 60.6 | 45.6 | 53.1 | 33.5 | 29 | -17 |
| 47 | Rube Waddell | 58.3 | 46.6 | 52.4 | 26.2 | 54 | 7 |
| 48 | Mickey Lolich | 64.0 | 40.8 | 52.4 | 26.0 | 55 | 7 |
| 49 | Andy Pettitte | 67.9 | 35.8 | 51.9 | 33.7 | 28 | -21 |
| 50 | Red Ruffing | 68.3 | 34.5 | 51.4 | 22.1 | 73 | 23 |
| # | Name | BOOG | fJAWS | fJAWS Rk | Rank Delta |
|---|---|---|---|---|---|
| 1 | Roger Clemens | 84.5 | 97.6 | 1 | 0 |
| 2 | Randy Johnson | 66.3 | 85.6 | 5 | 3 |
| 3 | Greg Maddux | 64.2 | 86.3 | 4 | 1 |
| 4 | Walter Johnson | 60.7 | 94.8 | 2 | -2 |
| 5 | Pedro Martínez | 54.0 | 68.9 | 14 | 9 |
| 6 | Nolan Ryan | 50.8 | 74.9 | 10 | 4 |
| 7 | Cy Young | 50.8 | 94.1 | 3 | -4 |
| 8 | Bert Blyleven | 50.5 | 76.3 | 8 | 0 |
| 9 | Clayton Kershaw | 50.4 | 65.3 | 18 | 9 |
| 10 | Bob Gibson | 49.5 | 73.4 | 12 | 2 |
| 11 | Steve Carlton | 49.5 | 78.4 | 6 | -5 |
| 12 | Justin Verlander | 47.7 | 65.6 | 17 | 5 |
| 13 | John Smoltz | 46.3 | 63.6 | 21 | 8 |
| 14 | Tom Seaver | 46.2 | 74.9 | 11 | -3 |
| 15 | Mike Mussina | 45.3 | 61.4 | 25 | 10 |
| 16 | Curt Schilling | 44.9 | 64.0 | 19 | 3 |
| 17 | Christy Mathewson | 44.6 | 77.2 | 7 | -10 |
| 18 | Max Scherzer | 43.7 | 60.0 | 30 | 12 |
| 19 | Kevin Brown | 42.5 | 62.3 | 22 | 3 |
| 20 | Gaylord Perry | 41.6 | 71.4 | 13 | -7 |
| 21 | Lefty Grove | 41.6 | 65.7 | 16 | -5 |
| 22 | Pete Alexander | 41.0 | 75.4 | 9 | -13 |
| 23 | Roy Halladay | 39.5 | 55.5 | 34 | 11 |
| 24 | Zack Greinke | 37.6 | 56.9 | 32 | 8 |
| 25 | Bullet Rogan | 37.4 | 54.0 | 41 | 16 |
| 26 | Chris Sale | 36.2 | 49.2 | 55 | 29 |
| 27 | Fergie Jenkins | 35.5 | 67.2 | 15 | -12 |
| 28 | Andy Pettitte | 33.7 | 51.9 | 49 | 21 |
| 29 | Dwight Gooden | 33.5 | 53.1 | 46 | 17 |
| 30 | Jacob deGrom | 32.7 | 44.3 | 84 | 54 |
| 31 | CC Sabathia | 32.2 | 53.9 | 43 | 12 |
| 32 | Jim Kaat | 31.4 | 58.1 | 31 | -1 |
| 33 | Rick Reuschel | 31.0 | 53.7 | 44 | 11 |
| 34 | Tommy John | 30.3 | 56.4 | 33 | -1 |
| 35 | Dennis Eckersley | 30.1 | 45.6 | 75 | 40 |
| 36 | Don Sutton | 29.7 | 61.6 | 23 | -13 |
| 37 | Hal Newhouser | 29.5 | 55.4 | 35 | -2 |
| 38 | Bret Saberhagen | 29.4 | 46.3 | 68 | 30 |
| 39 | Félix Hernández | 29.3 | 47.5 | 62 | 23 |
| 40 | Javier Vazquez | 28.7 | 47.4 | 63 | 23 |
| 41 | Don Drysdale | 28.3 | 54.7 | 39 | -2 |
| 42 | David Cone | 28.2 | 46.7 | 66 | 24 |
| 43 | Robin Roberts | 28.1 | 61.5 | 24 | -19 |
| 44 | Cliff Lee | 28.1 | 45.7 | 74 | 30 |
| 45 | Tom Glavine | 28.0 | 54.9 | 38 | -7 |
| 46 | Roy Oswalt | 28.0 | 45.2 | 76 | 30 |
| 47 | Camilo Pascual | 27.7 | 48.1 | 58 | 11 |
| 48 | Warren Spahn | 27.4 | 60.9 | 27 | -21 |
| 49 | Jim Bunning | 27.4 | 55.1 | 37 | -12 |
| 50 | Sandy Koufax | 26.7 | 48.8 | 56 | 6 |
Let’s look at an illustrative example from this list to demonstrate the impact of zeroing out negative WAA seasons: Albert Pujols. He’s listed as the 24th-best position player in MLB history by both BOOG score and fJAWS; if you buy what fJAWS is selling, then the BOOG score ranking must also feel sensible. However, if we instead counted up his career WAA total without excluding negative seasons, his career total would be 18% lower and he’d be ranked 42nd. On the other hand, if you chose to exclude negative WAR seasons from his fJAWS total, it would change by less than 2% — not enough to affect his placement on the list. It’s simply a lot harder for an MLB player to be significantly below replacement level than to be significantly below average. The zeroing-out step is relatively inconsequential for JAWS, but a critical part of the process for BOOG.
Taking a step back, two things stand out about the hitter and pitcher comparisons. First, the batting lists are much more similar to each other than the pitching lists are. This reflects the fact that pitcher usage patterns have changed much more dramatically across MLB history than batter usage patterns have, and BOOG is less influenced by differences in volume than JAWS. It’s a natural consequence of the fact that BOOG scores are, at first pass, all about peak – at least when it comes to individual-season totals. You can’t rack up a high career BOOG total without sticking around in the league for a long time, but you have to do that while continuing to be a notably above-average player. High quality major league batters don’t see as much variability in their playing time from year to year as pitchers do, especially when measured on the large scale of decades and centuries. The fact that JAWS and BOOG agree in broad strokes about the top position players in MLB history is a helpful sanity check that the underlying WAA methodology arrives at roughly the same conclusions as JAWS, at least when presented with player-seasons that have more consistency in terms of total time on the field.
Second, and relatedly, the JAWS pitcher list is much more dominated by players from the first 70 years of MLB history (1871-1940) than the BOOG list is. Again, this is due to the different choice of baseline in the underlying WAR vs. WAA numbers. By less heavily rewarding high-WAR seasons from old-school pitchers whose value to their team was more dependent on volume of innings thrown, BOOG scores consider the “peak vs. longevity” divide on an individual-season level. JAWS rates Christy Mathewson (~320 innings pitched per year) as the seventh-greatest pitcher of all time and Pedro Martínez (~200 innings pitched per year) as the 14th best; BOOG rates them as 17th and fifth, respectively. Koufax is not among the top 50 pitchers of all time according to JAWS; by BOOG score, he is. We will dive further into the implications of this “anti-era effect” as it relates to pitchers specifically in a later article.
Using WAA allows us to more naturally contextualize changing pitcher workloads across different eras of baseball, and creates a level playing field where performances from 100 years apart can be stacked up against each other with fewer adjustments needed to account for structural changes in the game. When evaluating batters, a WAA-based metric spits out rankings that do not drastically differ from JAWS, which serves as an informal proof of concept that the underlying methodology is more or less sound. These two facts combine to make BOOG scores a powerful method of analysis for providing context to discussions about the Hall of Fame.
Returning to the question from the beginning of this article: John Clarkson’s 1889 season was worth 4.7 WAA as a pitcher. Tarik Skubal’s 2025 season was worth 4.7 WAA as a pitcher. So as pitchers, they both added the same amount to their BOOG totals in those respective campaigns. I think that feels about right!
I’ve set up a simple web page where you can query for the BOOG score of any player in MLB history who reached at least 10.0 BOOG in his career (5.0 minimum for relievers), as well as the BOOG of all active players who are above 0. It displays their career accumulation on a year-by-year basis, and you can also compare their totals to the median Hall of Fame player (medians defined for the groups of all starting pitchers, all relievers, or all position players).
In later articles, I will dig deeper into Hall of Fame BOOG thresholds at specific positions, highlight specific cases of players considered by BOOG to be overlooked, and introduce a BOOG-based Hall of Fame induction estimator.
Michael is a software engineer and lifelong Orioles fan living in Maryland. He has been an occasional guest and StatBlast correspondent for Effectively Wild dating back to 2018.
Thanks for doing this! Awesome work.
I especially love BOOG graphs. I hope you do a follow up on a selection of the borderline guys. Interesting view of guys like Lofton, Edmonds.
So far my favorite has been the batch of CF guys who get talked about: Lofton, Edmonds, Minoso, Andruw, Murphy, Dawson, Beltran, Cutch. Those Hof probability graphs tell the story so well.
Thanks! Yes, that is the plan for future articles.