Archive for Research

When Do Stars Become Scrubs?

Baseball is a game driven by stars. They create the most exciting highlight reels that captivate audiences and leave us all in awe. However, eventually every star player loses their battle with Father Time. The purpose of this research was to try and determine when a star player’s production declines to the point where they can become easily replaceable. I decided to use a process called survival analysis to determine when this event occurs.

Methodology

Survival analysis attempts to determine the probability of when an event will occur. In any survival analysis problem, you need to determine three things. You need to determine the requirements for your population, the variables to predict the time of event, and the event.

For this problem, I decided that I would include any player that had their first season of 4 WAR or higher between 1920 and 1999 in my population. I decided to use for my variables: the age when they recorded their first star season, body mass index, offensive runs above average per 150 games, and defensive runs above average per 150 games as my variables. The event I chose to predict was when the player would have his first season below 1 WAR following their star season. The cutoffs for determining stars and scrubs were fairly arbitrary, but I chose these cutoffs because the FanGraphs glossary loosely defines an All-Star season as 4-5 WAR and a scrub season as 0-1 WAR.

Determining the variables was much more difficult. I wanted to pick variables that would represent a player’s performance, age, and overall health. The age was simple enough to find, but it was difficult to find any injury history for players so I decided to calculate a player’s BMI from their listed height and weight. Obviously this isn’t a perfect representation, because a player’s weight is constantly changing throughout his career, but it’s the best that I could do given my limited resources. In order to limit my performance variables, I thought it was best to settle for the offensive runs and defensive runs component of WAR. However, since these are accumulating statistics, I had to recreate them as rate statistics in order to avoid creating correlation issues with the age variable in the model. I would have liked to use more offensive variables, but I feared that adding more inputs would make the model too convoluted and affect the accuracy of the player predictions. Alright, that’s enough preparation; let’s dive into the actual data.

Survival Rate Data

As a jumping off point, I’ll start by presenting a table of the survival rates for my population. Each season indicates the percentage of players from the original population that had not yet recorded a scrub season.

 

Season 1 2 3 4 5 6 7 8 9 10
Survival Function 87.62% 74.28% 65.20% 54.88% 45.80% 39.06% 32.32% 26.96% 22.15% 17.19%
Season 11 12 13 14 15 16 17 18 19 20
Survival Function 13.76% 10.73% 7.57% 5.50% 3.44% 2.06% 1.24% 0.69% 0.28% 0.00%

Let’s make some quick observations. The data shows that no star player has gone more than 20 seasons without recording a season below 1 WAR. It also appears that the survival function decays exponentially.  I also found it interesting that over 50% of stars turn into scrubs by their fifth season and that only 17% of star players survive 10 years in the majors before they register a scrub season. Looking at this data really helps to appreciate how rare it is when players like Derek Jeter and Adrian Beltre perform at a consistent level on a year to year basis.

Hazard Rate Data

Next, we will look at the hazard rate of the players in the population. One of the purposes of examining the hazard rate is to see how the rate of failure changes in a population over time. To find the hazard rate for each time period, you divide the amount of events recorded during a time period by the amount of players that have not yet registered a scrub season. Below is the following calculation for each time period in table format.

Season 1 2 3 4 5 6 7 8 9 10
Hazard Function 12.38% 15.23% 12.22% 15.82% 16.54% 14.71% 17.25% 16.60% 17.86% 22.36%
Season 11 12 13 14 15 16 17 18 19 20
Hazard Function 20.00% 22.00% 29.49% 27.27% 37.50% 40.00% 40.00% 44.44% 60.00% 100.00%

As you can see by the table above, the hazard rate generally increases with each passing season. This makes sense, because as players age, their skill level decreases and their odds of registering a scrub season will increase. However, the hazard rates are fairly constant for the first ten years and then rapidly increase from then on. I’m rather surprised that the hazard rates stayed so consistent for the first ten or so years. I would have guessed that the hazard function would have increased much more rapidly with each passing season.

Determining the Model

It is important to identify the trend of the hazard function, because it helps determine which distribution to use when creating a parametric model. If the hazard rate increases exponentially, you are supposed to use a Weibull distribution. If the hazard rate is constant, you are supposed to use an exponential distribution. Since the hazard function was increasing, I originally attempted to the use the Weibull distribution for the model but I found that the model was predicting too many players to fail in the first few seasons, so I decided to try an exponential distribution instead.

I found that the exponential distribution model was more accurate at predicting survival rates in the first ten years, but severely under predicted the amount of players that would record a scrub season after ten years. I decided to use the exponential distribution, because I believe that it would be far more useful to accurately predict the first ten years instead of the last ten years, since only 17% of players survive ten years. I also believe that any franchise would be thrilled to obtain ten years of stardom from a player and anymore production is just an added bonus.

Survival Rate Estimates

Below is a table of each star player from 2000 to 2014 with the year they entered the population, the time until they became a scrub, every variable included in the model and their predicted survival rate for each of their first ten seasons since becoming a star.

Year Entered Name Time of Event Age BMI Off Def Season 1 Season 2 Season 3 Season 4 Season 5 Season 6 Season 7 Season 8 Season 9 Season 10
2000 Bobby Higginson 2 29 25.1 14.7 -11.0 77.83% 60.58% 47.15% 36.70% 28.56% 22.23% 17.30% 13.47% 10.48% 8.16%
2000 Darin Erstad 6 26 25.0 8.2 8.0 83.94% 70.46% 59.14% 49.64% 41.67% 34.98% 29.36% 24.64% 20.68% 17.36%
2000 Jorge Posada 8 28 27.6 11.2 7.1 80.94% 65.51% 53.02% 42.91% 34.73% 28.11% 22.75% 18.41% 14.90% 12.06%
2000 Jose Vidro 4 25 24.4 3.1 -4.9 83.43% 69.61% 58.07% 48.45% 40.42% 33.72% 28.14% 23.47% 19.58% 16.34%
2000 Phil Nevin 2 29 23.1 5.1 -2.8 76.52% 58.55% 44.80% 34.28% 26.23% 20.07% 15.36% 11.75% 8.99% 6.88%
2000 Richard Hidalgo 2 25 27.5 19.6 9.5 87.32% 76.24% 66.57% 58.13% 50.75% 44.32% 38.69% 33.79% 29.50% 25.76%
2000 Shannon Stewart 5 26 23.7 9.9 -3.2 83.26% 69.33% 57.73% 48.07% 40.02% 33.32% 27.75% 23.10% 19.24% 16.02%
2000 Todd Helton 8 26 28.2 25.4 -1.4 86.03% 74.01% 63.67% 54.77% 47.12% 40.53% 34.87% 30.00% 25.81% 22.20%
2000 Troy Glaus 3 23 26.1 12.2 9.0 88.69% 78.66% 69.76% 61.87% 54.87% 48.66% 43.16% 38.28% 33.95% 30.11%
2001 Albert Pujols 12 21 28.7 47.2 0.8 93.62% 87.66% 82.07% 76.84% 71.94% 67.35% 63.06% 59.04% 55.27% 51.75%
2001 Aramis Ramirez 1 23 27.0 -3.7 -2.3 85.43% 72.99% 62.35% 53.27% 45.51% 38.88% 33.22% 28.38% 24.24% 20.71%
2001 Bret Boone 3 32 25.8 -4.0 -0.5 66.26% 43.91% 29.09% 19.28% 12.77% 8.46% 5.61% 3.72% 2.46% 1.63%
2001 Cliff Floyd 5 28 26.1 13.4 -6.7 79.99% 63.98% 51.18% 40.94% 32.75% 26.19% 20.95% 16.76% 13.41% 10.72%
2001 Corey Koskie 6 28 26.9 11.2 7.7 81.02% 65.64% 53.18% 43.08% 34.90% 28.28% 22.91% 18.56% 15.04% 12.18%
2001 Eric Chavez 6 23 28.4 8.8 3.4 87.79% 77.06% 67.65% 59.39% 52.13% 45.77% 40.18% 35.27% 30.96% 27.18%
2001 Ichiro Suzuki 10 27 23.7 26.6 7.5 85.65% 73.36% 62.83% 53.82% 46.10% 39.48% 33.82% 28.96% 24.81% 21.25%
2001 J.D. Drew 10 25 26.4 25.6 10.8 88.29% 77.95% 68.82% 60.76% 53.65% 47.37% 41.82% 36.92% 32.60% 28.78%
2001 Lance Berkman 11 25 29.0 35.8 -6.9 88.44% 78.21% 69.17% 61.17% 54.10% 47.84% 42.31% 37.42% 33.09% 29.27%
2001 Mike Sweeney 3 27 25.7 12.9 -7.4 81.67% 66.70% 54.48% 44.49% 36.34% 29.68% 24.24% 19.80% 16.17% 13.21%
2001 Paul Lo Duca 6 29 27.7 11.2 14.7 79.86% 63.78% 50.94% 40.68% 32.49% 25.95% 20.72% 16.55% 13.22% 10.56%
2001 Placido Polanco 5 25 28.1 -9.1 12.1 82.55% 68.14% 56.25% 46.43% 38.33% 31.64% 26.12% 21.56% 17.80% 14.69%
2001 Rich Aurilia 6 29 23.1 4.1 10.1 77.86% 60.63% 47.21% 36.76% 28.62% 22.28% 17.35% 13.51% 10.52% 8.19%
2001 Ryan Klesko 5 30 27.5 21.8 -10.8 77.39% 59.89% 46.35% 35.87% 27.76% 21.48% 16.63% 12.87% 9.96% 7.71%
2001 Torii Hunter 13 25 28.9 -11.5 7.3 81.57% 66.53% 54.27% 44.26% 36.10% 29.45% 24.02% 19.59% 15.98% 13.03%
2002 Adam Dunn 6 22 32.9 25.5 -4.2 90.45% 81.82% 74.01% 66.94% 60.55% 54.77% 49.54% 44.81% 40.54% 36.67%
2002 Adrian Beltre N/A 23 30.7 -0.8 10.8 86.84% 75.41% 65.49% 56.87% 49.39% 42.89% 37.24% 32.34% 28.09% 24.39%
2002 Alfonso Soriano 7 26 25.7 11.8 -11.3 82.81% 68.57% 56.78% 47.02% 38.93% 32.24% 26.70% 22.11% 18.31% 15.16%
2002 Austin Kearns 2 22 30.0 31.7 17.8 92.26% 85.12% 78.53% 72.45% 66.84% 61.67% 56.89% 52.49% 48.43% 44.68%
2002 David Eckstein 5 27 27.4 4.8 5.3 81.22% 65.97% 53.58% 43.52% 35.34% 28.71% 23.31% 18.94% 15.38% 12.49%
2002 Edgar Renteria 7 25 26.4 -6.1 8.4 82.83% 68.61% 56.83% 47.08% 38.99% 32.30% 26.76% 22.16% 18.36% 15.21%
2002 Eric Hinske 2 24 30.2 21.6 4.6 88.41% 78.16% 69.10% 61.09% 54.00% 47.74% 42.21% 37.31% 32.99% 29.16%
2002 Jacque Jones 2 27 25.1 0.5 4.9 80.34% 64.54% 51.85% 41.66% 33.47% 26.89% 21.60% 17.35% 13.94% 11.20%
2002 Jose Hernandez 1 32 23.7 -8.5 8.1 66.31% 43.97% 29.16% 19.34% 12.82% 8.50% 5.64% 3.74% 2.48% 1.64%
2002 Junior Spivey 1 27 25.1 15.0 0.7 82.92% 68.76% 57.01% 47.28% 39.20% 32.50% 26.95% 22.35% 18.53% 15.37%
2002 Mark Kotsay 4 26 29.8 -0.1 11.5 82.51% 68.08% 56.18% 46.35% 38.25% 31.56% 26.04% 21.49% 17.73% 14.63%
2002 Miguel Tejada 8 28 32.5 3.5 1.8 78.40% 61.46% 48.19% 37.78% 29.62% 23.22% 18.20% 14.27% 11.19% 8.77%
2002 Pat Burrell 1 25 28.6 16.1 -15.1 84.76% 71.84% 60.90% 51.62% 43.75% 37.08% 31.43% 26.64% 22.58% 19.14%
2002 Randy Winn 8 28 22.5 -3.9 -0.8 76.68% 58.80% 45.09% 34.57% 26.51% 20.33% 15.59% 11.95% 9.17% 7.03%
2003 Bill Mueller 3 32 24.4 9.6 2.9 71.27% 50.80% 36.20% 25.80% 18.39% 13.11% 9.34% 6.66% 4.75% 3.38%
2003 Garret Anderson 1 31 23.7 2.7 0.7 71.50% 51.13% 36.56% 26.14% 18.69% 13.37% 9.56% 6.83% 4.89% 3.49%
2003 Hank Blalock 2 22 25.3 2.4 12.3 88.77% 78.80% 69.94% 62.09% 55.11% 48.92% 43.43% 38.55% 34.22% 30.37%
2003 Javy Lopez 3 32 23.1 8.8 7.8 71.83% 51.59% 37.06% 26.62% 19.12% 13.73% 9.87% 7.09% 5.09% 3.66%
2003 Jeff DaVanon 2 29 25.1 3.7 9.6 77.61% 60.23% 46.75% 36.28% 28.16% 21.85% 16.96% 13.16% 10.21% 7.93%
2003 Juan Pierre 5 25 25.8 -9.8 10.9 82.36% 67.84% 55.87% 46.02% 37.90% 31.22% 25.71% 21.18% 17.44% 14.37%
2003 Luis Castillo 5 27 20.2 0.5 3.3 80.34% 64.55% 51.86% 41.67% 33.48% 26.89% 21.61% 17.36% 13.95% 11.21%
2003 Marcus Giles 4 25 27.4 17.2 9.8 86.98% 75.66% 65.81% 57.24% 49.79% 43.31% 37.67% 32.77% 28.50% 24.79%
2003 Mark Loretta 2 31 23.7 -1.0 -1.9 69.97% 48.96% 34.25% 23.97% 16.77% 11.73% 8.21% 5.74% 4.02% 2.81%
2003 Melvin Mora 6 31 27.9 4.1 5.8 72.44% 52.48% 38.01% 27.54% 19.95% 14.45% 10.47% 7.58% 5.49% 3.98%
2003 Mike Lowell 2 29 23.7 7.4 2.3 77.70% 60.37% 46.90% 36.44% 28.31% 22.00% 17.09% 13.28% 10.32% 8.02%
2003 Milton Bradley 6 25 29.2 -2.7 7.1 83.29% 69.36% 57.77% 48.11% 40.07% 33.37% 27.80% 23.15% 19.28% 16.06%
2003 Morgan Ensberg 1 27 27.0 14.4 10.3 83.64% 69.96% 58.52% 48.95% 40.94% 34.25% 28.64% 23.96% 20.04% 16.76%
2003 Orlando Cabrera 1 28 28.0 -10.3 10.2 76.18% 58.04% 44.21% 33.68% 25.66% 19.55% 14.89% 11.34% 8.64% 6.58%
2003 Rafael Furcal 8 25 29.6 2.7 6.5 84.23% 70.94% 59.75% 50.33% 42.39% 35.70% 30.07% 25.33% 21.33% 17.97%
2003 Trot Nixon 4 29 25.7 16.8 -0.3 79.54% 63.27% 50.33% 40.04% 31.85% 25.33% 20.15% 16.03% 12.75% 10.14%
2004 Aaron Rowand 4 26 28.5 8.6 10.6 84.15% 70.81% 59.58% 50.14% 42.19% 35.50% 29.87% 25.14% 21.15% 17.80%
2004 Aubrey Huff 1 27 27.4 12.5 -11.3 81.13% 65.82% 53.40% 43.32% 35.15% 28.52% 23.14% 18.77% 15.23% 12.35%
2004 Brad Wilkerson 2 27 27.1 13.8 -3.1 82.23% 67.62% 55.61% 45.73% 37.61% 30.93% 25.43% 20.91% 17.20% 14.14%
2004 Carl Crawford 7 22 28.9 -3.4 13.0 87.93% 77.31% 67.98% 59.77% 52.56% 46.21% 40.63% 35.73% 31.41% 27.62%
2004 Carlos Guillen 5 28 28.4 4.7 5.7 79.30% 62.89% 49.87% 39.55% 31.36% 24.87% 19.72% 15.64% 12.40% 9.84%
2004 Carlos Lee 5 28 34.7 9.9 -3.6 79.19% 62.71% 49.66% 39.32% 31.14% 24.66% 19.53% 15.46% 12.25% 9.70%
2004 Coco Crisp 2 24 26.5 -4.6 9.9 84.85% 72.00% 61.09% 51.83% 43.98% 37.32% 31.67% 26.87% 22.80% 19.34%
2004 Corey Patterson 1 24 25.8 -5.1 8.8 84.68% 71.71% 60.73% 51.43% 43.55% 36.88% 31.23% 26.45% 22.40% 18.97%
2004 David Ortiz 5 28 28.0 14.6 -14.8 79.28% 62.85% 49.82% 39.50% 31.31% 24.82% 19.68% 15.60% 12.37% 9.80%
2004 Jack Wilson 2 26 27.1 -18.0 11.7 78.73% 61.99% 48.80% 38.42% 30.25% 23.82% 18.75% 14.76% 11.62% 9.15%
2004 Jason Varitek 2 32 29.5 1.4 8.6 69.34% 48.08% 33.34% 23.12% 16.03% 11.11% 7.71% 5.34% 3.70% 2.57%
2004 Jimmy Rollins N/A 25 27.4 -3.1 6.4 83.21% 69.24% 57.61% 47.94% 39.89% 33.19% 27.62% 22.98% 19.12% 15.91%
2004 Mark Teixeira 9 24 26.9 11.9 -1.9 86.61% 75.01% 64.96% 56.26% 48.72% 42.20% 36.55% 31.65% 27.41% 23.74%
2004 Travis Hafner 4 27 30.0 26.7 -17.1 83.33% 69.44% 57.86% 48.22% 40.18% 33.48% 27.90% 23.25% 19.37% 16.14%
2004 Vernon Wells 5 25 30.3 9.2 -2.8 84.56% 71.50% 60.46% 51.12% 43.23% 36.56% 30.91% 26.14% 22.10% 18.69%
2005 Brian Roberts 6 27 25.8 4.8 6.0 81.34% 66.16% 53.82% 43.78% 35.61% 28.96% 23.56% 19.16% 15.59% 12.68%
2005 Chase Utley N/A 26 26.4 15.7 13.2 85.66% 73.37% 62.85% 53.83% 46.11% 39.50% 33.83% 28.98% 24.82% 21.26%
2005 David DeJesus 9 25 26.5 6.8 2.8 84.73% 71.79% 60.82% 51.53% 43.66% 36.99% 31.34% 26.56% 22.50% 19.06%
2005 David Wright N/A 22 27.8 31.4 1.5 91.48% 83.69% 76.57% 70.04% 64.08% 58.62% 53.63% 49.06% 44.88% 41.06%
2005 Derrek Lee 1 29 28.5 18.4 -11.5 78.52% 61.66% 48.42% 38.02% 29.85% 23.44% 18.41% 14.45% 11.35% 8.91%
2005 Felipe Lopez 2 25 27.8 -3.9 1.5 82.57% 68.17% 56.29% 46.47% 38.37% 31.68% 26.16% 21.60% 17.83% 14.72%
2005 Grady Sizemore 5 22 25.7 16.2 11.2 90.39% 81.71% 73.86% 66.76% 60.35% 54.55% 49.31% 44.57% 40.29% 36.42%
2005 Jason Bay 2 26 27.0 37.1 -15.6 86.82% 75.37% 65.44% 56.81% 49.32% 42.82% 37.17% 32.27% 28.02% 24.32%
2005 Jhonny Peralta 1 23 27.6 6.0 2.9 87.36% 76.33% 66.68% 58.26% 50.90% 44.46% 38.85% 33.94% 29.65% 25.90%
2005 Julio Lugo 2 29 23.1 -3.3 6.7 75.53% 57.05% 43.09% 32.55% 24.58% 18.57% 14.03% 10.59% 8.00% 6.04%
2005 Mark Ellis 9 28 27.3 6.3 8.1 79.97% 63.95% 51.14% 40.89% 32.70% 26.15% 20.91% 16.72% 13.37% 10.69%
2005 Michael Young 7 28 26.4 3.9 -4.8 77.96% 60.77% 47.37% 36.93% 28.79% 22.44% 17.50% 13.64% 10.63% 8.29%
2005 Miguel Cabrera N/A 22 29.2 23.8 -13.8 89.77% 80.58% 72.33% 64.93% 58.28% 52.32% 46.96% 42.16% 37.84% 33.97%
2005 Nick Johnson 3 26 29.4 12.9 -7.3 83.28% 69.35% 57.75% 48.09% 40.05% 33.35% 27.77% 23.13% 19.26% 16.04%
2005 Richie Sexson 2 30 23.7 18.2 -12.9 76.37% 58.32% 44.54% 34.01% 25.98% 19.84% 15.15% 11.57% 8.84% 6.75%
2005 Victor Martinez 3 26 27.0 7.3 7.4 83.66% 70.00% 58.56% 49.00% 40.99% 34.30% 28.69% 24.01% 20.08% 16.80%
2006 Bill Hall 2 26 28.5 2.4 5.5 82.47% 68.01% 56.08% 46.25% 38.14% 31.45% 25.94% 21.39% 17.64% 14.55%
2006 Brandon Inge 2 29 26.5 -11.3 12.0 73.90% 54.62% 40.37% 29.83% 22.05% 16.29% 12.04% 8.90% 6.58% 4.86%
2006 Brian McCann N/A 22 28.7 12.3 8.4 89.71% 80.47% 72.19% 64.76% 58.09% 52.11% 46.75% 41.94% 37.62% 33.75%
2006 Curtis Granderson N/A 25 26.4 3.3 12.5 84.96% 72.18% 61.33% 52.11% 44.27% 37.61% 31.96% 27.15% 23.07% 19.60%
2006 Dan Uggla 7 26 29.3 13.1 7.2 84.62% 71.60% 60.58% 51.26% 43.38% 36.70% 31.06% 26.28% 22.24% 18.81%
2006 Freddy Sanchez 2 28 27.1 3.1 11.9 79.68% 63.49% 50.58% 40.31% 32.11% 25.59% 20.39% 16.25% 12.94% 10.31%
2006 Garrett Atkins 2 26 24.4 7.3 1.9 83.22% 69.26% 57.64% 47.97% 39.93% 33.23% 27.65% 23.02% 19.15% 15.94%
2006 Hanley Ramirez 5 22 28.9 22.4 -2.3 90.28% 81.50% 73.58% 66.43% 59.97% 54.14% 48.88% 44.12% 39.84% 35.96%
2006 Joe Mauer N/A 23 27.3 23.2 7.6 89.97% 80.94% 72.82% 65.52% 58.94% 53.03% 47.71% 42.92% 38.62% 34.74%
2006 Jose Reyes 3 23 26.4 3.8 9.7 87.55% 76.66% 67.12% 58.76% 51.45% 45.05% 39.44% 34.53% 30.24% 26.47%
2006 Ramon Hernandez 2 30 29.8 -2.7 14.1 74.04% 54.81% 40.58% 30.05% 22.24% 16.47% 12.19% 9.03% 6.68% 4.95%
2006 Reed Johnson 1 29 27.3 1.8 -0.3 75.78% 57.43% 43.52% 32.98% 25.00% 18.94% 14.36% 10.88% 8.24% 6.25%
2006 Ryan Howard 6 26 30.4 39.3 -11.0 87.40% 76.38% 66.76% 58.34% 50.99% 44.56% 38.95% 34.04% 29.75% 26.00%
2007 Alex Rios 2 26 24.9 6.4 5.2 83.34% 69.46% 57.89% 48.25% 40.21% 33.51% 27.93% 23.28% 19.40% 16.17%
2007 B.J. Upton 6 22 23.1 14.7 -5.7 89.26% 79.67% 71.11% 63.47% 56.65% 50.57% 45.14% 40.29% 35.96% 32.10%
2007 Brandon Phillips N/A 26 27.1 -11.3 7.9 79.86% 63.78% 50.93% 40.67% 32.48% 25.94% 20.72% 16.54% 13.21% 10.55%
2007 Carlos Pena 5 29 28.9 18.1 -16.3 77.86% 60.61% 47.19% 36.74% 28.61% 22.27% 17.34% 13.50% 10.51% 8.18%
2007 Chone Figgins 4 29 27.4 9.7 -3.0 77.46% 59.99% 46.47% 35.99% 27.88% 21.59% 16.73% 12.95% 10.03% 7.77%
2007 Corey Hart 1 25 26.6 10.8 -2.5 84.98% 72.21% 61.36% 52.14% 44.31% 37.65% 32.00% 27.19% 23.10% 19.63%
2007 Kevin Youkilis 6 28 29.0 12.3 0.3 80.40% 64.65% 51.98% 41.79% 33.60% 27.02% 21.72% 17.47% 14.04% 11.29%
2007 Matt Holliday N/A 27 30.4 26.0 -7.6 84.05% 70.65% 59.38% 49.91% 41.95% 35.26% 29.64% 24.91% 20.94% 17.60%
2007 Nick Markakis 6 23 25.1 11.1 -2.0 87.81% 77.11% 67.71% 59.46% 52.21% 45.85% 40.26% 35.35% 31.04% 27.26%
2007 Nick Swisher 7 26 27.1 16.7 -4.8 84.28% 71.02% 59.86% 50.44% 42.51% 35.83% 30.19% 25.45% 21.44% 18.07%
2007 Prince Fielder 7 23 38.4 22.1 -17.8 87.95% 77.35% 68.03% 59.83% 52.62% 46.28% 40.71% 35.80% 31.49% 27.69%
2007 Robinson Cano 1 24 28.5 11.7 -6.1 86.21% 74.31% 64.06% 55.22% 47.61% 41.04% 35.38% 30.50% 26.29% 22.66%
2007 Russell Martin N/A 24 30.8 10.5 14.4 87.50% 76.57% 67.00% 58.62% 51.30% 44.89% 39.28% 34.37% 30.07% 26.31%
2007 Ryan Zimmerman N/A 22 27.5 9.1 10.4 89.46% 80.03% 71.60% 64.05% 57.30% 51.27% 45.86% 41.03% 36.71% 32.84%
2007 Troy Tulowitzki 1 22 26.9 2.2 15.8 88.92% 79.07% 70.32% 62.53% 55.60% 49.44% 43.97% 39.10% 34.77% 30.92%
2008 Carlos Quentin 1 25 31.0 14.5 -2.6 85.47% 73.05% 62.43% 53.36% 45.60% 38.98% 33.31% 28.47% 24.33% 20.80%
2008 Dustin Pedroia N/A 24 25.1 13.5 6.3 87.51% 76.58% 67.01% 58.64% 51.32% 44.91% 39.30% 34.39% 30.09% 26.33%
2008 Evan Longoria N/A 22 27.0 25.9 21.9 91.95% 84.55% 77.75% 71.49% 65.74% 60.45% 55.59% 51.11% 47.00% 43.22%
2008 Ian Kinsler N/A 26 27.1 18.2 -6.5 84.40% 71.24% 60.13% 50.75% 42.84% 36.16% 30.52% 25.76% 21.74% 18.35%
2008 J.J. Hardy N/A 25 25.1 -2.1 15.9 84.34% 71.13% 59.98% 50.59% 42.66% 35.98% 30.35% 25.59% 21.58% 18.20%
2008 Jacoby Ellsbury 2 24 25.7 7.4 16.9 87.36% 76.32% 66.67% 58.24% 50.88% 44.45% 38.83% 33.92% 29.63% 25.89%
2008 Jayson Werth 4 29 28.5 12.8 10.6 79.75% 63.60% 50.72% 40.45% 32.26% 25.73% 20.52% 16.36% 13.05% 10.41%
2008 Josh Hamilton N/A 27 29.2 28.3 -9.6 84.33% 71.12% 59.97% 50.58% 42.65% 35.97% 30.33% 25.58% 21.57% 18.19%
2008 Mark DeRosa 2 33 28.4 -1.6 0.9 64.22% 41.24% 26.48% 17.01% 10.92% 7.01% 4.50% 2.89% 1.86% 1.19%
2008 Mike Aviles 1 27 29.4 20.3 21.5 85.59% 73.26% 62.70% 53.66% 45.93% 39.31% 33.65% 28.80% 24.65% 21.10%
2008 Ryan Braun 6 24 25.7 36.6 -17.5 89.07% 79.33% 70.66% 62.94% 56.06% 49.93% 44.47% 39.61% 35.28% 31.43%
2008 Ryan Ludwick 3 29 27.6 16.6 0.2 79.47% 63.15% 50.19% 39.88% 31.69% 25.19% 20.02% 15.91% 12.64% 10.04%
2008 Shane Victorino 6 27 28.1 4.0 9.1 81.43% 66.31% 53.99% 43.96% 35.80% 29.15% 23.74% 19.33% 15.74% 12.82%
2009 Aaron Hill 2 27 28.6 3.3 4.0 80.72% 65.16% 52.60% 42.46% 34.27% 27.67% 22.33% 18.03% 14.55% 11.75%
2009 Adrian Gonzalez N/A 27 28.9 19.8 -10.1 82.68% 68.37% 56.53% 46.74% 38.65% 31.96% 26.42% 21.85% 18.07% 14.94%
2009 Ben Zobrist N/A 28 26.2 11.9 9.9 81.42% 66.29% 53.98% 43.95% 35.78% 29.14% 23.72% 19.32% 15.73% 12.81%
2009 Casey Blake 3 35 26.3 5.8 0.1 60.57% 36.69% 22.23% 13.46% 8.15% 4.94% 2.99% 1.81% 1.10% 0.66%
2009 Denard Span N/A 25 28.5 23.8 -1.6 87.10% 75.87% 66.08% 57.56% 50.13% 43.67% 38.03% 33.13% 28.86% 25.13%
2009 Franklin Gutierrez 3 26 25.0 -1.8 18.8 83.05% 68.97% 57.28% 47.57% 39.50% 32.81% 27.25% 22.63% 18.79% 15.61%
2009 Jason Bartlett 3 29 25.8 5.9 13.7 78.61% 61.79% 48.58% 38.19% 30.02% 23.60% 18.55% 14.58% 11.46% 9.01%
2009 Joey Votto N/A 25 28.2 28.7 -8.1 87.36% 76.32% 66.67% 58.24% 50.88% 44.45% 38.83% 33.92% 29.64% 25.89%
2009 Justin Upton N/A 21 26.3 13.3 -6.9 90.00% 81.01% 72.91% 65.62% 59.06% 53.16% 47.84% 43.06% 38.76% 34.88%
2009 Marco Scutaro 5 33 26.5 -5.2 3.3 63.45% 40.26% 25.55% 16.21% 10.29% 6.53% 4.14% 2.63% 1.67% 1.06%
2009 Matt Kemp 1 24 26.2 16.9 -4.8 87.18% 76.00% 66.26% 57.76% 50.36% 43.90% 38.27% 33.37% 29.09% 25.36%
2009 Michael Bourn 5 26 25.8 -2.5 7.8 81.80% 66.92% 54.74% 44.78% 36.63% 29.96% 24.51% 20.05% 16.40% 13.42%
2009 Nyjer Morgan 1 28 25.8 3.4 27.3 81.46% 66.35% 54.05% 44.03% 35.86% 29.21% 23.80% 19.38% 15.79% 12.86%
2009 Pablo Sandoval N/A 22 34.2 29.1 -1.6 90.97% 82.76% 75.29% 68.49% 62.31% 56.68% 51.56% 46.91% 42.67% 38.82%
2009 Shin-Soo Choo 5 26 28.6 28.4 -5.3 86.20% 74.30% 64.05% 55.21% 47.59% 41.02% 35.36% 30.48% 26.28% 22.65%
2010 Alexei Ramirez N/A 28 23.1 -3.3 6.6 77.71% 60.39% 46.93% 36.47% 28.34% 22.03% 17.12% 13.30% 10.34% 8.03%
2010 Andres Torres 3 32 28.0 6.1 14.2 71.73% 51.45% 36.90% 26.47% 18.99% 13.62% 9.77% 7.01% 5.03% 3.61%
2010 Angel Pagan 1 28 25.7 6.4 8.6 80.12% 64.20% 51.43% 41.21% 33.02% 26.46% 21.20% 16.98% 13.61% 10.90%
2010 Austin Jackson 4 23 24.4 8.2 7.5 88.08% 77.59% 68.34% 60.20% 53.02% 46.71% 41.14% 36.24% 31.92% 28.12%
2010 Brett Gardner 2 26 26.5 8.2 21.3 85.05% 72.34% 61.52% 52.33% 44.51% 37.85% 32.19% 27.38% 23.29% 19.81%
2010 Buster Posey N/A 23 28.4 18.0 10.6 89.49% 80.08% 71.67% 64.13% 57.39% 51.36% 45.96% 41.13% 36.81% 32.94%
2010 Carlos Gonzalez 4 24 29.0 17.4 3.5 87.78% 77.06% 67.64% 59.38% 52.12% 45.75% 40.16% 35.26% 30.95% 27.17%
2010 Carlos Ruiz N/A 31 29.4 -5.0 14.6 70.92% 50.30% 35.68% 25.30% 17.95% 12.73% 9.03% 6.40% 4.54% 3.22%
2010 Chase Headley N/A 26 28.2 1.9 -2.1 81.63% 66.64% 54.40% 44.40% 36.25% 29.59% 24.15% 19.72% 16.09% 13.14%
2010 Chris Young 3 26 25.7 -1.1 0.5 81.34% 66.17% 53.82% 43.78% 35.61% 28.97% 23.56% 19.17% 15.59% 12.68%
2010 Colby Rasmus 1 23 25.0 12.8 3.8 88.46% 78.25% 69.21% 61.23% 54.16% 47.91% 42.38% 37.49% 33.16% 29.33%
2010 Daric Barton 1 24 27.8 11.8 -2.8 86.51% 74.84% 64.74% 56.00% 48.45% 41.91% 36.25% 31.36% 27.13% 23.47%
2010 Jason Heyward N/A 20 29.0 28.5 -1.1 92.70% 85.94% 79.67% 73.86% 68.47% 63.47% 58.84% 54.55% 50.57% 46.88%
2010 Jay Bruce 4 23 26.9 7.5 5.6 87.79% 77.07% 67.66% 59.40% 52.15% 45.78% 40.19% 35.28% 30.98% 27.19%
2010 Jose Bautista N/A 29 27.8 3.5 -9.0 75.07% 56.35% 42.30% 31.76% 23.84% 17.90% 13.43% 10.08% 7.57% 5.68%
2010 Justin Morneau 1 29 26.8 17.0 -7.5 78.72% 61.96% 48.78% 38.39% 30.22% 23.79% 18.73% 14.74% 11.60% 9.13%
2010 Kelly Johnson 2 28 26.4 9.5 2.2 80.07% 64.12% 51.34% 41.11% 32.92% 26.36% 21.11% 16.90% 13.53% 10.84%
2010 Marlon Byrd 2 32 33.2 0.9 1.7 67.87% 46.07% 31.27% 21.22% 14.41% 9.78% 6.64% 4.50% 3.06% 2.08%
2010 Nelson Cruz N/A 29 29.5 10.2 4.0 78.35% 61.38% 48.09% 37.68% 29.52% 23.13% 18.12% 14.20% 11.12% 8.71%
2010 Rickie Weeks 3 27 31.6 12.0 -3.6 81.66% 66.68% 54.45% 44.47% 36.31% 29.65% 24.21% 19.77% 16.15% 13.18%
2010 Stephen Drew 2 27 25.8 -0.7 1.5 79.66% 63.46% 50.55% 40.27% 32.08% 25.56% 20.36% 16.22% 12.92% 10.29%
2011 Alex Avila 2 24 29.3 9.9 1.4 86.49% 74.80% 64.69% 55.95% 48.39% 41.85% 36.20% 31.31% 27.08% 23.42%
2011 Alex Gordon N/A 27 29.0 7.0 1.0 81.18% 65.90% 53.49% 43.43% 35.25% 28.62% 23.23% 18.86% 15.31% 12.43%
2011 Andrew McCutchen N/A 24 27.3 24.1 -1.9 88.39% 78.12% 69.05% 61.03% 53.95% 47.68% 42.14% 37.25% 32.92% 29.10%
2011 Cameron Maybin 2 24 25.6 4.7 6.9 86.19% 74.29% 64.03% 55.19% 47.57% 41.00% 35.34% 30.46% 26.26% 22.63%
2011 Elvis Andrus N/A 22 27.1 -4.6 13.7 87.84% 77.16% 67.78% 59.53% 52.30% 45.94% 40.35% 35.44% 31.13% 27.35%
2011 Giancarlo Stanton N/A 21 27.7 20.6 0.6 91.21% 83.19% 75.87% 69.20% 63.12% 57.57% 52.51% 47.89% 43.68% 39.84%
2011 Howie Kendrick N/A 27 30.1 4.5 6.1 81.14% 65.84% 53.42% 43.34% 35.17% 28.54% 23.15% 18.79% 15.24% 12.37%
2011 Hunter Pence N/A 28 26.8 15.2 -1.6 80.92% 65.47% 52.98% 42.87% 34.69% 28.07% 22.71% 18.38% 14.87% 12.03%
2011 Matt Wieters 3 25 28.5 -7.6 18.4 83.43% 69.60% 58.07% 48.45% 40.42% 33.72% 28.13% 23.47% 19.58% 16.34%
2011 Mike Napoli N/A 29 29.8 20.5 2.3 80.50% 64.81% 52.17% 42.00% 33.81% 27.22% 21.91% 17.64% 14.20% 11.43%
2011 Peter Bourjos 2 24 24.4 4.6 20.5 87.24% 76.11% 66.40% 57.92% 50.53% 44.09% 38.46% 33.55% 29.27% 25.54%
2011 Yadier Molina N/A 28 30.7 -14.6 20.1 76.20% 58.06% 44.24% 33.71% 25.69% 19.58% 14.92% 11.37% 8.66% 6.60%
2012 Adam Jones N/A 26 28.1 4.2 -1.8 82.13% 67.46% 55.41% 45.51% 37.38% 30.70% 25.22% 20.71% 17.01% 13.97%
2012 Bryce Harper N/A 19 28.1 18.0 9.0 92.98% 86.45% 80.38% 74.73% 69.48% 64.60% 60.07% 55.85% 51.93% 48.28%
2012 Edwin Encarnacion N/A 29 30.3 10.1 -11.4 76.39% 58.36% 44.58% 34.06% 26.02% 19.88% 15.19% 11.60% 8.86% 6.77%
2012 Ian Desmond N/A 26 26.9 0.3 2.6 81.81% 66.93% 54.75% 44.79% 36.65% 29.98% 24.53% 20.06% 16.41% 13.43%
2012 Josh Reddick N/A 25 23.1 2.2 10.1 84.65% 71.66% 60.66% 51.35% 43.47% 36.80% 31.15% 26.37% 22.32% 18.90%
2012 Martin Prado N/A 28 25.1 7.8 1.7 79.70% 63.52% 50.63% 40.35% 32.16% 25.63% 20.43% 16.28% 12.98% 10.34%
2012 Melky Cabrera 1 27 30.1 0.9 -5.4 79.08% 62.54% 49.46% 39.11% 30.93% 24.46% 19.35% 15.30% 12.10% 9.57%
2012 Miguel Montero 1 28 29.3 1.7 8.2 78.85% 62.17% 49.02% 38.65% 30.48% 24.03% 18.95% 14.94% 11.78% 9.29%
2012 Mike Trout N/A 20 29.5 53.6 13.0 95.05% 90.35% 85.89% 81.64% 77.60% 73.76% 70.12% 66.65% 63.35% 60.22%
2013 Andrelton Simmons N/A 23 25.0 -5.9 32.5 87.81% 77.10% 67.71% 59.45% 52.20% 45.84% 40.25% 35.35% 31.04% 27.25%
2013 Brandon Belt 1 25 26.1 16.7 -6.5 85.66% 73.37% 62.85% 53.83% 46.11% 39.50% 33.83% 28.98% 24.82% 21.26%
2013 Carlos Gomez N/A 27 27.5 -1.4 15.1 80.92% 65.48% 52.98% 42.87% 34.69% 28.07% 22.72% 18.38% 14.87% 12.04%
2013 Chris Davis 1 27 28.7 13.6 -13.9 81.04% 65.67% 53.22% 43.13% 34.95% 28.33% 22.96% 18.60% 15.08% 12.22%
2013 Freddie Freeman N/A 23 26.7 17.3 -14.6 87.77% 77.04% 67.62% 59.36% 52.10% 45.73% 40.14% 35.23% 30.92% 27.14%
2013 Gerardo Parra 1 26 27.9 -6.2 9.2 81.11% 65.78% 53.35% 43.27% 35.09% 28.46% 23.09% 18.72% 15.19% 12.32%
2013 Jason Castro N/A 26 26.9 2.9 4.5 82.54% 68.12% 56.23% 46.41% 38.30% 31.61% 26.09% 21.54% 17.78% 14.67%
2013 Jason Kipnis 1 26 26.5 17.6 -2.3 84.69% 71.72% 60.74% 51.44% 43.56% 36.89% 31.24% 26.46% 22.41% 18.97%
2013 Josh Donaldson N/A 27 29.8 19.0 10.9 84.45% 71.32% 60.23% 50.87% 42.96% 36.28% 30.64% 25.87% 21.85% 18.45%
2013 Juan Uribe N/A 34 31.9 -12.1 12.1 58.89% 34.68% 20.42% 12.03% 7.08% 4.17% 2.46% 1.45% 0.85% 0.50%
2013 Kyle Seager N/A 25 28.5 8.3 2.2 84.87% 72.03% 61.14% 51.89% 44.04% 37.38% 31.72% 26.92% 22.85% 19.39%
2013 Manny Machado N/A 20 23.1 0.2 28.8 91.50% 83.73% 76.61% 70.10% 64.15% 58.69% 53.71% 49.14% 44.97% 41.15%
2013 Matt Carpenter N/A 27 26.9 27.7 -3.7 84.82% 71.95% 61.03% 51.77% 43.91% 37.25% 31.60% 26.80% 22.73% 19.28%
2013 Paul Goldschmidt N/A 25 30.6 30.0 -9.6 87.39% 76.38% 66.75% 58.34% 50.98% 44.56% 38.94% 34.03% 29.74% 25.99%
2013 Starling Marte N/A 24 24.4 17.9 7.8 88.26% 77.90% 68.75% 60.68% 53.55% 47.26% 41.71% 36.82% 32.49% 28.68%
2013 Yasiel Puig N/A 22 29.4 37.6 -0.9 91.96% 84.58% 77.78% 71.53% 65.78% 60.50% 55.64% 51.17% 47.05% 43.27%
2014 Anthony Rendon N/A 24 26.4 18.4 6.2 88.17% 77.74% 68.55% 60.44% 53.29% 46.99% 41.43% 36.53% 32.21% 28.40%
2014 Anthony Rizzo N/A 24 30.0 11.5 -3.0 86.37% 74.60% 64.44% 55.65% 48.07% 41.52% 35.86% 30.97% 26.75% 23.11%
2014 Brian Dozier N/A 27 26.5 3.4 -0.5 80.33% 64.53% 51.84% 41.64% 33.45% 26.87% 21.59% 17.34% 13.93% 11.19%
2014 Christian Yelich N/A 22 25.0 17.7 -0.7 89.89% 80.81% 72.64% 65.30% 58.70% 52.77% 47.44% 42.65% 38.34% 34.46%
2014 Devin Mesoraco N/A 26 29.0 -2.0 7.8 81.78% 66.89% 54.70% 44.74% 36.59% 29.93% 24.47% 20.02% 16.37% 13.39%
2014 Erick Aybar N/A 30 25.8 -1.6 7.6 73.64% 54.22% 39.93% 29.40% 21.65% 15.94% 11.74% 8.64% 6.37% 4.69%
2014 J.D. Martinez N/A 26 27.5 1.8 -9.8 80.83% 65.33% 52.81% 42.68% 34.50% 27.89% 22.54% 18.22% 14.73% 11.90%
2014 Jonathan Lucroy N/A 28 26.4 6.4 11.2 80.37% 64.60% 51.92% 41.73% 33.54% 26.96% 21.66% 17.41% 13.99% 11.25%
2014 Jose Abreu N/A 27 31.9 42.9 -14.9 86.30% 74.48% 64.28% 55.48% 47.88% 41.32% 35.66% 30.77% 26.56% 22.92%
2014 Jose Altuve N/A 24 28.2 5.0 -6.4 85.08% 72.39% 61.59% 52.40% 44.58% 37.93% 32.27% 27.46% 23.36% 19.87%
2014 Josh Harrison N/A 26 30.4 5.4 3.4 82.79% 68.54% 56.75% 46.98% 38.90% 32.20% 26.66% 22.07% 18.27% 15.13%
2014 Juan Lagares N/A 25 28.4 -5.1 28.9 84.82% 71.95% 61.03% 51.77% 43.91% 37.25% 31.60% 26.80% 22.74% 19.28%
2014 Kevin Kiermaier N/A 24 25.7 13.1 21.3 88.48% 78.28% 69.26% 61.28% 54.22% 47.97% 42.44% 37.55% 33.22% 29.39%
2014 Lorenzo Cain N/A 28 26.3 2.8 19.4 80.49% 64.78% 52.14% 41.97% 33.78% 27.19% 21.88% 17.61% 14.18% 11.41%
2014 Michael Brantley N/A 27 25.7 10.0 -8.4 80.96% 65.55% 53.07% 42.97% 34.79% 28.17% 22.80% 18.46% 14.95% 12.10%
2014 Steve Pearce N/A 31 29.3 6.6 -3.1 71.82% 51.58% 37.04% 26.60% 19.11% 13.72% 9.85% 7.08% 5.08% 3.65%
2014 Todd Frazier N/A 28 27.5 11.0 4.4 80.61% 64.98% 52.38% 42.22% 34.03% 27.43% 22.11% 17.82% 14.37% 11.58%
2014 Yan Gomes N/A 26 27.6 9.4 13.6 84.58% 71.54% 60.51% 51.18% 43.29% 36.62% 30.97% 26.20% 22.16% 18.74%

Conclusions

After looking at this table, we can draw several conclusions. First, this Mike Trout guy is really good at baseball. Secondly, age is the main variable in determining the time until failure. The players with the highest survival rates are all under twenty-five and all the lowest survival rates are over thirty. This makes sense, because it is much easier for a twenty-year-old star to remain effective until he is thirty compared to a thirty-year-old star attempting to remain effective until he is forty. This is because older players face more challenges such as eroding skills, an increased chance of sustaining injuries and having their playing time reduced to prevent injuries.

It also appears that offensive stars survive longer than defensive stars. This is probably due to the fact that defensive skills usually deteriorate faster than offensive skills. I also believe that since defensive statistics are more volatile than offensive statistics, that players that derive much of their value from their defense are more likely to have their WAR fluctuate from year to year. This makes it more likely that a defensive star could register a scrub season one year and then become a star again the next year. And this brings me to my next point.

Things to Keep in Mind

If a player records a scrub season that does not necessarily mean that he is finished.  If this were the case, players like Aramis Ramirez, Robinson Cano and Troy Tulowitzki would have had much less productive careers. It is also important to remember that a player enters the population as soon as they record their first star season, so it is quite possible that a player could improve after their first star season and make it more likely that they can outlast their projected survival rate. The main thing to remember is that no model is perfect and no model is meant to replace the human decision-making process. Models are only meant to improve the decision-making process and it is my hope that this model has accomplished that goal.


Different Aging Curves For Different Strikeout Profiles

What follows will look at aging curves as they relate to players with specific strikeout profiles. Specifically, we will look at how wOBA ages for players that strikeout more than the league-average strikeout rate and less than the league-average strikeout rate.

Through the research that is presented in this post, two points will be proven:

  1. Players of different strikeout profiles age—their wOBAs change—at different rates.
  2. The aging curve for players of different strikeout profiles has changed over time.

Before I present the methodology, the research that was conducted, and their conclusions, I want to give a big thank you to Jeff Zimmerman, who has not only done a lot of research around aging curves, but has also helped me throughout this process and pushed me in the right direction several times when I was stuck. Thank you.

Population

In order to give a non insignificant amount of time for a player’s wOBA to stabilize, but not place the playing time threshold for plate appearances so high that we artificially limit the population even more than it naturally is at the ends of the age spectrum, I looked at all player season from 1950 to 2014 where a player had a minimum of 600 plate appearances for the first aging curve in this post. The second aging curve in this post looks at all player seasons from 1990 to 2014 with a minimum of 600 plate appearances.

Now that we have our population, we need to split our population into two groups: players that strikeout more than league average and players that strikeout less than league average.

Because the league average strikeout rate of today is very different than it was 65 years ago, we can’t look at a player’s strikeout rate from 1950 and compare it to the league average strikeout rate of today.

In order to divide the population into two groups, I created a stat that weighs a player’s strikeout rate against the league average strikeout rate for the years that they played. For example, if a player played from 1970 to 1975, their adjusted strikeout rate would reflect how their strikeout rate compares to the league average strikeout rate from 1970 to 1975.

Players were then placed into two buckets based on their adjusted strikeout rate: players that struck out more than league average and players that struck out less than league average.

Methodology

There has been a lot of discussion over the years about the correct methodology to use for aging curves. This conversation has had altruistic intentions in the sense that it’s aim has been to minimize the survivorship bias that is inherent in the process, and, through the progress that has been made over the years, this study uses what the author has found to his knowledge to be the best technique to date. This article by Mitchell Lichtman summarizes a lot of the opinions.

While there is a survivorship bias inherent in any aging curve, the purpose of the different techniques used to create aging curves is to minimize the survivorship bias wherever possible.

What We Don’t Want In an Aging Curve 

An aging curve is not the average of all performances by players of specific ages. For example, say you have a group of 30-year-old players that have an average of a .320 wOBA and group of 29-year-old players that have an average of a .300 wOBA.

The point of an aging curve is to see how a player aged, not how they played. The group of 30-year-old players has a high wOBA because they are a talented group of players; they lasted long enough to play until they are 30. As they aged from the previous year, when they were 29 to their current age 30 season, they lost the bottom portion of players from their player pool. These are the players that couldn’t hang on any longer, whether it be because of a decline in defense, offense, or a combination of both. This bottom portion of players lower the wOBA of the current 29-year-old population through their presence and raise the wOBA of the 30-year-old population through their absence.

At the same time, the current 30-year-olds aged from their age-29 season to their age-30 season. Sure, there may be players who had a better age-30 season than age-29 season, but the current group of 30-year-olds, as a whole, still played worse at 30 than they did at 29.

When you look at the average of a particular age group, in this case 30-year-olds, you only see the players that survived, and, because they no longer play, you leave behind the players that are hidden from you sample. The method that follows resolves this issue to an extent.

What We Do Want In an Aging Curve

This study uses the delta method which looks at the differences of player seasons (i.e. a players age 29 wOBA minus their age 28 wOBA) and weighs those differences by the harmonic mean of the plate appearances for each pair seasons in question.

I would explain this further, but Jeff Zimmerman does an excellent job of this in a post on hitter aging curves that he did several years ago. While Jeff Zimmerman looked at RAA, which is a counting state, the methodology is basically the same for our purposes and wOBA, which is a rate stat:

In a nutshell, to do accurate work on this, I needed to go through all the hitters who ever played two consecutive seasons. If a player played back-to-back seasons, the RAA values were compared. The RAA values were adjusted to the harmonic mean of that player’s plate appearances.

Consider this fictional player:

Year1: RAA = 40 in 600 PA age 25
Year2: RAA = 30 in 300 PA age 26

Adjusting to harmonic mean: 2/((1/PA_y1)+(1/PA_y2)) = PA_hm
/((1/600)+(1/300)) = 400

Adjust RAA to PA_hm: (PA_hm/PA_y1)*RAA_y1 = RAA_y1_hm
(400/600)*40 = 26.7 RAA for Year1
(400/300)*30 = 40 RAA for Year2

This player would have gained 13.3 RAR (40 RAA – 26.7 RAA) in 400 PA from ages 25 to 26. From then, I then would add all the changes in RAA and PA together and adjust the values to 600 PA to see how much a player improved as he aged.

Findings

Below is an aging curve by strikeout profile for all player seasons with over 600 plate appearances in a season from 1950 until 2015.

Screen Shot 2015-04-18 at 1.23.52 PM

We can see several findings immediately:

  1. Players do age differently based on their strikeout profile.
  2. Players that strikeout more than league average peak at 23.
  3. Players that strikeout less than league average take longer to hit their peak—their age 26 season.
  4. Players that strikeout more than league average age better than players that strikeout less than league average.

From a historical perspective, this graph is fun to look at, but the way the game was played over half a century ago is eclipsed by societal evolutions that today’s players benefit from.

To give us a more realistic idea of how today’s players age relative to their strikeout rate, I made another graph the at looks at player seasons from 1990 to 2014.

Screen Shot 2015-04-18 at 1.40.36 PM

What we find in this graph, which is more current with today’s style of play, is that players still age differently dependent on their strikeout profile, but not in the same way that they did in the previous sample.

Players that strikeout more than league average still peak earlier than players that strike out less than league average, but in this more current population of players, players that strikeout more than league average peak very early—their age 21 season. This information would reciprocate the sentiment that has been conveyed through recent work that suggests that the aging curve has changed to the point that players peak almost as soon as when they enter the league.

The peak age for players that strikeout at below league average rates is still 26, but whereas this group aged more poorly than the strikeout heavy group in our previous population, players that strikeout at below league average rates now age better than their counterparts.

Conclusions

This information can make material differences for our overall expectations and outlooks on players.

Previous knowledge would suggest that players like George Springer and Kris Bryant—players who have exorbitant strikeout rates—are still on the climb as far as their talent goes, but this information shows that these players may already be at/close to their peaks or on the decline as far a their wOBA is concerned.

This information also shows that we should be patient with prospects that have a penchant to put balls is play; while they peak more quickly than they did in the previous population, they take longer to develop than players with more swing and miss in their game, and when they do start to decline, there isn’t much need to worry, because their climb from their peaks will be gradual.

Like many other studies that have looked at new aging curves, this study confirms that players/prospects peak earlier now than at any other point throughout history, but it also shows that a player’s trajectory upward and downward is dependent on characteristics specific to their approaches at the plate.

Devon Jordan is obsessed with statistical analysis, non-fiction literature, and electronic music. If you enjoyed reading him, follow him on Twitter @devonjjordan.


Austin Jackson’s Bothersome Batted-Ball Bind

On July 30, 2014, the Seattle Mariners found themselves in the interesting position of being in playoff contention. The Mariners sported a 32-23 record, only 2.5 games back of the AL West-leading Los Angeles Angels, and owned the third-best Pythagorean record in the American League. Seattle’s newfound position as postseason hopefuls meant that they were suddenly buyers at the trade deadline – not drastically so, but in the sense that the Mariners were only a couple of upgrades away from assembling themselves a nicely well-rounded playoffs roster. Chief among these desired upgrades was a serviceable everyday center fielder, one who could replace a revolving door of below-average outfielders that included Abraham Almonte, James Jones, Stefen Romero, and Endy Chavez.

Jack Zduriencik sought to remedy the Mariners’ outfield issues with a pair of trade deadline deals. The first involved packaging Almonte and minor-league pitcher Stephen Kohlscheen to the Padres in return for Chris Denorfia, a rather unsexy deal to be sure, but one that was a success at the time in that the acquired player was not Almonte. The second deal, a three-way transaction between the Rays, Tigers, and Mariners, was collectively more sexy, but a large share of the sexy went to the Tigers, who landed Rays ace David Price. The other major components of the deal were the Rays’ acquisition of young Mariners middle infielder Nick Franklin and Tigers pitcher Drew Smyly, as well as Seattle’s prospective answer to its outfield problem: Detroit center fielder Austin Jackson.

Since his move to the Mariners, Jackson has racked up 277 plate appearances for Seattle, and the results have been fantastically underwhelming. Of the center fielders who amassed more than 100 plate appearances for the Mariners in 2014 – an uninspiring triumvirate of Jackson, Abraham Almonte, and James Jones – Jackson produced the worst offensive performance by wRC+. Jackson’s 2014 performance also disappointed even by more conventional measures:

  • Jackson totaled 34 extra-base hits in 416 plate appearances for the Tigers in 2014. For the rest of the year, in 240 plate appearances for the Mariners, Jackson managed 6.
  • Jackson’s ISO dropped from .127 to a paltry .031 with the move from Detroit to Seattle.
  • Jackson’s 2014 OBP/SLG/wOBA with Detroit: .330/.397/.321. With Seattle: .271/.264/.243.

Not great for a player only two seasons removed from a 5-win campaign.

Ostensibly, something was fundamentally different with Jackson in 2014, something that can hopefully be determined by closely examining his recent performance. Looking first to Jackson’s approach, it seems that there hasn’t been too much change over the course of his career. His K/BB ratio has generally hovered around league average and his contact rates haven’t fluctuated all that much from year-to-year. If anything, Jackson’s approach metrics look like they’re trending positively – he actually posted career bests in Z-contact% and SwStr% in 2014. If we examine Jackson’s batted ball data, however, we begin to get a little closer to the root of Jackson’s troubles of late. The most easily identifiable aspect of Jackson’s game can be somewhat distilled in the following graphic:

Over the course of his career, Jackson’s BABIP has been way above league average. He managed an absolutely ridiculous .396 BABIP in his 2010 rookie season over 675 PA, and his career-best 2012 season, in which he posted a 134 wRC+, was bolstered by a BABIP of .371. That figure would predictably fall after 2012, but between 2013 & 2014, Jackson’s BABIP only declined by .008, whereas in the same period, his wOBA fell from a very good .332 to a mediocre .292. This might suggest that in 2014 specifically, it may not have been the frequency with which Jackson was able to put balls in play so much as the quality of those batted balls that limited Jackson’s production.

Unfortunately, batted ball data is out of the scope of my access. The closest I can get to Jackson’s batted-ball profile is  by pulling data from this pre-season piece on Jackson by Jake Mailhot over at Lookout Landing, and indirectly from Jeff Zimmerman’s work on hitter analytics at RotoGraphs (the relevant batted-ball spreadsheet now seems to be unavailable for some reason).

To quickly explain – this table charts batted-ball rates expressed as a percentage of league average. Batted balls are separated into three categories (line drive, groundball, fly ball) which are then further divided into subcategories of contact quality (Well-Hit, Medium, and Weakly-hit). These categories are ordered left-to-right from highest to lowest based on xBABIP.

Mailhot astutely notes an alarming drop in well-hit groundball rate – from 64% above league average in 2012 to 11% below league average in 2014. This is accompanied by a commensurate rise in weakly-hit groundballs. Jackson’s well-hit line-drive rate also drops by a sizable amount, hovering around league average in 2014, while his rate of medium-hit line drives balloons to 198% of league average in 2014. Mailhot also points out possibly the most substantial shift: an immense drop-off in well-hit fly-ball rate in 2014 to 56% of league average, a trend corroborated by data pulled from Baseball Heat Maps on Jackson’s average fly ball distance over his career:

Jackson’s high rate of well-hit line drives and ground balls prior to 2014 puts into perspective the aspects of his game that brought him success earlier on in his career, and his sharp decline in those metrics in 2014 even more so. To put it in exceedingly simple terms, Austin Jackson just didn’t really hit balls hard in 2014, something he was quite good at doing before that season. Judging by the splits, most of the not-hitting-balls-hard occurred after the move to Seattle.

Jackson’s 2013 was much better than his 2014, but it is the beginning of a short trend of BABIP decline. From examining batted-ball data, we can infer that quality of contact has a significant bearing on BABIP, and this makes sense using conventional logic as well. Hard-hit line ground balls are more likely to find gaps between defenders, hard-hit line drives are more likely to drop in for hits, and hard-hit fly balls are more likely to turn into extra-base hits (although BABIP ignores home runs). The easy explanation is that Jackson lost some power in 2014. I don’t have enough film on Jackson to know for sure if there’s a visually concrete reason for this (if, for example, there’s something off in his swing mechanics), but data from 2014 indicates that Jackson just hasn’t been making good contact.

Jackson’s issues are probably best explained by his batted-ball troubles, but park factors likely play some part as well, with Comerica Park being relatively more hitter-friendly than Safeco Field. Safeco’s pitcher-friendly park factor and the ‘dead ball effect’ of Seattle’s marine air probably have something to do with Jackson’s decline in fly-ball distance, although Jackson is himself contributing to that same decline in some measure.

At the time of his acquisition, a merely average performance from Jackson would have been a significant upgrade over the convoluted mishmash that had previously taken the field for the Mariners. Unfortunately, he was unable to even provide replacement-value production after coming to Seattle, totaling -0.4 wins above replacement in 2014. The Mariners traded for an above-average player and received the production level of a player who theoretically wouldn’t cut it in the big leagues altogether.

The prospect of 2015 being a bounceback year for Jackson has not gone over too well in these first few weeks. ZiPS (R) and Steamer (R) still think Jackson could manage 1.7-2.1 WAR on the season, which is a bit below his peak, but I think the Mariners would take that statline in a heartbeat. I’ve gone this far without mentioning Jackson’s other tools, but as a 28-year-old without a concerning injury history, there’s not as much reason to worry about his defense and baserunning as there is to worry about his offensive output. Jackson was a below-average defender by UZR in 2013/2014 and has been worth approximately 4 baserunning runs above replacement in each of the past couple of years, neither of which have dictated his value nearly as much as his offense, or lack thereof. Using those numbers as a serious predictive measure from year-to-year is simply not very useful at this point.

Lloyd McClendon was Jackson’s hitting coach back in Detroit, and suffice it to say he probably has a better grasp on Jackson’s habits as a batter than most. If anyone’s able to get Jackson back on track this season, it’s probably McClendon. At time of writing (the 18th of April), Jackson managed a slightly encouraging 2-hit, 1-walk performance against the Rangers. It’s early yet in the season, and there’s time for Jackson to hopefully figure things out. Alternatively, if Jackson can’t find some of his pre-2014 form this season, the Mariners might once again find themselves in the same trade deadline predicament from last year – only this time, there’s not an obvious trade chip à la Nick Franklin. Then again, 2015 is Jackson’s last year under team control, so the Mariners may simply choose to let him walk after the year is over if they’re not satisfied with his performance. If that’s the case, it’s hard to imagine looking back on the 2014 Jackson trade with anything but the same tinge of regret and frustration that has colored so many other Mariners transactions of the last decade.


Hardball Retrospective – The “Original” 1924 Washington Senators

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. Therefore, Frank Robinson is listed on the Reds roster for the duration of his career while the Rangers claim Ivan Rodriguez and the Red Sox declare Jeff Bagwell. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition.  Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Expanding on my research for the book, the following series of articles will reveal the finest single-season rosters for every Major League organization based on overall rankings in OWAR and OWS along with the general managers and scouting directors that constructed the teams. “Hardball Retrospective” is available in digital format on Amazon, Barnes and Noble, GooglePlay, iTunes and KoboBooks. The print edition is coming soon. Additional information and a discussion forum are available at TuataraSoftware.com.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

Assessment

The 1924 Washington Senators   OWAR: 43.1     OWS: 287     OPW%: .615

Based on the revised standings the “Original” 1924 Senators obliterated the competition with the Tigers finishing a distant 13 games in arrears. Walter “Big Train” Johnson, approaching the final stop in his 21-year career, continued to blow smoke past American League batsmen. He whiffed the most batters in the Junior Circuit for the twelfth time and furnished a 23-7 mark with the best ERA (2.72) and WHIP (1.116) in the League. Johnson received the MVP Award for his efforts in ’24 and the future Hall-of-Famer retired three years later with 417 victories, a 2.17 ERA and a 1.061 WHIP along with 3,509 strikeouts and the most shutouts in Major League history (110). Johnson ranks first among pitchers in “The New Bill James Historical Baseball Abstract”.

Jack Bentley bolstered the Washington pitching corps, delivering 16 victories against 5 losses. Firpo Marberry split time between the rotation and bullpen, notching 11 wins and saving 15 contests (although saves were not officially tabulated until 1969).

ROTATION POS WAR WS
Walter Johnson SP 7.02 28.65
Jack Bentley SP 1.96 11.78
Firpo Marberry SP 1.48 17.72
Joe Martina SP 0.35 5.74
BULLPEN POS WAR WS
Ted Wingfield RP 0.67 2.54
By Speece RP -0.25 3.52
Slim McGrew SP -0.27 0.32
Paul Zahniser SP -0.28 3.72

Goose Goslin (.344/12/129) topped the American League leader boards in RBI while recording 199 hits and 100 runs. The future Hall of Famer surpassed the century mark in ribbies 11 times and recorded a .316 lifetime batting average. Sam Rice batted .334 with 106 runs scored and 39 two-baggers while producing a League-best 216 base hits. A .322 career hitter, Rice concluded his career only 13 hits shy of 3,000.

Charlie Jamieson rapped 213 safeties and posted a personal-best .359 BA after leading the Junior Circuit in the previous campaign with 222 knocks. First-sacker Joe Judge clubbed 38 two-base hits and delivered a .324 BA. Goslin rated 16th among left fielders in the “NBJHBA”. Rice (33rd-RF), Judge (44th-1B) and Bucky Harris (70th-2B) also placed in the top 100 at their respective positions.

LINEUP POS WAR WS
Charlie Jamieson CF/LF 3.09 19.11
Sam Rice RF 3.65 23.99
Goose Goslin LF 5.69 28.91
Joe Judge 1B 2.12 19.08
Ossie Bluege 3B 0.72 10.42
Bucky Harris 2B 0.32 13.31
Eddie Ainsmith C 0.11 0.45
Howie Shanks SS -0.02 5.21
BENCH POS WAR WS
Frank Brower 1B 1.05 5.27
Irish Meusel LF 0.98 16.78
Doc Prothro 3B 0.9 5.89
Bing Miller RF 0.83 13.65
Earl McNeely CF 0.3 5.84
Carl East RF 0.09 0.36
Ike Davis SS 0.02 0.35
Bennie Tate C -0.02 0.64
Carr Smith RF -0.13 0.04
Tommy Taylor 3B -0.13 0.85
Showboat Fisher RF -0.14 0.4
Pinky Hargrave C -0.35 0.21
Mule Shirley 1B -0.5 0.34
Frank Ellerbe 3B -0.9 2.19

The “Original” 1924 Washington Senators roster

NAME POS WAR WS General Manager
Walter Johnson SP 7.02 28.65 Thomas Noyes
Goose Goslin LF 5.69 28.91 Clark Griffith
Sam Rice RF 3.65 23.99 Clark Griffith
Charlie Jamieson LF 3.09 19.11 Clark Griffith
Joe Judge 1B 2.12 19.08 Clark Griffith
Jack Bentley SP 1.96 11.78 Clark Griffith
Firpo Marberry SP 1.48 17.72 Clark Griffith
Frank Brower 1B 1.05 5.27 Clark Griffith
Irish Meusel LF 0.98 16.78 Clark Griffith
Doc Prothro 3B 0.9 5.89 Clark Griffith
Bing Miller RF 0.83 13.65 Clark Griffith
Ossie Bluege 3B 0.72 10.42 Clark Griffith
Ted Wingfield RP 0.67 2.54 Clark Griffith
Joe Martina SP 0.35 5.74 Clark Griffith
Bucky Harris 2B 0.32 13.31 Clark Griffith
Earl McNeely CF 0.3 5.84 Clark Griffith
Eddie Ainsmith C 0.11 0.45 Thomas Noyes
Carl East RF 0.09 0.36 Clark Griffith
Ike Davis SS 0.02 0.35 Clark Griffith
Howie Shanks SS -0.02 5.21 Thomas Noyes
Bennie Tate C -0.02 0.64 Clark Griffith
Carr Smith RF -0.13 0.04 Clark Griffith
Tommy Taylor 3B -0.13 0.85 Clark Griffith
Showboat Fisher RF -0.14 0.4 Clark Griffith
By Speece RP -0.25 3.52 Clark Griffith
Slim McGrew SP -0.27 0.32 Clark Griffith
Paul Zahniser SP -0.28 3.72 Clark Griffith
Pinky Hargrave C -0.35 0.21 Clark Griffith
Mule Shirley 1B -0.5 0.34 Clark Griffith
Frank Ellerbe 3B -0.9 2.19 Clark Griffith

Honorable Mention

The “Original” 1915 Senators             OWAR: 49.1     OWS: 272     OPW%: .565

“Big Train” Johnson (27-13, 1.55) completed 35 of 39 starts while leading the American League in wins, WHIP (0.933), innings pitched, shutouts and strikeouts. The rotation was supplemented by Doc Ayers (14-9, 2.21) and Bert Gallia (17-11, 2.29). Clyde “Deerfoot” Milan swiped 40 bags and Tom Long legged out 25 triples at the top of the lineup.

The “Original” 1965 Twins                 OWAR: 46.0     OWS: 280     OPW%: .644

Zoilo Versalles topped the leader boards with 126 tallies, 45 doubles, 12 triples and 308 total bases to capture the 1965 A.L. MVP Award. Teammate Tony Oliva (.321/16/98) finished runner-up in the MVP race and collected his second batting title. Bob Allison, Jimmie Hall and Harmon Killebrew slammed at least 20 circuit clouts apiece. Jim Kaat (18-11, 2.83) anchored the starting staff and Ted Abernathy led the League with 31 saves and 84 relief appearances.

On Deck

The “Original” 1992 White Sox

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive


Testing the Eye Test: Part 2

Sorry for the relatively long delay – sometimes life gets in the way of our best laid plans. In case you want a refresher, here is part 1: http://www.fangraphs.com/community/testing-the-eye-test-part-1/

In part 1, I found that, counter to my expectations, range correlated most strongly with FSR data of all the UZR components (UZR itself had a stronger correlation with FSR). I expected the strongest-correlated component to be errors, which was actually one of the least-correlated components. However, I wanted to go a little bit farther and look at the difference between correlations between the UZR components and FSR and the correlations between the UZR components and UZR itself to get a sense of what the fans weight more than UZR does. As a reminder, here is the data set I compiled for this analysis:

“I pulled the defensive stats of every player who qualified (minimum of 900 innings) at a position from 2009-2014 (FSR data is only available for those six seasons on FanGraphs). I then disregarded catchers, as UZR does not cover the position. Likewise, pitchers are left out because they are not covered by UZR or FSR. That left me with 761 player seasons across the other seven positions.”

Without further ado, here are the correlations between UZR and its components:

Position |# |ARM |DPR |RngR |ErrR
1B |118 |N/A |0.207 |0.930 |0.326
2B |117 |N/A |0.275 |0.907 |0.465
3B |107 |N/A |0.166 |0.948 |0.386
SS |130|N/A |0.459 |0.866 |0.384
LF | 71 |0.584 |N/A |0.895 |0.196
CF |115 |0.357 |N/A |0.935 |0.069
RF |103|0.310 |N/A |0.906 |0.061

I always had a suspicion that range was the most important component of UZR but these results are insane. It turns out range is far and away the most important component of UZR. Interestingly, the weakest correlation for range is at SS, perhaps because shortstops without proper range are moved to another position. ARM, although only calculated for outfielders (a real shame as Andrelton Simmons deserves credit for being able to make this throw), has the second-strongest correlation but lags range by a large amount. Like the FSR correlation, it is surprising that LF has a stronger ARM correlation than CF or RF. DPR narrowly edges out errors, although the correlation for errors is far stronger when you only consider infielders. Now, to get a sense of the difference, here’s the two sets of correlation subtracted from each other (positive numbers mean the correlation with UZR is higher and negative numbers mean the correlation with FSR is higher):

Position | # | ARM | DPR | RngR | ErrR
1B | 118 | N/A | -0.005 | 0.644 | 0.006
2B | 117 | N/A | 0.116 | 0.437 | -0.082
3B | 107 | N/A | 0.011 | 0.315 | 0.125
SS | 130 | N/A | 0.095 | 0.437 | 0.041
LF | 71 | 0.074 | N/A | 0.369 | 0.010
CF | 115 | 0.120 | N/A | 0.441 | -0.002
RF | 103 | 0.096 | N/A | 0.365 | -0.006

There are two different ways to look at this: one is that FSR has nearly the same correlation as UZR in most categories. That’s good! It lends a lot of credibility to FSR to know that you can predict FSR nearly as well as UZR with ErrR or DPR. On the other hand, look at the huge difference in the range column. It appears that the fans are severely underestimating the importance of having great range (or have different ideas of how to evaluate range). That’s a problem! As we just saw, range is the most important component of UZR for every position. It is also not terribly surprising as I hypothesized at the beginning of this series that the fans are underestimating the importance of range in favor of flashier tools. This also explains a lot of the discussion about Derek Jeter’s defensive ability (or lack thereof).

This sums up the research portion of this series. I think all of this does lend a lot of credence to FSR: it does reflect that range is the most important component of defense and it does a good job of properly ranking the importance of the other components. In addition, the correlation between FSR and UZR is fairly strong but not so strong that the two systems are interchangeable. However, when considering FSR, be sure to mentally adjust when a player has particularly good (or bad) range.

In part 3, I will examine some of the player seasons that produced the most disparity in the two rankings.


A Decade of DH: The Mariners Post-Edgar Martínez

Edgar Martínez is kind of a demigod in Seattle. If you drive west past Safeco Field, parallel to the first-base line, you’re doing so on Edgar Martínez Drive (hang a right at home plate and you’re on Dave Niehaus Way).

He’s the only one of the franchise’s most celebrated players, besides hopefully Felix, to have spent his entire career with the Mariners organization, something Ken Griffey Jr. and Ichiro can’t lay claim to. His game-winning double in the 1995 ALDS is the Seattle Sports Moment to many, and it was just a quarterfinal, Super Bowl/’79 NBA Finals/1917 Stanley Cup be damned.

He’s also the greatest designated hitter of all time, and if not the greatest (which he is, in a completely objective manner of speaking untainted by my personal preferences) the player who perfectly typifies the designated hitter position. For years he has been the barometer of the DH, the mark by which all who came before and all who shall come after will be evaluated – Frank Thomas and Jim Thome weren’t purebred designated hitters the way Edgar was. Although Edgar won his first batting title playing third base, a hamstring injury relegated him to the DH role, so he was the only one of the three to primarily play designated hitter for most of his career, unlike Thomas and Thome, who spent much more time in the field. Also, Frank Thomas was an absolutely filthy pitcher in Backyard Baseball 2003, as well as complete garbage as a hitter in that very same game, so don’t you tell me that designated “hitter” is his primary position.

seriously this makes no sense

But whatever Edgar Martínez meant to the DH position went tenfold for the Mariners. For me Edgar’s retirement was in a way the turning of a page for the franchise, but his loss as a Mariners icon could ostensibly be counteracted somewhat by roster stalwarts Ichiro, Dan Wilson, Bret Boone, and Jamie Moyer (duh), among others.

What would change almost irreparably was the Seattle Mariners’ designated hitter slot that Martínez vacated when he retired.

2015 marks the Mariners’ eleventh season since Martínez’s retirement, meaning that a full decade has passed in the majors without Edgar on the Mariners. The first step in our journey into an Edgar-less world begins with a ranking of cumulative fWAR at the DH position by team from 2005 (the Mariners’ first season without Martínez) to 2014.

 

The first thing you’ll notice seems pretty intuitive, which is that the top 15 teams are all American League teams – this obviously makes sense because the DH rule is only applicable in the American League, so it would logically follow that the fifteen American League teams would have accrued the most Wins Above Replacement in the major leag-

wait, hold up – philadelphia????????

where are the mariners

um

ok

lemme just scroll down to find them, one sec

..

….

…………

uhhhhhhhhhhhh

So Mariners designated hitters rank 27th, which means that between the Mariners and the Astros, twelve – TWELVE National League teams, who only employ a designated hitter for a handful of games per season (interleague away games to be exact), have produced more fWAR at that position over the last decade than the Mariners, who use a designated hitter on pretty much a daily basis.

In fairness, the explanation for this seems logical: the defensive fWAR penalty for the designated hitter position (the highest of any defensive position) is cumulative – it increases with the amount of innings played at any given position. The Mariners have accrued ungodly of amounts value above below replacement by trotting out consistently bad DH production, whereas the magnitude of damage a National League DH can do to his team’s aggregate replacement value is limited by sample size. Something that I don’t completely understand is that FanGraphs’ data on National League players DHing seems to be incomplete, but maybe it just has something to do with eligibility not lining up with FanGraphs data or the fact the sample size of National League DHs is inadequte. Perhaps having all National League DHs accounted for just wouldn’t be worth the effort or be statistically significant.

Even if we remove the National League from the equation, the Mariners are still dead last in their own league by quite a bit. The next-worst team, Houston, has been in the American League for all of two full seasons and has managed to comfortably outpace Seattle (upon further examination this is made more impressive by the fact that one of those seasons, Chris Carter’s 37-dinger campaign in 2014, doubles Seattle’s cumulative fWAR over the entire decade in magnitude). But then again as stated before, Seattle have incurred a penalty for having adhered to the DH rule since its inception, whereas Houston have only had two seasons to let the DH penalty pile up.

In order to ascertain exactly what shenanigans could have gone down with the M’s DH position such that all Seattle DHs from 2005-2014 collectively managed to produce fewer wins than Edgar’s farewell 2004 season (his worst by fWAR, totaling -0.5), here’s a fond look back at some of the Mariners’ highlights at DH over the past decade (min. 100 PA), ordered by total fWAR produced for the Mariners.

1. Russell Branyan, 2009 & 2010, 125 wRC+, 3.4 fWAR

FanGraphs lists Branyan first in highest total accumulated fWAR post-Martínez for any Seattle DH, which makes sense, because his 2009 116-game stint with the Mariners was quite good-until you realize that he played all of those games for the Mariners at first base, meaning that 2.7 wins of these 3.4 weren’t even put up from the designated hitter slot. Fun fact: Branyan’s 2009 season, in which he swatted 31 homers and posted 126 wRC+, is the only season since 2006 in which a Mariners player has hit 30 homers (2013 Ibañez missed this mark by just one home run).

On the bright side, Branyan returned to the Mariners via trade in June of 2010, and this time he actually put in some time at DH. He managed to do quite well for himself, with 121 wRC+ in 238 plate appearances, and 25 of his 44 hits went for extra bases. After a couple 2011 stints with Arizona and Los Angeles, he then decided that playing for Seattle had ruined the major leagues for him and went on to play only in minor-league and Mexican league games for the rest of his career.

2. John Jaso, 2012, 143 wRC+, 2.6 fWAR

Jaso shared catching and DH duties with Jesus Montero in 2012, and in only 108 games became the second-most valuable position player on the team by fWAR behind Kyle Seager. Jaso also had the third-highest walk of rate of any position player in baseball (min. 350 PA) in 2012 and was Felix Hérnandez’s batterymate for his perfect game against the Tampa Bay Rays.

The Mariners then infamously dealt Jaso to division rival Oakland in a three-team deal that yielded a return of Michael Morse, former Mariner just coming off a career year with Washington in 2011. Morse played half of the 2013 season before injury caught up to him and he was shipped off to Baltimore.

Jack Z could lead the Mariners to 4 consecutive titles and be the executive of the year each of those years and Mariners fans (myself included) would probably still find time to complain about the Jaso-Morse Trade for some reason. Jaso is currently back with Tampa Bay and is currently injured, but his 2013 and 2014 seasons were still much better collectively than what the Mariners fielded during that same time period.

3. Mike Sweeney, 2009-10, 111 wRC+, 0.7 fWAR 

At the same time Russell Branyan was busy dirtying himself in the field and hitting big hits, the Mariners extended 2 non-roster invites in 2 consecutive years to great Royals player, great hitter, and great all-around guy Mike Sweeney (who doesn’t have a picture on his Wikipedia profile, but Yuniesky Betancourt does….?). Making the team both times, Sweeney was a productive hitter for the Mariners in 2009 and 2010 and was traded to the Phillies, they of the 15th-ranked DH fWAR, in time for the NLDS, where he was able to collect a hit in his first and only postseason at-bat (against Aroldis Chapman, no less). He then signed a one-day contract to retire a Royal. It’s nice to feel good about something tangentially Mariners-related every once in a while, especially because after we move past Sweeney on this list things start to get a little dicey.

4. Kendrys Morales, 2013 & 2014, 108 wRC+, 0.2 fWAR

Morales was one of the better hitters on a largely uninspiring 2013 Mariners team, boasting a .342 wOBA and a 119+ wRC, both of which would have led the team if not for an incredibly strange Raul Ibañez season. The Twins signed Morales as a free agent the following offseason for $7.6 million, which was kind of strange but made sense if they could turn that value into something. It turns out that something was the Mariners’ Stephen Pryor, and Morales ended up back in Seattle, where he had effectively iced his chances of cashing in on a qualifying offer from the Mariners a year ago. In 2014, Morales put up -0.8 fWAR for Minnesota and -1.0 fWAR for Seattle, by far the worst year of his career.

5. Jack Cust, 2011,  97 wRC+, -0.1 fWAR

A profile of Jack Cust came up on the Jumbotron at Safeco Field once while I was in attendance at a Mariners home game, in which he stated that his favorite quote was “play hard”, a nugget of wisdom Cust attributed to himself. The combination of that Jumbotron quote and Cust’s .116 ISO for the Mariners in 2011 continues to be one of the more perplexing relationships I’ve observed to date, as is the 97 wRC+ (Cust’s wRC on the season was 20).

6. Jose Vidro, 2007-08, 94 wRC+, -0.2 fWAR

Vidro, who leads the players on this list in plate appearances with 955, spent almost two seasons with the Mariners (2007 & 2008). His 2007 season with Seattle wasn’t bad – his .775 OPS was just above league average, his 10.1 BB% was just above league average, and his .345 wOBA was just above league average. Vidro decided to use 2008 to erase most of the solid work he had done in the previous year, cutting his walks in half, getting on base 70% as often, and found himself designated for assignment and later released that summer.

7. Greg Dobbs, 2004-06, 74 wRC+, -0.3 WAR

Dobbs didn’t really play enough to be remembered as outright terrible, usually taking backup duty at third base and then being relegated to a pinch-hitting/DH role in 2005/2006. Before we move on, there’s a couple interesting notes about Dobbs.

First, Greg Dobbs hit a home run in his first at-bat with the Mariners (and his first major league at-bat), which would turn out to be the only home run he would hit in 2004. This isn’t particularly notable except for the fact that it always reminds me that Miguel Olivo also homered in his first major league at-bat, which I will never forget for some incredibly frustrating reason. “Miguel Olivo homered in his first major league at-bat. He was with the Chicago White Sox and the home run was off Andy Pettitte”. This useless piece of information has been wasting my neural capacity ever since I read it on some Mariners gameday program, which I think had Richie Sexson on the cover, so I’m jointly blaming Sexson and Olivo both for forcing me to remember that information and also for being pretty underwhelming with the Mariners.

The second thing is Dobbs’ 2006 season, in which he was only around for 28 PA in 23 games, 18 of which he came in as a pinch-hitter. In those games he managed 150 wRC+ on a cool .435 BABIP. He also walked 0 times. Ultimately it was only 28 PA, an absurdly silly sample size, and clearly the Mariners felt similarly, because they waived out of him in 2007, whereupon Pat Gillick, now with the Phillies, decided to take a second chance on Dobbs long enough for him to earn a World Series ring in 2008.

8. Jeff Clement, 2006-09, 90 wRC+, -0.3 fWAR

Yeah, let’s just not go there.

9. Milton Bradley, 2010-11, 83 wRC+, -0.5 fWAR

Bradley contributed -0.5 fWAR in his time with the Mariners and interestingly enough, being bad at baseball may have been one of the better things he did in Seattle.

10. Carl Everett, 2006, 72 wRC+, -0.8 WAR

Carl Everett was a two-time All-Star in Boston, put up a six-win season with the Astros, and found himself signed by the Mariners for the 2006 season, where he had his worst offensive season by far. Everett was league-average in B% and K% and that was about it. Everett was released in July of 2006 at the age of 35, having posted -0.8 fWAR in 92 games. You could say he was getting to be a bit of a dinosaur, but don’t tell Carl Everett anyone said that about him.

11. Ben Broussard, 2006-07, 88 wRC+, -0.8 fWAR

Broussard was acquired by the Mariners from Cleveland in the second half of the 2006 season, filling in mostly at DH after Carl Everett’s dismissal; in that time-frame Broussard’s figure of 78 wRC+ is not particularly inspiring, and only slightly bests Everett’s. Broussard then spent most of his first and only full season (2007) for the Mariners switching between first base and the corner outfield spots, His overall 88 wRC+ was a slight improvement but still not great, especially while the man Bill Bavasi gave up to acquire Broussard, Shin-Soo Choo, has produced 24.3 fWAR since the 2007 season.

Back to Broussard, though:

12. Eduardo Perez (2006), 48 wRC+, -0.8 fWAR

I don’t think I really have to say anything here.

13. Jesus Montero (2012-), 83 wRC+, -0.9 fWAR

Jesus Montero is now somewhat of a tragic figure among Marinerds. In the 2012-2013 offseason, Montero claimed that he had a coach who had helped him ‘learn to run’, which I guess if you haven’t yet is probably a good idea. He then cut his 2013 season short by getting suspended for his involvement in the Biogenesis snafu. Last August, he got into a bizarre altercation. Montero’s role in the organization has strayed far from the top hitting prospect the Mariners traded Michael Pineda(who’s now the #3 in New York) for. Montero continues to toil away in the Mariners’ farm system (reports out of Tacoma yesterday were that he legged out an infield single), in the hopes he can top 2013’s terrible 64 wRC+. Montero is still only 25, so it’s not as if all hope is lost, but the fact remains that his on-field production at the major league level has been nothing short of disappointing.

14. Ken Griffey, Jr. [1989-99(omitted), 2009-10], 84 wRC+, -0.9 fWAR

Nobody was expecting 90s Griffey Jr. the ballplayer, which was convenient because he didn’t show up. Griffey was only mediocre in 2009, with a wRC+ of 97 and a wOBA of .324, rendering him merely replacement-level (0.0 fWAR). 2010 was a different story. Griffey Jr. posted a depressingly bad 32 wRC+ in 108 plate appearances. Amid issues with Don Wakamatsu restricting his playing time and the bizarre rumor of Griffey napping during a potential pinch-hit opportunity, things came to a head in June of 2010, when Griffey abruptly left the club, drove home, and announced his retirement before the next days game. The Kid’s return to Seattle was a welcome dip into the nostalgia-drenched coffers of yesteryear for a struggling ballclub, and before anyone had time to process it, the sweetest swing in baseball was silenced in a flash.

15. Corey Hart (2014), 70 wRC+, -1.1 fWAR

Hart has the lowest single-season WAR of any player on this list. The Mariners paid $6 million with $7 million in undisclosed incentives for a year of Hart coming off a knee surgery that caused him to miss all of 2013, which at the time seemed to be a reasonable gamble for a high-risk commodity that could potentially have paid great dividends. Unfortunately, Hart struggled to stay healthy and perform. playing only 68 games in the season; the midseason acquisition of Kendrys Morales certainly didn’t help. Ultimately the Mariners’ front office decided to take a gamble with Hart and lost.

 

BONUS ROUND (fun with small sample sizes): Scott Spiezio in 2005

Scott Spiezio in 2004 was simply a bad infielder and player, posting a miserable .288 OBP, 67 wRC+, among other poor statistics on his way to putting up a below replacement-level season of -0.1 fWAR.

Big deal though – it’s one bad season. Besides, if you look up on that same list, Carl Everett put up -0.8 fWAR in 92 games. Eduardo Perez did it in 43!

This is nothing to Scott Spiezio.

Again, the sample size here is ridiculously small, but the absurdity of the numbers he managed to log in 29 games is honestly kind of impressive. The Mariners released him from here and he then went on to win the World Series with St. Louis, even hitting a game-tying triple in Game 2 of the 2006 NLCS, so fortunately he was of some use to a team after that 2005 season.

 

In fairness, the designated hitter is not an incredibly stable position in today’s game. It doesn’t make sense to pay a premium for a skill that can be replicated by other hitters who can also play competent defense. There are only a handful of “conventional” designated hitters in the league, and even Victor Martinez and his -31.1 UZR/150 in 2014 are called upon to play defense every once in a while. For whatever reason – be it sentimentality with Griffey or having the odds stacked against them with Hart or just because Bavasi (Clement, Broussard), the Mariners have gotten an extraordinarily poor level of performance out of their designated hitters in an Edgar-less world, with some bright spots (Jaso, Branyan, 1 season of Kendrys Morales).

This season, Seattle will call on 2014 home-run king Nelson Cruz to fill in most nights at DH (at least as long as Seth Smith and Justin Ruggiano are healthy enough to man right so Cruz doesn’t have to). In 10 years of trying to fill Edgar’s place, the Mariners haven’t quite succeeded and probably won’t ever do so, but wouldn’t it be something to see them come close?


Gausmanian Distribution

At the end of spring training, Buck Showalter banished Kevin Gausman from the rotation in favor of Ubaldo Jimenez, a pitcher with a much higher salary and much less talent.  Many assumed that Jimenez’ salary largely dictated the move. Yes, he outpitched Gausman in spring training (4.44 ERA to 7.04), but it’s hard to believe that Showalter invests very much in spring training stats, and in any case if you put “4.44” into Google Translator, “success” is unlikely to be one of the resulting character strings.

One Orioles fan of my acquaintance heard that Showalter’s decision had more forethought: Buck’s intent may be to use Gausman much as the fireman reliever of old, and bring him in to critical situations in ballgames regardless of today’s ossified reliever usage patterns. Bill James long ago established that this is the most effective way to use a top-flight reliever, but it is less clear that this is the best way to use a potential #1 starter. Gausman is the only pitcher on the Orioles 25-man roster who has even  a prayer of turning into an ace, and it seems unlikely he’ll do it from the pen.

Gausman’s had a somewhat unusual start to his career. In his first two years as a major leaguer, he started 25 games and made 15 relief appearances. There are a total of 15 active pitchers who had at least 25 starts and 15 relief appearances in their first two years:

 

(Table courtesy of the invaluable Baseball Reference Play Index)

It’s certainly an eclectic mix. Only Buehrle established himself as an ace, though Arroyo has had a good career as a mid-rotation workhorse, and Masterson and (to a lesser extent) I-Can’t-Believe-It’s-Not-Fausto-Carmona have made useful contributions. For other starters on this list (Wood, Kelly) it’s too soon to tell. Affeldt and Stammen wisely gave up starting and have become bullpen mainstays. More sobering, many of the names on this list have had their careers derailed by injuries. It’s hard to know whether the mixed usage contributed to injury problems for guys like Ogando, Billingsley, and Holland; it is equally possible that conserving these young arms early may have averted even more serious or earlier arm trouble.

Gausman sits uneasily here; he is by far the highest drafted pitcher on this list (fourth overall in 2012). It is unsurprising to see a club experiment with a 38th-round pick who struggles to break a pane of glass, like Buehrle. Such tinkering is less common with a player drafted to be a rotation anchor. Indeed, there are only two other first-rounders on this list, Billingsley and Lynn.

In his first season (2006), Billingsley started 16 games and came in from the bullpen twice. He put up a respectable 3.80 ERA, but with atrocious peripherals (5.8 BB/9, 5,9 K/9). The Dodgers understandably exiled him to the bullpen to start the 2007 season, but Dresden-like pyrotechnics from Proven Veterans Mark Hendrickson, Brett Tomko, and Jason Schmidt forced the Dodgers to put Billingsley back in the rotation in June, and he acquitted himself reasonably the well the rest of the way. He would go on to have uneven success over the next four seasons until diagnosed with a torn UCL in September 2012. He has pitched in two major league games since.

Lance Lynn offers a happier comp for Gausman. He appeared largely in relief (2 starts in 18 games) in 2011. Despite Kyle McClellan’s runtastic performance as the Cardinals’ fifth starter, LaRussa elected not to insert Lynn into the rotation; the Cardinals instead traded for Edwin Jackson, who stabilized the fifth spot.  This seems similar to Showalter’s choice: go with the established if not necessarily dominant veteran in lieu of the risky young flamethrower. Lynn had put good numbers in 2011 at AAA, but not in 2010. The Cards’ reluctance to turn over a rotation spot to him in the midst of a playoff run was understandable. Lynn has been in the rotation since 2012, and has consistently produced very close to his career marks of 3.32 FIP and 2.71 K/BB, despite some jumpiness in his ERA.

Both these examples tend to suggest Showalter is making a mistake. The Dodgers finally ran out of Jason Schmidts, while the Cards went with the good-enough E-Jax (and, to  be fair, won the World Series). But in each case the young replacement would quickly prove himself superior to the older and supposedly safer option when finally given the chance. There are very few who would predict that, over the course of 30 starts, Jimenez will outperform Gausman in any significant statistical category.

But Showalter has other things on his mind. Specifically, this:

#27 Orioles


Name IP ERA FIP WAR
Chris Tillman 184.0 4.10 4.40 1.4
Wei-Yin Chen 169.0 4.04 4.17 1.5
Miguel Gonzalez 157.0 4.42 4.84 0.5
Bud Norris 154.0 4.15 4.30 1.1
Ubaldo Jimenez 146.0 4.28 4.38 0.9
Kevin Gausman 91.0 3.97 4.00 0.9
Dylan Bundy 18.0 4.40 4.56 0.1
Total 919.0 4.17 4.38 6.4

 

Yep, this is the FanGraphs Depth Chart projection for the Orioles starting rotation, with the O’s ranked 27th out of 30. Not a single starter checks in with a FIP under 4.00. This is a shaky rotation, and the Orioles have no quick way of making it better. Eventually, perhaps as early as next year Gausman, Bundy, and Hunter Harvey will form an enviable top 3, but there’s another problem on Buck’s plate. Next year, much of the current roster may be lost to free agency, including Chris Davis, Matt Wieters, Chen, and Norris. The Orioles are under enormous pressure to win now.

And Gausman can help! Because at this stage of his career, he is a much better reliever than starter. The big difference is in strikeouts:

AL average starter K/9: 7.1

AL average reliever K/9: 8.3

Gausman as starter K/9: 7.0

Gausman as reliever K/9: 11.7

That there is some major whiffage for a staff in dire need of it. Put Gausman together with Zach Britton, Darren O’Day, and Tommy “Big Game” Hunter, and the Orioles have a fully weaponized bullpen.  Buck’s plan is to hold on for the first five or six innings, and them shut down the opponent’s offense while the Orioles bats bludgeon their way to victory. And with Gausman acting as a mobile reserve, Showalter can shrink the innings for which the starters are responsible, but do so on a game-to-game basis. On those days when the starters happen to be effective they can go longer, and on those days (more often than not, one suspects) that they get into trouble, Showalter will be able to address some of that trouble with the best arm on the roster.

This isn’t the way I would ordinarily do it, but then again, this isn’t the roster I would have assembled. Showalter has repeatedly shown an ability to work with the tools he has rather than impose some prefabricated tactical rule set that disregards the strengths and weaknesses of his players. Baltimore’s road to the playoffs is neither straight nor sure, but at least it’s Showalter behind the wheel.


Insurance in Baseball is Like a Black Hole

How much gravity does insurance have in Major League Baseball front office decisions?

Puns aside, let me tell you the funny thing about a black hole. You see astronomers cannot really see one, instead they are detected through their effects on the universe around them. Although less extreme, insurance is similar in this regard on its impact on baseball teams. Most teams insure some of their larger contracts in case their players cannot play due to an exterior factor such as injury. Perhaps the impacts of this major facet of the game does not cross our mind often because it is not eminently visible. However, make no mistake that insurance is a major factor when teams make major decisions regarding the DL, contract extensions, playing time and so on.

First, consider the history of baseball insurance to better understand why it impacts baseball. It can be said that by the late 1990s it had become common place for teams to insure their larger contracts. The first time baseball and insurance first truly started getting media attention was with Albert Belle in 2001 due to confusion over his insurance contract. Albert Belle had suffered a career-ending hip injury with the Orioles. Fans grew excited however despite the disheartening news when in 2002 the all-star slugger was added back onto the Orioles’ 40 man roster. Disappointed Orioles fans can tell you that Belle never played another MLB game however. Instead, he was added back onto the 40-man roster so the Orioles could collect insurance on his contract (some MLB insurance contracts do not cover a player unless they remain on the 40-man roster). At the time the MLB insurance contract covered the remainder of the salary owed on Belle’s contract. According to writer Michael Branda, the Orioles recovered an astounding 27.3 million out of a 39-million-dollar loss represented by Belle’s injury. Since the huge losses on Belle’s contract in the early 2000s, insurers in baseball have become much more stringent with their underwriting in baseball contracts.

Belle was not the only reason for teams being more stringent with their underwriting. The associated risks with insuring a MLB player have increased. One reason is PEDs. The MLB really did not enforce its ban on PEDs up until the late 2000s, but now being caught using PEDs can result in a significant loss in playing time. According to MLB Trade Rumors a syndicate of the MLB, Ervin Santana a first-time offender was suspended for 80 games on April 3rd of 2015. Santana was expected to make 13.5 million dollars this season. Despite being suspended, the Minnesota Twins are expected to still have to pay half of Santana’s salary (only players caught for the second time or more for steroids lose most of their salary).

To account for this insurers enforce a 60 to 90 day deductible policy in order to shield themselves from these sort of losses as well as claims made for short-term injuries. In addition to the 90 day deductible insurance policies are typically term policies of about three years with an option for renewal after the conclusion of the contract. As a result, if a contract proves to incur severe losses the liabilities will be only in the short term. Obviously to further protect themselves insurers only cover players with no preexisting injuries and all players must be inspected by the insurer. Furthermore, with wide variability and unpredictability in player health an insurer can in a way readjust its rate every three years to better reflect the player’s risk of not playing.

These more stringent underwriting practices have influenced the game, oftentimes when deciding when to take a pitcher off of the DL. Adam Kilgore of the Washington Post explained it best when he talked about how in 2012 Stephen Strasburg was “shut down” during a playoff push for the Nationals due to potential health concerns. In this decision there appeared to be a delicate balance between contention and the considerations of the insurance company that would not have covered Strasburg if he had been injured due to these health concerns. It is not crazy to think that finances come into play when making a decision on baseball players. Jeff Moorad, a decision maker for the Padres explained that in 2010 Chris Young was eligible to come off the DL. The Padres ultimately chose to put Young back on the field but Moorad added, “the accounting department much preferred that he [had stayed] on the disabled list.”

Baseball insurance has grown steadily more expensive. Although it is difficult to ascertain how much a team actually spends on insurance it is clear that it can be a burden for smaller teams. For instance, the Arizona Diamondbacks’ rotation once featured the dominant starting pitcher Brandon Webb. Between 2006 and 2008 Brandon Webb made three all-star games and finished first, second and second respectively in Cy Young voting. In this span Webb also led the league in innings pitched once. In essence Webb was due for a large payday once his contract expired. However contract negotiations between Webb and the Diamondbacks hit a snag in June of 2008. Despite a track record as a durable starter, no insurance company was willing to write a policy for injury risk to the Diamondbacks hurler. The reason is because insurance companies refused to accept the risk of Webb injuring his arm or if they were willing to, they were going to charge exorbitant rates.

As a result, the Diamondbacks could not get an insurance contract that did not have an exclusion on arm, shoulder or elbow injuries, all vital and injury-prone body parts for pitchers. According to AZCentral, a news outlet that covers the Arizona Diamondbacks, due to Webb not being insurable the Diamondbacks broke off all contract negotiations. In the long run, this proved to be a smart move since ten months later after contract talks ceased Brandon Webb never pitched in the majors again due to injuries. The point of this is that not only is baseball insurance so expensive it can prove to be impractical, it shows that most insurers are unwilling to insure some pitchers. Walt Jocketty, the former general manager for the middle-market St. Louis Cardinals explained that insurance has “become so expensive that it’s a cost item we really have to look at when you put your payroll together.”

In addition to insurance contracts altering how teams manage their rosters they influence how teams treat their players. In essence there is a human side to these contracts. Consider Josh Hamilton who is in the middle of a massive five-year, 125-million-dollar contract with the Angels and who has been playing quite ineffectively relative to his salary. Josh Hamilton who recently suffered a relapse on his drug addiction before the 2015 season is a prime example of insurance influencing teams to treat their players in a way they hopefully normally would not. Despite being a repeat offender, an arbitrator chose not to suspend Hamilton for any period of time. What makes this story scandalous is the Angels’ seemingly acerbic response to this news. It appears that the Angels almost wanted Hamilton to be suspended to spare them the expense of his failed contract. Clearly the Angels have little incentive to help Hamilton recover from his addiction. Instead, it is in their interest to see that Hamilton never plays another game of baseball again because if he does not play for an extended period of time the Angels can potentially collect insurance and definitely reduce their payroll.

Insurance influences baseball more than many people may realize. When it comes to playing time, DL decisions and contract negotiations, insurance seems to be an integral piece in the decision-making process. For me though, part of what makes baseball great is the inherent competition of players, often with disregard to their own body (*cough* Adam Eaton *cough*). There is little harm of teams protecting themselves from the inherent risks of baseball players becoming injured. The risk is that insurance becomes an incentive for teams to make decisions that may be bad for the game, such as not playing players for financial gain. Let’s hope that ultimately, teams do not get engulfed into this black hole.


Hardball Retrospective – The “Original” 1999 Texas Rangers

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. Therefore, Fergie Jenkins is listed on the Phillies roster for the duration of his career while the Pirates claim Barry Bonds and the Rays declare Carl Crawford. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition.  Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Expanding on my research for the book, the following series of articles will reveal the finest single-season rosters for every Major League organization based on overall rankings in OWAR and OWS along with the general managers and scouting directors that constructed the teams. “Hardball Retrospective” is available in digital format on Amazon, Barnes and Noble, GooglePlay, iTunes and KoboBooks. The print edition is coming soon. Additional information and a discussion forum are available at TuataraSoftware.com.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

Assessment

The 1999 Texas Rangers         OWAR: 50.4     OWS: 284     OPW%: .512

GM Tom Grieve acquired 79% (38 of 48) of the ballplayers on the 1999 Rangers roster. 38 of the 48 team members were selected through the Amateur Draft process. Based on the revised standings the “Original” 1999 Rangers placed six games behind the Mariners in the American League Western Division race. Texas (83-79) claimed the Wild Card by a one-game margin over Chicago and Kansas City.

Perennial All-Star backstop Ivan Rodriguez enhanced his trophy case with the 1999 A.L. MVP award. “Pudge” produced a .332 BA while notching career-bests in home runs (35), RBI (113), runs scored (116), base hits (199) and stolen bases (25). Rodriguez collected 13 Gold Glove Awards including 10 in consecutive seasons (1992-2001). “Slammin’” Sammy Sosa launched 63 moon-shots, drove in 141 baserunners and registered 114 tallies. Juan “Igor” Gonzalez belted 39 round-trippers, knocked in 128 runs and delivered a .328 BA after an MVP season in the previous campaign.

Fernando Tatis (.298/34/107) enjoyed a career year over at the hot corner, scoring 104 runs and swiping 21 bags. Rusty Greer clubbed 41 doubles, 20 big-flies and plated 101 baserunners while eclipsing the .300 mark for the fourth successive season. Rich Aurilia (.281/22/80) and Mike Stanley (.281/19/72) supplied additional thump towards the bottom of the lineup. Warren Morris parlayed a .288 BA and 15 long balls into a third-place finish in the Rookie of the Year balloting.

Rodriguez slots into 13th place in “The New Bill James Historical Baseball Abstract” among backstops. He certainly elevated his ranking after playing ten additional years following the publication of NBJHBA in 2001. Right fielders Sosa and Gonzalez are listed in 45th and 52th place, respectively.

LINEUP POS WAR WS
Warren Morris 2B 1.71 15.4
Ivan Rodriguez C 5.22 28.63
Fernando Tatis 3B 5.05 23.74
Sammy Sosa RF 4.98 26.64
Juan Gonzalez DH/RF 2.88 24.42
Rich Aurilia SS 3.06 18.11
Rusty Greer LF 2.32 21.03
Mike Stanley 1B 1.82 13.67
Terrell Lowery CF -0.17 3.21
BENCH POS WAR WS
Rey Sanchez SS 2.59 11.29
Jose Hernandez SS 2.3 16.33
Dean Palmer 3B 1.04 16.71
Hanley Frias SS 0.22 4.42
Edwin Diaz 2B 0.17 0.62
Kevin L. Brown C 0.12 0.58
Jon Shave SS 0.09 1.98
Bill Haselman C -0.04 3.84
Jeff Frye 2B -0.13 2.35
Ruben Mateo CF -0.26 1.9
Kelly Dransfeldt SS -0.26 0.8
Chad Kreuter C -0.58 3.51

Kevin J. Brown, the undisputed ace of the Texas rotation, compiled a record of 18-9 with a 3.00 ERA, 1.066 WHIP and 221 strikeouts. The balance of the starting staff submitted sub-par efforts in contrast to their career norms. Jeff Zimmerman (9-3, 2.36) fashioned a 0.833 WHIP and received an invitation to the Mid-Summer Classic during his rookie campaign.

ROTATION POS WAR WS
Kevin J. Brown SP 5.54 19.92
Darren Oliver SP 3.94 12.45
Rick Helling SP 3.78 12.52
Kenny Rogers SP 2.97 11.57
Wilson Alvarez SP 1.89 9.95
BULLPEN POS WAR WS
Jeff Zimmerman RP 3.67 14.64
Mike Venafro RP 1.19 7.36
Mark Petkovsek RP 0.84 9.49
Terry Mathews RP 0.31 2.54
Danny Kolb RP 0.13 1.9
Brian Bohanon SP 1.63 9.68
Ryan Dempster SP 1.45 6.98
Jim Brower SP 0.42 1.87
Robb Nen RP 0.07 7.89
Danny Patterson RP -0.08 2.64
Mike Cather RP -0.17 0
Corey Lee RP -0.2 0
Jonathan Johnson RP -0.26 0
Bobby Witt SP -0.28 4.52
Billy Taylor RP -0.3 5.54
Dan Smith SP -0.34 1.73
Tony Fossas RP -0.39 0
Ryan Glynn SP -0.42 0
Scott Eyre RP -0.66 0
Doug Davis RP -0.66 0
Julio Santana SP -1 0.17
Matt Whiteside RP -1.1 0

The “Original” 1999 Texas Rangers roster

NAME POS WAR WS General Manager Scouting Director
Kevin Brown SP 5.54 19.92 Tom Grieve Sandy Johnson
Ivan Rodriguez C 5.22 28.63 Tom Grieve Sandy Johnson
Fernando Tatis 3B 5.05 23.74 Tom Grieve Sandy Johnson
Sammy Sosa RF 4.98 26.64 Tom Grieve Sandy Johnson
Darren Oliver SP 3.94 12.45 Tom Grieve Sandy Johnson
Rick Helling SP 3.78 12.52 Tom Grieve Sandy Johnson
Jeff Zimmerman RP 3.67 14.64 Doug Melvin Chuck McMichael
Rich Aurilia SS 3.06 18.11 Tom Grieve Sandy Johnson
Kenny Rogers SP 2.97 11.57 Eddie Robinson Joe Klein
Juan Gonzalez RF 2.88 24.42 Tom Grieve Sandy Johnson
Rey Sanchez SS 2.59 11.29 Tom Grieve Sandy Johnson
Rusty Greer LF 2.32 21.03 Tom Grieve Sandy Johnson
Jose Hernandez SS 2.3 16.33 Tom Grieve Sandy Johnson
Wilson Alvarez SP 1.89 9.95 Tom Grieve Sandy Johnson
Mike Stanley 1B 1.82 13.67 Tom Grieve Sandy Johnson
Warren Morris 2B 1.71 15.4 Doug Melvin
Brian Bohanon SP 1.63 9.68 Tom Grieve Sandy Johnson
Ryan Dempster SP 1.45 6.98 Doug Melvin Sandy Johnson
Mike Venafro RP 1.19 7.36 Doug Melvin Sandy Johnson
Dean Palmer 3B 1.04 16.71 Tom Grieve Sandy Johnson
Mark Petkovsek RP 0.84 9.49 Tom Grieve Sandy Johnson
Jim Brower SP 0.42 1.87 Tom Grieve Sandy Johnson
Terry Mathews RP 0.31 2.54 Tom Grieve Sandy Johnson
Hanley Frias SS 0.22 4.42 Tom Grieve Sandy Johnson
Edwin Diaz 2B 0.17 0.62 Tom Grieve Sandy Johnson
Danny Kolb RP 0.13 1.9 Doug Melvin Sandy Johnson
Kevin Brown C 0.12 0.58 Tom Grieve Sandy Johnson
Jon Shave SS 0.09 1.98 Tom Grieve Sandy Johnson
Robb Nen RP 0.07 7.89 Tom Grieve Sandy Johnson
Bill Haselman C -0.04 3.84 Tom Grieve Sandy Johnson
Danny Patterson RP -0.08 2.64 Tom Grieve Sandy Johnson
Jeff Frye 2B -0.13 2.35 Tom Grieve Sandy Johnson
Terrell Lowery CF -0.17 3.21 Tom Grieve Sandy Johnson
Mike Cather RP -0.17 0 Tom Grieve Sandy Johnson
Corey Lee RP -0.2 0 Doug Melvin
Ruben Mateo CF -0.26 1.9 Doug Melvin Sandy Johnson
Jonathan Johnson RP -0.26 0 Doug Melvin Sandy Johnson
Kelly Dransfeldt SS -0.26 0.8 Doug Melvin
Bobby Witt SP -0.28 4.52 Tom Grieve Sandy Johnson
Billy Taylor RP -0.3 5.54 Eddie Robinson
Dan Smith SP -0.34 1.73 Tom Grieve Sandy Johnson
Tony Fossas RP -0.39 0 Eddie Robinson
Ryan Glynn SP -0.42 0 Doug Melvin Sandy Johnson
Chad Kreuter C -0.58 3.51 Tom Grieve Sandy Johnson
Scott Eyre RP -0.66 0 Tom Grieve Sandy Johnson
Doug Davis RP -0.66 0 Doug Melvin
Julio Santana SP -1 0.17 Tom Grieve Sandy Johnson
Matt Whiteside RP -1.1 0 Tom Grieve Sandy Johnson

Honorable Mention

The “Original” 2001 Rangers              OWAR: 48.4     OWS: 278     OPW%: .513

Sosa shredded opposition pitching to the tune of a .328 BA while launching 64 moon-shots, registering 160 RBI and scoring a League-best 146 runs. Aurilia delivered career-bests with a .324 BA, 37 dingers, 97 ribbies and 114 tallies as he topped the circuit with 206 safeties. Gonzalez swatted 35 big-flies and knocked in 140 baserunners. Zimmerman notched 28 saves and Brown furnished a 2.65 ERA in 19 starts.

On Deck

The “Original” 1924 Senators

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive


Are Two Opening-Day Homers Merely Dust-in the Wind?

As a Red Sox fan, I got very excited opening day when Dustin Pedroia hit two home runs. One of the big questions of this offseason is whether he has upper-single-digit homer power, or upper-teens homer power. Of course, as a thinking baseball fan, my head tells me to avoid getting overly excited about a small sample size. But does the two-HR outbreak actually tell us nothing? I think the expectations going into the season combined with Pedroia’s performance in his first game is a perfect situation to use Bayes’ Theorem.

To elaborate, I think Pedroia’s expectations going into this season have a bimodal distribution. If you look at his 2008-2012 seasons, he averaged 16 HR per year. His last two seasons averaged 8 HR per year. Was this due to a real decline, or due to injuries that sapped his power? While someone like Mike Trout might have a nice normally-distributed expectation around 35 HR, I expected Pedroia to have an either/or season: he’d either get back to 2008-2012 production, or continue as a 8-HR guy.

Now for a review of Bayes’ Theorem: it tells you how to update your prior beliefs given an observation. The formula for this is P(A|B) = P(B|A)*P(A)/P(B), where A and B are events, P(A) and P(B) are the probabilities of those events, and P(A|B) or P(B|A) should be read as “Probability of A given B,” or “Probability of B given A,” respectively. Specifically, in this case, A is “Dustin Pedroia is a 16-HR guy”, and B is “Dustin Pedroia hit 2 HR in his first game of the season”. I had a preseason belief about P(A), but I want to update it given that event B has occurred.

As implied above, I’m going to simplify Pedroia’s season outcomes into two possible outcomes: He is an 8-HR guy, or a 16-HR guy. Before the season, I’m going to guess that I had about a 50-50 belief that he was either one. Another assumption I’m going to make, to make the math easier, is that a season will see 640 plate appearances. You can make your own assumptions, but this is a demonstration of how much Bayes’ Theorem helps us update beliefs based on just one observation.

We need to determine three quantities to do our calculation now:
1. P(A)—probability that Pedroia is a 16-HR guy
2. P(B|A)—probability that we would see Pedroia hit 2 HR in his first 5 plate appearances, given that he is a 16-HR guy
3. P(B)—probability that we would see Pedroia hit 2 HR in his first 5 plate appearances (taking our 50-50 chance that he’s a 16 or 8-HR guy as a given)

1. Probability that Pedroia is a 16-HR guy

Easy. By assumption, P(A) is 50%.

2. Probability that we would see Pedroia hit 2 HR in his first 5 plate appearances, given that he’s a 16-HR guy

Tougher, but we can use a binomial probability model. That is 5C2*P(HR)^2*(1-P(HR))^3. When we have 16 HR in 640 plate appearances, P(HR) is 1/40, and 1-P(HR) is 39/40. This turns out to be .00579. P(B|A)= 0.579%.

3. Probability that we would see Pedroia hit 2 HR in his first 5 plate appearances, with preseason assumptions

This is the weighted average of all his possible season outcomes—so probability of 2HR in 5PA, given that he is a 16-HR guy, times the chance that he’s a 16-HR guy, PLUS, probability of 2HR in 5PA, times the chance that he’s an 8-HR guy. The same calculation as in number 2 can be done for if he’s an 8-HR guy, yielding an answer that the chance that he’d hit 2HR in 5PA is 0.151%. Given our calculation in the above paragraph, and our preseason assumption that it’s 50-50 that he’s an 8 or 16-HR guy, that gives us a weighted average P(B) = 0.365%.

So now we can mash all of those numbers into Bayes’ equation, and we find that .50*.00579/.00365 = .794, or 79.4%! Turns out that my Red Sox-loving lizard brain was not wrong! If you believed preseason that there was a 50%-50% chance that Pedroia would return to his 2008-2012 form, you should rationally update your beliefs to 80%-20% on the minuscule sample size of just two home runs in five plate appearances! Another note is that we should be forward-looking: since he has nearly a full season of plate appearances remaining, it might be rational to think that he’s likely to be an 18-HR guy, now that he has 2 in the bag.

This method could be adapted to a continuous expectation of outcomes, allowing a chance that Pedroia might be something besides an 8HR guy or a 16HR guy (although you and I know that that is clearly absurd).