Archive for Research

When Do Stars Become Scrubs?

April 30, 2015

Baseball is a game driven by stars. They create the most exciting highlight reels that captivate audiences and leave us all in awe. However, eventually every star player loses their battle with Father Time. The purpose of this research was to try and determine when a star player’s production declines to the point where they can become easily replaceable. I decided to use a process called survival analysis to determine when this event occurs.

Methodology

Survival analysis attempts to determine the probability of when an event will occur. In any survival analysis problem, you need to determine three things. You need to determine the requirements for your population, the variables to predict the time of event, and the event.

For this problem, I decided that I would include any player that had their first season of 4 WAR or higher between 1920 and 1999 in my population. I decided to use for my variables: the age when they recorded their first star season, body mass index, offensive runs above average per 150 games, and defensive runs above average per 150 games as my variables. The event I chose to predict was when the player would have his first season below 1 WAR following their star season. The cutoffs for determining stars and scrubs were fairly arbitrary, but I chose these cutoffs because the FanGraphs glossary loosely defines an All-Star season as 4-5 WAR and a scrub season as 0-1 WAR.

Determining the variables was much more difficult. I wanted to pick variables that would represent a player’s performance, age, and overall health. The age was simple enough to find, but it was difficult to find any injury history for players so I decided to calculate a player’s BMI from their listed height and weight. Obviously this isn’t a perfect representation, because a player’s weight is constantly changing throughout his career, but it’s the best that I could do given my limited resources. In order to limit my performance variables, I thought it was best to settle for the offensive runs and defensive runs component of WAR. However, since these are accumulating statistics, I had to recreate them as rate statistics in order to avoid creating correlation issues with the age variable in the model. I would have liked to use more offensive variables, but I feared that adding more inputs would make the model too convoluted and affect the accuracy of the player predictions. Alright, that’s enough preparation; let’s dive into the actual data.

Survival Rate Data

As a jumping off point, I’ll start by presenting a table of the survival rates for my population. Each season indicates the percentage of players from the original population that had not yet recorded a scrub season.

Season	1	2	3	4	5	6	7	8	9	10
Survival Function	87.62%	74.28%	65.20%	54.88%	45.80%	39.06%	32.32%	26.96%	22.15%	17.19%
Season	11	12	13	14	15	16	17	18	19	20
Survival Function	13.76%	10.73%	7.57%	5.50%	3.44%	2.06%	1.24%	0.69%	0.28%	0.00%

Let’s make some quick observations. The data shows that no star player has gone more than 20 seasons without recording a season below 1 WAR. It also appears that the survival function decays exponentially. I also found it interesting that over 50% of stars turn into scrubs by their fifth season and that only 17% of star players survive 10 years in the majors before they register a scrub season. Looking at this data really helps to appreciate how rare it is when players like Derek Jeter and Adrian Beltre perform at a consistent level on a year to year basis.

Hazard Rate Data

Next, we will look at the hazard rate of the players in the population. One of the purposes of examining the hazard rate is to see how the rate of failure changes in a population over time. To find the hazard rate for each time period, you divide the amount of events recorded during a time period by the amount of players that have not yet registered a scrub season. Below is the following calculation for each time period in table format.

Season	1	2	3	4	5	6	7	8	9	10
Hazard Function	12.38%	15.23%	12.22%	15.82%	16.54%	14.71%	17.25%	16.60%	17.86%	22.36%
Season	11	12	13	14	15	16	17	18	19	20
Hazard Function	20.00%	22.00%	29.49%	27.27%	37.50%	40.00%	40.00%	44.44%	60.00%	100.00%

As you can see by the table above, the hazard rate generally increases with each passing season. This makes sense, because as players age, their skill level decreases and their odds of registering a scrub season will increase. However, the hazard rates are fairly constant for the first ten years and then rapidly increase from then on. I’m rather surprised that the hazard rates stayed so consistent for the first ten or so years. I would have guessed that the hazard function would have increased much more rapidly with each passing season.

Determining the Model

It is important to identify the trend of the hazard function, because it helps determine which distribution to use when creating a parametric model. If the hazard rate increases exponentially, you are supposed to use a Weibull distribution. If the hazard rate is constant, you are supposed to use an exponential distribution. Since the hazard function was increasing, I originally attempted to the use the Weibull distribution for the model but I found that the model was predicting too many players to fail in the first few seasons, so I decided to try an exponential distribution instead.

I found that the exponential distribution model was more accurate at predicting survival rates in the first ten years, but severely under predicted the amount of players that would record a scrub season after ten years. I decided to use the exponential distribution, because I believe that it would be far more useful to accurately predict the first ten years instead of the last ten years, since only 17% of players survive ten years. I also believe that any franchise would be thrilled to obtain ten years of stardom from a player and anymore production is just an added bonus.

Survival Rate Estimates

Below is a table of each star player from 2000 to 2014 with the year they entered the population, the time until they became a scrub, every variable included in the model and their predicted survival rate for each of their first ten seasons since becoming a star.

Year Entered	Name	Time of Event	Age	BMI	Off	Def	Season 1	Season 2	Season 3	Season 4	Season 5	Season 6	Season 7	Season 8	Season 9	Season 10
2000	Bobby Higginson	2	29	25.1	14.7	-11.0	77.83%	60.58%	47.15%	36.70%	28.56%	22.23%	17.30%	13.47%	10.48%	8.16%
2000	Darin Erstad	6	26	25.0	8.2	8.0	83.94%	70.46%	59.14%	49.64%	41.67%	34.98%	29.36%	24.64%	20.68%	17.36%
2000	Jorge Posada	8	28	27.6	11.2	7.1	80.94%	65.51%	53.02%	42.91%	34.73%	28.11%	22.75%	18.41%	14.90%	12.06%
2000	Jose Vidro	4	25	24.4	3.1	-4.9	83.43%	69.61%	58.07%	48.45%	40.42%	33.72%	28.14%	23.47%	19.58%	16.34%
2000	Phil Nevin	2	29	23.1	5.1	-2.8	76.52%	58.55%	44.80%	34.28%	26.23%	20.07%	15.36%	11.75%	8.99%	6.88%
2000	Richard Hidalgo	2	25	27.5	19.6	9.5	87.32%	76.24%	66.57%	58.13%	50.75%	44.32%	38.69%	33.79%	29.50%	25.76%
2000	Shannon Stewart	5	26	23.7	9.9	-3.2	83.26%	69.33%	57.73%	48.07%	40.02%	33.32%	27.75%	23.10%	19.24%	16.02%
2000	Todd Helton	8	26	28.2	25.4	-1.4	86.03%	74.01%	63.67%	54.77%	47.12%	40.53%	34.87%	30.00%	25.81%	22.20%
2000	Troy Glaus	3	23	26.1	12.2	9.0	88.69%	78.66%	69.76%	61.87%	54.87%	48.66%	43.16%	38.28%	33.95%	30.11%
2001	Albert Pujols	12	21	28.7	47.2	0.8	93.62%	87.66%	82.07%	76.84%	71.94%	67.35%	63.06%	59.04%	55.27%	51.75%
2001	Aramis Ramirez	1	23	27.0	-3.7	-2.3	85.43%	72.99%	62.35%	53.27%	45.51%	38.88%	33.22%	28.38%	24.24%	20.71%
2001	Bret Boone	3	32	25.8	-4.0	-0.5	66.26%	43.91%	29.09%	19.28%	12.77%	8.46%	5.61%	3.72%	2.46%	1.63%
2001	Cliff Floyd	5	28	26.1	13.4	-6.7	79.99%	63.98%	51.18%	40.94%	32.75%	26.19%	20.95%	16.76%	13.41%	10.72%
2001	Corey Koskie	6	28	26.9	11.2	7.7	81.02%	65.64%	53.18%	43.08%	34.90%	28.28%	22.91%	18.56%	15.04%	12.18%
2001	Eric Chavez	6	23	28.4	8.8	3.4	87.79%	77.06%	67.65%	59.39%	52.13%	45.77%	40.18%	35.27%	30.96%	27.18%
2001	Ichiro Suzuki	10	27	23.7	26.6	7.5	85.65%	73.36%	62.83%	53.82%	46.10%	39.48%	33.82%	28.96%	24.81%	21.25%
2001	J.D. Drew	10	25	26.4	25.6	10.8	88.29%	77.95%	68.82%	60.76%	53.65%	47.37%	41.82%	36.92%	32.60%	28.78%
2001	Lance Berkman	11	25	29.0	35.8	-6.9	88.44%	78.21%	69.17%	61.17%	54.10%	47.84%	42.31%	37.42%	33.09%	29.27%
2001	Mike Sweeney	3	27	25.7	12.9	-7.4	81.67%	66.70%	54.48%	44.49%	36.34%	29.68%	24.24%	19.80%	16.17%	13.21%
2001	Paul Lo Duca	6	29	27.7	11.2	14.7	79.86%	63.78%	50.94%	40.68%	32.49%	25.95%	20.72%	16.55%	13.22%	10.56%
2001	Placido Polanco	5	25	28.1	-9.1	12.1	82.55%	68.14%	56.25%	46.43%	38.33%	31.64%	26.12%	21.56%	17.80%	14.69%
2001	Rich Aurilia	6	29	23.1	4.1	10.1	77.86%	60.63%	47.21%	36.76%	28.62%	22.28%	17.35%	13.51%	10.52%	8.19%
2001	Ryan Klesko	5	30	27.5	21.8	-10.8	77.39%	59.89%	46.35%	35.87%	27.76%	21.48%	16.63%	12.87%	9.96%	7.71%
2001	Torii Hunter	13	25	28.9	-11.5	7.3	81.57%	66.53%	54.27%	44.26%	36.10%	29.45%	24.02%	19.59%	15.98%	13.03%
2002	Adam Dunn	6	22	32.9	25.5	-4.2	90.45%	81.82%	74.01%	66.94%	60.55%	54.77%	49.54%	44.81%	40.54%	36.67%
2002	Adrian Beltre	N/A	23	30.7	-0.8	10.8	86.84%	75.41%	65.49%	56.87%	49.39%	42.89%	37.24%	32.34%	28.09%	24.39%
2002	Alfonso Soriano	7	26	25.7	11.8	-11.3	82.81%	68.57%	56.78%	47.02%	38.93%	32.24%	26.70%	22.11%	18.31%	15.16%
2002	Austin Kearns	2	22	30.0	31.7	17.8	92.26%	85.12%	78.53%	72.45%	66.84%	61.67%	56.89%	52.49%	48.43%	44.68%
2002	David Eckstein	5	27	27.4	4.8	5.3	81.22%	65.97%	53.58%	43.52%	35.34%	28.71%	23.31%	18.94%	15.38%	12.49%
2002	Edgar Renteria	7	25	26.4	-6.1	8.4	82.83%	68.61%	56.83%	47.08%	38.99%	32.30%	26.76%	22.16%	18.36%	15.21%
2002	Eric Hinske	2	24	30.2	21.6	4.6	88.41%	78.16%	69.10%	61.09%	54.00%	47.74%	42.21%	37.31%	32.99%	29.16%
2002	Jacque Jones	2	27	25.1	0.5	4.9	80.34%	64.54%	51.85%	41.66%	33.47%	26.89%	21.60%	17.35%	13.94%	11.20%
2002	Jose Hernandez	1	32	23.7	-8.5	8.1	66.31%	43.97%	29.16%	19.34%	12.82%	8.50%	5.64%	3.74%	2.48%	1.64%
2002	Junior Spivey	1	27	25.1	15.0	0.7	82.92%	68.76%	57.01%	47.28%	39.20%	32.50%	26.95%	22.35%	18.53%	15.37%
2002	Mark Kotsay	4	26	29.8	-0.1	11.5	82.51%	68.08%	56.18%	46.35%	38.25%	31.56%	26.04%	21.49%	17.73%	14.63%
2002	Miguel Tejada	8	28	32.5	3.5	1.8	78.40%	61.46%	48.19%	37.78%	29.62%	23.22%	18.20%	14.27%	11.19%	8.77%
2002	Pat Burrell	1	25	28.6	16.1	-15.1	84.76%	71.84%	60.90%	51.62%	43.75%	37.08%	31.43%	26.64%	22.58%	19.14%
2002	Randy Winn	8	28	22.5	-3.9	-0.8	76.68%	58.80%	45.09%	34.57%	26.51%	20.33%	15.59%	11.95%	9.17%	7.03%
2003	Bill Mueller	3	32	24.4	9.6	2.9	71.27%	50.80%	36.20%	25.80%	18.39%	13.11%	9.34%	6.66%	4.75%	3.38%
2003	Garret Anderson	1	31	23.7	2.7	0.7	71.50%	51.13%	36.56%	26.14%	18.69%	13.37%	9.56%	6.83%	4.89%	3.49%
2003	Hank Blalock	2	22	25.3	2.4	12.3	88.77%	78.80%	69.94%	62.09%	55.11%	48.92%	43.43%	38.55%	34.22%	30.37%
2003	Javy Lopez	3	32	23.1	8.8	7.8	71.83%	51.59%	37.06%	26.62%	19.12%	13.73%	9.87%	7.09%	5.09%	3.66%
2003	Jeff DaVanon	2	29	25.1	3.7	9.6	77.61%	60.23%	46.75%	36.28%	28.16%	21.85%	16.96%	13.16%	10.21%	7.93%
2003	Juan Pierre	5	25	25.8	-9.8	10.9	82.36%	67.84%	55.87%	46.02%	37.90%	31.22%	25.71%	21.18%	17.44%	14.37%
2003	Luis Castillo	5	27	20.2	0.5	3.3	80.34%	64.55%	51.86%	41.67%	33.48%	26.89%	21.61%	17.36%	13.95%	11.21%
2003	Marcus Giles	4	25	27.4	17.2	9.8	86.98%	75.66%	65.81%	57.24%	49.79%	43.31%	37.67%	32.77%	28.50%	24.79%
2003	Mark Loretta	2	31	23.7	-1.0	-1.9	69.97%	48.96%	34.25%	23.97%	16.77%	11.73%	8.21%	5.74%	4.02%	2.81%
2003	Melvin Mora	6	31	27.9	4.1	5.8	72.44%	52.48%	38.01%	27.54%	19.95%	14.45%	10.47%	7.58%	5.49%	3.98%
2003	Mike Lowell	2	29	23.7	7.4	2.3	77.70%	60.37%	46.90%	36.44%	28.31%	22.00%	17.09%	13.28%	10.32%	8.02%
2003	Milton Bradley	6	25	29.2	-2.7	7.1	83.29%	69.36%	57.77%	48.11%	40.07%	33.37%	27.80%	23.15%	19.28%	16.06%
2003	Morgan Ensberg	1	27	27.0	14.4	10.3	83.64%	69.96%	58.52%	48.95%	40.94%	34.25%	28.64%	23.96%	20.04%	16.76%
2003	Orlando Cabrera	1	28	28.0	-10.3	10.2	76.18%	58.04%	44.21%	33.68%	25.66%	19.55%	14.89%	11.34%	8.64%	6.58%
2003	Rafael Furcal	8	25	29.6	2.7	6.5	84.23%	70.94%	59.75%	50.33%	42.39%	35.70%	30.07%	25.33%	21.33%	17.97%
2003	Trot Nixon	4	29	25.7	16.8	-0.3	79.54%	63.27%	50.33%	40.04%	31.85%	25.33%	20.15%	16.03%	12.75%	10.14%
2004	Aaron Rowand	4	26	28.5	8.6	10.6	84.15%	70.81%	59.58%	50.14%	42.19%	35.50%	29.87%	25.14%	21.15%	17.80%
2004	Aubrey Huff	1	27	27.4	12.5	-11.3	81.13%	65.82%	53.40%	43.32%	35.15%	28.52%	23.14%	18.77%	15.23%	12.35%
2004	Brad Wilkerson	2	27	27.1	13.8	-3.1	82.23%	67.62%	55.61%	45.73%	37.61%	30.93%	25.43%	20.91%	17.20%	14.14%
2004	Carl Crawford	7	22	28.9	-3.4	13.0	87.93%	77.31%	67.98%	59.77%	52.56%	46.21%	40.63%	35.73%	31.41%	27.62%
2004	Carlos Guillen	5	28	28.4	4.7	5.7	79.30%	62.89%	49.87%	39.55%	31.36%	24.87%	19.72%	15.64%	12.40%	9.84%
2004	Carlos Lee	5	28	34.7	9.9	-3.6	79.19%	62.71%	49.66%	39.32%	31.14%	24.66%	19.53%	15.46%	12.25%	9.70%
2004	Coco Crisp	2	24	26.5	-4.6	9.9	84.85%	72.00%	61.09%	51.83%	43.98%	37.32%	31.67%	26.87%	22.80%	19.34%
2004	Corey Patterson	1	24	25.8	-5.1	8.8	84.68%	71.71%	60.73%	51.43%	43.55%	36.88%	31.23%	26.45%	22.40%	18.97%
2004	David Ortiz	5	28	28.0	14.6	-14.8	79.28%	62.85%	49.82%	39.50%	31.31%	24.82%	19.68%	15.60%	12.37%	9.80%
2004	Jack Wilson	2	26	27.1	-18.0	11.7	78.73%	61.99%	48.80%	38.42%	30.25%	23.82%	18.75%	14.76%	11.62%	9.15%
2004	Jason Varitek	2	32	29.5	1.4	8.6	69.34%	48.08%	33.34%	23.12%	16.03%	11.11%	7.71%	5.34%	3.70%	2.57%
2004	Jimmy Rollins	N/A	25	27.4	-3.1	6.4	83.21%	69.24%	57.61%	47.94%	39.89%	33.19%	27.62%	22.98%	19.12%	15.91%
2004	Mark Teixeira	9	24	26.9	11.9	-1.9	86.61%	75.01%	64.96%	56.26%	48.72%	42.20%	36.55%	31.65%	27.41%	23.74%
2004	Travis Hafner	4	27	30.0	26.7	-17.1	83.33%	69.44%	57.86%	48.22%	40.18%	33.48%	27.90%	23.25%	19.37%	16.14%
2004	Vernon Wells	5	25	30.3	9.2	-2.8	84.56%	71.50%	60.46%	51.12%	43.23%	36.56%	30.91%	26.14%	22.10%	18.69%
2005	Brian Roberts	6	27	25.8	4.8	6.0	81.34%	66.16%	53.82%	43.78%	35.61%	28.96%	23.56%	19.16%	15.59%	12.68%
2005	Chase Utley	N/A	26	26.4	15.7	13.2	85.66%	73.37%	62.85%	53.83%	46.11%	39.50%	33.83%	28.98%	24.82%	21.26%
2005	David DeJesus	9	25	26.5	6.8	2.8	84.73%	71.79%	60.82%	51.53%	43.66%	36.99%	31.34%	26.56%	22.50%	19.06%
2005	David Wright	N/A	22	27.8	31.4	1.5	91.48%	83.69%	76.57%	70.04%	64.08%	58.62%	53.63%	49.06%	44.88%	41.06%
2005	Derrek Lee	1	29	28.5	18.4	-11.5	78.52%	61.66%	48.42%	38.02%	29.85%	23.44%	18.41%	14.45%	11.35%	8.91%
2005	Felipe Lopez	2	25	27.8	-3.9	1.5	82.57%	68.17%	56.29%	46.47%	38.37%	31.68%	26.16%	21.60%	17.83%	14.72%
2005	Grady Sizemore	5	22	25.7	16.2	11.2	90.39%	81.71%	73.86%	66.76%	60.35%	54.55%	49.31%	44.57%	40.29%	36.42%
2005	Jason Bay	2	26	27.0	37.1	-15.6	86.82%	75.37%	65.44%	56.81%	49.32%	42.82%	37.17%	32.27%	28.02%	24.32%
2005	Jhonny Peralta	1	23	27.6	6.0	2.9	87.36%	76.33%	66.68%	58.26%	50.90%	44.46%	38.85%	33.94%	29.65%	25.90%
2005	Julio Lugo	2	29	23.1	-3.3	6.7	75.53%	57.05%	43.09%	32.55%	24.58%	18.57%	14.03%	10.59%	8.00%	6.04%
2005	Mark Ellis	9	28	27.3	6.3	8.1	79.97%	63.95%	51.14%	40.89%	32.70%	26.15%	20.91%	16.72%	13.37%	10.69%
2005	Michael Young	7	28	26.4	3.9	-4.8	77.96%	60.77%	47.37%	36.93%	28.79%	22.44%	17.50%	13.64%	10.63%	8.29%
2005	Miguel Cabrera	N/A	22	29.2	23.8	-13.8	89.77%	80.58%	72.33%	64.93%	58.28%	52.32%	46.96%	42.16%	37.84%	33.97%
2005	Nick Johnson	3	26	29.4	12.9	-7.3	83.28%	69.35%	57.75%	48.09%	40.05%	33.35%	27.77%	23.13%	19.26%	16.04%
2005	Richie Sexson	2	30	23.7	18.2	-12.9	76.37%	58.32%	44.54%	34.01%	25.98%	19.84%	15.15%	11.57%	8.84%	6.75%
2005	Victor Martinez	3	26	27.0	7.3	7.4	83.66%	70.00%	58.56%	49.00%	40.99%	34.30%	28.69%	24.01%	20.08%	16.80%
2006	Bill Hall	2	26	28.5	2.4	5.5	82.47%	68.01%	56.08%	46.25%	38.14%	31.45%	25.94%	21.39%	17.64%	14.55%
2006	Brandon Inge	2	29	26.5	-11.3	12.0	73.90%	54.62%	40.37%	29.83%	22.05%	16.29%	12.04%	8.90%	6.58%	4.86%
2006	Brian McCann	N/A	22	28.7	12.3	8.4	89.71%	80.47%	72.19%	64.76%	58.09%	52.11%	46.75%	41.94%	37.62%	33.75%
2006	Curtis Granderson	N/A	25	26.4	3.3	12.5	84.96%	72.18%	61.33%	52.11%	44.27%	37.61%	31.96%	27.15%	23.07%	19.60%
2006	Dan Uggla	7	26	29.3	13.1	7.2	84.62%	71.60%	60.58%	51.26%	43.38%	36.70%	31.06%	26.28%	22.24%	18.81%
2006	Freddy Sanchez	2	28	27.1	3.1	11.9	79.68%	63.49%	50.58%	40.31%	32.11%	25.59%	20.39%	16.25%	12.94%	10.31%
2006	Garrett Atkins	2	26	24.4	7.3	1.9	83.22%	69.26%	57.64%	47.97%	39.93%	33.23%	27.65%	23.02%	19.15%	15.94%
2006	Hanley Ramirez	5	22	28.9	22.4	-2.3	90.28%	81.50%	73.58%	66.43%	59.97%	54.14%	48.88%	44.12%	39.84%	35.96%
2006	Joe Mauer	N/A	23	27.3	23.2	7.6	89.97%	80.94%	72.82%	65.52%	58.94%	53.03%	47.71%	42.92%	38.62%	34.74%
2006	Jose Reyes	3	23	26.4	3.8	9.7	87.55%	76.66%	67.12%	58.76%	51.45%	45.05%	39.44%	34.53%	30.24%	26.47%
2006	Ramon Hernandez	2	30	29.8	-2.7	14.1	74.04%	54.81%	40.58%	30.05%	22.24%	16.47%	12.19%	9.03%	6.68%	4.95%
2006	Reed Johnson	1	29	27.3	1.8	-0.3	75.78%	57.43%	43.52%	32.98%	25.00%	18.94%	14.36%	10.88%	8.24%	6.25%
2006	Ryan Howard	6	26	30.4	39.3	-11.0	87.40%	76.38%	66.76%	58.34%	50.99%	44.56%	38.95%	34.04%	29.75%	26.00%
2007	Alex Rios	2	26	24.9	6.4	5.2	83.34%	69.46%	57.89%	48.25%	40.21%	33.51%	27.93%	23.28%	19.40%	16.17%
2007	B.J. Upton	6	22	23.1	14.7	-5.7	89.26%	79.67%	71.11%	63.47%	56.65%	50.57%	45.14%	40.29%	35.96%	32.10%
2007	Brandon Phillips	N/A	26	27.1	-11.3	7.9	79.86%	63.78%	50.93%	40.67%	32.48%	25.94%	20.72%	16.54%	13.21%	10.55%
2007	Carlos Pena	5	29	28.9	18.1	-16.3	77.86%	60.61%	47.19%	36.74%	28.61%	22.27%	17.34%	13.50%	10.51%	8.18%
2007	Chone Figgins	4	29	27.4	9.7	-3.0	77.46%	59.99%	46.47%	35.99%	27.88%	21.59%	16.73%	12.95%	10.03%	7.77%
2007	Corey Hart	1	25	26.6	10.8	-2.5	84.98%	72.21%	61.36%	52.14%	44.31%	37.65%	32.00%	27.19%	23.10%	19.63%
2007	Kevin Youkilis	6	28	29.0	12.3	0.3	80.40%	64.65%	51.98%	41.79%	33.60%	27.02%	21.72%	17.47%	14.04%	11.29%
2007	Matt Holliday	N/A	27	30.4	26.0	-7.6	84.05%	70.65%	59.38%	49.91%	41.95%	35.26%	29.64%	24.91%	20.94%	17.60%
2007	Nick Markakis	6	23	25.1	11.1	-2.0	87.81%	77.11%	67.71%	59.46%	52.21%	45.85%	40.26%	35.35%	31.04%	27.26%
2007	Nick Swisher	7	26	27.1	16.7	-4.8	84.28%	71.02%	59.86%	50.44%	42.51%	35.83%	30.19%	25.45%	21.44%	18.07%
2007	Prince Fielder	7	23	38.4	22.1	-17.8	87.95%	77.35%	68.03%	59.83%	52.62%	46.28%	40.71%	35.80%	31.49%	27.69%
2007	Robinson Cano	1	24	28.5	11.7	-6.1	86.21%	74.31%	64.06%	55.22%	47.61%	41.04%	35.38%	30.50%	26.29%	22.66%
2007	Russell Martin	N/A	24	30.8	10.5	14.4	87.50%	76.57%	67.00%	58.62%	51.30%	44.89%	39.28%	34.37%	30.07%	26.31%
2007	Ryan Zimmerman	N/A	22	27.5	9.1	10.4	89.46%	80.03%	71.60%	64.05%	57.30%	51.27%	45.86%	41.03%	36.71%	32.84%
2007	Troy Tulowitzki	1	22	26.9	2.2	15.8	88.92%	79.07%	70.32%	62.53%	55.60%	49.44%	43.97%	39.10%	34.77%	30.92%
2008	Carlos Quentin	1	25	31.0	14.5	-2.6	85.47%	73.05%	62.43%	53.36%	45.60%	38.98%	33.31%	28.47%	24.33%	20.80%
2008	Dustin Pedroia	N/A	24	25.1	13.5	6.3	87.51%	76.58%	67.01%	58.64%	51.32%	44.91%	39.30%	34.39%	30.09%	26.33%
2008	Evan Longoria	N/A	22	27.0	25.9	21.9	91.95%	84.55%	77.75%	71.49%	65.74%	60.45%	55.59%	51.11%	47.00%	43.22%
2008	Ian Kinsler	N/A	26	27.1	18.2	-6.5	84.40%	71.24%	60.13%	50.75%	42.84%	36.16%	30.52%	25.76%	21.74%	18.35%
2008	J.J. Hardy	N/A	25	25.1	-2.1	15.9	84.34%	71.13%	59.98%	50.59%	42.66%	35.98%	30.35%	25.59%	21.58%	18.20%
2008	Jacoby Ellsbury	2	24	25.7	7.4	16.9	87.36%	76.32%	66.67%	58.24%	50.88%	44.45%	38.83%	33.92%	29.63%	25.89%
2008	Jayson Werth	4	29	28.5	12.8	10.6	79.75%	63.60%	50.72%	40.45%	32.26%	25.73%	20.52%	16.36%	13.05%	10.41%
2008	Josh Hamilton	N/A	27	29.2	28.3	-9.6	84.33%	71.12%	59.97%	50.58%	42.65%	35.97%	30.33%	25.58%	21.57%	18.19%
2008	Mark DeRosa	2	33	28.4	-1.6	0.9	64.22%	41.24%	26.48%	17.01%	10.92%	7.01%	4.50%	2.89%	1.86%	1.19%
2008	Mike Aviles	1	27	29.4	20.3	21.5	85.59%	73.26%	62.70%	53.66%	45.93%	39.31%	33.65%	28.80%	24.65%	21.10%
2008	Ryan Braun	6	24	25.7	36.6	-17.5	89.07%	79.33%	70.66%	62.94%	56.06%	49.93%	44.47%	39.61%	35.28%	31.43%
2008	Ryan Ludwick	3	29	27.6	16.6	0.2	79.47%	63.15%	50.19%	39.88%	31.69%	25.19%	20.02%	15.91%	12.64%	10.04%
2008	Shane Victorino	6	27	28.1	4.0	9.1	81.43%	66.31%	53.99%	43.96%	35.80%	29.15%	23.74%	19.33%	15.74%	12.82%
2009	Aaron Hill	2	27	28.6	3.3	4.0	80.72%	65.16%	52.60%	42.46%	34.27%	27.67%	22.33%	18.03%	14.55%	11.75%
2009	Adrian Gonzalez	N/A	27	28.9	19.8	-10.1	82.68%	68.37%	56.53%	46.74%	38.65%	31.96%	26.42%	21.85%	18.07%	14.94%
2009	Ben Zobrist	N/A	28	26.2	11.9	9.9	81.42%	66.29%	53.98%	43.95%	35.78%	29.14%	23.72%	19.32%	15.73%	12.81%
2009	Casey Blake	3	35	26.3	5.8	0.1	60.57%	36.69%	22.23%	13.46%	8.15%	4.94%	2.99%	1.81%	1.10%	0.66%
2009	Denard Span	N/A	25	28.5	23.8	-1.6	87.10%	75.87%	66.08%	57.56%	50.13%	43.67%	38.03%	33.13%	28.86%	25.13%
2009	Franklin Gutierrez	3	26	25.0	-1.8	18.8	83.05%	68.97%	57.28%	47.57%	39.50%	32.81%	27.25%	22.63%	18.79%	15.61%
2009	Jason Bartlett	3	29	25.8	5.9	13.7	78.61%	61.79%	48.58%	38.19%	30.02%	23.60%	18.55%	14.58%	11.46%	9.01%
2009	Joey Votto	N/A	25	28.2	28.7	-8.1	87.36%	76.32%	66.67%	58.24%	50.88%	44.45%	38.83%	33.92%	29.64%	25.89%
2009	Justin Upton	N/A	21	26.3	13.3	-6.9	90.00%	81.01%	72.91%	65.62%	59.06%	53.16%	47.84%	43.06%	38.76%	34.88%
2009	Marco Scutaro	5	33	26.5	-5.2	3.3	63.45%	40.26%	25.55%	16.21%	10.29%	6.53%	4.14%	2.63%	1.67%	1.06%
2009	Matt Kemp	1	24	26.2	16.9	-4.8	87.18%	76.00%	66.26%	57.76%	50.36%	43.90%	38.27%	33.37%	29.09%	25.36%
2009	Michael Bourn	5	26	25.8	-2.5	7.8	81.80%	66.92%	54.74%	44.78%	36.63%	29.96%	24.51%	20.05%	16.40%	13.42%
2009	Nyjer Morgan	1	28	25.8	3.4	27.3	81.46%	66.35%	54.05%	44.03%	35.86%	29.21%	23.80%	19.38%	15.79%	12.86%
2009	Pablo Sandoval	N/A	22	34.2	29.1	-1.6	90.97%	82.76%	75.29%	68.49%	62.31%	56.68%	51.56%	46.91%	42.67%	38.82%
2009	Shin-Soo Choo	5	26	28.6	28.4	-5.3	86.20%	74.30%	64.05%	55.21%	47.59%	41.02%	35.36%	30.48%	26.28%	22.65%
2010	Alexei Ramirez	N/A	28	23.1	-3.3	6.6	77.71%	60.39%	46.93%	36.47%	28.34%	22.03%	17.12%	13.30%	10.34%	8.03%
2010	Andres Torres	3	32	28.0	6.1	14.2	71.73%	51.45%	36.90%	26.47%	18.99%	13.62%	9.77%	7.01%	5.03%	3.61%
2010	Angel Pagan	1	28	25.7	6.4	8.6	80.12%	64.20%	51.43%	41.21%	33.02%	26.46%	21.20%	16.98%	13.61%	10.90%
2010	Austin Jackson	4	23	24.4	8.2	7.5	88.08%	77.59%	68.34%	60.20%	53.02%	46.71%	41.14%	36.24%	31.92%	28.12%
2010	Brett Gardner	2	26	26.5	8.2	21.3	85.05%	72.34%	61.52%	52.33%	44.51%	37.85%	32.19%	27.38%	23.29%	19.81%
2010	Buster Posey	N/A	23	28.4	18.0	10.6	89.49%	80.08%	71.67%	64.13%	57.39%	51.36%	45.96%	41.13%	36.81%	32.94%
2010	Carlos Gonzalez	4	24	29.0	17.4	3.5	87.78%	77.06%	67.64%	59.38%	52.12%	45.75%	40.16%	35.26%	30.95%	27.17%
2010	Carlos Ruiz	N/A	31	29.4	-5.0	14.6	70.92%	50.30%	35.68%	25.30%	17.95%	12.73%	9.03%	6.40%	4.54%	3.22%
2010	Chase Headley	N/A	26	28.2	1.9	-2.1	81.63%	66.64%	54.40%	44.40%	36.25%	29.59%	24.15%	19.72%	16.09%	13.14%
2010	Chris Young	3	26	25.7	-1.1	0.5	81.34%	66.17%	53.82%	43.78%	35.61%	28.97%	23.56%	19.17%	15.59%	12.68%
2010	Colby Rasmus	1	23	25.0	12.8	3.8	88.46%	78.25%	69.21%	61.23%	54.16%	47.91%	42.38%	37.49%	33.16%	29.33%
2010	Daric Barton	1	24	27.8	11.8	-2.8	86.51%	74.84%	64.74%	56.00%	48.45%	41.91%	36.25%	31.36%	27.13%	23.47%
2010	Jason Heyward	N/A	20	29.0	28.5	-1.1	92.70%	85.94%	79.67%	73.86%	68.47%	63.47%	58.84%	54.55%	50.57%	46.88%
2010	Jay Bruce	4	23	26.9	7.5	5.6	87.79%	77.07%	67.66%	59.40%	52.15%	45.78%	40.19%	35.28%	30.98%	27.19%
2010	Jose Bautista	N/A	29	27.8	3.5	-9.0	75.07%	56.35%	42.30%	31.76%	23.84%	17.90%	13.43%	10.08%	7.57%	5.68%
2010	Justin Morneau	1	29	26.8	17.0	-7.5	78.72%	61.96%	48.78%	38.39%	30.22%	23.79%	18.73%	14.74%	11.60%	9.13%
2010	Kelly Johnson	2	28	26.4	9.5	2.2	80.07%	64.12%	51.34%	41.11%	32.92%	26.36%	21.11%	16.90%	13.53%	10.84%
2010	Marlon Byrd	2	32	33.2	0.9	1.7	67.87%	46.07%	31.27%	21.22%	14.41%	9.78%	6.64%	4.50%	3.06%	2.08%
2010	Nelson Cruz	N/A	29	29.5	10.2	4.0	78.35%	61.38%	48.09%	37.68%	29.52%	23.13%	18.12%	14.20%	11.12%	8.71%
2010	Rickie Weeks	3	27	31.6	12.0	-3.6	81.66%	66.68%	54.45%	44.47%	36.31%	29.65%	24.21%	19.77%	16.15%	13.18%
2010	Stephen Drew	2	27	25.8	-0.7	1.5	79.66%	63.46%	50.55%	40.27%	32.08%	25.56%	20.36%	16.22%	12.92%	10.29%
2011	Alex Avila	2	24	29.3	9.9	1.4	86.49%	74.80%	64.69%	55.95%	48.39%	41.85%	36.20%	31.31%	27.08%	23.42%
2011	Alex Gordon	N/A	27	29.0	7.0	1.0	81.18%	65.90%	53.49%	43.43%	35.25%	28.62%	23.23%	18.86%	15.31%	12.43%
2011	Andrew McCutchen	N/A	24	27.3	24.1	-1.9	88.39%	78.12%	69.05%	61.03%	53.95%	47.68%	42.14%	37.25%	32.92%	29.10%
2011	Cameron Maybin	2	24	25.6	4.7	6.9	86.19%	74.29%	64.03%	55.19%	47.57%	41.00%	35.34%	30.46%	26.26%	22.63%
2011	Elvis Andrus	N/A	22	27.1	-4.6	13.7	87.84%	77.16%	67.78%	59.53%	52.30%	45.94%	40.35%	35.44%	31.13%	27.35%
2011	Giancarlo Stanton	N/A	21	27.7	20.6	0.6	91.21%	83.19%	75.87%	69.20%	63.12%	57.57%	52.51%	47.89%	43.68%	39.84%
2011	Howie Kendrick	N/A	27	30.1	4.5	6.1	81.14%	65.84%	53.42%	43.34%	35.17%	28.54%	23.15%	18.79%	15.24%	12.37%
2011	Hunter Pence	N/A	28	26.8	15.2	-1.6	80.92%	65.47%	52.98%	42.87%	34.69%	28.07%	22.71%	18.38%	14.87%	12.03%
2011	Matt Wieters	3	25	28.5	-7.6	18.4	83.43%	69.60%	58.07%	48.45%	40.42%	33.72%	28.13%	23.47%	19.58%	16.34%
2011	Mike Napoli	N/A	29	29.8	20.5	2.3	80.50%	64.81%	52.17%	42.00%	33.81%	27.22%	21.91%	17.64%	14.20%	11.43%
2011	Peter Bourjos	2	24	24.4	4.6	20.5	87.24%	76.11%	66.40%	57.92%	50.53%	44.09%	38.46%	33.55%	29.27%	25.54%
2011	Yadier Molina	N/A	28	30.7	-14.6	20.1	76.20%	58.06%	44.24%	33.71%	25.69%	19.58%	14.92%	11.37%	8.66%	6.60%
2012	Adam Jones	N/A	26	28.1	4.2	-1.8	82.13%	67.46%	55.41%	45.51%	37.38%	30.70%	25.22%	20.71%	17.01%	13.97%
2012	Bryce Harper	N/A	19	28.1	18.0	9.0	92.98%	86.45%	80.38%	74.73%	69.48%	64.60%	60.07%	55.85%	51.93%	48.28%
2012	Edwin Encarnacion	N/A	29	30.3	10.1	-11.4	76.39%	58.36%	44.58%	34.06%	26.02%	19.88%	15.19%	11.60%	8.86%	6.77%
2012	Ian Desmond	N/A	26	26.9	0.3	2.6	81.81%	66.93%	54.75%	44.79%	36.65%	29.98%	24.53%	20.06%	16.41%	13.43%
2012	Josh Reddick	N/A	25	23.1	2.2	10.1	84.65%	71.66%	60.66%	51.35%	43.47%	36.80%	31.15%	26.37%	22.32%	18.90%
2012	Martin Prado	N/A	28	25.1	7.8	1.7	79.70%	63.52%	50.63%	40.35%	32.16%	25.63%	20.43%	16.28%	12.98%	10.34%
2012	Melky Cabrera	1	27	30.1	0.9	-5.4	79.08%	62.54%	49.46%	39.11%	30.93%	24.46%	19.35%	15.30%	12.10%	9.57%
2012	Miguel Montero	1	28	29.3	1.7	8.2	78.85%	62.17%	49.02%	38.65%	30.48%	24.03%	18.95%	14.94%	11.78%	9.29%
2012	Mike Trout	N/A	20	29.5	53.6	13.0	95.05%	90.35%	85.89%	81.64%	77.60%	73.76%	70.12%	66.65%	63.35%	60.22%
2013	Andrelton Simmons	N/A	23	25.0	-5.9	32.5	87.81%	77.10%	67.71%	59.45%	52.20%	45.84%	40.25%	35.35%	31.04%	27.25%
2013	Brandon Belt	1	25	26.1	16.7	-6.5	85.66%	73.37%	62.85%	53.83%	46.11%	39.50%	33.83%	28.98%	24.82%	21.26%
2013	Carlos Gomez	N/A	27	27.5	-1.4	15.1	80.92%	65.48%	52.98%	42.87%	34.69%	28.07%	22.72%	18.38%	14.87%	12.04%
2013	Chris Davis	1	27	28.7	13.6	-13.9	81.04%	65.67%	53.22%	43.13%	34.95%	28.33%	22.96%	18.60%	15.08%	12.22%
2013	Freddie Freeman	N/A	23	26.7	17.3	-14.6	87.77%	77.04%	67.62%	59.36%	52.10%	45.73%	40.14%	35.23%	30.92%	27.14%
2013	Gerardo Parra	1	26	27.9	-6.2	9.2	81.11%	65.78%	53.35%	43.27%	35.09%	28.46%	23.09%	18.72%	15.19%	12.32%
2013	Jason Castro	N/A	26	26.9	2.9	4.5	82.54%	68.12%	56.23%	46.41%	38.30%	31.61%	26.09%	21.54%	17.78%	14.67%
2013	Jason Kipnis	1	26	26.5	17.6	-2.3	84.69%	71.72%	60.74%	51.44%	43.56%	36.89%	31.24%	26.46%	22.41%	18.97%
2013	Josh Donaldson	N/A	27	29.8	19.0	10.9	84.45%	71.32%	60.23%	50.87%	42.96%	36.28%	30.64%	25.87%	21.85%	18.45%
2013	Juan Uribe	N/A	34	31.9	-12.1	12.1	58.89%	34.68%	20.42%	12.03%	7.08%	4.17%	2.46%	1.45%	0.85%	0.50%
2013	Kyle Seager	N/A	25	28.5	8.3	2.2	84.87%	72.03%	61.14%	51.89%	44.04%	37.38%	31.72%	26.92%	22.85%	19.39%
2013	Manny Machado	N/A	20	23.1	0.2	28.8	91.50%	83.73%	76.61%	70.10%	64.15%	58.69%	53.71%	49.14%	44.97%	41.15%
2013	Matt Carpenter	N/A	27	26.9	27.7	-3.7	84.82%	71.95%	61.03%	51.77%	43.91%	37.25%	31.60%	26.80%	22.73%	19.28%
2013	Paul Goldschmidt	N/A	25	30.6	30.0	-9.6	87.39%	76.38%	66.75%	58.34%	50.98%	44.56%	38.94%	34.03%	29.74%	25.99%
2013	Starling Marte	N/A	24	24.4	17.9	7.8	88.26%	77.90%	68.75%	60.68%	53.55%	47.26%	41.71%	36.82%	32.49%	28.68%
2013	Yasiel Puig	N/A	22	29.4	37.6	-0.9	91.96%	84.58%	77.78%	71.53%	65.78%	60.50%	55.64%	51.17%	47.05%	43.27%
2014	Anthony Rendon	N/A	24	26.4	18.4	6.2	88.17%	77.74%	68.55%	60.44%	53.29%	46.99%	41.43%	36.53%	32.21%	28.40%
2014	Anthony Rizzo	N/A	24	30.0	11.5	-3.0	86.37%	74.60%	64.44%	55.65%	48.07%	41.52%	35.86%	30.97%	26.75%	23.11%
2014	Brian Dozier	N/A	27	26.5	3.4	-0.5	80.33%	64.53%	51.84%	41.64%	33.45%	26.87%	21.59%	17.34%	13.93%	11.19%
2014	Christian Yelich	N/A	22	25.0	17.7	-0.7	89.89%	80.81%	72.64%	65.30%	58.70%	52.77%	47.44%	42.65%	38.34%	34.46%
2014	Devin Mesoraco	N/A	26	29.0	-2.0	7.8	81.78%	66.89%	54.70%	44.74%	36.59%	29.93%	24.47%	20.02%	16.37%	13.39%
2014	Erick Aybar	N/A	30	25.8	-1.6	7.6	73.64%	54.22%	39.93%	29.40%	21.65%	15.94%	11.74%	8.64%	6.37%	4.69%
2014	J.D. Martinez	N/A	26	27.5	1.8	-9.8	80.83%	65.33%	52.81%	42.68%	34.50%	27.89%	22.54%	18.22%	14.73%	11.90%
2014	Jonathan Lucroy	N/A	28	26.4	6.4	11.2	80.37%	64.60%	51.92%	41.73%	33.54%	26.96%	21.66%	17.41%	13.99%	11.25%
2014	Jose Abreu	N/A	27	31.9	42.9	-14.9	86.30%	74.48%	64.28%	55.48%	47.88%	41.32%	35.66%	30.77%	26.56%	22.92%
2014	Jose Altuve	N/A	24	28.2	5.0	-6.4	85.08%	72.39%	61.59%	52.40%	44.58%	37.93%	32.27%	27.46%	23.36%	19.87%
2014	Josh Harrison	N/A	26	30.4	5.4	3.4	82.79%	68.54%	56.75%	46.98%	38.90%	32.20%	26.66%	22.07%	18.27%	15.13%
2014	Juan Lagares	N/A	25	28.4	-5.1	28.9	84.82%	71.95%	61.03%	51.77%	43.91%	37.25%	31.60%	26.80%	22.74%	19.28%
2014	Kevin Kiermaier	N/A	24	25.7	13.1	21.3	88.48%	78.28%	69.26%	61.28%	54.22%	47.97%	42.44%	37.55%	33.22%	29.39%
2014	Lorenzo Cain	N/A	28	26.3	2.8	19.4	80.49%	64.78%	52.14%	41.97%	33.78%	27.19%	21.88%	17.61%	14.18%	11.41%
2014	Michael Brantley	N/A	27	25.7	10.0	-8.4	80.96%	65.55%	53.07%	42.97%	34.79%	28.17%	22.80%	18.46%	14.95%	12.10%
2014	Steve Pearce	N/A	31	29.3	6.6	-3.1	71.82%	51.58%	37.04%	26.60%	19.11%	13.72%	9.85%	7.08%	5.08%	3.65%
2014	Todd Frazier	N/A	28	27.5	11.0	4.4	80.61%	64.98%	52.38%	42.22%	34.03%	27.43%	22.11%	17.82%	14.37%	11.58%
2014	Yan Gomes	N/A	26	27.6	9.4	13.6	84.58%	71.54%	60.51%	51.18%	43.29%	36.62%	30.97%	26.20%	22.16%	18.74%

Conclusions

After looking at this table, we can draw several conclusions. First, this Mike Trout guy is really good at baseball. Secondly, age is the main variable in determining the time until failure. The players with the highest survival rates are all under twenty-five and all the lowest survival rates are over thirty. This makes sense, because it is much easier for a twenty-year-old star to remain effective until he is thirty compared to a thirty-year-old star attempting to remain effective until he is forty. This is because older players face more challenges such as eroding skills, an increased chance of sustaining injuries and having their playing time reduced to prevent injuries.

It also appears that offensive stars survive longer than defensive stars. This is probably due to the fact that defensive skills usually deteriorate faster than offensive skills. I also believe that since defensive statistics are more volatile than offensive statistics, that players that derive much of their value from their defense are more likely to have their WAR fluctuate from year to year. This makes it more likely that a defensive star could register a scrub season one year and then become a star again the next year. And this brings me to my next point.

Things to Keep in Mind

If a player records a scrub season that does not necessarily mean that he is finished. If this were the case, players like Aramis Ramirez, Robinson Cano and Troy Tulowitzki would have had much less productive careers. It is also important to remember that a player enters the population as soon as they record their first star season, so it is quite possible that a player could improve after their first star season and make it more likely that they can outlast their projected survival rate. The main thing to remember is that no model is perfect and no model is meant to replace the human decision-making process. Models are only meant to improve the decision-making process and it is my hope that this model has accomplished that goal.

Different Aging Curves For Different Strikeout Profiles

by Devin Jordan

April 22, 2015

What follows will look at aging curves as they relate to players with specific strikeout profiles. Specifically, we will look at how wOBA ages for players that strikeout more than the league-average strikeout rate and less than the league-average strikeout rate.

Through the research that is presented in this post, two points will be proven:

Players of different strikeout profiles age—their wOBAs change—at different rates.
The aging curve for players of different strikeout profiles has changed over time.

Before I present the methodology, the research that was conducted, and their conclusions, I want to give a big thank you to Jeff Zimmerman, who has not only done a lot of research around aging curves, but has also helped me throughout this process and pushed me in the right direction several times when I was stuck. Thank you.

Population

In order to give a non insignificant amount of time for a player’s wOBA to stabilize, but not place the playing time threshold for plate appearances so high that we artificially limit the population even more than it naturally is at the ends of the age spectrum, I looked at all player season from 1950 to 2014 where a player had a minimum of 600 plate appearances for the first aging curve in this post. The second aging curve in this post looks at all player seasons from 1990 to 2014 with a minimum of 600 plate appearances.

Now that we have our population, we need to split our population into two groups: players that strikeout more than league average and players that strikeout less than league average.

Because the league average strikeout rate of today is very different than it was 65 years ago, we can’t look at a player’s strikeout rate from 1950 and compare it to the league average strikeout rate of today.

In order to divide the population into two groups, I created a stat that weighs a player’s strikeout rate against the league average strikeout rate for the years that they played. For example, if a player played from 1970 to 1975, their adjusted strikeout rate would reflect how their strikeout rate compares to the league average strikeout rate from 1970 to 1975.

Players were then placed into two buckets based on their adjusted strikeout rate: players that struck out more than league average and players that struck out less than league average.

Methodology

There has been a lot of discussion over the years about the correct methodology to use for aging curves. This conversation has had altruistic intentions in the sense that it’s aim has been to minimize the survivorship bias that is inherent in the process, and, through the progress that has been made over the years, this study uses what the author has found to his knowledge to be the best technique to date. This article by Mitchell Lichtman summarizes a lot of the opinions.

While there is a survivorship bias inherent in any aging curve, the purpose of the different techniques used to create aging curves is to minimize the survivorship bias wherever possible.

What We Don’t Want In an Aging Curve

An aging curve is not the average of all performances by players of specific ages. For example, say you have a group of 30-year-old players that have an average of a .320 wOBA and group of 29-year-old players that have an average of a .300 wOBA.

The point of an aging curve is to see how a player aged, not how they played. The group of 30-year-old players has a high wOBA because they are a talented group of players; they lasted long enough to play until they are 30. As they aged from the previous year, when they were 29 to their current age 30 season, they lost the bottom portion of players from their player pool. These are the players that couldn’t hang on any longer, whether it be because of a decline in defense, offense, or a combination of both. This bottom portion of players lower the wOBA of the current 29-year-old population through their presence and raise the wOBA of the 30-year-old population through their absence.

At the same time, the current 30-year-olds aged from their age-29 season to their age-30 season. Sure, there may be players who had a better age-30 season than age-29 season, but the current group of 30-year-olds, as a whole, still played worse at 30 than they did at 29.

When you look at the average of a particular age group, in this case 30-year-olds, you only see the players that survived, and, because they no longer play, you leave behind the players that are hidden from you sample. The method that follows resolves this issue to an extent.

What We Do Want In an Aging Curve

This study uses the delta method which looks at the differences of player seasons (i.e. a players age 29 wOBA minus their age 28 wOBA) and weighs those differences by the harmonic mean of the plate appearances for each pair seasons in question.

I would explain this further, but Jeff Zimmerman does an excellent job of this in a post on hitter aging curves that he did several years ago. While Jeff Zimmerman looked at RAA, which is a counting state, the methodology is basically the same for our purposes and wOBA, which is a rate stat:

In a nutshell, to do accurate work on this, I needed to go through all the hitters who ever played two consecutive seasons. If a player played back-to-back seasons, the RAA values were compared. The RAA values were adjusted to the harmonic mean of that player’s plate appearances.

Consider this fictional player:

Year1: RAA = 40 in 600 PA age 25
Year2: RAA = 30 in 300 PA age 26

Adjusting to harmonic mean: 2/((1/PA_y1)+(1/PA_y2)) = PA_hm
/((1/600)+(1/300)) = 400

Adjust RAA to PA_hm: (PA_hm/PA_y1)*RAA_y1 = RAA_y1_hm
(400/600)*40 = 26.7 RAA for Year1
(400/300)*30 = 40 RAA for Year2

This player would have gained 13.3 RAR (40 RAA – 26.7 RAA) in 400 PA from ages 25 to 26. From then, I then would add all the changes in RAA and PA together and adjust the values to 600 PA to see how much a player improved as he aged.

Findings

Below is an aging curve by strikeout profile for all player seasons with over 600 plate appearances in a season from 1950 until 2015.

We can see several findings immediately:

Players do age differently based on their strikeout profile.
Players that strikeout more than league average peak at 23.
Players that strikeout less than league average take longer to hit their peak—their age 26 season.
Players that strikeout more than league average age better than players that strikeout less than league average.

From a historical perspective, this graph is fun to look at, but the way the game was played over half a century ago is eclipsed by societal evolutions that today’s players benefit from.

To give us a more realistic idea of how today’s players age relative to their strikeout rate, I made another graph the at looks at player seasons from 1990 to 2014.

What we find in this graph, which is more current with today’s style of play, is that players still age differently dependent on their strikeout profile, but not in the same way that they did in the previous sample.

Players that strikeout more than league average still peak earlier than players that strike out less than league average, but in this more current population of players, players that strikeout more than league average peak very early—their age 21 season. This information would reciprocate the sentiment that has been conveyed through recent work that suggests that the aging curve has changed to the point that players peak almost as soon as when they enter the league.

The peak age for players that strikeout at below league average rates is still 26, but whereas this group aged more poorly than the strikeout heavy group in our previous population, players that strikeout at below league average rates now age better than their counterparts.

Conclusions

This information can make material differences for our overall expectations and outlooks on players.

Previous knowledge would suggest that players like George Springer and Kris Bryant—players who have exorbitant strikeout rates—are still on the climb as far as their talent goes, but this information shows that these players may already be at/close to their peaks or on the decline as far a their wOBA is concerned.

This information also shows that we should be patient with prospects that have a penchant to put balls is play; while they peak more quickly than they did in the previous population, they take longer to develop than players with more swing and miss in their game, and when they do start to decline, there isn’t much need to worry, because their climb from their peaks will be gradual.

Like many other studies that have looked at new aging curves, this study confirms that players/prospects peak earlier now than at any other point throughout history, but it also shows that a player’s trajectory upward and downward is dependent on characteristics specific to their approaches at the plate.

Devon Jordan is obsessed with statistical analysis, non-fiction literature, and electronic music. If you enjoyed reading him, follow him on Twitter @devonjjordan.

Austin Jackson’s Bothersome Batted-Ball Bind

by maxe6

April 21, 2015

On July 30, 2014, the Seattle Mariners found themselves in the interesting position of being in playoff contention. The Mariners sported a 32-23 record, only 2.5 games back of the AL West-leading Los Angeles Angels, and owned the third-best Pythagorean record in the American League. Seattle’s newfound position as postseason hopefuls meant that they were suddenly buyers at the trade deadline – not drastically so, but in the sense that the Mariners were only a couple of upgrades away from assembling themselves a nicely well-rounded playoffs roster. Chief among these desired upgrades was a serviceable everyday center fielder, one who could replace a revolving door of below-average outfielders that included Abraham Almonte, James Jones, Stefen Romero, and Endy Chavez.

Jack Zduriencik sought to remedy the Mariners’ outfield issues with a pair of trade deadline deals. The first involved packaging Almonte and minor-league pitcher Stephen Kohlscheen to the Padres in return for Chris Denorfia, a rather unsexy deal to be sure, but one that was a success at the time in that the acquired player was not Almonte. The second deal, a three-way transaction between the Rays, Tigers, and Mariners, was collectively more sexy, but a large share of the sexy went to the Tigers, who landed Rays ace David Price. The other major components of the deal were the Rays’ acquisition of young Mariners middle infielder Nick Franklin and Tigers pitcher Drew Smyly, as well as Seattle’s prospective answer to its outfield problem: Detroit center fielder Austin Jackson.

Since his move to the Mariners, Jackson has racked up 277 plate appearances for Seattle, and the results have been fantastically underwhelming. Of the center fielders who amassed more than 100 plate appearances for the Mariners in 2014 – an uninspiring triumvirate of Jackson, Abraham Almonte, and James Jones – Jackson produced the worst offensive performance by wRC+. Jackson’s 2014 performance also disappointed even by more conventional measures:

Jackson totaled 34 extra-base hits in 416 plate appearances for the Tigers in 2014. For the rest of the year, in 240 plate appearances for the Mariners, Jackson managed 6.
Jackson’s ISO dropped from .127 to a paltry .031 with the move from Detroit to Seattle.
Jackson’s 2014 OBP/SLG/wOBA with Detroit: .330/.397/.321. With Seattle: .271/.264/.243.

Not great for a player only two seasons removed from a 5-win campaign.

Ostensibly, something was fundamentally different with Jackson in 2014, something that can hopefully be determined by closely examining his recent performance. Looking first to Jackson’s approach, it seems that there hasn’t been too much change over the course of his career. His K/BB ratio has generally hovered around league average and his contact rates haven’t fluctuated all that much from year-to-year. If anything, Jackson’s approach metrics look like they’re trending positively – he actually posted career bests in Z-contact% and SwStr% in 2014. If we examine Jackson’s batted ball data, however, we begin to get a little closer to the root of Jackson’s troubles of late. The most easily identifiable aspect of Jackson’s game can be somewhat distilled in the following graphic:

Over the course of his career, Jackson’s BABIP has been way above league average. He managed an absolutely ridiculous .396 BABIP in his 2010 rookie season over 675 PA, and his career-best 2012 season, in which he posted a 134 wRC+, was bolstered by a BABIP of .371. That figure would predictably fall after 2012, but between 2013 & 2014, Jackson’s BABIP only declined by .008, whereas in the same period, his wOBA fell from a very good .332 to a mediocre .292. This might suggest that in 2014 specifically, it may not have been the frequency with which Jackson was able to put balls in play so much as the quality of those batted balls that limited Jackson’s production.

Unfortunately, batted ball data is out of the scope of my access. The closest I can get to Jackson’s batted-ball profile is by pulling data from this pre-season piece on Jackson by Jake Mailhot over at Lookout Landing, and indirectly from Jeff Zimmerman’s work on hitter analytics at RotoGraphs (the relevant batted-ball spreadsheet now seems to be unavailable for some reason).

To quickly explain – this table charts batted-ball rates expressed as a percentage of league average. Batted balls are separated into three categories (line drive, groundball, fly ball) which are then further divided into subcategories of contact quality (Well-Hit, Medium, and Weakly-hit). These categories are ordered left-to-right from highest to lowest based on xBABIP.

Mailhot astutely notes an alarming drop in well-hit groundball rate – from 64% above league average in 2012 to 11% below league average in 2014. This is accompanied by a commensurate rise in weakly-hit groundballs. Jackson’s well-hit line-drive rate also drops by a sizable amount, hovering around league average in 2014, while his rate of medium-hit line drives balloons to 198% of league average in 2014. Mailhot also points out possibly the most substantial shift: an immense drop-off in well-hit fly-ball rate in 2014 to 56% of league average, a trend corroborated by data pulled from Baseball Heat Maps on Jackson’s average fly ball distance over his career:

Jackson’s high rate of well-hit line drives and ground balls prior to 2014 puts into perspective the aspects of his game that brought him success earlier on in his career, and his sharp decline in those metrics in 2014 even more so. To put it in exceedingly simple terms, Austin Jackson just didn’t really hit balls hard in 2014, something he was quite good at doing before that season. Judging by the splits, most of the not-hitting-balls-hard occurred after the move to Seattle.

Jackson’s 2013 was much better than his 2014, but it is the beginning of a short trend of BABIP decline. From examining batted-ball data, we can infer that quality of contact has a significant bearing on BABIP, and this makes sense using conventional logic as well. Hard-hit line ground balls are more likely to find gaps between defenders, hard-hit line drives are more likely to drop in for hits, and hard-hit fly balls are more likely to turn into extra-base hits (although BABIP ignores home runs). The easy explanation is that Jackson lost some power in 2014. I don’t have enough film on Jackson to know for sure if there’s a visually concrete reason for this (if, for example, there’s something off in his swing mechanics), but data from 2014 indicates that Jackson just hasn’t been making good contact.

Jackson’s issues are probably best explained by his batted-ball troubles, but park factors likely play some part as well, with Comerica Park being relatively more hitter-friendly than Safeco Field. Safeco’s pitcher-friendly park factor and the ‘dead ball effect’ of Seattle’s marine air probably have something to do with Jackson’s decline in fly-ball distance, although Jackson is himself contributing to that same decline in some measure.

At the time of his acquisition, a merely average performance from Jackson would have been a significant upgrade over the convoluted mishmash that had previously taken the field for the Mariners. Unfortunately, he was unable to even provide replacement-value production after coming to Seattle, totaling -0.4 wins above replacement in 2014. The Mariners traded for an above-average player and received the production level of a player who theoretically wouldn’t cut it in the big leagues altogether.

The prospect of 2015 being a bounceback year for Jackson has not gone over too well in these first few weeks. ZiPS (R) and Steamer (R) still think Jackson could manage 1.7-2.1 WAR on the season, which is a bit below his peak, but I think the Mariners would take that statline in a heartbeat. I’ve gone this far without mentioning Jackson’s other tools, but as a 28-year-old without a concerning injury history, there’s not as much reason to worry about his defense and baserunning as there is to worry about his offensive output. Jackson was a below-average defender by UZR in 2013/2014 and has been worth approximately 4 baserunning runs above replacement in each of the past couple of years, neither of which have dictated his value nearly as much as his offense, or lack thereof. Using those numbers as a serious predictive measure from year-to-year is simply not very useful at this point.

Lloyd McClendon was Jackson’s hitting coach back in Detroit, and suffice it to say he probably has a better grasp on Jackson’s habits as a batter than most. If anyone’s able to get Jackson back on track this season, it’s probably McClendon. At time of writing (the 18th of April), Jackson managed a slightly encouraging 2-hit, 1-walk performance against the Rangers. It’s early yet in the season, and there’s time for Jackson to hopefully figure things out. Alternatively, if Jackson can’t find some of his pre-2014 form this season, the Mariners might once again find themselves in the same trade deadline predicament from last year – only this time, there’s not an obvious trade chip à la Nick Franklin. Then again, 2015 is Jackson’s last year under team control, so the Mariners may simply choose to let him walk after the year is over if they’re not satisfied with his performance. If that’s the case, it’s hard to imagine looking back on the 2014 Jackson trade with anything but the same tinge of regret and frustration that has colored so many other Mariners transactions of the last decade.

Hardball Retrospective – The “Original” 1924 Washington Senators

by DerekBain

April 21, 2015

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. Therefore, Frank Robinson is listed on the Reds roster for the duration of his career while the Rangers claim Ivan Rodriguez and the Red Sox declare Jeff Bagwell. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition. Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Expanding on my research for the book, the following series of articles will reveal the finest single-season rosters for every Major League organization based on overall rankings in OWAR and OWS along with the general managers and scouting directors that constructed the teams. “Hardball Retrospective” is available in digital format on Amazon, Barnes and Noble, GooglePlay, iTunes and KoboBooks. The print edition is coming soon. Additional information and a discussion forum are available at TuataraSoftware.com.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

Assessment

The 1924 Washington Senators OWAR: 43.1 OWS: 287 OPW%: .615

Based on the revised standings the “Original” 1924 Senators obliterated the competition with the Tigers finishing a distant 13 games in arrears. Walter “Big Train” Johnson, approaching the final stop in his 21-year career, continued to blow smoke past American League batsmen. He whiffed the most batters in the Junior Circuit for the twelfth time and furnished a 23-7 mark with the best ERA (2.72) and WHIP (1.116) in the League. Johnson received the MVP Award for his efforts in ’24 and the future Hall-of-Famer retired three years later with 417 victories, a 2.17 ERA and a 1.061 WHIP along with 3,509 strikeouts and the most shutouts in Major League history (110). Johnson ranks first among pitchers in “The New Bill James Historical Baseball Abstract”.

Jack Bentley bolstered the Washington pitching corps, delivering 16 victories against 5 losses. Firpo Marberry split time between the rotation and bullpen, notching 11 wins and saving 15 contests (although saves were not officially tabulated until 1969).

ROTATION	POS	WAR	WS
Walter Johnson	SP	7.02	28.65
Jack Bentley	SP	1.96	11.78
Firpo Marberry	SP	1.48	17.72
Joe Martina	SP	0.35	5.74

BULLPEN	POS	WAR	WS
Ted Wingfield	RP	0.67	2.54
By Speece	RP	-0.25	3.52
Slim McGrew	SP	-0.27	0.32
Paul Zahniser	SP	-0.28	3.72

Goose Goslin (.344/12/129) topped the American League leader boards in RBI while recording 199 hits and 100 runs. The future Hall of Famer surpassed the century mark in ribbies 11 times and recorded a .316 lifetime batting average. Sam Rice batted .334 with 106 runs scored and 39 two-baggers while producing a League-best 216 base hits. A .322 career hitter, Rice concluded his career only 13 hits shy of 3,000.

Charlie Jamieson rapped 213 safeties and posted a personal-best .359 BA after leading the Junior Circuit in the previous campaign with 222 knocks. First-sacker Joe Judge clubbed 38 two-base hits and delivered a .324 BA. Goslin rated 16^th among left fielders in the “NBJHBA”. Rice (33^rd-RF), Judge (44^th-1B) and Bucky Harris (70^th-2B) also placed in the top 100 at their respective positions.

LINEUP	POS	WAR	WS
Charlie Jamieson	CF/LF	3.09	19.11
Sam Rice	RF	3.65	23.99
Goose Goslin	LF	5.69	28.91
Joe Judge	1B	2.12	19.08
Ossie Bluege	3B	0.72	10.42
Bucky Harris	2B	0.32	13.31
Eddie Ainsmith	C	0.11	0.45
Howie Shanks	SS	-0.02	5.21

BENCH	POS	WAR	WS
Frank Brower	1B	1.05	5.27
Irish Meusel	LF	0.98	16.78
Doc Prothro	3B	0.9	5.89
Bing Miller	RF	0.83	13.65
Earl McNeely	CF	0.3	5.84
Carl East	RF	0.09	0.36
Ike Davis	SS	0.02	0.35
Bennie Tate	C	-0.02	0.64
Carr Smith	RF	-0.13	0.04
Tommy Taylor	3B	-0.13	0.85
Showboat Fisher	RF	-0.14	0.4
Pinky Hargrave	C	-0.35	0.21
Mule Shirley	1B	-0.5	0.34
Frank Ellerbe	3B	-0.9	2.19

The “Original” 1924 Washington Senators roster

NAME	POS	WAR	WS	General Manager
Walter Johnson	SP	7.02	28.65	Thomas Noyes
Goose Goslin	LF	5.69	28.91	Clark Griffith
Sam Rice	RF	3.65	23.99	Clark Griffith
Charlie Jamieson	LF	3.09	19.11	Clark Griffith
Joe Judge	1B	2.12	19.08	Clark Griffith
Jack Bentley	SP	1.96	11.78	Clark Griffith
Firpo Marberry	SP	1.48	17.72	Clark Griffith
Frank Brower	1B	1.05	5.27	Clark Griffith
Irish Meusel	LF	0.98	16.78	Clark Griffith
Doc Prothro	3B	0.9	5.89	Clark Griffith
Bing Miller	RF	0.83	13.65	Clark Griffith
Ossie Bluege	3B	0.72	10.42	Clark Griffith
Ted Wingfield	RP	0.67	2.54	Clark Griffith
Joe Martina	SP	0.35	5.74	Clark Griffith
Bucky Harris	2B	0.32	13.31	Clark Griffith
Earl McNeely	CF	0.3	5.84	Clark Griffith
Eddie Ainsmith	C	0.11	0.45	Thomas Noyes
Carl East	RF	0.09	0.36	Clark Griffith
Ike Davis	SS	0.02	0.35	Clark Griffith
Howie Shanks	SS	-0.02	5.21	Thomas Noyes
Bennie Tate	C	-0.02	0.64	Clark Griffith
Carr Smith	RF	-0.13	0.04	Clark Griffith
Tommy Taylor	3B	-0.13	0.85	Clark Griffith
Showboat Fisher	RF	-0.14	0.4	Clark Griffith
By Speece	RP	-0.25	3.52	Clark Griffith
Slim McGrew	SP	-0.27	0.32	Clark Griffith
Paul Zahniser	SP	-0.28	3.72	Clark Griffith
Pinky Hargrave	C	-0.35	0.21	Clark Griffith
Mule Shirley	1B	-0.5	0.34	Clark Griffith
Frank Ellerbe	3B	-0.9	2.19	Clark Griffith

Honorable Mention

The “Original” 1915 Senators OWAR: 49.1 OWS: 272 OPW%: .565

“Big Train” Johnson (27-13, 1.55) completed 35 of 39 starts while leading the American League in wins, WHIP (0.933), innings pitched, shutouts and strikeouts. The rotation was supplemented by Doc Ayers (14-9, 2.21) and Bert Gallia (17-11, 2.29). Clyde “Deerfoot” Milan swiped 40 bags and Tom Long legged out 25 triples at the top of the lineup.

The “Original” 1965 Twins OWAR: 46.0 OWS: 280 OPW%: .644

Zoilo Versalles topped the leader boards with 126 tallies, 45 doubles, 12 triples and 308 total bases to capture the 1965 A.L. MVP Award. Teammate Tony Oliva (.321/16/98) finished runner-up in the MVP race and collected his second batting title. Bob Allison, Jimmie Hall and Harmon Killebrew slammed at least 20 circuit clouts apiece. Jim Kaat (18-11, 2.83) anchored the starting staff and Ted Abernathy led the League with 31 saves and 84 relief appearances.

On Deck

The “Original” 1992 White Sox

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive

Testing the Eye Test: Part 2

by vslyke

April 17, 2015

Sorry for the relatively long delay – sometimes life gets in the way of our best laid plans. In case you want a refresher, here is part 1: http://www.fangraphs.com/community/testing-the-eye-test-part-1/

In part 1, I found that, counter to my expectations, range correlated most strongly with FSR data of all the UZR components (UZR itself had a stronger correlation with FSR). I expected the strongest-correlated component to be errors, which was actually one of the least-correlated components. However, I wanted to go a little bit farther and look at the difference between correlations between the UZR components and FSR and the correlations between the UZR components and UZR itself to get a sense of what the fans weight more than UZR does. As a reminder, here is the data set I compiled for this analysis:

“I pulled the defensive stats of every player who qualified (minimum of 900 innings) at a position from 2009-2014 (FSR data is only available for those six seasons on FanGraphs). I then disregarded catchers, as UZR does not cover the position. Likewise, pitchers are left out because they are not covered by UZR or FSR. That left me with 761 player seasons across the other seven positions.”

Without further ado, here are the correlations between UZR and its components:

Position |# |ARM |DPR |RngR |ErrR
1B |118 |N/A |0.207 |0.930 |0.326
2B |117 |N/A |0.275 |0.907 |0.465
3B |107 |N/A |0.166 |0.948 |0.386
SS |130|N/A |0.459 |0.866 |0.384
LF | 71 |0.584 |N/A |0.895 |0.196
CF |115 |0.357 |N/A |0.935 |0.069
RF |103|0.310 |N/A |0.906 |0.061

I always had a suspicion that range was the most important component of UZR but these results are insane. It turns out range is far and away the most important component of UZR. Interestingly, the weakest correlation for range is at SS, perhaps because shortstops without proper range are moved to another position. ARM, although only calculated for outfielders (a real shame as Andrelton Simmons deserves credit for being able to make this throw), has the second-strongest correlation but lags range by a large amount. Like the FSR correlation, it is surprising that LF has a stronger ARM correlation than CF or RF. DPR narrowly edges out errors, although the correlation for errors is far stronger when you only consider infielders. Now, to get a sense of the difference, here’s the two sets of correlation subtracted from each other (positive numbers mean the correlation with UZR is higher and negative numbers mean the correlation with FSR is higher):

Position | # | ARM | DPR | RngR | ErrR
1B | 118 | N/A | -0.005 | 0.644 | 0.006
2B | 117 | N/A | 0.116 | 0.437 | -0.082
3B | 107 | N/A | 0.011 | 0.315 | 0.125
SS | 130 | N/A | 0.095 | 0.437 | 0.041
LF | 71 | 0.074 | N/A | 0.369 | 0.010
CF | 115 | 0.120 | N/A | 0.441 | -0.002
RF | 103 | 0.096 | N/A | 0.365 | -0.006

There are two different ways to look at this: one is that FSR has nearly the same correlation as UZR in most categories. That’s good! It lends a lot of credibility to FSR to know that you can predict FSR nearly as well as UZR with ErrR or DPR. On the other hand, look at the huge difference in the range column. It appears that the fans are severely underestimating the importance of having great range (or have different ideas of how to evaluate range). That’s a problem! As we just saw, range is the most important component of UZR for every position. It is also not terribly surprising as I hypothesized at the beginning of this series that the fans are underestimating the importance of range in favor of flashier tools. This also explains a lot of the discussion about Derek Jeter’s defensive ability (or lack thereof).

This sums up the research portion of this series. I think all of this does lend a lot of credence to FSR: it does reflect that range is the most important component of defense and it does a good job of properly ranking the importance of the other components. In addition, the correlation between FSR and UZR is fairly strong but not so strong that the two systems are interchangeable. However, when considering FSR, be sure to mentally adjust when a player has particularly good (or bad) range.

In part 3, I will examine some of the player seasons that produced the most disparity in the two rankings.

A Decade of DH: The Mariners Post-Edgar Martínez

by maxe6

April 16, 2015

Edgar Martínez is kind of a demigod in Seattle. If you drive west past Safeco Field, parallel to the first-base line, you’re doing so on Edgar Martínez Drive (hang a right at home plate and you’re on Dave Niehaus Way).

He’s the only one of the franchise’s most celebrated players, besides hopefully Felix, to have spent his entire career with the Mariners organization, something Ken Griffey Jr. and Ichiro can’t lay claim to. His game-winning double in the 1995 ALDS is the Seattle Sports Moment to many, and it was just a quarterfinal, Super Bowl/’79 NBA Finals/1917 Stanley Cup be damned.

He’s also the greatest designated hitter of all time, and if not the greatest (which he is, in a completely objective manner of speaking untainted by my personal preferences) the player who perfectly typifies the designated hitter position. For years he has been the barometer of the DH, the mark by which all who came before and all who shall come after will be evaluated – Frank Thomas and Jim Thome weren’t purebred designated hitters the way Edgar was. Although Edgar won his first batting title playing third base, a hamstring injury relegated him to the DH role, so he was the only one of the three to primarily play designated hitter for most of his career, unlike Thomas and Thome, who spent much more time in the field. Also, Frank Thomas was an absolutely filthy pitcher in Backyard Baseball 2003, as well as complete garbage as a hitter in that very same game, so don’t you tell me that designated “hitter” is his primary position.

But whatever Edgar Martínez meant to the DH position went tenfold for the Mariners. For me Edgar’s retirement was in a way the turning of a page for the franchise, but his loss as a Mariners icon could ostensibly be counteracted somewhat by roster stalwarts Ichiro, Dan Wilson, Bret Boone, and Jamie Moyer (duh), among others.

What would change almost irreparably was the Seattle Mariners’ designated hitter slot that Martínez vacated when he retired.

2015 marks the Mariners’ eleventh season since Martínez’s retirement, meaning that a full decade has passed in the majors without Edgar on the Mariners. The first step in our journey into an Edgar-less world begins with a ranking of cumulative fWAR at the DH position by team from 2005 (the Mariners’ first season without Martínez) to 2014.

The first thing you’ll notice seems pretty intuitive, which is that the top 15 teams are all American League teams – this obviously makes sense because the DH rule is only applicable in the American League, so it would logically follow that the fifteen American League teams would have accrued the most Wins Above Replacement in the major leag-

wait, hold up – philadelphia????????

where are the mariners

lemme just scroll down to find them, one sec

…

….

…………

uhhhhhhhhhhhh

So Mariners designated hitters rank 27th, which means that between the Mariners and the Astros, twelve – TWELVE National League teams, who only employ a designated hitter for a handful of games per season (interleague away games to be exact), have produced more fWAR at that position over the last decade than the Mariners, who use a designated hitter on pretty much a daily basis.

In fairness, the explanation for this seems logical: the defensive fWAR penalty for the designated hitter position (the highest of any defensive position) is cumulative – it increases with the amount of innings played at any given position. The Mariners have accrued ungodly of amounts value ~~above~~ below replacement by trotting out consistently bad DH production, whereas the magnitude of damage a National League DH can do to his team’s aggregate replacement value is limited by sample size. Something that I don’t completely understand is that FanGraphs’ data on National League players DHing seems to be incomplete, but maybe it just has something to do with eligibility not lining up with FanGraphs data or the fact the sample size of National League DHs is inadequte. Perhaps having all National League DHs accounted for just wouldn’t be worth the effort or be statistically significant.

Even if we remove the National League from the equation, the Mariners are still dead last in their own league by quite a bit. The next-worst team, Houston, has been in the American League for all of two full seasons and has managed to comfortably outpace Seattle (upon further examination this is made more impressive by the fact that one of those seasons, Chris Carter’s 37-dinger campaign in 2014, doubles Seattle’s cumulative fWAR over the entire decade in magnitude). But then again as stated before, Seattle have incurred a penalty for having adhered to the DH rule since its inception, whereas Houston have only had two seasons to let the DH penalty pile up.

In order to ascertain exactly what shenanigans could have gone down with the M’s DH position such that all Seattle DHs from 2005-2014 collectively managed to produce fewer wins than Edgar’s farewell 2004 season (his worst by fWAR, totaling -0.5), here’s a fond look back at some of the Mariners’ highlights at DH over the past decade (min. 100 PA), ordered by total fWAR produced for the Mariners.

1. Russell Branyan, 2009 & 2010, 125 wRC+, 3.4 fWAR

FanGraphs lists Branyan first in highest total accumulated fWAR post-Martínez for any Seattle DH, which makes sense, because his 2009 116-game stint with the Mariners was quite good-until you realize that he played all of those games for the Mariners at first base, meaning that 2.7 wins of these 3.4 weren’t even put up from the designated hitter slot. Fun fact: Branyan’s 2009 season, in which he swatted 31 homers and posted 126 wRC+, is the only season since 2006 in which a Mariners player has hit 30 homers (2013 Ibañez missed this mark by just one home run).

On the bright side, Branyan returned to the Mariners via trade in June of 2010, and this time he actually put in some time at DH. He managed to do quite well for himself, with 121 wRC+ in 238 plate appearances, and 25 of his 44 hits went for extra bases. After a couple 2011 stints with Arizona and Los Angeles, he then decided that playing for Seattle had ruined the major leagues for him and went on to play only in minor-league and Mexican league games for the rest of his career.

2. John Jaso, 2012, 143 wRC+, 2.6 fWAR

Jaso shared catching and DH duties with Jesus Montero in 2012, and in only 108 games became the second-most valuable position player on the team by fWAR behind Kyle Seager. Jaso also had the third-highest walk of rate of any position player in baseball (min. 350 PA) in 2012 and was Felix Hérnandez’s batterymate for his perfect game against the Tampa Bay Rays.

The Mariners then infamously dealt Jaso to division rival Oakland in a three-team deal that yielded a return of Michael Morse, former Mariner just coming off a career year with Washington in 2011. Morse played half of the 2013 season before injury caught up to him and he was shipped off to Baltimore.

Jack Z could lead the Mariners to 4 consecutive titles and be the executive of the year each of those years and Mariners fans (myself included) would probably still find time to complain about the Jaso-Morse Trade for some reason. Jaso is currently back with Tampa Bay and is currently injured, but his 2013 and 2014 seasons were still much better collectively than what the Mariners fielded during that same time period.

3. Mike Sweeney, 2009-10, 111 wRC+, 0.7 fWAR

At the same time Russell Branyan was busy dirtying himself in the field and hitting big hits, the Mariners extended 2 non-roster invites in 2 consecutive years to great Royals player, great hitter, and great all-around guy Mike Sweeney (who doesn’t have a picture on his Wikipedia profile, but Yuniesky Betancourt does….?). Making the team both times, Sweeney was a productive hitter for the Mariners in 2009 and 2010 and was traded to the Phillies, they of the 15th-ranked DH fWAR, in time for the NLDS, where he was able to collect a hit in his first and only postseason at-bat (against Aroldis Chapman, no less). He then signed a one-day contract to retire a Royal. It’s nice to feel good about something tangentially Mariners-related every once in a while, especially because after we move past Sweeney on this list things start to get a little dicey.

4. Kendrys Morales, 2013 & 2014, 108 wRC+, 0.2 fWAR

Morales was one of the better hitters on a largely uninspiring 2013 Mariners team, boasting a .342 wOBA and a 119+ wRC, both of which would have led the team if not for an incredibly strange Raul Ibañez season. The Twins signed Morales as a free agent the following offseason for $7.6 million, which was kind of strange but made sense if they could turn that value into something. It turns out that something was the Mariners’ Stephen Pryor, and Morales ended up back in Seattle, where he had effectively iced his chances of cashing in on a qualifying offer from the Mariners a year ago. In 2014, Morales put up -0.8 fWAR for Minnesota and -1.0 fWAR for Seattle, by far the worst year of his career.

5. Jack Cust, 2011, 97 wRC+, -0.1 fWAR

A profile of Jack Cust came up on the Jumbotron at Safeco Field once while I was in attendance at a Mariners home game, in which he stated that his favorite quote was “play hard”, a nugget of wisdom Cust attributed to himself. The combination of that Jumbotron quote and Cust’s .116 ISO for the Mariners in 2011 continues to be one of the more perplexing relationships I’ve observed to date, as is the 97 wRC+ (Cust’s wRC on the season was 20).

6. Jose Vidro, 2007-08, 94 wRC+, -0.2 fWAR

Vidro, who leads the players on this list in plate appearances with 955, spent almost two seasons with the Mariners (2007 & 2008). His 2007 season with Seattle wasn’t bad – his .775 OPS was just above league average, his 10.1 BB% was just above league average, and his .345 wOBA was just above league average. Vidro decided to use 2008 to erase most of the solid work he had done in the previous year, cutting his walks in half, getting on base 70% as often, and found himself designated for assignment and later released that summer.

7. Greg Dobbs, 2004-06, 74 wRC+, -0.3 WAR

Dobbs didn’t really play enough to be remembered as outright terrible, usually taking backup duty at third base and then being relegated to a pinch-hitting/DH role in 2005/2006. Before we move on, there’s a couple interesting notes about Dobbs.

First, Greg Dobbs hit a home run in his first at-bat with the Mariners (and his first major league at-bat), which would turn out to be the only home run he would hit in 2004. This isn’t particularly notable except for the fact that it always reminds me that Miguel Olivo also homered in his first major league at-bat, which I will never forget for some incredibly frustrating reason. “Miguel Olivo homered in his first major league at-bat. He was with the Chicago White Sox and the home run was off Andy Pettitte”. This useless piece of information has been wasting my neural capacity ever since I read it on some Mariners gameday program, which I think had Richie Sexson on the cover, so I’m jointly blaming Sexson and Olivo both for forcing me to remember that information and also for being pretty underwhelming with the Mariners.

The second thing is Dobbs’ 2006 season, in which he was only around for 28 PA in 23 games, 18 of which he came in as a pinch-hitter. In those games he managed 150 wRC+ on a cool .435 BABIP. He also walked 0 times. Ultimately it was only 28 PA, an absurdly silly sample size, and clearly the Mariners felt similarly, because they waived out of him in 2007, whereupon Pat Gillick, now with the Phillies, decided to take a second chance on Dobbs long enough for him to earn a World Series ring in 2008.

8. Jeff Clement, 2006-09, 90 wRC+, -0.3 fWAR

Yeah, let’s just not go there.

9. Milton Bradley, 2010-11, 83 wRC+, -0.5 fWAR

Bradley contributed -0.5 fWAR in his time with the Mariners and interestingly enough, being bad at baseball may have been one of the better things he did in Seattle.

10. Carl Everett, 2006, 72 wRC+, -0.8 WAR

Carl Everett was a two-time All-Star in Boston, put up a six-win season with the Astros, and found himself signed by the Mariners for the 2006 season, where he had his worst offensive season by far. Everett was league-average in B% and K% and that was about it. Everett was released in July of 2006 at the age of 35, having posted -0.8 fWAR in 92 games. You could say he was getting to be a bit of a dinosaur, but don’t tell Carl Everett anyone said that about him.

11. Ben Broussard, 2006-07, 88 wRC+, -0.8 fWAR

Broussard was acquired by the Mariners from Cleveland in the second half of the 2006 season, filling in mostly at DH after Carl Everett’s dismissal; in that time-frame Broussard’s figure of 78 wRC+ is not particularly inspiring, and only slightly bests Everett’s. Broussard then spent most of his first and only full season (2007) for the Mariners switching between first base and the corner outfield spots, His overall 88 wRC+ was a slight improvement but still not great, especially while the man Bill Bavasi gave up to acquire Broussard, Shin-Soo Choo, has produced 24.3 fWAR since the 2007 season.

Back to Broussard, though:

12. Eduardo Perez (2006), 48 wRC+, -0.8 fWAR

I don’t think I really have to say anything here.

13. Jesus Montero (2012-), 83 wRC+, -0.9 fWAR

Jesus Montero is now somewhat of a tragic figure among Marinerds. In the 2012-2013 offseason, Montero claimed that he had a coach who had helped him ‘learn to run’, which I guess if you haven’t yet is probably a good idea. He then cut his 2013 season short by getting suspended for his involvement in the Biogenesis snafu. Last August, he got into a bizarre altercation. Montero’s role in the organization has strayed far from the top hitting prospect the Mariners traded Michael Pineda(who’s now the #3 in New York) for. Montero continues to toil away in the Mariners’ farm system (reports out of Tacoma yesterday were that he legged out an infield single), in the hopes he can top 2013’s terrible 64 wRC+. Montero is still only 25, so it’s not as if all hope is lost, but the fact remains that his on-field production at the major league level has been nothing short of disappointing.

14. Ken Griffey, Jr. [1989-99(omitted), 2009-10], 84 wRC+, -0.9 fWAR

Nobody was expecting 90s Griffey Jr. the ballplayer, which was convenient because he didn’t show up. Griffey was only mediocre in 2009, with a wRC+ of 97 and a wOBA of .324, rendering him merely replacement-level (0.0 fWAR). 2010 was a different story. Griffey Jr. posted a depressingly bad 32 wRC+ in 108 plate appearances. Amid issues with Don Wakamatsu restricting his playing time and the bizarre rumor of Griffey napping during a potential pinch-hit opportunity, things came to a head in June of 2010, when Griffey abruptly left the club, drove home, and announced his retirement before the next days game. The Kid’s return to Seattle was a welcome dip into the nostalgia-drenched coffers of yesteryear for a struggling ballclub, and before anyone had time to process it, the sweetest swing in baseball was silenced in a flash.

15. Corey Hart (2014), 70 wRC+, -1.1 fWAR

Hart has the lowest single-season WAR of any player on this list. The Mariners paid $6 million with $7 million in undisclosed incentives for a year of Hart coming off a knee surgery that caused him to miss all of 2013, which at the time seemed to be a reasonable gamble for a high-risk commodity that could potentially have paid great dividends. Unfortunately, Hart struggled to stay healthy and perform. playing only 68 games in the season; the midseason acquisition of Kendrys Morales certainly didn’t help. Ultimately the Mariners’ front office decided to take a gamble with Hart and lost.

BONUS ROUND (fun with small sample sizes): Scott Spiezio in 2005

Scott Spiezio in 2004 was simply a bad infielder and player, posting a miserable .288 OBP, 67 wRC+, among other poor statistics on his way to putting up a below replacement-level season of -0.1 fWAR.

Big deal though – it’s one bad season. Besides, if you look up on that same list, Carl Everett put up -0.8 fWAR in 92 games. Eduardo Perez did it in 43!

This is nothing to Scott Spiezio.

Again, the sample size here is ridiculously small, but the absurdity of the numbers he managed to log in 29 games is honestly kind of impressive. The Mariners released him from here and he then went on to win the World Series with St. Louis, even hitting a game-tying triple in Game 2 of the 2006 NLCS, so fortunately he was of some use to a team after that 2005 season.

In fairness, the designated hitter is not an incredibly stable position in today’s game. It doesn’t make sense to pay a premium for a skill that can be replicated by other hitters who can also play competent defense. There are only a handful of “conventional” designated hitters in the league, and even Victor Martinez and his -31.1 UZR/150 in 2014 are called upon to play defense every once in a while. For whatever reason – be it sentimentality with Griffey or having the odds stacked against them with Hart or just because Bavasi (Clement, Broussard), the Mariners have gotten an extraordinarily poor level of performance out of their designated hitters in an Edgar-less world, with some bright spots (Jaso, Branyan, 1 season of Kendrys Morales).

This season, Seattle will call on 2014 home-run king Nelson Cruz to fill in most nights at DH (at least as long as Seth Smith and Justin Ruggiano are healthy enough to man right so Cruz doesn’t have to). In 10 years of trying to fill Edgar’s place, the Mariners haven’t quite succeeded and probably won’t ever do so, but wouldn’t it be something to see them come close?

Gausmanian Distribution

by 1908

April 16, 2015

At the end of spring training, Buck Showalter banished Kevin Gausman from the rotation in favor of Ubaldo Jimenez, a pitcher with a much higher salary and much less talent. Many assumed that Jimenez’ salary largely dictated the move. Yes, he outpitched Gausman in spring training (4.44 ERA to 7.04), but it’s hard to believe that Showalter invests very much in spring training stats, and in any case if you put “4.44” into Google Translator, “success” is unlikely to be one of the resulting character strings.

One Orioles fan of my acquaintance heard that Showalter’s decision had more forethought: Buck’s intent may be to use Gausman much as the fireman reliever of old, and bring him in to critical situations in ballgames regardless of today’s ossified reliever usage patterns. Bill James long ago established that this is the most effective way to use a top-flight reliever, but it is less clear that this is the best way to use a potential #1 starter. Gausman is the only pitcher on the Orioles 25-man roster who has even a prayer of turning into an ace, and it seems unlikely he’ll do it from the pen.

Gausman’s had a somewhat unusual start to his career. In his first two years as a major leaguer, he started 25 games and made 15 relief appearances. There are a total of 15 active pitchers who had at least 25 starts and 15 relief appearances in their first two years:

Rk	Player	bWAR	GS	GR
1	Mark Buehrle	6.6	35	25
2	Roberto Hernandez	6.4	39	31
3	Alexi Ogando	5.6	29	46
4	Chad Billingsley	5.1	36	25
5	Alex Wood	4.8	35	31
6	Jeremy Affeldt	4.4	25	45
7	Joe Kelly	3.3	31	30
8	Josh Collmenter	3.2	35	24
9	Justin Masterson	3.2	25	53
10	Lance Lynn	2.8	31	22
11	David Hernandez	1.6	27	34
12	Kevin Gausman	0.7	25	15
13	Derek Holland	-0.1	31	16
14	Craig Stammen	-0.3	38	16
15	Bronson Arroyo	-0.8	25	19

(Table courtesy of the invaluable Baseball Reference Play Index)

It’s certainly an eclectic mix. Only Buehrle established himself as an ace, though Arroyo has had a good career as a mid-rotation workhorse, and Masterson and (to a lesser extent) I-Can’t-Believe-It’s-Not-Fausto-Carmona have made useful contributions. For other starters on this list (Wood, Kelly) it’s too soon to tell. Affeldt and Stammen wisely gave up starting and have become bullpen mainstays. More sobering, many of the names on this list have had their careers derailed by injuries. It’s hard to know whether the mixed usage contributed to injury problems for guys like Ogando, Billingsley, and Holland; it is equally possible that conserving these young arms early may have averted even more serious or earlier arm trouble.

Gausman sits uneasily here; he is by far the highest drafted pitcher on this list (fourth overall in 2012). It is unsurprising to see a club experiment with a 38th-round pick who struggles to break a pane of glass, like Buehrle. Such tinkering is less common with a player drafted to be a rotation anchor. Indeed, there are only two other first-rounders on this list, Billingsley and Lynn.

In his first season (2006), Billingsley started 16 games and came in from the bullpen twice. He put up a respectable 3.80 ERA, but with atrocious peripherals (5.8 BB/9, 5,9 K/9). The Dodgers understandably exiled him to the bullpen to start the 2007 season, but Dresden-like pyrotechnics from Proven Veterans Mark Hendrickson, Brett Tomko, and Jason Schmidt forced the Dodgers to put Billingsley back in the rotation in June, and he acquitted himself reasonably the well the rest of the way. He would go on to have uneven success over the next four seasons until diagnosed with a torn UCL in September 2012. He has pitched in two major league games since.

Lance Lynn offers a happier comp for Gausman. He appeared largely in relief (2 starts in 18 games) in 2011. Despite Kyle McClellan’s runtastic performance as the Cardinals’ fifth starter, LaRussa elected not to insert Lynn into the rotation; the Cardinals instead traded for Edwin Jackson, who stabilized the fifth spot. This seems similar to Showalter’s choice: go with the established if not necessarily dominant veteran in lieu of the risky young flamethrower. Lynn had put good numbers in 2011 at AAA, but not in 2010. The Cards’ reluctance to turn over a rotation spot to him in the midst of a playoff run was understandable. Lynn has been in the rotation since 2012, and has consistently produced very close to his career marks of 3.32 FIP and 2.71 K/BB, despite some jumpiness in his ERA.

Both these examples tend to suggest Showalter is making a mistake. The Dodgers finally ran out of Jason Schmidts, while the Cards went with the good-enough E-Jax (and, to be fair, won the World Series). But in each case the young replacement would quickly prove himself superior to the older and supposedly safer option when finally given the chance. There are very few who would predict that, over the course of 30 starts, Jimenez will outperform Gausman in any significant statistical category.

But Showalter has other things on his mind. Specifically, this:

#27 Orioles

Name	IP	ERA	FIP	WAR
Chris Tillman	184.0	4.10	4.40	1.4
Wei-Yin Chen	169.0	4.04	4.17	1.5
Miguel Gonzalez	157.0	4.42	4.84	0.5
Bud Norris	154.0	4.15	4.30	1.1
Ubaldo Jimenez	146.0	4.28	4.38	0.9
Kevin Gausman	91.0	3.97	4.00	0.9
Dylan Bundy	18.0	4.40	4.56	0.1
Total	919.0	4.17	4.38	6.4

Yep, this is the FanGraphs Depth Chart projection for the Orioles starting rotation, with the O’s ranked 27th out of 30. Not a single starter checks in with a FIP under 4.00. This is a shaky rotation, and the Orioles have no quick way of making it better. Eventually, perhaps as early as next year Gausman, Bundy, and Hunter Harvey will form an enviable top 3, but there’s another problem on Buck’s plate. Next year, much of the current roster may be lost to free agency, including Chris Davis, Matt Wieters, Chen, and Norris. The Orioles are under enormous pressure to win now.

And Gausman can help! Because at this stage of his career, he is a much better reliever than starter. The big difference is in strikeouts:

AL average starter K/9: 7.1

AL average reliever K/9: 8.3

Gausman as starter K/9: 7.0

Gausman as reliever K/9: 11.7

That there is some major whiffage for a staff in dire need of it. Put Gausman together with Zach Britton, Darren O’Day, and Tommy “Big Game” Hunter, and the Orioles have a fully weaponized bullpen. Buck’s plan is to hold on for the first five or six innings, and them shut down the opponent’s offense while the Orioles bats bludgeon their way to victory. And with Gausman acting as a mobile reserve, Showalter can shrink the innings for which the starters are responsible, but do so on a game-to-game basis. On those days when the starters happen to be effective they can go longer, and on those days (more often than not, one suspects) that they get into trouble, Showalter will be able to address some of that trouble with the best arm on the roster.

This isn’t the way I would ordinarily do it, but then again, this isn’t the roster I would have assembled. Showalter has repeatedly shown an ability to work with the tools he has rather than impose some prefabricated tactical rule set that disregards the strengths and weaknesses of his players. Baltimore’s road to the playoffs is neither straight nor sure, but at least it’s Showalter behind the wheel.

Insurance in Baseball is Like a Black Hole

by Titubs

April 11, 2015

How much gravity does insurance have in Major League Baseball front office decisions?

Puns aside, let me tell you the funny thing about a black hole. You see astronomers cannot really see one, instead they are detected through their effects on the universe around them. Although less extreme, insurance is similar in this regard on its impact on baseball teams. Most teams insure some of their larger contracts in case their players cannot play due to an exterior factor such as injury. Perhaps the impacts of this major facet of the game does not cross our mind often because it is not eminently visible. However, make no mistake that insurance is a major factor when teams make major decisions regarding the DL, contract extensions, playing time and so on.

First, consider the history of baseball insurance to better understand why it impacts baseball. It can be said that by the late 1990s it had become common place for teams to insure their larger contracts. The first time baseball and insurance first truly started getting media attention was with Albert Belle in 2001 due to confusion over his insurance contract. Albert Belle had suffered a career-ending hip injury with the Orioles. Fans grew excited however despite the disheartening news when in 2002 the all-star slugger was added back onto the Orioles’ 40 man roster. Disappointed Orioles fans can tell you that Belle never played another MLB game however. Instead, he was added back onto the 40-man roster so the Orioles could collect insurance on his contract (some MLB insurance contracts do not cover a player unless they remain on the 40-man roster). At the time the MLB insurance contract covered the remainder of the salary owed on Belle’s contract. According to writer Michael Branda, the Orioles recovered an astounding 27.3 million out of a 39-million-dollar loss represented by Belle’s injury. Since the huge losses on Belle’s contract in the early 2000s, insurers in baseball have become much more stringent with their underwriting in baseball contracts.

Belle was not the only reason for teams being more stringent with their underwriting. The associated risks with insuring a MLB player have increased. One reason is PEDs. The MLB really did not enforce its ban on PEDs up until the late 2000s, but now being caught using PEDs can result in a significant loss in playing time. According to MLB Trade Rumors a syndicate of the MLB, Ervin Santana a first-time offender was suspended for 80 games on April 3^rd of 2015. Santana was expected to make 13.5 million dollars this season. Despite being suspended, the Minnesota Twins are expected to still have to pay half of Santana’s salary (only players caught for the second time or more for steroids lose most of their salary).

To account for this insurers enforce a 60 to 90 day deductible policy in order to shield themselves from these sort of losses as well as claims made for short-term injuries. In addition to the 90 day deductible insurance policies are typically term policies of about three years with an option for renewal after the conclusion of the contract. As a result, if a contract proves to incur severe losses the liabilities will be only in the short term. Obviously to further protect themselves insurers only cover players with no preexisting injuries and all players must be inspected by the insurer. Furthermore, with wide variability and unpredictability in player health an insurer can in a way readjust its rate every three years to better reflect the player’s risk of not playing.

These more stringent underwriting practices have influenced the game, oftentimes when deciding when to take a pitcher off of the DL. Adam Kilgore of the Washington Post explained it best when he talked about how in 2012 Stephen Strasburg was “shut down” during a playoff push for the Nationals due to potential health concerns. In this decision there appeared to be a delicate balance between contention and the considerations of the insurance company that would not have covered Strasburg if he had been injured due to these health concerns. It is not crazy to think that finances come into play when making a decision on baseball players. Jeff Moorad, a decision maker for the Padres explained that in 2010 Chris Young was eligible to come off the DL. The Padres ultimately chose to put Young back on the field but Moorad added, “the accounting department much preferred that he [had stayed] on the disabled list.”

Baseball insurance has grown steadily more expensive. Although it is difficult to ascertain how much a team actually spends on insurance it is clear that it can be a burden for smaller teams. For instance, the Arizona Diamondbacks’ rotation once featured the dominant starting pitcher Brandon Webb. Between 2006 and 2008 Brandon Webb made three all-star games and finished first, second and second respectively in Cy Young voting. In this span Webb also led the league in innings pitched once. In essence Webb was due for a large payday once his contract expired. However contract negotiations between Webb and the Diamondbacks hit a snag in June of 2008. Despite a track record as a durable starter, no insurance company was willing to write a policy for injury risk to the Diamondbacks hurler. The reason is because insurance companies refused to accept the risk of Webb injuring his arm or if they were willing to, they were going to charge exorbitant rates.

As a result, the Diamondbacks could not get an insurance contract that did not have an exclusion on arm, shoulder or elbow injuries, all vital and injury-prone body parts for pitchers. According to AZCentral, a news outlet that covers the Arizona Diamondbacks, due to Webb not being insurable the Diamondbacks broke off all contract negotiations. In the long run, this proved to be a smart move since ten months later after contract talks ceased Brandon Webb never pitched in the majors again due to injuries. The point of this is that not only is baseball insurance so expensive it can prove to be impractical, it shows that most insurers are unwilling to insure some pitchers. Walt Jocketty, the former general manager for the middle-market St. Louis Cardinals explained that insurance has “become so expensive that it’s a cost item we really have to look at when you put your payroll together.”

In addition to insurance contracts altering how teams manage their rosters they influence how teams treat their players. In essence there is a human side to these contracts. Consider Josh Hamilton who is in the middle of a massive five-year, 125-million-dollar contract with the Angels and who has been playing quite ineffectively relative to his salary. Josh Hamilton who recently suffered a relapse on his drug addiction before the 2015 season is a prime example of insurance influencing teams to treat their players in a way they hopefully normally would not. Despite being a repeat offender, an arbitrator chose not to suspend Hamilton for any period of time. What makes this story scandalous is the Angels’ seemingly acerbic response to this news. It appears that the Angels almost wanted Hamilton to be suspended to spare them the expense of his failed contract. Clearly the Angels have little incentive to help Hamilton recover from his addiction. Instead, it is in their interest to see that Hamilton never plays another game of baseball again because if he does not play for an extended period of time the Angels can potentially collect insurance and definitely reduce their payroll.

Insurance influences baseball more than many people may realize. When it comes to playing time, DL decisions and contract negotiations, insurance seems to be an integral piece in the decision-making process. For me though, part of what makes baseball great is the inherent competition of players, often with disregard to their own body (*cough* Adam Eaton *cough*). There is little harm of teams protecting themselves from the inherent risks of baseball players becoming injured. The risk is that insurance becomes an incentive for teams to make decisions that may be bad for the game, such as not playing players for financial gain. Let’s hope that ultimately, teams do not get engulfed into this black hole.

Hardball Retrospective – The “Original” 1999 Texas Rangers

by DerekBain

April 11, 2015

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. Therefore, Fergie Jenkins is listed on the Phillies roster for the duration of his career while the Pirates claim Barry Bonds and the Rays declare Carl Crawford. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition. Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

Assessment

The 1999 Texas Rangers OWAR: 50.4 OWS: 284 OPW%: .512

GM Tom Grieve acquired 79% (38 of 48) of the ballplayers on the 1999 Rangers roster. 38 of the 48 team members were selected through the Amateur Draft process. Based on the revised standings the “Original” 1999 Rangers placed six games behind the Mariners in the American League Western Division race. Texas (83-79) claimed the Wild Card by a one-game margin over Chicago and Kansas City.

Perennial All-Star backstop Ivan Rodriguez enhanced his trophy case with the 1999 A.L. MVP award. “Pudge” produced a .332 BA while notching career-bests in home runs (35), RBI (113), runs scored (116), base hits (199) and stolen bases (25). Rodriguez collected 13 Gold Glove Awards including 10 in consecutive seasons (1992-2001). “Slammin’” Sammy Sosa launched 63 moon-shots, drove in 141 baserunners and registered 114 tallies. Juan “Igor” Gonzalez belted 39 round-trippers, knocked in 128 runs and delivered a .328 BA after an MVP season in the previous campaign.

Fernando Tatis (.298/34/107) enjoyed a career year over at the hot corner, scoring 104 runs and swiping 21 bags. Rusty Greer clubbed 41 doubles, 20 big-flies and plated 101 baserunners while eclipsing the .300 mark for the fourth successive season. Rich Aurilia (.281/22/80) and Mike Stanley (.281/19/72) supplied additional thump towards the bottom of the lineup. Warren Morris parlayed a .288 BA and 15 long balls into a third-place finish in the Rookie of the Year balloting.

Rodriguez slots into 13^th place in “The New Bill James Historical Baseball Abstract” among backstops. He certainly elevated his ranking after playing ten additional years following the publication of NBJHBA in 2001. Right fielders Sosa and Gonzalez are listed in 45^th and 52th place, respectively.

LINEUP	POS	WAR	WS
Warren Morris	2B	1.71	15.4
Ivan Rodriguez	C	5.22	28.63
Fernando Tatis	3B	5.05	23.74
Sammy Sosa	RF	4.98	26.64
Juan Gonzalez	DH/RF	2.88	24.42
Rich Aurilia	SS	3.06	18.11
Rusty Greer	LF	2.32	21.03
Mike Stanley	1B	1.82	13.67
Terrell Lowery	CF	-0.17	3.21
BENCH	POS	WAR	WS
Rey Sanchez	SS	2.59	11.29
Jose Hernandez	SS	2.3	16.33
Dean Palmer	3B	1.04	16.71
Hanley Frias	SS	0.22	4.42
Edwin Diaz	2B	0.17	0.62
Kevin L. Brown	C	0.12	0.58
Jon Shave	SS	0.09	1.98
Bill Haselman	C	-0.04	3.84
Jeff Frye	2B	-0.13	2.35
Ruben Mateo	CF	-0.26	1.9
Kelly Dransfeldt	SS	-0.26	0.8
Chad Kreuter	C	-0.58	3.51

Kevin J. Brown, the undisputed ace of the Texas rotation, compiled a record of 18-9 with a 3.00 ERA, 1.066 WHIP and 221 strikeouts. The balance of the starting staff submitted sub-par efforts in contrast to their career norms. Jeff Zimmerman (9-3, 2.36) fashioned a 0.833 WHIP and received an invitation to the Mid-Summer Classic during his rookie campaign.

ROTATION	POS	WAR	WS
Kevin J. Brown	SP	5.54	19.92
Darren Oliver	SP	3.94	12.45
Rick Helling	SP	3.78	12.52
Kenny Rogers	SP	2.97	11.57
Wilson Alvarez	SP	1.89	9.95
BULLPEN	POS	WAR	WS
Jeff Zimmerman	RP	3.67	14.64
Mike Venafro	RP	1.19	7.36
Mark Petkovsek	RP	0.84	9.49
Terry Mathews	RP	0.31	2.54
Danny Kolb	RP	0.13	1.9
Brian Bohanon	SP	1.63	9.68
Ryan Dempster	SP	1.45	6.98
Jim Brower	SP	0.42	1.87
Robb Nen	RP	0.07	7.89
Danny Patterson	RP	-0.08	2.64
Mike Cather	RP	-0.17	0
Corey Lee	RP	-0.2	0
Jonathan Johnson	RP	-0.26	0
Bobby Witt	SP	-0.28	4.52
Billy Taylor	RP	-0.3	5.54
Dan Smith	SP	-0.34	1.73
Tony Fossas	RP	-0.39	0
Ryan Glynn	SP	-0.42	0
Scott Eyre	RP	-0.66	0
Doug Davis	RP	-0.66	0
Julio Santana	SP	-1	0.17
Matt Whiteside	RP	-1.1	0

The “Original” 1999 Texas Rangers roster

NAME	POS	WAR	WS	General Manager	Scouting Director
Kevin Brown	SP	5.54	19.92	Tom Grieve	Sandy Johnson
Ivan Rodriguez	C	5.22	28.63	Tom Grieve	Sandy Johnson
Fernando Tatis	3B	5.05	23.74	Tom Grieve	Sandy Johnson
Sammy Sosa	RF	4.98	26.64	Tom Grieve	Sandy Johnson
Darren Oliver	SP	3.94	12.45	Tom Grieve	Sandy Johnson
Rick Helling	SP	3.78	12.52	Tom Grieve	Sandy Johnson
Jeff Zimmerman	RP	3.67	14.64	Doug Melvin	Chuck McMichael
Rich Aurilia	SS	3.06	18.11	Tom Grieve	Sandy Johnson
Kenny Rogers	SP	2.97	11.57	Eddie Robinson	Joe Klein
Juan Gonzalez	RF	2.88	24.42	Tom Grieve	Sandy Johnson
Rey Sanchez	SS	2.59	11.29	Tom Grieve	Sandy Johnson
Rusty Greer	LF	2.32	21.03	Tom Grieve	Sandy Johnson
Jose Hernandez	SS	2.3	16.33	Tom Grieve	Sandy Johnson
Wilson Alvarez	SP	1.89	9.95	Tom Grieve	Sandy Johnson
Mike Stanley	1B	1.82	13.67	Tom Grieve	Sandy Johnson
Warren Morris	2B	1.71	15.4	Doug Melvin
Brian Bohanon	SP	1.63	9.68	Tom Grieve	Sandy Johnson
Ryan Dempster	SP	1.45	6.98	Doug Melvin	Sandy Johnson
Mike Venafro	RP	1.19	7.36	Doug Melvin	Sandy Johnson
Dean Palmer	3B	1.04	16.71	Tom Grieve	Sandy Johnson
Mark Petkovsek	RP	0.84	9.49	Tom Grieve	Sandy Johnson
Jim Brower	SP	0.42	1.87	Tom Grieve	Sandy Johnson
Terry Mathews	RP	0.31	2.54	Tom Grieve	Sandy Johnson
Hanley Frias	SS	0.22	4.42	Tom Grieve	Sandy Johnson
Edwin Diaz	2B	0.17	0.62	Tom Grieve	Sandy Johnson
Danny Kolb	RP	0.13	1.9	Doug Melvin	Sandy Johnson
Kevin Brown	C	0.12	0.58	Tom Grieve	Sandy Johnson
Jon Shave	SS	0.09	1.98	Tom Grieve	Sandy Johnson
Robb Nen	RP	0.07	7.89	Tom Grieve	Sandy Johnson
Bill Haselman	C	-0.04	3.84	Tom Grieve	Sandy Johnson
Danny Patterson	RP	-0.08	2.64	Tom Grieve	Sandy Johnson
Jeff Frye	2B	-0.13	2.35	Tom Grieve	Sandy Johnson
Terrell Lowery	CF	-0.17	3.21	Tom Grieve	Sandy Johnson
Mike Cather	RP	-0.17	0	Tom Grieve	Sandy Johnson
Corey Lee	RP	-0.2	0	Doug Melvin
Ruben Mateo	CF	-0.26	1.9	Doug Melvin	Sandy Johnson
Jonathan Johnson	RP	-0.26	0	Doug Melvin	Sandy Johnson
Kelly Dransfeldt	SS	-0.26	0.8	Doug Melvin
Bobby Witt	SP	-0.28	4.52	Tom Grieve	Sandy Johnson
Billy Taylor	RP	-0.3	5.54	Eddie Robinson
Dan Smith	SP	-0.34	1.73	Tom Grieve	Sandy Johnson
Tony Fossas	RP	-0.39	0	Eddie Robinson
Ryan Glynn	SP	-0.42	0	Doug Melvin	Sandy Johnson
Chad Kreuter	C	-0.58	3.51	Tom Grieve	Sandy Johnson
Scott Eyre	RP	-0.66	0	Tom Grieve	Sandy Johnson
Doug Davis	RP	-0.66	0	Doug Melvin
Julio Santana	SP	-1	0.17	Tom Grieve	Sandy Johnson
Matt Whiteside	RP	-1.1	0	Tom Grieve	Sandy Johnson

Honorable Mention

The “Original” 2001 Rangers OWAR: 48.4 OWS: 278 OPW%: .513

Sosa shredded opposition pitching to the tune of a .328 BA while launching 64 moon-shots, registering 160 RBI and scoring a League-best 146 runs. Aurilia delivered career-bests with a .324 BA, 37 dingers, 97 ribbies and 114 tallies as he topped the circuit with 206 safeties. Gonzalez swatted 35 big-flies and knocked in 140 baserunners. Zimmerman notched 28 saves and Brown furnished a 2.65 ERA in 19 starts.

On Deck

The “Original” 1924 Senators

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive

Are Two Opening-Day Homers Merely Dust-in the Wind?

by jacaissie

April 9, 2015

As a Red Sox fan, I got very excited opening day when Dustin Pedroia hit two home runs. One of the big questions of this offseason is whether he has upper-single-digit homer power, or upper-teens homer power. Of course, as a thinking baseball fan, my head tells me to avoid getting overly excited about a small sample size. But does the two-HR outbreak actually tell us nothing? I think the expectations going into the season combined with Pedroia’s performance in his first game is a perfect situation to use Bayes’ Theorem.

To elaborate, I think Pedroia’s expectations going into this season have a bimodal distribution. If you look at his 2008-2012 seasons, he averaged 16 HR per year. His last two seasons averaged 8 HR per year. Was this due to a real decline, or due to injuries that sapped his power? While someone like Mike Trout might have a nice normally-distributed expectation around 35 HR, I expected Pedroia to have an either/or season: he’d either get back to 2008-2012 production, or continue as a 8-HR guy.

Now for a review of Bayes’ Theorem: it tells you how to update your prior beliefs given an observation. The formula for this is P(A|B) = P(B|A)*P(A)/P(B), where A and B are events, P(A) and P(B) are the probabilities of those events, and P(A|B) or P(B|A) should be read as “Probability of A given B,” or “Probability of B given A,” respectively. Specifically, in this case, A is “Dustin Pedroia is a 16-HR guy”, and B is “Dustin Pedroia hit 2 HR in his first game of the season”. I had a preseason belief about P(A), but I want to update it given that event B has occurred.

As implied above, I’m going to simplify Pedroia’s season outcomes into two possible outcomes: He is an 8-HR guy, or a 16-HR guy. Before the season, I’m going to guess that I had about a 50-50 belief that he was either one. Another assumption I’m going to make, to make the math easier, is that a season will see 640 plate appearances. You can make your own assumptions, but this is a demonstration of how much Bayes’ Theorem helps us update beliefs based on just one observation.

We need to determine three quantities to do our calculation now:
1. P(A)—probability that Pedroia is a 16-HR guy
2. P(B|A)—probability that we would see Pedroia hit 2 HR in his first 5 plate appearances, given that he is a 16-HR guy
3. P(B)—probability that we would see Pedroia hit 2 HR in his first 5 plate appearances (taking our 50-50 chance that he’s a 16 or 8-HR guy as a given)

1. Probability that Pedroia is a 16-HR guy

Easy. By assumption, P(A) is 50%.

2. Probability that we would see Pedroia hit 2 HR in his first 5 plate appearances, given that he’s a 16-HR guy

Tougher, but we can use a binomial probability model. That is 5C2*P(HR)^2*(1-P(HR))^3. When we have 16 HR in 640 plate appearances, P(HR) is 1/40, and 1-P(HR) is 39/40. This turns out to be .00579. P(B|A)= 0.579%.

3. Probability that we would see Pedroia hit 2 HR in his first 5 plate appearances, with preseason assumptions

This is the weighted average of all his possible season outcomes—so probability of 2HR in 5PA, given that he is a 16-HR guy, times the chance that he’s a 16-HR guy, PLUS, probability of 2HR in 5PA, times the chance that he’s an 8-HR guy. The same calculation as in number 2 can be done for if he’s an 8-HR guy, yielding an answer that the chance that he’d hit 2HR in 5PA is 0.151%. Given our calculation in the above paragraph, and our preseason assumption that it’s 50-50 that he’s an 8 or 16-HR guy, that gives us a weighted average P(B) = 0.365%.

So now we can mash all of those numbers into Bayes’ equation, and we find that .50*.00579/.00365 = .794, or 79.4%! Turns out that my Red Sox-loving lizard brain was not wrong! If you believed preseason that there was a 50%-50% chance that Pedroia would return to his 2008-2012 form, you should rationally update your beliefs to 80%-20% on the minuscule sample size of just two home runs in five plate appearances! Another note is that we should be forward-looking: since he has nearly a full season of plate appearances remaining, it might be rational to think that he’s likely to be an 18-HR guy, now that he has 2 in the bag.

This method could be adapted to a continuous expectation of outcomes, allowing a chance that Pedroia might be something besides an 8HR guy or a 16HR guy (although you and I know that that is clearly absurd).

« Previous Page — « Previous entries

Next entries » — Next Page »

BAL	CHW	ATH
BOS	CLE	HOU
NYY	DET	LAA
TBR	KCR	SEA
TOR	MIN	TEX

ATL	CHC	ARI
MIA	CIN	COL
NYM	MIL	LAD
PHI	PIT	SDP
WSN	STL	SFG