2013 in Baseball: Without the Luck

DISCLAIMER: I know certain players are more likely to outperform/underperform the league-average BABIP based on their specific player profiles. This is just a fun exercise to consider if everyone’s “luck” was the same.

With that disclaimer out of the way, I began wondering who the best/worst hitters are in baseball if batted-ball luck didn’t figure into the equation. We hear analysis frequently about Player X who’s having a breakout year, and the refrain is consistently that he is having better luck on batted balls than he had been having in the past. For example, BABIP was one of the main reasons cited for how Chris Johnson batted .321 in 2013 after hitting .268 over the previous two seasons. Many people look for fantasy sleepers based on a much lower than normal BABIP. The effects of BABIP are undeniably real and have been well documented. If we take BABIP out of the equation though, who rises to the top?

Before we get to the results, let me go over my methodology. It’s extremely simple, and you can probably guess how this is done. If you don’t want the boring details, please skip ahead. The first step to these calculations was to keep all factors not included in the BABIP formula constant. Each player still hits the same number of home runs. Each player still walks at the same rate. Each player still strikes out the same amount. The only component that changes is hits that don’t leave the yard (1B, 2B, 3B). I took the denominator of the BABIP equation for each player (AB-K-HR+SF) and multiplied it by the league-average BABIP (.297). This gives us the number of non-HR hits a player would have tallied if luck was removed. To get the number of singles, doubles, and triples each player hit, I took the ratio of Actual Hit Type/Actual Total non-HR and multiplied by expected non-HR. For example, Mike Trout hit 115 singles, 39 doubles, and 9 triples in 2013. That means that 70.6% of his non-HRs were singles, 23.9% were doubles, and 5.5% were triples. When adjusted for BABIP, we would expect Trout to hit roughly 129 non-HRs this past season. Multiplying 129 by the component percentages gives us roughly 91 singles, 31 doubles, and 7 triples. Everything else remains the same.

Batting Average Leaders

To answer the question posed in the introduction, we can look at many different stats. We already discussed how much an effect BABIP can have on a batting average, so maybe we should start there. For what it’s worth, the MLB leader in BABIP in 2013 was Chris Johnson at .394, and the worst BABIP belonged to Darwin Barney at .222.

AL Adjusted Batting Average Leaders – 2013 (min. 500 PA)

Player

2013 AVG (Adjusted)

2013 AVG (Actual)

Difference

Edwin Encarnacion

.313

.272

+.041

Miguel Cabrera

.304

.348

-.044

Adrian Beltre

.295

.315

-.020

Coco Crisp

.294

.261

+.033

J.J. Hardy

.291

.263

+.028

 

NL Adjusted Batting Average Leaders – 2013 (min. 500 PA)

Player

2013 AVG (Adjusted)

2013 AVG (Actual)

Difference

Andrelton Simmons

.291

.248

+.044

Martin Prado

.290

.282

+.008

Norichika Aoki

.288

.286

+.002

Jonathan Lucroy

.287

.280

+.007

Yadier Molina

.283

.319

-.035

Looking at those tables, the first thing that jumps out to me is that only two players (Edwin Encarnacion and Miguel Cabrera) in all of Major League Baseball would have hit .300 last year if luck is removed. The American League seems to possess better luck-independent hitters as the NL “batting champ” would have finished tied for fifth in the AL. Also, if Andrelton Simmons could actually hit .291 each season, he’d be an MVP candidate. I also find it interesting to look at which players benefited and suffered the most from their respective BABIPs.

Most Positive Batting Average Changes – 2013 (min. 500 PA)

Player

2013 AVG (Adjusted)

2013 Average (Actual)

Difference

Darwin Barney

.273

.208

+.065

Andrelton Simmons

.292

.248

+.044

Dan Uggla

.220

.179

+.042

Edwin Encarnacion

.313

.272

+.041

Matt Wieters

.275

.235

+.040

 

Most Negative Batting Average Changes – 2013 (min. 500 PA)

Player

2013 AVG (Adjusted)

2013 Average (Actual)

Difference

Chris Johnson

.248

.321

-.073

Joe Mauer

.257

.324

-.067

Michael Cuddyer

.267

.331

-.064

Mike Trout

.265

.323

-.058

Freddie Freeman

.264

.319

-.055

On-Base Percentage Leaders

Perhaps we shouldn’t limit ourselves to just simply batting average. Isn’t it more important to avoid outs that it is to just get hits? Let’s look at the OBP results.

AL Adjusted On-Base Percentage Leaders – 2013 (min. 500 PA)

Player

2013 OBP (Adjusted)

2013 OBP (Actual)

Difference

Edwin Encarnacion

.406

.370

+.035

Miguel Cabrera

.404

.442

-.037

Mike Trout

.384

.432

-.047

Jose Bautista

.383

.358

+.025

David Ortiz

.379

.395

-.016

 

NL Adjusted On-Base Percentage Leaders – 2013 (min. 500 PA)

Player

2013 OBP (Adjusted)

2013 OBP (Actual)

Difference

Shin-Soo Choo

.399

.423

-.024

Joey Votto

.399

.435

-.037

Paul Goldschmidt

.373

.401

-.027

Matt Holliday

.372

.389

-.017

Troy Tulowitzki

.366

.391

-.025

Once again, only two hitters (Encarnacion and Cabrera) would have reached based at a .400 clip. A trend is definitely starting to emerge. The gap between the AL and the NL is much less pronounced here though. As for the biggest changes in the MLB, consider the following tables.

Most Positive On-Base Percentage Changes – 2013 (min. 500 PA)

Player

2013 OBP (Adjusted)

2013 OBP (Actual)

Difference

Darwin Barney

.325

.266

+.059

Andrelton Simmons

.337

.296

+.041

Matt Wieters

.323

.287

+.036

Edwin Encarnacion

.406

.370

+.036

Dan Uggla

.344

.309

+.035

 

Most Negative On-Base Percentage Changes – 2013 (min. 500 PA)

Player

2013 OBP (Adjusted)

2013 OBP (Actual)

Difference

Chris Johnson

.289

.358

-.069

Joe Mauer

.345

.404

-.059

Michael Cuddyer

.331

.389

-.058

Allen Craig

.323

.373

-.050

Freddie Freeman

.347

.396

-.049

As you might expect, these tables don’t look all that much different from the batting average change tables. Other than some reordering, the only difference here sees Allen Craig replace Mike Trout on the most negative change table.

On-Base + Slugging Leaders

Getting on base a lot is a promising start, but you win baseball games by scoring runs. What hitters were best at driving the ball while avoiding outs? Let’s look at the OPS results.

AL Adjusted On-Base + Slugging Leaders – 2013 (min. 500 PA)

Player

2013 OPS (Adjusted)

2013 OPS (Actual)

Difference

Edwin Encarnacion

.993

.904

+.088

Miguel Cabrera

.988

1.078

-.090

Chris Davis

.953

1.004

-.051

David Ortiz

.918

.959

-.041

Jose Bautista

.918

.856

+.062

 

NL Adjusted On-Base + Slugging Leaders – 2013 (min. 500 PA)

Player

2013 OPS (Adjusted)

2013 OPS (Actual)

Difference

Paul Goldschmidt

.883

.952

-.069

Troy Tulowitzki

.871

.931

-.060

Jayson Werth

.839

.931

-.092

Matt Holliday

.837

.879

-.042

Domonic Brown

.835

.818

+.017

Once again, our Top 2 are Encarnacion and Cabrera, with a considerably gap between Cabrera and third place Chris Davis. The AL/NL split is at its most pronounced here. To see if our trend in the biggest changes tables continues, consider the following tables.

Most Positive On-Base + Slugging Changes – 2013 (min. 500 PA)

Player

2013 OPS (Adjusted)

2013 OPS (Actual)

Difference

Darwin Barney

.712

.569

+.143

Andrelton Simmons

.790

.692

+.098

Edwin Encarnacion

.993

.904

+.089

Dan Uggla

.759

.671

+.088

Matt Wieters

.790

.704

+.086

 

Most Negative On-Base + Slugging Changes – 2013 (min. 500 PA)

Player

2013 OPS (Adjusted)

2013 OPS (Actual)

Difference

Chris Johnson

.657

.816

-.159

Joe Mauer

.736

.880

-.144

Michael Cuddyer

.779

.919

-.140

Mike Trout

.863

.988

-.125

Allen Craig

.712

.830

-.118

The trend continues as expected. Also, the negative regressers are harder hit than the positive regression candidates.

Weighted Runs Created Plus Leaders

This is FanGraphs though, so we can’t simply look at traditional stats. We need something that’s park-adjusted and comparative to league average. Let’s look at wRC+. (NOTE: These numbers aren’t adjusted for individual leagues as is normally done with wRC+. I’m lazy and didn’t take the time to do that extra step, so the wRC+ values won’t make up exactly with what is listed elsewhere on this site.)

AL Adjusted wRC+ Leaders – 2013 (min. 500 PA)

Player

2013 wRC+ (Adjusted)

2013 wRC+ (Actual)

Difference

Edwin Encarnacion

161

137

+24

Miguel Cabrera

155

180

-25

David Ortiz

155

167

-12

Coco Crisp

154

132

+22

Chris Davis

154

168

-14

 

NL Adjusted wRC+ Leaders – 2013 (min. 500 PA)

Player

2013 wRC+ (Adjusted)

2013 wRC+ (Actual)

Difference

Paul Goldschmidt

148

168

-20

Hunter Pence

142

148

-6

Andrew McCutchen

141

170

-29

Shin-Soo Choo

141

158

-17

Buster Posey

140

149

-9

As you might expect, Encarnacion and Cabrera top the charts again. Paul Goldschmidt is once again the National League leader. As for the biggest movers, they look very similar as well as you might expect.

Most Positive wRC+ Changes – 2013 (min. 500 PA)

Player

2013 wRC+ (Adjusted)

2013 wRC+ (Actual)

Difference

Darwin Barney

79

40

+39

Andrelton Simmons

127

97

+30

Dan Uggla

122

97

+25

Matt Wieters

111

86

+25

Edwin Encarnacion

161

137

+24

 

Most Negative wRC+ Changes – 2013 (min. 500 PA)

Player

2013 wRC+ (Adjusted)

2013 wRC+ (Actual)

Difference

Chris Johnson

85

135

-50

Joe Mauer

105

147

-42

Allen Craig

113

150

-37

Michael Cuddyer

88

125

-37

Mike Trout

147

183

-36

Since we looked at the leaders in each category, let’s look at those who failed to meet such lofty standards in 2013.

Batting Average Laggards

AL Adjusted Batting Average Laggards – 2013 (min. 500 PA)

Player

2013 AVG (Adjusted)

2013 AVG (Actual)

Difference

Chris Carter

.216

.223

-.007

Mike Napoli

.219

.259

-.040

Mark Reynolds

.230

.220

+.010

Michael Bourn

.232

.263

-.031

Stephen Drew

.237

.253

-.016

 

NL Adjusted Batting Average Laggards – 2013 (min. 500 PA)

Player

2013 AVG (Adjusted)

2013 AVG (Actual)

Difference

Dan Uggla

.220

.179

+.042

Starling Marte

.234

.280

-.046

Chase Headley

.235

.250

-.015

Giancarlo Stanton

.240

.249

-.009

Gregor Blanco

.241

.265

-.024

The most startling thing I notice from these tables is that Dan Uggla gained .042 points in his batting average and still finished last in the league. Now, that’s impressive.

On-Base Percentage Laggards

AL Adjusted On-Base Percentage Laggards – 2013 (min. 500 PA)

Player

2013 OBP (Adjusted)

2013 OBP (Actual)

Difference

Alcides Escobar

.287

.259

+.028

Michael Bourn

.288

.316

-.028

Manny Machado

.294

.314

-.019

Leonys Martin

.297

.313

-.015

Torii Hunter

.300

.334

-.035

 

NL Adjusted On-Base Percentage Laggards – 2013 (min. 500 PA)

Player

2013 OBP (Adjusted)

2013 OBP (Actual)

Difference

Adeiny Hechavarria

.288

.267

+.021

Chris Johnson

.289

.358

-.069

Starlin Castro

.290

.284

+.006

Zack Cozart

.294

.284

+.010

Marlon Byrd

.300

.336

-.036

Michael Bourn is our only carryover from the batting average tables that appears on the OBP tables as well. Probably not a great sign for Cleveland.

On-Base + Slugging Laggards

AL Adjusted On-Base + Slugging Laggards – 2013 (min. 500 PA)

Player

2013 OPS (Adjusted)

2013 OPS (Actual)

Difference

Michael Bourn

.609

.676

-.066

Alcides Escobar

.621

.559

+.062

Elvis Andrus

.633

.659

-.026

Jose Altuve

.643

.678

-.035

Leonys Martin

.661

.698

-.037

 

NL Adjusted On-Base + Slugging Laggards – 2013 (min. 500 PA)

Player

2013 OPS (Adjusted)

2013 OPS (Actual)

Difference

Adeiny Hechavarria

.615

.565

+.050

Eric Young

.638

.645

-.007

Gregor Blanco

.639

.690

-.051

Starlin Castro

.644

.631

+.013

Chris Johnson

.657

.816

-.158

Uh-oh, Bourn is back again, and the only player relatively close to him is Adeiny Hechavarria. Hechavarria is a fine defensive shortstop who has noted offensive woes. Bourn was a big free agent signing for Cleveland expected to jump start the Indians offense. Those represent completely different expectations.

Weighted Runs Created Plus Laggards

AL Adjusted wRC+ Laggards – 2013 (min. 500 PA)

Player

2013 wRC+ (Adjusted)

2013 wRC+ (Actual)

Difference

Alcides Escobar

64

45

+19

Jose Altuve

70

80

-10

Ichiro Suzuki

75

68

+7

Michael Bourn

77

98

-21

Elvis Andrus

81

89

-8

 

NL Adjusted wRC+ Laggards – 2013 (min. 500 PA)

Player

2013 wRC+ (Adjusted)

2013 wRC+ (Actual)

Difference

Starlin Castro

62

58

+4

Adeiny Hechavarria

68

53

+15

Nolan Arenado

70

70

0

Darwin Barney

79

40

+39

Eric Young

80

82

-2

Nothing here is meant to be used to draw conclusions about any hitters. I’m not advocating for Edwin Encarnacion as the best regular in baseball or Starlin Castro as the worst. I just thought this would be an interesting simple exercise to consider. Just for fun though, let’s look at the AL MVP race one more (“luck-independent”) time.

Statistic

Miguel Cabrera

Mike Trout

AVG

.304

.265

SLG

.584

.479

OBP

.404

.384

OPS

.988

.863

wOBA

.418

.374

wRC+

155

147

If we take luck out of the debate, Cabrera is an 8% better hitter compared to league average than is Trout. I guess the BBWAA doesn’t think Trout is an 8% better fielder and base runner than Cabrera. Surely they know what they’re talking about though. I mean they do get to decide who belongs in the Hall of Fame after all. They’re the smartest baseball folks out there.


A Brief Follow-Up on Elite RP and @Ottoneu LWTS

The debate on relievers in fantasy baseball continues to rage in our Ottoneu league – and today, RotoGraphs’ Brad Johnson joined the fray with an article on the subject inspired by our intrepid commissioner @Fazeorange.  To be fair to Brad, he stated up front that his goal was not to resolve the argument, but rather to present a framework to answering the question.  Given my trademark immodesty, I’d like to offer a high-level (and what I consider rather obvious) answer to the question.

Beyond my endless rants on the subject, this question arises from a real-world situation: one of our best and most active owners (and a former champion) has, at least on the message boards, been championing a strategy whereby he stacks his bullpen with “elite” relievers, thereby projecting an enormous bullpen advantage that not only confers extra points, but added flexibility elsewhere (because elite RPs are cheaper than elite position players or starting pitchers).  His bullpen currently includes 3 of the top 5 RPs from last year, and were LWTS scoring retrospective (like a Strat-o-Matic league, perhaps), this makes sense.  However, the question is whether it is likely his bullpen is filled with elite 2014 RPs.  Unfortunately, that is unlikely the case.

To attempt to gather data that might answer the question (if you want the entire spreadsheet, please email me), I went back and gathered the list of Top 10 projected players at each position (25 for the OF) in March 2012 and March 2013 (i.e., immediately prior to the beginning of the season).   I used ESPN rankings, though I’m not sure that the lists matter, the variation at the top is minimal each year.  I then pulled each players Ottoneu LWTS point total for that season, and compared that number to the replacement level for his position in that year.  Brad’s replacement level for RPs was 72 (on the assumption that each team rostered about 5 RPs, and perhaps a bench player – in other words, sorting by RP scoring the 72nd-ranked player).  For my positions, I defined the “replacement” player as follows:

C 18
1B 24
2B 24
SS 24
3B 18
OF 72
SP 72
RP 72

[** Interestingly, the replacement 3B in each year, finishing 18th each year, was Alberto Callaspo – at the gut level, this gives me some confidence we are defining replacement level appropriately]

Next, I summed the PAR (or, in some cases, the points below replacement) for each position.  Broken down by position and year, the results are as follows:

2012 Total PAR PAR/Player 2013 Total PAR PAR/Player
C 2706 271 C 2018 202
1B 1118 112 1B 1978 198
2B 3301 330 2B 2457 246
SS 3295 330 SS 2476 248
3B 3091 309 3B 3126 313
OF 6805 272 OF 6924 277
SP 3431 343 SP 3537 354
RP 686 69 RP 528 53

What do we see?  Surprisingly, the level of production for the top 10 projected players across the positions versus our replacement level is remarkably constant – except for RP.  If you invest in Robinson Cano, or Adrian Beltre, or Clayton Kershaw, you can be expecting 300-350 Points Above Replacement.  The only outlier here is RP – if you invest in an elite RP, you can expect to receive and extra 50-60 points from that investment.  Why?  Well, because even amongst the top 10, the flame-out rate is significant.  In 2012, here is the projected top 10, along with their actual production and PAR:

Player Points PAR
Craig Kimbrel 747 377
Mariano Rivera 79 -291
Jonathan Papelbon 582 212
John Axford 480 110
Brian Wilson 7 -363
Rafael Betancourt 449 79
Joel Hanrahan 439 69
Jose Valverde 498 128
Jason Motte 620 250
J.J. Putz 485 115

Craig Kimbrel was the #1 ranked RP and performed like it.  Mariano Rivera blew out his knee shagging flies, while Brian Wilson wrecked his elbow as pitchers do, and both delivered far below replacement-level value.  Rafael Betancourt and Joel Hanrahan managed to deliver replacement-level to slightly above performance – but presumably at elite prices.

Was 2012 an outlier?  Here is the same table from 2013:

Player Points PAR
Craig Kimbrel 722 323
Aroldis Chapman 603 204
Jonathan Papelbon 449 50
Rafael Soriano 501 102
Fernando Rodney 553 154
Mariano Rivera 547 148
J.J. Putz 215 -184
Joe Nathan 649 250
Joel Hanrahan -11 -410
John Axford 290 -109

Is there a pattern?  Sort of.  Craig Kimbrel?  Monster – go get him if you can.  3 of the 10 posted wildly below replacement level.  A couple other big names Papelbon and Rodney) managed slightly above replacement, and a couple (Chapman and Nathan) were excellent.

Of course, this is a first-level review – as always in Ottoneu, price matters, so a $2 Brian Wilson headed to Tommy John doesn’t affect things very much.  However, the question we’re trying to answer here is whether or not investing in the best RPs pre-season makes sense.  Regardless of price, in my view it doesn’t because there is such little likelihood that we can identify them if they’re not wearing a Braves jersey and closing in Atlanta.  Why does this debate rage, at least in our league?  My suspicion is that, as we look back year over year, it can be difficult to remember which relievers were expected to be elite – those sitting on Koji Uehara now can scarcely remember a time when he wasn’t atop the RP leaderboards.  Nevertheless, if we look at the numbers, RP is the one position that investing in anyone not named Craig Kimbrel makes little sense from the perspective of Points Above Replacement.

Thoughts?  Issues?  Problems with my methodology?  General screeds?  All are welcome, either though the site or via email at bill dot porter at gmail dot com.

(Also posted on my blog:  sportsbythenumbers.wordpress.com


Adjusted Quality Starts

Ah, the quality start. It’s one of several stats (along with Bill James’ Game Score and even the venerable, much-maligned pitcher win) designed to answer an age-old question: How well did the starting pitcher do his job? Most would agree that a pitcher’s job is to keep his team in the game and give his offense a good chance to win, and most would agree that even the bare-minimum quality start (6 IP, 3 ER) is at least an acceptable performance.

That said, the criteria for a quality start are pretty arbitrary, and they’ve invited a bit of criticism. Why is a six-inning, three-run performance a QS, but an eight-inning, four-run outing is not? Why do we say a pitcher had a “quality” performance if he pitched to the tune of a 4.50 ERA? What if a pitcher has a quality start through six, then blows it in the seventh inning or later?

The usual response to those criticisms is that, hey, they tend to get worked out in the aggregate. Overall, pitchers actually do post very good numbers in their quality starts. The 4.50 ERA “quality” pitcher is a myth.

Then again, if we want something that works out in the aggregate, why bother with QS? Most of the issues with plain old pitcher wins get worked out over time, too. Aren’t they a good enough proxy for quality performances?

Of course not. The idea behind quality starts is a good one. All we need is a clearer look at the question the stat is designed to answer, and we’ll have a better definition for the stat itself.

The question: “How many times did the pitcher give his team a good chance to win?”

The definition: A pitcher is awarded an Adjusted Quality Start (AQS) if he:

  1. Starts the game.
  2. Pitches at least six innings.
  3. Posts a run average (RA9) no worse than the league average.

#1 is, well, a requirement for something with the word “start” in its name. Moving on.

#2 is admittedly still arbitrary, but it’s a pretty good criterion, I think. A six-inning performance leaves only three innings to the bullpen, which isn’t all that much strain for most teams.

#3 is the change that gets to the heart of the issue. If the starter gives his team a decent number of at least league-average innings, then his teammates (assuming an average offense and average bullpen) should have at least a 50/50 shot at winning.

I use RA9 rather than ERA partly because of the well-documented issues with the definition of “earned” runs and partly because, as far as winning is concerned, it doesn’t especially matter whether a run is “earned” or not. A team that loses 4-3 because of four unearned runs still loses.

So, let’s put this metric to the test. In the American League in 2013, the league-average RA9 was 4.29, yielding three ways to post an AQS.

1) Pitch at least 6 innings, give up 2 or fewer runs.

This is by far the most common AQS because it includes all the zero-, one- and two-run starts. The top 10 in this sort of AQS were:

James Shields 20
Max Scherzer 20
Hisashi Iwakuma 19
Felix Hernandez 19
Bartolo Colon 19
Derek Holland 17
Ervin Santana 16
Anibal Sanchex 16
Justin Masterson 16
5 tied with 15

2) Pitch at least 6.1 innings, give up 3 runs.

Chris Sale was the master of the exactly-three-run AQS, as he did it 8 times in 2013. The top 10:

Chris Sale 8
Justin Verlander 7
James Shields 7
CC Sabathia 6
Jarrod Parker 6
Doug Fister 6
Yu Darvish 6
C.J. Wilson 5
7 tied with 4

3) Pitch at least 8.1 innings, give up 4 runs.

Unsurprisingly, this was by far the least common sort of AQS. It’s not often that a starter who gives up that many runs is allowed to pitch into the ninth. In fact, it only happened twice in the AL last year: CC Sabathia’s four-run, complete-game victory on June 5, and Corey Kluber’s 8.2-inning, four-run win on July 31.

Overall, your 2013 AL leaders in AQS:

James Shields 27
Max Scherzer 23
Hisashi Iwakuma 22
Chris Sale 22
Bartolo Colon 21
Doug Fister 21
Jarrod Parker 21
Justin Verlander 21
Yu Darvish 20
Felix Hernandez 20

Over in the National League, the average RA9 was a tick lower at 4.04. That’s not going to affect the two-and-fewer starts, but it’ll set the bar for the three- and four-run AQS a little higher.

1) Pitch at least 6 innings, give up 2 or fewer runs.

I doubt anyone will be surprised by the name at the top of the list.

Clayton Kershaw 22
Cole Hamels 20
Patrick Corbin 20
Jordan Zimmermann 19
Travis Wood 19
Madison Bumgarner 19
Zack Greinke 18
Gio Gonzalez 18
6 tied with 17

2) Pitch at least 7 innings, give up 3 earned runs.

Adam Wainwright 6
Cliff Lee 6
Mike Minor 4
Kris Medlen 4
Clayton Kershaw 4
Kyle Lohse 3
Cole Hamels 3
Andrew Cashner 3
Bronson Arroyo 3
13 tied with 2

(As an aside, the 6.2-inning, 3-run start just missed the cut-off in the NL, as that would be a 4.05 RA9 against a league average of 4.04. There were 22 starts that met those criteria in the NL last year, and I debated including them, but it would only make minor changes to the leaderboard. Mat Latos takes home the Just Missed Award with three such starts.)

3) Pitch at least 9 innings, give up 4 earned runs.

Only one NL pitcher pulled this one off in 2013. That was Brandon McCarthy, who gave up four in a complete-game loss on September 2.

Finally, your 2013 NL leaders in AQS:

Clayton Kershaw 26
Cole Hamels 23
Adam Wainwright 23
Cliff Lee 22
Madison Bumgarner 21
Patrick Corbin 21
Travis Wood 21
Jordan Zimmermann 21
Matt Cain 18
Zack Greinke 18
Gio Gonzalez 18
Lance Lynn 18

(If we include the 6.2-inning, 3-run starts, Gonzalez and Lynn take sole possession of ninth and tenth place with 20 and 19 AQS, respectively.)

There’s more that could be done with Adjusted Quality Starts – park factors, for instance – but that’s probably too much work for a stat that’s supposed to answer a pretty narrow question. If you want to know how often a starter kept his team in the game, this is a good, er, start.


Are $20 Million Per Season Contracts Ever Worth It?

When I saw that Clayton Kershaw signed a seven-year contract with an average salary of over $30 million per season, my first thought was: that’s definitely going to be an albatross contract. In my mind, anything over $20 million per season has always seemed to be that threshold where a player no longer has a realistic chance of performing to the value of the contract. But, I am smart enough to know that what is in my mind is not always the same as what is in reality, so this brief post will look at every player during the 2013 MLB season that collected a salary of $20 million or more. My goal was to get a rough idea of exactly how many players either outperformed, adequately performed, or underperformed their salary.

I collected data from the website baseballplayersalaries.com. In the table below, I’ve reported the names, teams, and estimated salaries of each $20 million plus player in the 2013 MLB season. I’ve also reported the percent of their team’s payroll each player received and the percent contribution that player made to the team’s on-field performance.

Table 1: Players with $20 million or greater salaries for the 2013 MLB Season

Player Name Team Salary % Team’s Payroll % Team’s On-field Performance
Alex Rodriguez New York Yankees

$28,000,000

11.76%

0.97%

Johan Santana New York Mets

$25,500,000

32.93%

0.00%

Cliff Lee Philadelphia Phillies

$25,000,000

14.80%

40.56%

CC Sabathia New York Yankees

$23,000,000

9.66%

0.97%

Joe Mauer Minnesota Twins

$23,000,000

32.15%

27.00%

Prince Fielder Detroit Tigers

$23,000,000

14.84%

3.27%

Mark Teixeira New York Yankees

$22,500,000

9.45%

-0.65%

Tim Lincecum San Francisco Giants

$22,000,000

15.19%

-2.14%

Vernon Wells New York Yankees

$21,000,000

8.82%

-0.65%

Miguel Cabrera Detroit Tigers

$21,000,000

13.55%

13.85%

Adrian Gonzalez Los Angeles Dodgers

$21,000,000

9.19%

9.09%

Carl Crawford Los Angeles Dodgers

$20,000,000

8.75%

3.86%

Barry Zito San Francisco Giants

$20,000,000

13.81%

-9.29%

Matt Kemp Los Angeles Dodgers

$20,000,000

8.75%

1.14%

Roy Halladay Philadelphia Phillies

$20,000,000

11.84%

-5.00%

Ryan Howard Philadelphia Phillies

$20,000,000

11.84%

3.33%

Matt Cain San Francisco Giants

$20,000,000

13.81%

1.79%

Justin Verlander Detroit Tigers

$20,000,000

12.90%

8.85%

 

The first thing I noticed was the number of players that underperformed their salary — 13 of 18. That’s just over 72%!

When it comes to players that I consider underperformed, there are too many to list. So, instead I’ll list the players who I consider adequately performed to their salary for the team they were on: Joe Mauer, Miguel Cabrera, Adrian Gonzalez, and Justin Verlander. That’s only 4 of 18, or about 22%. I did not include Cliff Lee in the list because he clearly outperformed his salary based on this measure. He was the only one of 18 players to do so. That translates to only 6% of players with $20 million plus salaries outperforming.

Even more staggering was when I calculated the average percent of team payroll an individual player on my list made, and compared it to the average percent of team on-field performance. The average player on the list made 14% of their team’s payroll, but only contributed to 5% of their team’s performance.

I realize that the criteria I have used is limited in many ways. For example, players on a poor-performing team (ex. Cliff Lee) will have a higher percent of on-field team performance, and vice versa. Or players on a team with a low total payroll will have a higher percentage of team payroll. However, I feel these numbers are so overwhelmingly lopsided that I’m not sure if you would be able to find any objective criteria that would show an opposite trend.

Given that these high-paid players consistently underperform their salary, an entire new set of questions arise. Why are teams still so willing to hand out these contracts? Do underperforming ‘star’ players really generate enough additional team revenue to justify their cost? What would happen if a large-market team properly valued their players?

With the precedent set by the Kershaw contract, maybe in the not-so-distant future $30 million will be the new $20 million, but as of the 2013 season a $20-million salary almost guarantees the player will not be getting the short end of the stick on that deal (in terms of performance at least).


Can Studies of Bosses Help us Figure Out How Good Sports Managers Are?

This post originally appeared on my blog Biotech, Baseball, Big Data, Business, Biology…

The world would be a simpler place, although maybe a much more boring and predictable one, if every aspect of performance could be measured directly. My completely unoriginal thought here is that one of the reasons sports appeal to so many people is because they provide clarity. In a confusing, complex world where the NSA is sucking up our information like a Dyson vacuum sucks feathers in a henhouse, and we’re told this is for our own good, clarity can be refreshing.

The simple view of an athlete’s performance is that all the accolades (or jeers), all the milestones (or flops), all the accumulated statistical totals (or lack thereof) are because of that athlete’s ability: his or her drive, passion, training, and natural ability. And that performance is measured via the statistics each sport collects and chooses to honor and promote. Performance is right there, what more do you need? What more could you want?

But much as we might find the simple view intuitive and appealing, it’s also incorrect. Not only are some of those statistics, at best, clearly crude proxies for true ability, they are also often (always?) dependent upon context. Where and to whom did that quarterback throw all those touchdown passes? Which coach directed that basketball player during the prime of her career to play in a style that complimented (or confounded) her natural tendencies and strengths? What elements of that slugger’s personal life were in shambles the year he broke into the major leagues and thrived/struggled, and what difference did it make?*

If sports analysis is moving in any direction, I like to think it’s moving towards a nuanced, humble view of sports performance that accepts the statistics, the measured performance, the team won-loss record, as proxies at best, distant cousins twice-removed from what we are most curious about: who’s good? How good? Was/were he/she/they ever the best? What does this record mean in absolute terms, if such a Platonic thing could ever exist? We might try to find better ways to measure performance and context, but we’ll always be approaching the asymptote, never quite getting there.

And if athletic performance is so hard to measure, how much harder is it to measure those whose actions are another step yet away from the statistics, the solid measurements produced on the field?

What makes a good manager or coach, and how can we tell?

This is a topic of endless debate, and for good reason. Although they are not often paid like it, there is among many a feeling that the manager or head coach is one of the key elements underlying athletic and team success. As Bum Phillips said of Bear Bryant, “He could take his’n and beat your’n, or take your’n and beat his’n.” This is maybe the ultimate expression of the belief that a coach is what makes the team what it is.

Whether that’s true or not is the question, though. It’s clearly not simple. There have been efforts in the sports analysis community to try and figure out how much coaches and managers matter although sometimes these efforts suffer a little too much from retrospective analysis. For example, “these managers’ teams had winning won-loss records, so therefore they are better managers. Let’s look at the traits they have in common and say those are the traits that let us classify managers into good and bad.” These kinds of analyses are, I believe, over-fitting the data, and it’s often not long before contrary examples pop up.

So what to do? Well, a working paper put out by the National Bureau of Economic Research showed one possible way (thanks to @freakonomics and @marketplaceAPM)

The methodology may not be completely applicable to the sporting environment, or even to most business environments but sports I think is a closer match than most because of the nature of player and coach movements (as we’ll see in a bit).

This study, by researchers at Stanford and the University of Utah, attempted to answer the question of how much bosses are worth to employee performance. And the method they used, frankly, was based on brute force. They first had to find a business situation that would offer them a huge sample size (23,878 workers, 1940 bosses, and 5,729,508 worker-day measurements of productivity) and a clearly quantifiable and electronically captured measure of productivity: technology-based services (TBS). Think of jobs like retail clerks or call center operators where specific actions are repeated and logged; the specific business that was studied remains nameless as a condition of the research. And the third characteristic that made this work is that this particular company also moves employees from boss to boss on a regular basis — in general once or twice a year.

Let me digress for just a second to expand on why this is so important. In clinical research the gold standard is the double-blinded, placebo controlled trial. Which this was not. But it’s a good deal more rigorous than an anecdotal, under-powered observational study. Essentially, their study design is a retrospective, (effectively) randomized crossover study. This allows the performance of each individual to be compared both within the period of time he or she is working for a given boss as well as across different bosses. The accumulation of so many data points allowed the researchers to build statistical models that could isolate the effect of specific bosses on performance even given the vast amounts of noise that are inherent in the day-to-day performance of these employees.

In addition, their model is designed to discovery, a priori, which bosses are best rather than relying on any information from the company under study. In other words, factors such as won-loss records and championships and media savvy don’t enter into the equation. Whether the company or the researchers are going back to their data and corroborating it now with surveys and opinions of the employee, bosses and upper management I don’t know, but that would be fascinating, wouldn’t it?

To give a very high level of summary of their work, they created a mixed model of human capital as the product of talent and effort. Each of those two elements was then further broken down into components that are and are not under the influence of one’s boss. Next, estimation methods were used to approximate the relative effect of different components within the model, including those due to the boss, based on the shape of the overall dataset.**

They uncovered several possible effects in their analysis, the primary one being that top bosses can result in about a 10% increase in the productivity of his or her group relative to the worst bosses. They also found that a good boss seemed to affect worker retention, and that there was a small but significant effect of pairing good workers with good bosses.

Generalizing the specific findings directly in any way to sports management is completely unwarranted. There are several key differences between the situation they analyzed and the team environment; these should not be overlooked, such as: the diversity of actions taken by any individual athlete in a team setting (as opposed to rote, repetitive work like taking reservations in a call center); the effect of peers in a sports environment likely being greater than the work situation described in this study (i.e., workers being generally autonomous in their tasks), and that athletes often get different bosses by moving between establishments (teams) whereas the current study examined a single company.

However, what I think is worth exploring is the question of whether a similar methodology could be applied to sports teams. Let me just say that I will not attempt an exploration myself, I’m just pointing out the possibilities. So anyone hoping for a big take-home message can stop now. Sorry for taking five minutes of your life!

Here are what I see as the requirements of a sport that would allow generation of a large and diverse enough dataset.

1) Specific measurements of output. As discussed above in the way-too-long-winded introduction, one thing sports has plenty of are measurements of output. Except soccer. What do people measure in soccer? YouTube video highlights of great runs followed by missed kicks?***

2) A large number of transitions of coaches/managers and players between situations. Fortunately in this age of free agency, trades and hot seats, there are routinely numerous changes of players and coaches/managers every season. Also fortunately, players and coaches/managers often get multiple chances with different teams and situations.

3) Enough data. This is a tough one. Off the top of my head it seems baseball and basketball are really the only sports that have enough granularity, a long enough season, and sufficient numbers of teams and players to make this work. Maybe hockey. American football, probably not.

However, it seems worth a try. To explore this idea further with baseball as an example, one could choose to isolate one component of performance such as a hitting statistic. Since we would want to measure something that is both generally agreed upon as positive and also something that stabilizes relatively quickly to best reflect effect of coaching, one could pick strikeout rate (60 plate appearances (PA)), walk rate (120 PA), or singles rate (290 PA). It should be stated up front that this means the effect of the manager only on that particular skill will be seen. Probably the entire analysis would need to be repeated for each of several offensive statistics to create a composite and granular picture of how a given manager influences players under his direction.

One could then use individual game performance as the time component of the model and collect data on that specific metric over time and relate that to which managers a given player had and for how long. The null hypothesis would be that the effect of managers would be nothing, and so the result of the model we would look for are signs that specific managers do make a substantial difference in performance by the players under his instruction compared to those same players before and after being on that manager’s team.

Is there enough data for a signal to be seen? You know, in my day job as a genomics researcher this is probably the main question I get from scientists wanting to perform an experiment: is the number of experimental subjects big enough? And my answer is always the unsatisfying, “We won’t know for sure until we do the experiment and compare the natural variation to the effect size.” Same answer here.

And why bother? Well, I go back to what I said earlier about approaching the asymptote and trying to learn more. Not just in sports, but in so many other parts of life, there are elements that right now are in the realm of intuition and anecdote and subjectivity. Who’s a good CEO? What public policy interventions do good and, more important perhaps, are the most cost effective? Wired magazine just had a nice article about the use of controlled trials to measure the actual effect of public policy interventions in the developing world. In our search to make the world a better and more understandable place, we owe it to ourselves to keep asking questions and trying to come up with ways to answer them.

*Notice here, by the way, that you can take either situation–thriving or failing–and make up a completely believable story in your head about how that player’s personal life played a role in his performance. How he rose above the conflict, or the field was his refuge, or his anger or frustration fed his on-field performance. Alternatively, how he’s a tragic figure, his potential derailed by drugs/philandering/emotions, making him an all too human and very sympathetic figure. This is because our minds are programmed to make up stories, to find cause and effect, to indulge in the narrative fallacy. Be careful of that. It will screw up your thinking faster than anything else.

**Just for fun, here’s one of the equations from their model.

Equation for effort

This roughly translates as: An individual worker (i)’s output (q)  at time t is equal to the ability of the mean worker (alpha sub zero) plus the specific worker’s innate ability (alpha sub i) plus the set of variables outside the worker-boss interaction (X sub it times capital Beta) plus the ability of an average boss (d sub 0t) divided by team size (N sub jt) to the theta power, where theta is related to public versus private time with the boss plus the ability of the current boss (d sub jt) divided by the team size to theta. This equation relates to the current effect. A longer version of the equation tries to take into account the effect of past bosses and the persistence of boss effects.

***I am reminded of one of the fine haikus inspired by the 1994 World Cup tournament in the US, source sadly lost to me although if anyone remembers it, let me know:
“Run, run, run, run, run
Run, run, run, run, run, run, run
Run, run, pass, shoot, miss.”


Joey Votto: 6-WAR Player

In spite of the ridiculous scrutiny regarding his 2013 season, Joey Votto is one of the best hitters in baseball. He also has a large contract that begins in 2014. Big contracts have seemingly brought bad baseball voodoo to some of baseball’s best (injury to Pujols noted). As Votto begins his mega-deal, both ZiPS and Steamer show him declining by over a win in WAR in 2014.  Because of how good he has been for the last four years, I found this projected decline somewhat surprising. I decided to dive into the numbers and found some really interesting things.

Votto has had four straight years of excellence. Since 2010, he has the third-best WAR among position players (25.1), the second-best wRC+ (164), the second-best wOBA (418) and the best OBP (434). Votto is at least in the conversation as the best hitter in baseball over this four-year stretch. But for some more perspective, let’s look at his last four years in detail.

Year

AVG

OBP

SLG

BB%

ISO

BABIP

wRC+

wOBA

Def

WAR

2010

324

424

600

14%

276

361

172

438

-10.1

6.8

2011

309

416

531

15.3%

222

349

157

406

-5.2

6.5

2012

337

474

567

19.8%

230

404

178

438

-2.1

5.6

2013

305

435

491

18.6%

186

360

156

400

-10.1

6.2

Quite impressive. Votto has earned over six WAR every year except 2012, the year he injured his knee and played in only 111 games. He played many of those games injured as well. The only reasonable complaints are that Votto’s power dropped some in 2013, and his defense was poor (decent for a first baseman). The extent to which Votto’s knee surgery affected his power in 2013 remains to be seen, but it isn’t like his power numbers fell off a cliff.  So what do our beloved projection systems say about Votto’s age-30 season in 2014? They say he will be good but not quite as elite as he has been.

Projection System

AVG

OBP

SLG

BB%

ISO

BABIP

wRC+

wOBA

Def

WAR

ZiPS

289

416

506

17.5%

217

334

N/A (149 OPS+)

386

3

4.8

Steamer

296

424

507

17.8%

211

341

156

400

-9.7

5.0

The encouraging part for Reds fans is that both Steamer and ZiPS think Votto’s power will tick up a little. The hope is that Votto’s knee will be healed, and he will return to the doubles machine he once was, with a few more home runs as well. But Votto is also entering his age-30 season. His best power days may very well be behind him. Or maybe not. I’ll get to that shortly.

The other numbers are similar to the four previous years with one noticeable difference. Both projection systems predict Votto’s batting average to drop below 300 for the first time since his rookie season in 2008, where he struggled to a 297 average.  The cause of this decline in batting average is our old friend BABIP. ZiPS has Votto’s BABIP dropping to 334 even though Votto has averaged a 368.5 BABIP for the last four years. BABIP can fluctuate wildly from year to year,  but Votto has shown the ability to maintain a high BABIP throughout his career. We can expect him to do a little better than these projections. If every 10 points of BABIP equals about 0.3 in WAR, Votto is likely to gain between half a win and one full win.

But the equation 10 points of BABIP=0.3 WAR is with all other stats being equal. If Votto’s BABIP is higher than these projections, he is likely to also have some more extra-base hits, including a couple more home runs. This added power would raise his value even more. Both projection systems already have his power rising from last year. Votto could have a few years of solid power left, especially if his knee is fully healthy.

This puts Votto around six-WAR territory. The other important factor will be his defense. Votto’s defense was poor in 2013 compared to his previous two seasons. He was a top-five defensive first baseman in 2011 according to FanGraphs’ Def and would have been top-three in 2012 had he played enough games to qualify. To remain a six-WAR player, Votto will likely need to return to an above-average defensive first baseman. Steamer has his defense at about the same level as 2013. I do not believe ZiPS Def adjusts for position, but it appears they think he will be average to slightly above-average defensively for a first baseman.

The Reds have much bigger problems than their superstar first baseman. They lack the ability to get on base consistently. They have serious question marks in left field and center field. The reality is that the Reds are a borderline playoff team right now and need Votto to be an elite player to have a legitimate chance of returning to the postseason. After looking at the numbers and with the prospect of a fully healthy knee, It is easy to see Votto continuing his run of excellence.


Do International Players Contribute More than Domestic Players?

I have always wondered what the contribution of international players, players signed as amateur free agents, was compared to that of domestic players, players who went through the Rule 4 Draft process. So much money is spent annually on academies in the Dominican Republic and, to a lesser extent, in Venezuela. Of course, each team has a different budget for these international operations. The Yankees’ complex in the Dominican Republic is much more extravagant than the Marlins’, for example. Regardless, a question I have always asked is what the return on investment (ROI) is for these teams, seeing as greater than 90% of the players that come through these academies don’t ever reach the big leagues or develop into true prospects.

A little background on why I am so interested in this topic: I spent a year in the Dominican Republic, initially volunteering at a successful amateur agency in San Pedro de Macoris (an hour east of Santo Domingo), then helping out with the Dominican Prospect League’s showcases and tournaments, eventually landing with the Yankees as a Player Development/Video Operations intern.

Without access to financial statements, it is nearly impossible to determine a ROI for each team. Instead, I decided to do something much more simple. I looked at the WAR contributions for each team from international players and from domestic players.

I used Baseball-reference.com for all my information, sorting position players by plate appearances and used an arbitrary minimum of 400PA in order to include players that had enough opportunity to contribute in 2013, either positively or negatively.

On the extremes, in 2013 the Cardinals, Orioles, Nationals, and Phillies all had zero international players with at least 400PA, whereas the Diamondbacks, Rangers, Tigers, and Brewers each had four international players with the minimum plate appearances. Overall, 48 international players with at least 400PA combined for 141.6 WAR in 2013. On the other side, 151 domestic players combined for 396.5 WAR. Translated into WAR per player, international players contributed a rate of 3.0WAR/player and domestic players at a rate of 2.6WAR/player.

While going through the players of each team, I realized that I am leaving out players who contributed a significant WAR even though they did not accumulate 400PA, so I decided to lower the minimum to 300PA and change the rate statistic to WAR per 600PA, instead of per player. Players such as Hanley Ramirez were previously left out due to injury. Also, players who were traded midseason and did not have sufficient playing time to post 400PA with one team were previously excluded, such as Alfonso Soriano, are now included with the lowered minimum. Here is what the new results show:

Table 1: WAR per 600PA for international and domestic players during 2013 season. Minimum 300PA. WAR values taken from Baseball-reference.com.

PA

WAR

WAR/600PA

Int’l Players

32851

154.1

2.8

Domestic Players

101805

432.8

2.6

 

 

 

 

The results show that international players contributed a slightly higher rate of WAR per 600PA in 2013. The 0.2 greater WAR/600PA is not significant enough to conclude that international players contribute more talent per PA than did domestic players.

The next question I had was to determine what percentage of players who had 300PA were international and what percentage of WAR they contributed out of the total players with 300PA. What I found was that 24% of players with at least 300PA were international and they contributed 26% of WAR out of a total of 586.9 WAR. The percentage of players that are international seem to have contributed a similar percentage of overall WAR in 2013.

One small issue I came across was that there were a handful of players that went through the draft even though they are international players. A few examples are Jose Bautista (Dominican), Edwin Encarnacion (Dominican), Yan Gomes (Brazilian), Pedro Alvarez (Dominican), and Yonder Alonso (Cuban). I decided to switch this group of players from domestic to international. Table 2 shows WAR per 600PA, while changing this group of players from domestic to international.

Table 2: WAR per 600PA for international and domestic players during 2013 season, taking into account international players who were part of Rule 4 Draft. Minimum 300PA. WAR values taken from Baseball-reference.com.

PA

WAR

WAR/600PA

Int’l Players

36495

171.9

2.8

Domestic Players

98161

415.0

2.5

 

 

 

 

The data from Table 2 shows that the gap between international and domestic players of WAR/600PA increased to 0.3, but this gap is still not significant. The question about percentage of WAR contributed changes slightly, but also not significantly. International players contribute 29% of total WAR while international players only make up 27% of total players who had at least 300PA in 2013.

In conclusion, from this short study, I cannot say that international players contributed significantly more WAR than do domestic players in 2013, but there was a difference of 0.3 WAR/600PA in favor of international players. Furthermore, 27% of players with at least 300PA were international and they contributed 29% of the total WAR in 2013 of all players with at least 300PA.

I did not look at pitchers yet, but am open to hear thoughts, criticism, and possible future directions to continue this brief study!


Ten Most Valuable Hitting Fantasy Seasons Since 1920

One of the best features of the wins above replacement (WAR) statistic is that it allows us to compare the greatest single-season performances across different eras in baseball history.  Anyone who has browsed the FanGraphs Leaders page should know that Babe Ruth had the top-five WAR seasons in history, all in the 1920s.  In terms of offensive runs added (batting and base running), Ruth’s 1921 season ranks as the best ever, followed by Barry Bonds’ 73 home run “asterisk” season of 2001.  But what about fantasy baseball?  Were these also the greatest (read most valuable) rotisserie seasons ever recorded?  That’s the question I set out to answer.

Using a slightly modified version of Zach Sander’s fantasy value above replacement (FVAR) system for valuing fantasy players, I estimated the auction value for every hitting season from 1920-2013.  First, I determined every player’s position eligibility based on some simple assumptions (meant to reflect Yahoo’s approach) whereby a player is eligible for a position if they meet any of the following criteria:

  • Played at least 20 games at the position in the previous season.
  • Started at least 5 games at the position during the current season.
  • Played at least 10 games at the position in the current season.

With that established, I proceeded to calculate the z-scores, FVAR and  auction values (FVAR$) for roto leagues.  Based on a 5×5 12-team mixed league with $260 budget per team (and quite a few other assumptions) here are the ten most valuable fantasy seasons for hitters since 1920 (5×5, 12-team mixed):

Rank

Season

Name

POS

PA

AVG

R

HR

RBI

SB

FVAR$

1

2007

Alex Rodriguez

3B

708

0.314

143

54

156

24

$56

2

1997

Larry Walker

OF

664

0.366

143

49

130

33

$55

3

1985

Rickey Henderson

OF

654

0.314

146

24

72

80

$55

4

1983

Tim Raines

2B

720

0.298

133

11

71

90

$53

5

1963

Hank Aaron

OF

714

0.319

121

44

130

31

$53

6

1988

Jose Canseco

OF

705

0.307

120

42

124

40

$53

7

1993

Barry Bonds

OF

674

0.336

129

46

123

29

$52

8

1982

Rickey Henderson

OF

656

0.267

119

10

51

130

$52

9

1921

Babe Ruth

OF

693

0.378

177

59

171

17

$51

10

1974

Lou Brock

OF

702

0.306

105

3

48

118

$51

At this point you’re probably asking: “What, A-Rod?!?!”  I know, as a Red Sox fan and sentient being I was not happy to see A-Rod at the top of the heap.  As much as you may like or dislike A-Rod in real life, if you drafted him first overall in your 2007 fantasy league you were not disappointed with his across-the-board production.  But, you might also be asking, as I did, how was A-Rod’s 2007 season worth $5 more than Babe Ruth’s 1921 season?   Ruth’s hitting and base running in 1921 added 119 runs compared to 75 runs added for A-Rod in 2007, so what gives?  As best I can tell, here are some reasons why A-Rod-2007 had a higher FVAR$ than Ruth-1921:

  • Ruth’s replacement in 1921, Ralph Miller, was much worse than A-Rod’s replacement in 2007, Melky Cabrera.
  • As a result Ruth-1921 had a much higher FVARz score than A-Rod-2007, but the average above-replacement player in 1921 also had a higher FVARz than the average above-replacement player in 2007.
  • As shown in Zach Sanders’ third post on FVAR, the auction values are a function of FVARz divided by the average FVARz for above-replacement players.  Hence, Ruth’s FVARz was divided by a larger number to calculate FVAR$.

Does this make sense?  Yes actually, I think it does.  What it means is that in 2007 A-Rod and Melky Cabrera together were worth more than Babe Ruth and Ralph Miller together in 1921.  In a fantasy auction in 1921 it would have been unwise to spend too many fake dollars on the best players like Ruth and Hornsby (or drink in public because of that Prohibition thing) because you would have been stuck with the bottom players, like Ralph Miller, who were really, really bad (there were only 16 teams back then and no DH).

For fun, below is a dream fantasy lineup with the best hitters since 1920 at each position (5×5, 12-team mixed).  Enjoy.

Order

Season

Name

POS

PA

AVG

R

HR

RBI

SB

FVAR$

1

1983

Tim Raines

2B

720

0.298

133

11

71

90

$53

2

1985

Rickey Henderson

OF

654

0.314

146

24

72

80

$55

3

2007

Alex Rodriguez

3B

708

0.314

143

54

156

24

$56

4

1927

Lou Gehrig

1B

717

0.373

149

47

175

10

$48

5

2006

David Ortiz

Util

686

0.287

115

54

137

1

$34

6

1997

Larry Walker

OF

664

0.366

143

49

130

33

$55

7

1963

Hank Aaron

OF

714

0.319

121

44

130

31

$53

8

1997

Mike Piazza

C

633

0.362

104

40

124

5

$45

9

1998

Alex Rodriguez

SS

748

0.31

123

42

124

46

$48

I’m hoping to write more posts like this using historical FVAR, especially if readers/commenters think it worthwhile.

Twitter: @FVARBaseball

Website: fvarbaseball.wordpress.com


How Would You Produce if You Sometimes Swung the Bat?

Jeff Sullivan, esteemed overlord of the Community blog, wrote a fun article this past week looking at how the average schlub like you or me would produce if given an opportunity to bat and chose to literally never swing. Pitchers, as it turns out, are surprisingly fallible when it comes to striking out realistic simulacrums of hitters, and the expected production would be nonzero by a hair. What a couple people claimed in the comments was that they could be better than Jeff’s predicted .000/.073/.000, that they could swing blindly a couple times and occasionally get on base, somehow. Maybe that was you! I call shenanigans. Swinging obviously leads to negative outcomes as well, and I think the decreased chance of a walk would outweigh the nonzero chance of a hit. I took Jeff’s frequently given advice (if you want to know more, do the research yourself) and decided to see what would happen if you did decide to swing sometimes.

Throughout this, I’m going to follow a fairly similar methodology to Jeff and use a lot of his numbers whenever possible, so we’ll assume pitchers have the same abilities to throw strikes he gave. It’s true that it might change a bit if they knew what you were planning, but Jeff assumed the pitchers thought you were at least thinking about swinging, so it’s not too big a stretch. To extend the conclusions, we have to decide a few things — when you swing, how likely you are to make contact, and what happens when you do.

In making these assumptions, I’m trying to channel what you or I, a reader of FanGraphs, would do in the batter’s box. If I decided to swing ever, it would be just that, a conscious decision, one I would probably have to make before the ball ever left the pitcher’s hand. There is no way I have the ability to determine if a pitch is a ball or a strike or at all hittable while it’s approaching me rapidly. Therefore, in our scenario, you have a predetermined chance of swinging at any pitch, ball or strike. We’ll call that swing rate, and it depends entirely on how aggressive you’re feeling.

Now we need contact percentages, and for the next few sections, I’m again going to follow Jeff’s lead by looking for some of the historically worst contact percentages ever, and adjust those down somewhat. Looking over the last six years for players with at least 20 plate appearances, there are some pretty terrible numbers for both in-zone and out-of-zone contact rates. Sean West of the Marlins and Alex Wood of the Braves both have O-Contact rates of 0.00%. The lowest non-zero rate belongs to Sean Gallagher, at 14.3%. The lowest from a position player is that of Reid Gorecki, outfielder for the Braves, at 18.8%. It seems fairly reasonable to set your O-Contact% at 10%. Z-Contact% is better, but we see some of the same names. The lowest is Rick VandenHurk, 36.8%. Sean Gallagher ranks third-worst here as well, at 55.6%. Mike Costanzo is the worst position player at 59.3%, though good ol’ Reid Gorecki isn’t far behind, the fifth-worst position player at 68.6%. 50% seems like a good round number to choose for Z-Contact%.

So now we know when you swing, and whether or not you make contact. What happens then? There are 10 players with at least 20 PAs who have a BABIP of .000, so it’s presumably possible (maybe even likely) to ground out every time we even manage to make contact. Brandon McCarthy is the lowest-nonzero rate, at .037, or one hit in the 27 times he’s put the ball in play. The lowest position player is Oswaldo Navarro, shortstop, at .063. We’ll be generous, and assume some of these awful numbers are bad luck and need to be duly regressed, and put your true-talent BABIP at .050.

The floor for your hypothetical power abilities is even lower. Lots of position players have .000 ISO, with all of Luis Durango’s 19 career hits in 74 PAs (the largest sample size among players at .000) being singles. That’s boring, however. We want at least some chance for extra bases. ISO, however, measures extra bases per at-bat, when really what we want is extra bases conditional on the ball being a hit. We can safely set that pretty low as well. Johnny Cueto has a surprising number of PAs, at 340, and the lowest nonzero ISO at .004. Of his 26 hits, 25 were singles, with one double. There are lots of similarly bad numbers, so we’ll put your chance at extra bases at .02, or once per every fifty hits. Sounds about right. They’ll all be doubles — no way you’re hitting the ball hard enough for a triple or dinger, inside-the-park or otherwise.

Now it’s time to plug and chug. (Before continuing, an aside: we’re obviously assuming unrealistic things here, like that our BABIP will be the same on pitches in the zone and on those not, and that the probabilities will be the same regardless of the count, but we’re also assuming that you’re in a major-league game, so let’s not quibble over realism.) Continuing to shamelessly appropriate Jeff’s numbers and methodology, when we step up to the plate, there’s about a .24% chance we’ll be hit in the at-bat and automatically reach base. Nice! Jeff set an expected strike-rate of 70%, and we’ll use that as well. If we didn’t swing, we walked or were hit enough for an OBP of .073. How does swinging sometimes change your production?

Oof. Not well, is the answer. You can see that while BA and SLG increase when you swing more, it’s not nearly enough to cover the increased strikes and in-play outs, and reasonable swing rates in the 25-60 percent range cause lead to some even worse wOBAs in the .020s and .030s. Interestingly, the minimum wOBA comes right around a swing rate of 45%, after which it is better to swing more no matter what your rate is, until you’re deciding to swing before literally every single pitch. They should probably learn your strategy fairly quickly, but until they do, your OBP is about .040 (you get on base once every 25 at-bats!) and your wOBA is about .036. For comparison, Jon Lester (my go-to terribly-hitting pitcher) has a lifetime OBP of .030 and a wOBA of .021.

How good would you have to be for swinging sometimes to be better than doing nothing? It doesn’t take a lot. If you can make slightly more solid contact, and we up your BABIP to .150 and the chance of a double given a hit to 15%, swinging more is a very good thing, with your wOBA topping out at .110 (!!!) when you swing 100% of the time. To be fair, though, while this is a small increase in magnitude, it’s better contact than a lot of pitchers make, so I think the original assumptions are probably closer to the truth.

So now you know. If you’re ever dropped into a baseball game somehow, remember Eddie Gaedel, and keep the bat firmly fixed on your shoulder. Because while you would be terrible, really really terrible, it would be even worse if you for some second you forgot how terrible you were and tried swinging.

PS — I put together a whole spreadsheet where you can mess with all these numbers and see how it changes the results, and if that’s something people would be interested in I could provide a link so you can make assumptions of your own.


A Hall of Fame Moment, Parsed and in Context

There were two outs. Many baseball stories, whether real or fantasy, begin with this situation. It’s when the stakes are highest for both the offense and the defense. An out means reprieve and perhaps a win for the team on the field, while reaching base means an extended opportunity for the hitters and possibly a win as well. Such stories tell moments of baseball history. They freeze the instance in time that might otherwise get lost in the accumulation of statistics, games played, and sometimes even wins and losses. Baseball moments can encapsulate a player’s entire career, or the essence of baseball at a particular time. Moments are what we remember. They are not always heroic—sometimes a great baseball moment does not capture the essence of a player as much as a quotidian aspect of the game—but they can nonetheless be what I would consider a Hall of Fame moment. While these stories will not, and should not, be part of the calculations when determining a Hall of Fame player, they have a way of remaining with the observer.

In one of my own early baseball memories, there were two outs. It was, in my estimation, a Hall of Fame moment. What makes it notable is that there were only two outs. In an April 24th game of the strike-shortened 1994 season, the Montreal Expos played a nationally-televised away game against the Los Angeles Dodgers at Chavez Ravine. In the bottom of the first inning, with a runner on first base and one out, there was a pop-up to Expos right fielder Larry Walker. Being the quality defender that he was and given the relative ease of the pop-up from a right-hander, Walker tracked down the ball for the second out. So far unremarkable, but what he did next stood out.

Walker handed the ball to a receptive young baseball fan in the front row and started trotting back to the dugout. The runner on first, Jose Offerman, tagged first and took off running. Noticing that he just turned a live ball into a souvenir, Walker ran back to the fan, took the ball, and threw it to the infield. It turns out he didn’t need to retrieve the ball. According to the rules, once the ball left the field of play the runner was awarded two bases. In the moment, though, neither Offerman nor Walker mentally flipped through the rulebook. Offerman tried to score before Walker could get the ball back in play, all while Larry Walker pilfered a material object from a fan and in the process made a memorable moment for one fan unforgettable for everyone watching.

In 1994 Larry Walker was in his fifth full major-league season. His career was off to a fantastic start. Through the 1994 season, Walker slashed .281/.357/.483. He had hit 99 home runs, stolen 98 bases, had a wRC+ of 128, and accumulated a WAR of 20.9—about seven wins more than Jose Offerman produced in his 15-year career. Not only that, but in the strike-shortened 1994 season, Walker put up numbers that would be All-Star worthy over the course of 162 games. In 452 plate appearances, he hit .322/.394/.587 with 19 home runs, 15 stolen bases, a wRC+ of 149, and put up 4.4 WAR. While WAR is the only statistic cited that includes defensive value, it should be noted that over this time he played above-average defense, had a cannon for an arm, and had already won two Gold Gloves—something a cheeky producer reminded the audience of after Walker forgot how many outs there were.

That particular play is so memorable because it showed a famous athlete, a group so often abstracted as herculean, as eminently human. Incidentally, in the time-span from 1989-1994, admittedly chosen only because of my focus on Larry Walker’s early career, the major-league leader in WAR was well ahead of not only Walker, but everyone else in Major League Baseball. At 50.8 WAR and 15 wins ahead of Hall of Famer Ricky Henderson, it would only be later that Barry Bonds’ all too human actions, including mistakes, would be cause for the indictment of a generation and withholding him and others from the Baseball Hall of Fame.

But back to the play at Chavez Ravine in 1994. It was not just a Hall of Fame moment because a right fielder who in my opinion should be in the Hall of Fame fielded a routine pop fly and brain farted it into the stands. The other part of the story is the hitter, who happens to be someone else who I believe belongs in Cooperstown: Mike Piazza. In 1994, Piazza was a year removed from a Rookie of the Year award. Earning this award is anything but a guarantee of a Hall of Fame worthy career or even years of average play. For instance, Piazza was second in a string of five consecutive National League Rookies of the Year for the Los Angeles Dodgers from 1992 until 1996. Eric Karros, Raul Mondesi, Hideo Nomo, and Todd Hollandsworth have garnered little attention and even fewer votes for the Hall of Fame, despite solid to above-average careers. Piazza, like Walker, had a shortened 1994 season that would be valuable for an entire 162-game season. In 441 plate appearances, he slashed .319/.370/.541, hit 24 home runs, had a wRC+ of 139, and accumulated 3.8 WAR.

Like Walker’s case, only the most aspirational observer would have been thinking about the Hall of Fame and Mike Piazza in April of 1994 when he popped out to right field. Also like Walker, it’s the rest of his career that makes him a Hall of Famer. Piazza accrued 7745 plate appearances and hit 427 home runs over the course of the rest of his career, which ended after the 2007 season; his triple slash was .308/.377/.545; he had a wRC+ of 140; he ended his career with 63.7 WAR. He was the best-hitting catcher of all time, and the only real debate left, as Eno Sarris compellingly demonstrates, is whether he or Johnny Bench was the best catcher of all time. Larry Walker concluded his career after the 2005 season with more plate appearances than Piazza, 8030—his final tallies were a line of .313/.400/.565, a wRC+ of 140 (the same as Piazza’s), and 69 WAR. That Walker was able to do all of this despite missing quite a bit of time due to injury should buoy his Hall of Fame chances rather than diminish them, contrary to what Denver Post writer Troy Renck suggests. Additionally, that wRC+ and WAR are park-adjusted stats should mitigate the stigma of Coors Field.

Finally, somebody had to pitch the ball to Mike Piazza for Larry Walker to run it down, catch it, and give it away—that somebody was, in fact, future Hall of Famer Pedro Martinez. It was Martinez’s second full season as a major-league pitcher, and it was his first with the Montreal Expos. Prior to the beginning of the 1994 season, the Expos traded second basemen Delino Deshields to the very same Los Angeles Dodgers for Martinez. 1994 was also Martinez’s first great season as a starting pitcher. He tossed 144.2 innings in the short season, had an ERA of 3.42, which was slightly worse than his FIP of 3.32, and he struck out 8.83 batters per nine innings, all contributing to a 3.4 WAR mark. Martinez’s 1994 season in the context of the rest of his career was a blip akin to a pop out. He ended his career after the 2009 season with a 2.93 ERA, a 2.91 FIP, 10 K/9, 87.1 WAR, and a near guaranteed first-ballot selection to the Baseball Hall of Fame when he becomes eligible next year.

That play at Chavez Ravine in April of 1994 was as extraordinary as it was banal. Pedro Martinez got Mike Piazza to pop out to Larry Walker in right field in the first inning of an April game and within a shortened season when there wouldn’t even be a World Series. At the time, they were just baseball players. They are now potential and future Hall of Famers. One will be a first-ballot selection, another will likely be elected in the next two years, and I can reasonably see the third as either being the victim of an overcrowded ballot and losing candidacy—or sneaking in somewhere around his 13th-15th year of eligibility. The pop out that resulted in the second out of an inning was a Hall of Fame moment not just due to the players involved, but because it was a story that lodged itself into the collective memory of baseball fans. Stories and moments accumulate to create baseball legacies just as much as statistics do. Viewed in context, they are counter-narratives that illustrate that great baseball moments, and the greatness of players, are told through the stories of exceptional athletes and flawed human beings. Take a look.