Looking into Differences in Exit Velocity

Statcast has revolutionized the way we look at batted-ball data. We have been spoiled with exit velocity, launch angle and so much more. After looking into this treasure trove of data, I began to wonder, how closely is a hitter’s overall production tied to their exit velocity? More specifically, I wanted to uncover whether production was tied to differences between Air EV and Ground EV. First, I calculated the difference between Air EV and Ground EV from Baseball Savant. Next, I filtered the list to only include those with at least 100 batted-ball events to not skew the sample. I also calculated AIR% by adding together LD% and FB% to see who is maximizing their contact and see who may need a change in approach.

This first chart illustrates which players have the largest difference between Air and Ground EV:

Player Difference, Air EV and Ground EV (MPH) AIR%
Byung-ho Park 17.3 58.7%
Nick Castellanos 13.6 68.6%
Brett Eibner 13.0 57.6%
Ryan Schimpf 12.9 80.4%
Mike Napoli 12.9 63.6%
Oswaldo Arcia 12.9 58.2%
Adam Duvall 12.5 66.1%
Brian Dozier 12.3 63.6%
Sean Rodriguez 12.3 60.2%
Brandon Belt 12.1 73.8%

Byung-ho Park leads the way by nearly 4 MPH, with a difference of 17.3 MPH. With the exception of Eibner, Park and Arcia, this is a list of hitters whose primary BIP type is FBs. Each of these hitters has an AIR% over 60%, with Ryan Schimpf pacing the group at an incredible 80%. With such a stark difference in EV, each these players should focus on hitting the ball in the air to maximize their overall production. For Park, Eibner and Arcia, putting the ball on the ground severely limits how often they can make harder contact. All things equal, hard contact is better than soft contact and these players should adjust their approach accordingly to maximize hard contact, which could help their overall production.

As we move on, the next chart displays players with the smallest differences in Air and Ground EV:

Player Difference, Air EV and Ground EV AIR%
Billy Burns -2.4 46.8%
Melky Cabrera -2.2 56.9%
Max Kepler -1.5 52.8%
Matt Szczur -1.4 57.4%
Martin Prado -1.2 52.6%
Jose Peraza -1 56.5%
Lorenzo Cain -1 52.7%
Ryan Rua -0.9 47.9%
Miguel Rojas -0.7 46.0%
Tyler Holt -0.5 48.0%

The speedy Billy Burns tops this list, complemented by a group of players no one will mistake for sluggers. This group comes with considerably less ceiling and overall production. Of this group, only three guys managed to post a league-average or better wRC+ (Prado, Cabrera and Peraza). Lorenzo Cain has been better in the past but was hampered by injuries this past year. Cain and the three previously mentioned provide the blueprint for how this profile can work. By spraying the ball and making enough contact, these guys maximize their limited power but have a razor-thin line between their bats being productive and unplayable.

As an aside, there was only one player who had zero difference in his EVs. The culprit? Nick Markakis, which for some reason makes perfect sense. Anyways!

So now that I have shown the extremes we can begin to answer the original question: does EV difference even matter for overall production? To find out, I ran a couple different tests. First, I took the data and divided them evenly into quarters. The results look like this:

Group Average EV Difference Average wRC+  

Best Hitter

Top 25% 9.4 102 Joey Votto
25-50% 6.3 100 Mike Trout
50-75% 4.5 95 Miguel Cabrera
Bottom 25% 1.8 94 Daniel Murphy

The top 50% of hitters with large differences in EVs hit average or slightly better. Meanwhile, hitters in the bottom 50% produced slightly below average. To give each group a face, I took the best hitter by wRC+ and here we have four elite hitters. So far we have a very minor indication that says players with larger EV differences hit better than those with smaller differences. What we do not have is a concrete reason to disqualify a hitter from being elite based on their EV differences.

Next, I took the data and plotted players’ EV Differences and wRC+ to see if there was any correlation.  The graph is about as random as it gets with an R squared value of .022. This shows that there is a relationship between EV differences and overall offensive production but nothing significant.

All things equal, you probably take the guy with the larger differences but that does not guarantee any kind of success. We now know that their differences of how hard they hit balls in the air or on the ground do not preclude them from being elite. Hitting is both art and science and what we have learned today only reinforces that hitters can have very different profiles and still have excellent results.


Ervin Santana’s Sneaky Good Career

As of right now there are only eight active pitchers with at least 150 major-league wins on their resume: CC Sabathia, Bartolo Colon, John Lackey, Justin Verlander, Zack Grienke, Felix Hernandez, Jake Peavy, and Jered Weaver. Unsurprisingly, Jon Lester is only four wins away from joining the group. Cole Hamels is 14 off the mark. With a little bit of run support from the Minnesota Twins’ juggernaut of an offense, Ervin Santana can also join this exclusive group in 2017. Without a little research, it would’ve taken me at least a couple dozen guesses before I arrived on Mr. Santana as a candidate to join this group. He has flown under the radar for years now and it is about time he got his due credit as a solidly above-average major-league starting pitcher.

The problem with Santana is when he’s bad, he’s extremely bad. His disastrous seasons in 2007, 2009, and 2012 left us wondering when his next implosion would arrive. With those seasons well in the rear-view mirror, we can look at them as anomalies rather than Ervin’s reality. His 2012 in particular looks like a result of huge misfortune. His HR/FB shot up to 18.9%, 6.1% higher than any other season of his. As a result, his HR/9 approached 2. While Ervin has always been semi-homer prone, it is safe to say that a season like his 2012 was either a fluke or could be attributed to some kind of injury.

Early in his career, what plagued him was his low GB% and high walk rate. Slowly, as his career has progressed, he has become much more of a groundball pitcher and gotten control over his ballooning walk rate. His groundball rate has been above 40% every year since 2011 and his walk rate has been around or below three walks per nine every year since 2007, something that could not be said for his first three years in the league.

He is at 25.3 fWAR for his career, which is good for 19th among active pitchers, hovering around names with a much more successful connotation such as Ubaldo Jimenez and Scott Kazmir. It is easy to look past Santana and more towards guys such as Jimenez and Kazmir because the latter have done it with much more flash. Santana’s only standout season was way back in 2008, and since then he has only posted one season above 3.0 fWAR, his 2016 season. In other words, Santana has done it with under-the-radar consistency a la Bartolo Colon.

In the hypothetical world where there exists a Hall Of Solidly Good, Ervin Santana would be a first-ballot Hall-Of-Gooder. What strikes me is how different the baseball world seems to view him from what the numbers say about him.


The Season’s Least Likely Home Run

Jeff recently ran two articles about the season’s worst and best home runs, as measured by exit velocity.  As a small addendum to that, I’d like to include both exit velocity and launch angle to try to determine the season’s least likely home run.  So how do we do such a thing?  Warning!  I’m going to spend a bunch of time talking about R code and machine learning.  If you want to skip all that, feel free to scroll down a bit.  If, on the other hand, you’d like a more in-depth look at running machine learning on Statcast data, hit me up in the comments and I’ll write some more fleshed-out pieces.

As usual, we’re going to rely heavily on Baseball Savant.  Thanks to their Statcast tool, we can download enough information to blindly feed into a machine-learning model to see how exit velocity and launch angle affect the probability of getting a home run.  For instance, if we wanted to make a simple decision tree, we could do something like this.

# Read the data
my_csv <- 'hr_data.csv'
data_raw <- read.csv(my_csv)
# Make training and test sets
library(caret)
inTrain <- createDataPartition(data_raw$HR,p=0.7,list=FALSE)
training <- data_raw[inTrain,]
testing <- data_raw[-inTrain,]
# rpart == decision tree
method <- 'rpart'
# train the model
modelFit <- train(HR ~ ., method=method, data=training)
# Show the decision tree
library(rattle)
fancyRpartPlot(modelFit$finalModel)

 

That looks like what we would expect.  To hit a home run, you want to hit the ball really hard (over 100 MPH) and at the right angle (between 20 and 40 degrees).  So far so good.

Now, decision trees are pretty and easy to interpret but they’re no good for what we want to do because (a) they’re not as accurate as other, more sophisticated methods and (b) they don’t give meaningful probability values.  Let’s instead use boosting and see how well we did on our test set.

method <- 'gbm' # boosting
modelFit <- train(HR ~ ., method=method, data=training)
# How did this work on the test set?
predicted <- predict(modelFit,newdata=testing)
# Accuracy, precision, recall, F1 score
accuracy <- sum(predicted == testing$HR)/length(predicted)
precision <- posPredValue(predicted,testing$HR)
recall <- sensitivity(predicted,testing$HR)
F1 <- (2 * precision * recall)/(precision + recall)

print(accuracy) # 0.973
print(precision) # 0.792
print(recall) # 0.657
print(F1) # 0.718

The accuracy number looks nice, but the precison and recall show that this is far from an amazingly predictive algorithm.  Still, it’s decent, and all we really want is a starting point for the conversation I started in the title, so let’s apply this prediction to all home runs hit in 2016.

Once you throw out some fairly clear blips in the Statcast data, the “winner”, with a 0.3% chance of turning into a home run, is this beauty from Darwin Barney.*  This baby had an exit velocity of 91 MPH and launch angle of 40.7 degrees.  For fun, let’s look at where similarly-struck balls in the Rogers Centre ended up this year.

* I’m no bat-flip expert, but I believe you can see more of a flip of “I’m disgusted” than “yay” in that clip.

Congrats Darwin Barney!  There are no-doubters, then there are maybes, and then there are wall-scrapers.  They all look the same in the box score, but you can’t fool Statcast.


The Brewers Will Steal More Bases than Anyone In 20 Years

Checking out the Brewers’ team dashboard from 2016, and — HOLY HELL THEY STOLE A TON OF BASES. That’s 181, to be exact. Forty-two more than anyone else in the league. The seventh-most since 1996. The craziest thing about the Brew-Crew’s stolen-base total is that they didn’t even steal as many as they could. A huge chunk of the stolen bases came from break-out star Jonathan Villar, but an even bigger chunk came from three young up-and-comers in Hernan Perez, Orlando Arcia, and Keon Broxton. These three combined for fewer than 1000 plate appearances and all are expected to be starters at the outset of the 2017 season. The Milwaukee Brewers are going to challenge the 1996 Rockies’ number of 201 stolen bases for most team stolen bases in two decades.

Sensationalist title aside, it will take a little luck for the Brewers to break the 201 mark. Jonathan Villar alone will have a hard time repeating his 62-steal output, but right now I’m going to figure out just how the Brewers can make this work.

Firstly, they’re going to need health. Have a catastrophic injury to Villar and their chances of breaking the record go out the window. Same can pretty much be said about Broxton or Perez. Injuries are never fun so I’m going to put them aside just for this exercise in the name of entertainment. In a miracle by the Brewers training staff, all of their speedsters have a clean bill of health on the season and with that play in 150+ games. Same goes for solid stolen-base contributors in Ryan Braun and Scooter Gennett.

Jonathan Villar’s 2016 turns out not to be a fluke. He comes slightly back down to earth and steals only 50 bases in 2017, mostly attributed to his lower batting average/on-base percentage. Boom. Just like there we’re a quarter of the way there. As you can see, there’s no making it past 201 without Villar. In reality, I’d be pretty confident in betting the over of 50 steals for Villar.

Next is Hernan Perez. It’d be easy to extrapolate and and say in a full season, Perez would surpass 50 steals. The problem is it is hard to believe Perez will repeat his 2016 success. If Perez manages to stay in the lineup all season, he could easily make it past 40 steals, with 50 not out of the picture. Let’s play it safe and pencil Perez in for 40 steals. Okay, we made it to 90 after just two players.

Here’s where we can have a little fun. Brewers top prospect Keon Broxton is a strikeout machine. Even with those strikeouts, he made a big splash in his rookie 2016 season, hitting nine homers, stealing 23 bases, and sporting a .354 OBP in 254 plate appearances. Although Broxton has always been a big strikeout guy, there is reason to believe he might see some improvement. His K% of over 36% is bound to fall at least a few points. Age is also on his side. With a hypothetical decrease in whiffs, we can expect an increase in his already steady on-base percentage. I’m going out on a limb and predicting 50 steals for Broxton in a breakout sophomore campaign. That’s 140. A ton needs to go perfect for the team from Wisconsin but there is at least reason to believe these players can get 140 steals between the three of them.

The speed does not stop there. The Brewers’ top prospect, Orlando Arcia, stole 23 bases combined in Triple-A and the major leagues last year, his debut season. In Double-A in 2015, he stole 25 in 129 games. The young shortstop is expected to begin the season as Milwaukee’s starter. He should easily surpass the 20-steal mark assuming he holds onto the full-time job. With the running environment afforded to him in Milwaukee, I’d expect at least 25, with room for more. Twenty-five steals would have been the most on 19 different teams in 2016. Twenty-five might be the fourth-most on the 2017 Brewers alone.

From this point on the Brewers need to steal fewer than 40 bases to surpass the 201 mark. You don’t have to look far for those steals. Ryan Braun is getting older but he still stole 16 bases last year, and 24 in 2015. Scooter Gennett stole eight bases last year and reached double-digit stolen bases three times in his minor-league career. Kirk Niewenhuis stole eight bases in fewer than 400 plate appearances last year. Domino Santana is expected to see a full year of playing time barring injury, and has breached the double-digit mark in stolen bases in his minor-league career. And then there’s the occasional catcher steal or maybe even a steal from a pitcher or two. This is also not including any off-season deals the Brewers might pull off.

Adding all of this together, somehow it is even easier to see the Brewers blowing past the 1996 Rockies’ mark of 201 steals. Of course, with predictions like this a lot of things have to go right. They are largely dependent on a few speedsters, they have to avoid injuries, and the players still have to perform well enough to even have a chance to steal their bases. All in all, the 2017 Brewers have as much, if not more, of a shot of passing the 201 mark than anyone in the last 20 years.


Travis Jankowski and a Sub-Optimal Approach

The Padres have had an interesting year both on and off the field from the GM getting suspended for hiding medical information to trying to convert Christian Bethancourt into a Swiss Army Knife. One of the least interesting things about the Padres was the on-field product. In Year 2 of Prellermania, the Padres lost 94 games and spent time stocking the farm for 2018 and beyond. At present, there are few players of interest on the Padres. The one I find most interesting, however, is Travis Jankowski.

Jankowski just delivered 2.1 WAR playing in 131 games, primarily in CF, making him a useful player. A majority of this production comes from his defense and base-running, where he racked up 8 DRS and 3.1 BsR to make up for a below-average 82 wRC+. With that said, I believe Jankowski has additional upside that teams looking to upgrade in CF could target. Jankowski has some interesting metrics once you look past his surface stats that indicate there could be more upside than initially meets the eye.

This past season, Jankowski rated sixth in GB% at 58.4%, putting him in the company of Dee Gordon and Eric Hosmer. Jankowski also paced the league in going the other way, with an Oppo% of 39.1%. It seems obvious that Jankowski’s approach is to smack the ball the other way and let his 70-grade speed do the rest. Diving into Statcast, we find that Jankowski had an average launch angle of 2.4 degrees, and an average exit velocity of 86.2 MPH, backing up what we know about Jankowski’s groundball tendencies.

Diving deeper is where it gets interesting. I found that Jankowski had an average EV of 83.5 MPH on balls above 10 degrees compared to 90.8 MPH on balls below 10 degrees. In simpler terms, Jankowski makes a majority of his hard contact on the ground. Given his status as a lefty-hit / righty-throw guy, this makes some sense as a guy whose bottom, more dominant hand pulls the bat through the zone early, making it more difficult to get the ball up in the air. Still intrigued, I wanted to find hitters with similar EVs on launch angles below 10 degrees and came up with the following:

Name Exit Velocity 2016 wRC+
Jason Heyward 91.52 72
Ben Revere 90.99 47
Tyler Saladino 90.88 93
Travis Jankowski 90.84 82
Xander Bogaerts 90.69 113
Jace Peterson 90.56 95
Alexei Ramirez 90.44 63
Cesar Hernandez 90.4 108
Martin Prado 90.35 109
James Loney 90.26 89
J.T. Realmuto 90.25 107
Denard Span 90.24 96
Average 90.6 90

 

This is an interesting group of players, headlined by superstar Xander Bogaerts and the solid Martin Prado, and yet it also includes the disappointing Jason Heyward and the DFA’d Alexei Ramirez. If anything, it shows how razor-thin the margin is for players with this type of profile.

Next, I looked at the same group’s batted-ball profile and took the average of their hit type and hit distribution to compare to Jankowski’s and came up with the following:

 Name LD% GB% FB% PULL% CENT% OPPO%
Jankowski 26% 58% 16% 24% 37% 39%
Average 22% 51% 27% 38% 36% 27%

 

Looking at their batted-ball distributions, Jankowski stands as somewhat of an outlier in this group. Most notable is his how he rarely pulls the ball in favor of going the other way. With these tendencies, Jankowski is actually depressing his own value with the bat. When Jankowski goes the other way, he has an average EV of 84.5 MPH. Compare this to pulled balls, where his average EV is 89.1 MPH. In most cases, pulled balls are always going to be hit harder, but for someone as extreme as Jankowski, the opposite-field approach may be suppressing his overall offensive production. If Jankowski shifts his approach to drive more balls with authority to the pull side, he could push his bat closer to the league-average mark. As noted, Jankowski is a 70 runner and the additional chances on base would only serve to increase his base-running value. With these changes we are looking at a potential league-average bat from a guy who already has above-average defensive and base-running skills. This would be an insanely valuable piece. As we have seen, the trade market does not value defensive value the same as offense, so the acquisition price shouldn’t be prohibitive.

To have maximum value, Jankowski has to play CF, but with top prospect Manny Margot about ready to take over the position full-time, the Padres may deem Jankowski expendable. For teams not willing to pay the prospect price for Charlie Blackmon or not wanting to see Yoenis Cespedes play CF again, Jankowski represents an under-the-radar acquisition that could be had for a reasonable price. Given his skillset, Jankowski shouldn’t rack up the traditional counting stats rewarded in arbitration, and he could provide excellent value throughout his years of control. The Padres as a whole may not be very interesting but they have an interesting player in Travis Jankowski, who could provide immense value to a team with the foresight to acquire his services.


Hardball Retrospective – What Might Have Been – The “Original” 1979 Mets

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition.  Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Expanding on my research for the book, the following series of articles will reveal the teams with the biggest single-season difference in the WAR and Win Shares for the “Original” vs. “Actual” rosters for every Major League organization. “Hardball Retrospective” is available in digital format on Amazon, Barnes and Noble, GooglePlay, iTunes and KoboBooks. The paperback edition is available on Amazon, Barnes and Noble and CreateSpace. Supplemental Statistics, Charts and Graphs along with a discussion forum are offered at TuataraSoftware.com.

Don Daglow (Intellivision World Series Major League Baseball, Earl Weaver Baseball, Tony LaRussa Baseball) contributed the foreword for Hardball Retrospective. The foreword and preview of my book are accessible here.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

AWAR – Wins Above Replacement for players on “actual” teams

AWS – Win Shares for players on “actual” teams

APW% – Pythagorean Won-Loss record for the “actual” teams

 

Assessment

The 1979 New York Mets 

OWAR: 50.7     OWS: 262     OPW%: .479     (78-84)

AWAR: 24.8      AWS: 188     APW%: .389     (63-99)

WARdiff: 25.9                        WSdiff: 74  

The “Original” 1979 Mets ended the season in the cellar, yet the club outpaced the “Actuals” by fifteen victories! Ken Singleton earned runner-up status in the MVP balloting on the strength of a .295 BA with 35 circuit clouts and 111 ribbies. Lee “Maz” Mazzilli (.303/15/79) nabbed 34 bags and merited his lone All-Star appearance. Tim Foli set personal-bests in batting average (.288), base hits, runs and RBI. John “The Hammer” Milner contributed a .276 BA with 16 jacks while splitting time between left field and first base. “Actuals” right fielder Joel Youngblood posted a .275 BA and raked 37 doubles. Richie “The Gravedigger” Hebner added 25 two-base knocks and drove in 79 baserunners.

Tom Seaver and Nolan Ryan rated sixth and twenty-fourth, respectively, among pitchers in the “The New Bill James Historical Baseball Abstract” top 100 player rankings. “Original” Mets teammates registered in the “NBJHBA” top 100 ratings include Ken Singleton (18th-RF) Paul Blair (66th-CF) and Bud Harrelson (88th-SS). “Actuals” third baseman Richie Hebner ranked fifty-sixth while center fielder Jose Cardenal placed seventh-sixth.

  Original 1979 Mets                                  Actual 1979 Mets

STARTING LINEUP POS OWAR OWS STARTING LINEUP POS AWAR AWS
John Milner LF 1.8 13.03 Steve Henderson LF 2.18 11.79
Lee Mazzilli CF 3.56 24.14 Lee Mazzilli CF 3.56 24.14
Ken Singleton RF 4.49 31.68 Joel Youngblood RF 3.75 17.31
Mike Jorgensen 1B -0.09 2.56 Willie Montanez 1B -1.71 2.45
Bud Harrelson 2B 0.55 3.1 Doug Flynn 2B -1.92 6.85
Tim Foli SS 1.88 17.19 Frank Taveras SS -0.83 11.83
Ted Martinez 3B -0.34 1.38 Richie Hebner 3B 2.32 14.43
Alex Trevino C 0.36 5.04 John Stearns C 1.28 10.89
BENCH POS OWAR OWS BENCH POS AWAR AWS
Joe Nolan C -0.02 3.57 Alex Trevino C 0.36 5.04
Jerry Morales RF -1.96 3.43 Elliott Maddox RF 0.67 4.88
Duffy Dyer C 0.11 3.21 Dan Norman RF -0.1 2.22
Benny Ayala LF 0.3 3.01 Jose Cardenal RF 0.36 1.99
Paul Blair CF -1.12 1.41 Ron Hodges C -0.24 1.14
Ron Hodges C -0.24 1.14 Ed Kranepool 1B -0.58 0.86
Ed Kranepool 1B -0.58 0.86 Kelvin Chapman 2B -0.7 0.67
Kelvin Chapman 2B -0.7 0.67 Gil Flores RF -0.36 0.34
Bruce Boisclair RF -0.88 0.29 Bruce Boisclair RF -0.88 0.29
Ike Hampton 1B 0.03 0.19 Sergio Ferrer 3B -0.1 0.16
Roy Staiger 3B 0.06 0.17 Tim Foli SS -0.08 0.1

Jerry Koosman reached the 20-win plateau for the second time in his career. Tom “The Franchise” Seaver (16-6, 3.14) led the National League with 5 shutouts and finished fourth in the Cy Young Award balloting. Nino Espinosa delivered 14 victories with a 3.65 ERA. Nolan Ryan aka the “Ryan Express” tallied 16 victories and struck out 223 batsmen. Craig Swan augmented the “Originals” and “Actuals” rotation with 14 wins and a 3.29 ERA after securing the National League ERA title during the previous campaign.

  Original 1979 Mets                                  Actual 1979 Mets 

ROTATION POS OWAR OWS ROTATION POS AWAR AWS
Jerry Koosman SP 6.06 22.76 Craig Swan SP 3 15.36
Tom Seaver SP 3.68 16.04 Kevin Kobel SP 1.16 7.87
Craig Swan SP 3 15.36 Pete Falcone SP 0.49 6.15
Nino Espinosa SP 2.15 14.6 Tom Hausman SP 1.69 5.95
Nolan Ryan SP 2.88 13.52 Andy Hassler SP 0.54 4.87
BULLPEN POS OWAR OWS BULLPEN POS AWAR AWS
Neil Allen RP 0.19 6.26 Skip Lockwood RP 1.89 6.86
Tug McGraw RP -1.53 4.62 Neil Allen RP 0.19 6.26
Jeff Reardon RP 0.29 2.33 Ed Glynn RP 0.67 4.5
Roy Lee Jackson RP 0.43 1.77 Jeff Reardon RP 0.29 2.33
Dwight Bernard RP -0.51 0.44 Dale Murray RP -1.34 1.87
Steve Renko SP 2.68 11.18 Pat Zachry SP 0.28 2.94
Jim Bibby SP 2.85 11.06 Juan Berenguer SP 0.35 1.84
Ed Figueroa SP 0.98 5.38 Roy Lee Jackson RP 0.43 1.77
Jon Matlack SP 0.81 4.31 Ray Burris SP 0.13 0.85
Juan Berenguer SP 0.35 1.84 Wayne Twitchell RP -1.31 0.84
John Pacella SP 0.05 0.33 Jesse Orosco RP -0.33 0.57
Kim Seaman RP 0.05 0.29 Dwight Bernard RP -0.51 0.44
Jackson Todd RP -0.64 0.01 John Pacella SP 0.05 0.33
Mike Scott SP -0.83 0 Dock Ellis SP -1.6 0
Mike Scott SP -0.83 0

 Notable Transactions

Ken Singleton 

April 5, 1972: Traded by the New York Mets with Tim Foli and Mike Jorgensen to the Montreal Expos for Rusty Staub.

December 4, 1974: Traded by the Montreal Expos with Mike Torrez to the Baltimore Orioles for Bill Kirkpatrick (minors), Rich Coggins and Dave McNally. 

Jerry Koosman 

December 8, 1978: Traded by the New York Mets to the Minnesota Twins for a player to be named later and Greg Field (minors). The Minnesota Twins sent Jesse Orosco (February 7, 1979) to the New York Mets to complete the trade. 

Tom Seaver

June 15, 1977: Traded by the New York Mets to the Cincinnati Reds for Doug Flynn, Steve Henderson, Dan Norman and Pat Zachry.

Nino Espinosa

March 27, 1979: Traded by the New York Mets to the Philadelphia Phillies for Richie Hebner and Jose Moreno.

Nolan Ryan

December 10, 1971: Traded by the New York Mets with Frank Estrada, Don Rose and Leroy Stanton to the California Angels for Jim Fregosi.

Honorable Mention

The 2012 New York Mets 

OWAR: 27.7     OWS: 262     OPW%: .492     (80-82)

AWAR: 24.1       AWS: 221      APW%: .457    (74-88)

WARdiff: 3.6                        WSdiff: 41

The “Original” 2012 Mets placed third, fourteen games in arrears to the Nationals. David “Captain America” Wright (.306/21/93) raked 41 two-base hits and received his sixth All-Star invite. Angel “Crazy Horse” Pagan topped the circuit with 15 triples and set career-highs with 38 two-baggers and 95 runs scored. Jose B. Reyes swiped 40 bags and rapped 37 doubles while double-play partner Daniel Murphy contributed a .291 BA with 40 two-base knocks. Nelson R. Cruz nailed 45 doubles and jacked 24 round-trippers. First-sacker Ike B. Davis established personal-bests with 32 taters and 90 ribbies. A.J. Burnett paced the starting staff with 16 victories along with a 3.51 ERA and 180 strikeouts.

On Deck

What Might Have Been – The “Original” 2013 Marlins

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at “www.retrosheet.org”.

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive

 


The Real Best Reliever in Baseball

The best relief pitcher in baseball is not who you think he is. Most of you probably would not even include him in the top 10. If I were to take a poll on who is the best relief pitcher in baseball, the top voted would likely be Zach Britton, Dellin Betances, Aroldis Chapman, Kenley Jansen, and Andrew Miller. I will say that it is none of them. To illustrate my point, I will compare this mystery pitcher’s numbers to all of their numbers. Nothing too scary, just xFIP, K/9, and ERA. I also will not just tell you which pitcher produced which numbers. Where would be the fun in that? I will compare the numbers of all six pitchers and walk you, the reader, through determining which one is the best.

Pitcher A: 1.18 xFIP; 14.89 K/9; 1.45 ERA
Pitcher B: 1.92; 13.97; 1.55
Pitcher C: 1.17; 16.84; 1.16
Pitcher D: 1.75; 15.53; 3.08
Pitcher E: 2.41; 13.63; 1.83
Pitcher F: 2.09; 9.94; 0.54

At first glance, Pitcher F’s ERA of 0.54 is likely what stands out most. Alas, even calling him only by a letter cannot mask Britton. He has the lowest K/9 by far and the second-highest xFIP, so Britton is effectively taken out of consideration.

Pitcher D has an ERA over a run higher than any of the others. His K/9 and xFIP fit in the range but do not stand out. Thus, Dellin Betances is out as well.

Of the remaining four, Pitcher E rates the worst in each of the three categories. Goodbye, Kenley Jansen.

That leaves us with Pitcher A, Pitcher B, and Pitcher C. In this group, B is the worst across the board. Aroldis Chapman leaves the conversation.

Pitcher C is better than Pitcher A in all three statistics. Andrew Miller bows and exits.

Carter Capps stands victorious.

Yes, I know Capps did not pitch in 2016. I used his 2015 numbers. They stack up just as well against the elite relievers from that year as well. It is true that Capps pitched only 31 innings in 2015, but the stats I used are rates. Maybe a larger sample would have dragged him into mediocrity, but I doubt it. Capps was ahead of the field by such a large margin that even with regression in his 2017 return he would be #1.

I am crazy for saying Carter Capps is the best relief pitcher in baseball. Or am I, really? If Capps pitches as well in 2017 as he did in 2015, just over a larger sample, I believe many of you will agree with me. Some of you may even agree with me after reading this.

So, let me be the first to say it: Carter Capps is the real best relief pitcher in baseball.


Do Teams that Strike Out a Lot Steal More Bases?

This is a question that intuitively would seem to be answered by: Sure, why not?  The assumption was recently made in the comments section of this article by an FG writer:

Think about it — if you are Rougned Odor and you are on first base and, say, Joey Gallo is at the plate, there’s a good chance he’s going to cool down the stadium with some high-powered fanning.  He’s not exactly known as a high-contact guy.  There’s a roughly one-in-three chance that his at-bat is going to end in a backwards K sign being held up by someone in the stands.  So ‘Ned might decide this is a good time to steal because the ball isn’t likely to be put into play in the air, where, if caught, he would have to double back to tag.  Maybe he’s also thinking that, like Brad Johnson alluded, the break-even point for a steal (famously ~75% success rate as calculated by Bill James in Moneyball, ~66% in this more recent FG article) is lower if the guy at the plate is likely to cause an out, specifically a strikeout which normally doesn’t allow a runner to advance like a bunt, grounder or long fly might.

On the other hand, maybe Odor doesn’t have such a cynical view of Gallo, and doesn’t change his mindset on the basepaths.  Maybe he doesn’t try to assume what Gallo might do, so he doesn’t go for any more risky of a steal than he otherwise might.  So maybe he isn’t stealing at a higher rate than normal if the guy at the plate is a K machine.  Heck, maybe Joey Gallo is a specifically bad example here, because, though he does whiff a lot, he also hits a lot of home runs, which might cause a runner to take fewer risks when waiting on the outcome of his plate appearance.

So, let’s looks at what the numbers have to say.  I ran a simple correlation analysis between team stolen-base totals and team K%.  Here’s what I got:

So, no real correlation to be seen here.  But perhaps that shows that it could be a market inefficiency.  In 2016, the Brew Crew led the league in both K% and stolen bases.  Even without John Villar’s big SB season, they are a top-five SB team.  Below is a chart from last year — in yellow are the top five teams in both total SBs and K%.

Perhaps the Rays should have been trying to steal some more?  Though some of these anomalies could just simply be explained by personnel issues — maybe teams like the Orioles just have no one who can steal on the entire squad?

Here’s the same chart, for 2015, just for sugar and giggles:

For the Astros, this is starting to look like a trend — Orioles too.  I think my final answer to the question posited by this post is — Hmm, not sure exactly.  But maybe?


Don’t Tread On Dyson

Browsing through the unqualified FanGraphs WAR leaders for 2016, one may come across what seems like an anomaly at No. 69. Just ahead of certified breakout stars such as Jonathan Villar and Trevor Story, grizzled veterans such as Asdrubal Cabrera and Troy Tulowitzki, and an All-Star catcher in Yasmani Grandal, sits Royals back-up outfielder Jarrod Dyson, at 3.1 fWAR in just 337 plate appearances. If a well-educated-baseball individual were asked to name this mystery outfielder who placed just above these solidly above-average everyday players, Jarrod Dyson wouldn’t be one of the first 30 outfielders most would name. How did Dyson make it so far up this list? And what is he doing rotting on the bench behind the likes of Paulo Orlando, or even the corpses of Alex Gordon and Lorenzo Cain for that matter?

Jarrod Dyson ended up being the most valuable Royal this year. Even more valuable than Danny Duffy and Salvador Perez, despite having the ninth-most plate appearances on the team. So what’s the problem? The problem is Dyson’s profile is far from sexy. He owns a career .325 OBP and only seven home runs in over 1500 plate appearances. His wRC+ is below average for an American League outfielder at 86. Where Dyson extracts his value is in his defense and baserunning, two ways of evaluating a player that are still slow to catch on.

Since Dyson starting seeing semi-regular playing time in 2012, he ranks fourth in FanGraphs BsR behind Mike Trout, Billy Hamilton, and Rajai Davis, all of whom had more plate appearances than Dyson. If stolen bases are your cup of tea, Dyson ranks sixth since 2012, behind five guys who all had more plate appearances. The numbers are there to show how great of a baserunner Dyson is; the problem is getting front offices to realize just how valuable baserunning can be, especially when it comes to a player like Dyson who owns a decent, but not great career OBP.

It doesn’t stop at Mr. Dyson’s baserunning. If the Royals don’t use him as a pinch-runner off the bench, he is used as a defensive replacement. Obviously the Royals think highly of Dyson’s defense, and the numbers agree. Of the outfielders with at least 1000 innings, Dyson ranks fourth since 2012 in FanGraphs’ UZR/150. Even in limited playing time, competing against some who have played twice as many innings, Jarrod ranks 15th in FanGraphs’ defensive value. Jason Heyward, the man who just signed an eight-year, $184-million contract last offseason, is the only other player who ranks in the top 15 in both baserunning and outfield defense according to FanGraphs. What’s perplexing about this is that it’s not as if Heyward is a slugger on top of his outstanding defense and baserunning; he would only be considered a slightly above-average hitter by most measurements. So why isn’t Dyson considered in the same vein as Heyward? Sure, Jason Heyward, former first-round pick and All-Star, has more of a track record, but Jarrod Dyson should at least have been given a chance to start by this point.

Jarrod Dyson shows there is still progress to be made on the analytics front. The inexplicable handling of Dyson can be attributed to a mistrust in advanced statistics. If we are going to consider Mike Trout to be the best player in baseball based on metrics such as WAR, then players such as Dyson need to be given the same consideration. What separates Mike Trout from David Ortiz, Miguel Cabrera, and Josh Donaldson is what makes Jarrod Dyson at least an above-average starting outfielder, if given the chance.


Bucking the Trends

As Cubs fans and non-Cubs fans alike celebrate the end of the 108-year drought, we have overlooked the fact that in winning, the Cubs also bucked two trends in major league baseball:

  1. 100+ win teams struggle in the postseason and rarely win the World Series, especially since the wild-card era began in 1995
  2. Losers of the ALCS and NLCS (Cubs lost 2015 NLCS) historically decline the following season, both in win total and playoff appearance/outcome

Below is a table to quantify a team’s performance in the playoffs:

Playoff

Result

Playoff Result Score
Win WS 4
Lose WS 4-3 3.75
Lose WS 4-2 3.5
Lose WS 4-1 3.25
Lose WS 4-0 3
Lose LCS 4-3 2.75
Lose LCS 3-2* 2.666666667
Lose LCS 4-2 2.5
Lose LCS 3-1* 2.333333333
Lose LCS 4-1 2.25
Lose LCS 4-0 or 3-0* 2
Lose LDS 3-2 1.666666667
Lose LDS 3-1 1.333333333
Lose LDS 3-0 1
Lose Wild Card Game 0.5
Miss Playoffs 0

*The LCS was a best-of-five-game series from 1969 through 1984

It is important to acknowledge how close a team comes to winning a particular round. Based on a 0 to 4 scale, with 0 indicating the team missed the playoffs and 4 indicating the team won the World Series, the table credits fractions of a whole point for each playoff win. For example, in a best-of-seven-game series, each win (four wins needed to clinch) is worth 0.25. In a best-of-five-game series, each win (three wins needed to clinch) is worth 0.333 (1/3). Any mention of playoff result or average playoff result in this article is derived from this table.

THE STRUGGLE OF 100+ WIN TEAMS IN THE POST-SEASON

Playoff baseball, due to its small sample size and annual flair for the dramatic, historically has not treated exceptional regular season teams well. Jayson Stark recently wrote an article for ESPN titled, “Why superteams don’t win the World Series.” He noted that only twice in the first 21 seasons of the wild-card era had a team with the best record in baseball won the World Series (1998 and 2009 Yankees). Those two Yankee teams are also the only two 100-win ball clubs in the wild-card era to win the World Series. Research in this article will span the years 1969 to 2015, with 1969 being the first year of the league championship series (LCS).

Entering the 2016 season there had been 47 100+ win teams since the start of the 1969 season. Of those, 10 (21.3%) won the World Series. Other than those 10 World Series winners, how did 100+ win teams fare in the post-season?

Below are the average playoff results for 100+ win teams in each period of the major league baseball playoff structure from 1969 to 2015. The playoff structures were as follows:

1969-1984: LCS (best of 5 games) + World Series (best of 7 games)

1985-1993: LCS (best of 7 games) + World Series (best of 7 games)

1995-present: LDS (best of 5 games) + LCS (best of 7 games) + World Series (best of 7 games)

The wild-card game (2012-present) is omitted because a 100+ win team has yet to play in that game, although it certainly would be rare if we ever see a 100+ win team playing in the wild-card game.

Teams Average Playoff Result WS Titles % WS Titles
1969-1984 18 3.07 7 38.9%
1985-1993 7 2.75 1 14.3%
1995-2015 22 2.27 2 9.1%
1969-2015 47 2.65 10 21.3%

As the data shows, 100+ win teams during the 1969-1984 period on average made a World Series appearance. This could be partly due to the fact there was only one round of playoffs (the LCS) ahead of the World Series, with the LCS being a best of five games. It was certainly a much easier path to the World Series once a team made the playoffs, yet on average 100+ win teams were finishing with a World Series sweep.

Changing the LCS from a best-of-five-game series to a best-of-seven-game series had a negative impact on team post-season performance, as 100+ win teams during the 1985-1993 span on average lost a deciding Game Seven in the LCS.

When the league added the wild card and LDS in 1995, it expanded the opportunity to make the playoffs but made the path to a World Series title more difficult, for a team now had to win 11 games to hoist the trophy. In the wild-card era, 100+ win teams are on average losing 4-1 in the LCS. This period also has the lowest percentage of 100+ win teams winning the World Series.

Average Playoff Result Likelihood to Win WS
1969-1984 3.07 25.3%
1985-1993 2.75 19.4%
1995-2015 2.27 6.8%
1969-2015 2.65 17.1%

Using average playoff result standard deviation and a normal distribution, we can also see that the likelihood of a 100+ win team to win the World Series has had a significant decrease over the past several decades, left at under 7% during the wild-card era. The longevity of 100+ win teams in the playoffs has been trending downward over the past several decades. Despite being on the verge of a World Series defeat, the Cubs were able to successfully break through and buck a trend that had haunted outstanding regular-season teams for decades, especially since the wild-card era began in 1995.

THE CURSE OF THE LCS DEFEAT

The 2015 Cubs lost to the Mets in the NLCS yet bounced back in 2016 to have an even better regular season and win the World Series. This, however, was a rare feat. Teams that lose in the LCS historically win fewer regular-season games and perform worse on average in the post-season (if they make it) the following year. Below are two charts (1969-2015 and 1995-2015) that display average win differential, average playoff result, likelihood win differential is greater than +5 (2016 Cubs were +6), and the likelihood of winning the World Series.

1969-2015 American League National League MLB
Average Win Differential -7.27 -5.73 -6.5
Average Playoff Result 1.02 1.07 1.05
Likelihood Win Differential is >(+5) 13.7% 13.7% 13.8%
Likelihood to Win WS 2.9% 2.7% 2.8%
1995-2015 American League National League MLB
Average Win Differential -5.42 -2.32 -3.87
Average Playoff Result 1.00 1.46 1.23
Likelihood Win Differential is >(+5) 18.1% 21.6% 20.0%
Likelihood To Win WS 1.4% 5.2% 3.2%

Due to the 1981 and 1994 strikes, a few data points for win differential and playoff result are not included in the calculation. The data set includes 82 LCS losers for win differential and 88 LCS losers for average playoff result. The 1980-81, 1981-82, 1993-94, 1994-95, 1995-96 win differentials are not included for LCS losers in both leagues. The 1994 and 1995 playoff results are not included for LCS losers in both leagues because there was no post-season in 1994, hence no LCS loser. Regardless, there is a notable trend among LCS losers to perform worse the following season.

The 2016 Cubs not only won six more regular-season games than in 2015, but they became only the seventh team in history to lose the LCS one season and win the World Series the following season (1971 Pirates, 1972 Athletics, 1985 Royals, 1992 Blue Jays, 2004 Red Sox, 2006 Cardinals). Two of the previous six teams repeated as champions: 1973 Athletics and 1993 Blue Jays. Most recently, the 2005 Red Sox lost 3-0 in the ALDS and the 2007 Cardinals failed to make the playoffs.

LOOKING FORWARD

The Cubs have already been pegged favorites to win the 2017 World Series, which isn’t surprising given the fact nearly every key player is under team control. Is history on their side? Winning back-to-back titles is difficult in today’s competitive league, as new baseball thinking has somewhat evened the playing field and the small sample size of post-season baseball has the ability to lend unexpected results.

The 10 100+ win teams who have won the World Series since 1969 historically have not been successful in their attempts for back-to-back titles. Below are the average win differentials and average playoff result for these teams in the season following their championship:

Win Differential From 100+ Win WS Team Playoff Result
1970 Mets -17 0
1971 Orioles -7 3.75
1976 Reds -6 4
1977 Reds -14 0
1978 Yankees 0 4
1979 Yankees -11 0
1985 Tigers -20 0
1987 Mets -16 0
1999 Yankees -16 4
2010 Yankees -8 2.5
Average -11.5 1.83

Only three of these 10 teams (1975-76 Reds, 1977-78 Yankees, 1998-99 Yankees) have repeated as champions. Can the 2017 Cubs be the fourth? No matter the numbers, the 2017 Cubs still have to perform on the field. They were on the brink of losing the World Series in 2016, so we must not take anything for granted. But despite this, there’s no doubt the 2017 Cubs will be in a good position for a repeat. The Cubs are expected to be MLB’s best regular season team in 2017, according to FanGraphs and Jeff Sullivan’s analysis in his November 11, 2016 article. Only time will tell.