Archive for Research

David Price Is About to Go Off

View post on imgur.com

On June 25, this was David Price’s tweet to family, friends and fans.  It was a clear signal that he knew the patience of the Boston fans and media was wearing thin.

Fast forward to the All-Star break and his “Made for TV” stats (those that casual fans know best) are underwhelming: a 9-6 record with a 4.34 ERA, which is worse than the MLB average of 4.23.  It’s not so much his ERA that’s the problem to fans, but more his inability to be consistent from start to start.  Price has three starts of six-plus innings allowing two or fewer runs, but also has four starts of allowing six or more runs.  With the rest of the rotation producing an atrocious 4.86 ERA, the Sox desperately needed Price to be the one to stop the bleeding, something he hasn’t been able to do.  But that doesn’t mean his underlying skills have deteriorated and all of a sudden he’s become a league-average pitcher.  In fact, the advanced metrics say he’s been extremely unlucky and that he’s due for a big second half. 

View post on imgur.com

* Rank is solely being used to establish a baseline for Price as a top 10 pitcher.

In 2014 and 2015 combined, Price was ranked in the top 10 of all pitchers in four of the skill-based statistics: K%, BB%, xFIP and SIERA (the latter two being ERA estimators with a weighting towards more pitcher-controlled outcomes).  Through the 2016 All-Star break, Price has maintained or improved his top-10 rank in K%, xFIP and SIERA but dropped a few spots in walk rate.  Despite the move from 9th to 10th in K% rank, his K rate is actually up from 26.2% to 27.1%.  The reason for the drop in rank is that 2016 newcomers to the list Jose Fernandez, Noah Syndergaard and Drew Pomeranz did not meet the minimum innings qualifier for the 2014/2015 combined list.  On the flip side, Price’s xFIP and SIERA are higher than they were the past two years, but he has improved his ranking versus his peers.  This is because xFIPs and SIERAs are both up 10% league-wide versus last year (due to all the home runs being hit) while Price’s increases are smaller.

So what is happening?  If his base skills are fine, why is his ERA so high and his performance so inconsistent?

View post on imgur.com

So everyone is familiar with ERA and can easily infer that 4.34 is no bueno for a $217-million pitcher.  But there is a reason these stats are labeled “Non Skill-Based” — that’s because these stats are influenced by factors outside of the pitcher’s direct control (defense, luck, sequencing, variance, etc…) and therefore have wide variability over small samples.  Three of these stats (HR/FB%, BABIP and LOB%) explain why David Price is a great rebound candidate for the second half.

HR/FB%

Price’s current HR/FB (home runs per fly ball) rate is 15.2% — which is good for being ranked 76th out of 97 qualified starting pitchers.  The past two years combined he ranked 19th.  To put this in context, Price’s career average is 9.4% while the 2016 league average is 12.9%.  Price has never recorded a full season (>150 IP) HR/FB rate higher than 10.5%.  Also, on balls hit into play against Price this year, 31.3% of them are fly balls, the second-lowest rate of his career.  The only season in which he allowed a lower fly ball rate was in 2012 when he won the AL Cy Young award.  Price is giving up fewer fly balls this year, but of the fly balls he is allowing, they are going over the fence at the highest rate of his career.  Those that remember Price giving up a HR in 10 consecutive starts this year are nodding violently right now.  His HR/FB% will regress towards his career norm (9.4%) and this should be the main reason for a big second half.

BABIP

Price is also suffering from an unsustainable BABIP (batting average on balls in play).  His current mark of .321 is well above his career rate (.289) and even above his highest full-season rate (.306).  Once a ball is put into play it is out of the pitcher’s control what happens from there.  This is why defense and luck influence this stat more than skill.  And with that said, statistical outliers here tend to regress towards career norms.  Even though Price is allowing ground balls at a higher rate than the past two years, his 2016 GB% is still lower than his career average.  BABIP can be influenced by the number of ground balls a pitcher allows, but he’s not allowing vastly more than his career average.  His BABIP should have some positive regression in it, which is another predictor of improved second-half performance.

LOB%

Price’s Left-On-Base% (percentage of runners a pitcher strands over the course of a season) is currently 70.9%, which is also below his career rate (74.7%) and would be his second worst full-season rate (70.0%) if the season ended today.  Similar to HR/FB%, he is ranked 73rd out of 97 qualified starting pitchers.  The past two years he ranked 22nd.  A pitcher with a higher than average strikeout rate should be able to sustain a slightly higher than average LOB%, but it’s playing out the exact opposite way for Price.  This is partly due to his inflated BABIP and HR/FB%; as these statistics continue to regress towards his career norms, the LOB% will creep up to expected levels.


Much has been made of Price’s velocity being down this year compared to any point in his career.  At the start of the season, his velocity was over 2.0 MPH lower than his career average (94.1).  He has since closed this gap almost entirely.  Here is his average fastball velocity by month (with number of starts):

April: 92.0 (5)

May: 92.5 (6)

June: 92.9 (6)

July: 94.0 (2)

If this upward trend in velocity stabilizes somewhere at or above 93.5, then nearly all the performance metrics within his control — velocity, K%, BB%, xFIP and SIERA — will be at or near his career norms.

Let’s dive a little deeper into that early-season velocity issue.  Below are two charts.  The first shows combined performance of 2014 and 2015 for ERA-qualifying starters while the second chart is the same data for the 2016 season through the All-Star break.  The orange circle is David Price.  The red circle (if shown) represents Price’s career average.  The blue circles are a hand selected peer group of the top 10 pitchers in the game (Kershaw, Sale, Arrieta, Scherzer, Bumgarner, Greinke, Strasburg, Syndergaard, Salazar and Fernandez).  Remember those rankings where Price was right around the top 10 — these are the guys usually outperforming him.  The gray circles represent everyone else.  Note: For these first two charts the top-right quadrant is Good, and the bottom-left quadrant is Bad (unless you’re a knuckleballer).

2014-2015 K/9 vs FBv

View post on imgur.com

 

2016 K/9 vs FBv

View post on imgur.com

The first graph shows David Price clustered where you would expect him — right at the middle-to-bottom of his top-10 peer group, with a healthy average fastball velocity and K/9.  The second graph (2016) shows Price in a similar relationship to his peers, but with slightly lower velocity and a higher K/9.  Note the gap between the orange (Price’s 2016) and red (Price’s career average) dots depicting his improved strikeout numbers this year despite the slightly lower velocity.  This graph also shows what freaks Noah Syndergaard, Jose Fernandez and (to a lesser degree) Jered Weaver are.

The final two graphs show the relationship between ERA and xFIP where xFIP is the more predictive estimator of a pitcher’s skill.  The bottom-left quadrant is Good (think Kershaw) and the upper-right quadrant is Bad (think Buchholz).  Anyone in the upper-left quadrant (Price in 2016) is a candidate for positive regression.

2014-2015 ERA vs xFIP

View post on imgur.com

 

2016 ERA vs xFIP

View post on imgur.com

The first graph again shows Price in his usual place — at the tail end of the top 10.  In 2014 and 2015 combined he had a very similar ERA (2.88) and xFIP (2.98).  The second graph (2016) shows the disparity between his ERA (4.34) and xFIP (3.16).  Pitchers with this large of a gap between ERA and xFIP are great candidates for regression.  The important takeaway is that his xFIP, relative to his peers, has stayed in that top-10 range.  This supports the point that some bad luck is the main element depressing his ERA.

David Price can easily be the best pitcher in the American League over the next two and a half months.  He already owns the lowest xFIP in the AL at 3.16 — the next-closest is Corey Kluber, at 3.34.  The skills above show he can sustain the xFIP level, but with some change in luck and maintaining his improved velocity, he doesn’t need to “pitch better”; he just needs to keep pitching — and the results will follow.


Can First-Half (x)FIP Predict Second-Half ERA?

This article was originally published on Check Down Sports

Predictions are hard. Getting them right is harder. But everyone loves them, so I’m going to attempt to predict which starting pitchers will improve in the second half of the season, and which are poised to put up worse numbers. This information may be especially helpful for a GM thinking about acquiring a pitcher before the trade deadline, or, maybe more applicably, a fantasy owner trying to surge his team into playoff position.

How do you exactly predict starting-pitcher performance in MLB? Well, it’s pretty commonly known among baseball-thinkers that FIP is more accurate at predicting a subsequent year’s ERA than ERA itself. FIP is a statistic on an ERA-scale that only accounts for what the pitcher can control (strikeouts, walks, and home runs). There’s been a lot of research that looks at differences between ERA and FIP, but to my knowledge, there’s nothing out there to see if it can predict second-half performance. So that’s what I’m going to do here.

I compiled all the starting pitchers who were qualified in both the first and second halves of 2015 (57 total), and ran a basic scatter plot of their first-half ERA, FIP, and xFIP against second-half ERA, to see which of the former was best at predicting the latter.

First-Half ERA and Second-Half ERA

ERA_ERA

First up is first-half ERA and second-half ERA. A fairly weak correlation — 7% of a pitcher’s second-half ERA is explained by his first-half ERA — albeit significant (p-value < 0.10).

First-Half FIP and Second-Half ERA

FIP_ERA

Next is first-half FIP and second-half ERA. It’s hard to tell but the dots are, on average, a bit closer to the fit line — 11% of second-half ERA is explained by first-half FIP (p-value < 0.05).

First-Half xFIP and Second-Half ERA

xFIP_ERA

Lastly, we have first-half xFIP and second-half ERA. While FIP uses a pitcher’s actual home-run totals, xFIP uses league-average totals because home run rates fluctuate year-to-year. You can clearly see the dots are much closer to the fit line than in the previous two graphs — 15% of second-half ERA is predicted by first-half xFIP (p-value < 0.01).

Is 15% good? Using the same method as above, I looked at the correlation between 2014 xFIP and 2015 ERA — and found an r² of 27%. So while half-season predictions don’t seem to be as accurate as season-to-season predictions, if MLB teams are making real moves based on a 27% correlation, I’m going to take a leap and say my fantasy team can makes moves based on a 15% correlation.

Now the part you (and I) have been waiting for: Here are the top 10 pitchers poised for second-half improvement followed by the top 10 pitchers who may get worse (sorted by the difference between ERA and xFIP, as of 7/9).

Screen Shot 2016-07-11 at 5.04.32 PM

Screen Shot 2016-07-11 at 5.06.34 PM

Some interesting things to note on the first list:

  • Smyly is owned in 48% of Yahoo Fantasy leagues, Nola in 47%, Ray in 11%, and Bettis in 4%. Pick them up.
  • The rest could be solid buy-low trade options (minus Eovaldi, unless your league values middle relievers).
  • A common theme among the members are high BABIPs and home-run rates (>.300, >15%) — which suggests they have been victims of bad luck.

And the second list, where the opposites are mostly true:

  • While Teheran’s name has come up in trade talks, his numbers suggest he may regress in the second half.
  • Sell-high trade options in fantasy leagues.
  • Low BABIPs and home-run rates (<.275, <10%).

Updating Hitter xISO and Second-Half Predictions

In late May, I posted a version of expected ISO (xISO), inspired by Alex Chamberlain’s work, which incorporated the publicly available Statcast data, easily accessible from the Baseball Savant leaderboard. I’ve been tinkering with it since, and figured I would post an updated version, as well as some second-half predictions based on the current “leaders and laggards”.

MODEL UPDATE

The original version of xISO was a simple linear regression model using GB% and average LD/FV exit velocity (LDFBEV). The only feature of any real note was the inclusion of the square of LDFBEV as an additional term. I knew then that I could get better correlation to data if I used LD% and FB% and removed GB% from the model, but I thought the simpler model would be better. I also thought it would be weird to have LD% and FB% as separate terms, and then one combined term for average exit velocity. I guess I just changed my mind. Whatever, it’s all empirical, and the only rule is it has to…predict better. Let’s examine the model, again trained on 2015 qualified hitters, and using LD% and FB% instead of GB%.

New xISO Model, Trained on 2015 Data

As you can see, the coefficient of determination went up a little bit from the previous version. It’s not a big deal, but it’s basically free, so we’ll take it. The updated model equation is as follows:

Now, we also have a fair bit of data for this year. I don’t yet want to update the model parameters using 2015 and 2016 data to train, but I will at least check how the model correlates to this year’s outcomes so far. I arbitrarily selected a minimum of 175 batted ball events (BBE), which limits the pool to 141 players, as of July 8th.

2016 xISO

Look at that! Not too bad overall. Armed with some confidence in the method, let’s now take a look at some of the hitters who most over- and under-performed xISO in the first half (numbers current as of July 9). I will also attempt to avoid talking about any of the players I mentioned previously, or that Alex mentioned in his June xISO report.

 

OVERPERFORMERS

Jay Bruce: ISO = .274,  xISO = .187

Bruce is actually hitting his line drives and fly balls with less authority than last year (92.8 mph down from 93.2). His overall batted-ball profile looks similar as well. After a couple down years, it’s nice to see Bruce succeeding, but I’m not betting on it to continue.

Anthony Rizzo: ISO = .282,  xISO = .201

At the risk of enraging my pal, league-mate, and curator of Harper Wallbanger, we might need to calm down a little bit on Rizzo. Don’t get me wrong, I think he’s a very good player, but odds are he won’t continue to hit for quite this much power.

Jake Lamb: ISO = .330,  xISO = .256

Right now, Jake Lamb is second in the majors in ISO behind David Ortiz. He does hit the ball hard (97.9 mph LDFBEV), but he hits 46% of his balls on the ground. Even a .256 ISO would be quite good, given his decent walk rate. This will likely go down as a true breakout season for Lamb.

Wil Myers: ISO = .242,  xISO = .188

While some of the guys on this list play in hitters’ parks, Myers is an example of a first half overperformer in a pitcher’s park. Between expected power regression and his spotty injury history, I’m nervous about the second half.

 

UNDERPERFORMERS

Andrew McCutchen: ISO = .165,  xISO = .233

Now, ‘Cutch is hitting more popups this year than last year, which could be fooling xISO a bit. Still, I like his ISO to get back to around .200. Of more concern might be his spike in strikeouts.

Ryan Zimmerman: ISO = .181,  xISO = .236

Zimmerman’s exit velocity is up from last year (96.8 mph from 95.0). He probably won’t hit for average, but if he continue making hard contact, he should accumulate plenty of RBIs in the second half.

Yasiel Puig: ISO = .133,  xISO = .188

xISO basically expects Puig to get back to his career average of .183. My main worry with the burly Cuban is his struggle to maintain a healthy pair of hamstrings.

Colby Rasmus: ISO = .157,  xISO = .211

At this point, we basically know who Rasmus is. He is a player who consistently sports an ISO over .200. After a bump in fly balls last year, he’s sitting below his career average this season. That’s not ideal for power output, but he’s also hitting the ball a bit harder. I’ll still bet on him doubling his homer total over the remainder of the season, and surpassing 20 for the second season in Houston.

 

That’s it! Please feel free to to leave comments, questions, or suggestions for improvement. I’m working on a public document with the xISO calculation available for every player, updated daily-ish. Feel free to follow me on Twitter for updates, or badger me in the comments.


Hardball Retrospective – What Might Have Been – The “Original” 2004 Royals

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition.  Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Expanding on my research for the book, the following series of articles will reveal the teams with the biggest single-season difference in the WAR and Win Shares for the “Original” vs. “Actual” rosters for every Major League organization. “Hardball Retrospective” is available in digital format on Amazon, Barnes and Noble, GooglePlay, iTunes and KoboBooks. The paperback edition is available on Amazon, Barnes and Noble and CreateSpace. Supplemental Statistics, Charts and Graphs along with a discussion forum are offered at TuataraSoftware.com.

Don Daglow (Intellivision World Series Major League Baseball, Earl Weaver Baseball, Tony LaRussa Baseball) contributed the foreword for Hardball Retrospective. The foreword and preview of my book are accessible here.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

AWAR – Wins Above Replacement for players on “actual” teams

AWS – Win Shares for players on “actual” teams

APW% – Pythagorean Won-Loss record for the “actual” teams

Assessment

The 2004 Kansas City Royals 

OWAR: 40.4     OWS: 264     OPW%: .483     (78-84)

AWAR: 16.8      AWS: 173     APW%: .358     (58-104)

WARdiff: 23.6                        WSdiff: 91  

The “Original” 2004 Royals placed third in the American League Central division, 12 games behind the Indians. The “Actual” 2004 Royals lost 104 contests. Carlos Beltran (.267/38/104) enjoyed a monster campaign as he narrowly missed the 40/40 club. The Royals center fielder compiled 121 tallies and swiped 42 bags in 45 attempts. However he only earned 11.4 Win Shares for the “Actual” Royals (vs. 29 WS for the “Originals) due to a mid-season trade to the Houston Astros. Fellow outfielder Jeff Conine contributed 35 doubles while first-sacker Mike Sweeney went yard on 22 occasions.

Juan Gonzalez of the “Actuals” placed 52nd in the “The New Bill James Historical Baseball Abstract” top 100 player rankings. 

  Original 2004 Royals                                    Actual 2004 Royals

LINEUP POS OWAR OWS LINEUP POS AWAR AWS
Jeff Conine LF 2.29 14.93 David DeJesus LF/CF 0.65 8.92
Carlos Beltran CF 6.77 29.02 Carlos Beltran CF 2.78 11.47
Michael Tucker RF 1.25 14.12 Matt Stairs RF 0.12 10.96
Johnny Damon DH/CF 4.34 25.1 Ken Harvey DH/1B 0.42 9.33
Mike Sweeney 1B 1.9 12.49 Mike Sweeney 1B 1.9 12.49
Ruben Gotay 2B -0.41 2.79 Tony Graffanino 2B 0.27 6.56
Ramon Martinez SS 0.21 5.64 Angel Berroa SS 0.38 10.55
Joe Randa 3B 0.35 13.1 Joe Randa 3B 0.35 13.1
Brent Mayne C -0.39 3.69 John Buck C 0.32 4.67
BENCH POS OWAR OWS BENCH POS AWAR AWS
Ken Harvey 1B 0.42 9.33 Desi Relaford 3B -1.07 3.69
David DeJesus CF 0.65 8.92 Benito Santiago C 0.04 3.4
Andres Blanco SS 0.5 2.32 Alberto Castillo C 0.6 2.96
Juan Brito C -0.83 2.29 Calvin Pickering DH 0.3 2.94
Dee Brown LF -0.71 2.24 Ruben Gotay 2B -0.41 2.79
Kit Pellow RF -0.59 1.08 Juan Gonzalez RF 0.12 2.69
Shane Halter 3B -0.19 1.05 Abraham Nunez RF -0.47 2.58
Alex Prieto 2B -0.03 0.75 Andres Blanco SS 0.5 2.32
Matt Treanor C -0.11 0.51 Kelly Stinnett C 0.48 2.27
Byron Gettis LF -0.08 0.38 Dee Brown LF -0.71 2.24
Alexis Gomez LF -0.07 0.29 Aaron Guiel LF -0.55 0.49
Mendy Lopez 2B -0.5 0.22 Ruben Mateo RF -0.72 0.43
Brandon Berger LF -0.33 0.2 Byron Gettis LF -0.08 0.38
Donnie Murphy 2B -0.25 0.2 Alexis Gomez LF -0.07 0.29
Raul Gonzalez RF -0.16 0.12 Jose Bautista 3B -0.23 0.27
Paul Phillips C 0 0.1 Mendy Lopez 2B -0.5 0.22
Mike Tonis C -0.11 0.03 Brandon Berger LF -0.33 0.2
Larry Sutton 1B -0.01 0.03 Donnie Murphy 2B -0.25 0.2
Wilton Guerrero 2B -0.22 0.18
Paul Phillips C 0 0.1
Adrian Brown LF -0.07 0.08
Mike Tonis C -0.11 0.03
Rich Thompson RF -0.03 0.02
Damian Jackson RF -0.12 0.01

Jon Lieber recorded 14 victories and yielded only 18 bases on balls in 27 starts. Glendon Rusch fashioned a 3.47 ERA as he split time between starting and relief roles. Zack Greinke delivered 8 victories and a 3.97 ERA in his inaugural season. Tom “Flash” Gordon (9-4, 2.21) whiffed 96 batsmen in 89.2 innings and achieved All-Star status.

  Original 2004 Royals                                  Actual 2004 Royals

ROTATION POS OWAR OWS ROTATION POS AWAR AWS
Jon Lieber SP 2.87 10.43 Zack Greinke SP 3.62 9.73
Glendon Rusch SP 3.02 10 Jimmy Gobble SP 0.87 5.37
Zack Greinke SP 3.62 9.73 Dennys Reyes SP 0.79 4.58
Jimmy Gobble SP 0.87 5.37 Jeremy Affeldt SP 0.11 4.42
Jeremy Affeldt SP 0.11 4.42 Darrell May SP -0.05 4.07
BULLPEN POS OWAR OWS BULLPEN POS AWAR AWS
Tom Gordon RP 3.66 15.47 Shawn Camp RP 0.17 4.15
Dan Miceli RP 0.73 7.13 Jaime Cerda RP 0.69 4.05
Lance Carter RP 0.76 6.53 Nate Field RP 0.07 3.02
Kiko Calero RP 0.7 5.7 Scott Sullivan RP 0.12 2.85
Orber Moreno RP 0.08 2.84 Jason Grimsley RP 0.59 2.55
Wes Obermueller SP -0.01 2.98 Brian Anderson SP -0.71 2.84
Ryan Bukvich RP 0.12 0.82 Mike Wood SP 0.24 1.91
Chad Durbin RP -1.03 0.39 Jimmy Serrano SP 0.5 1.57
Rodney Myers RP 0.06 0.29 D. J. Carrasco RP -0.12 1.54
Jason Simontacchi RP -0.28 0.26 Rudy Seanez RP 0.32 1.45
Mike MacDougal RP -0.13 0.23 Ryan Bukvich RP 0.12 0.82
Kevin Appier SP -0.44 0 Mike MacDougal RP -0.13 0.23
Chris George SP -0.82 0 Kevin Appier SP -0.44 0
Jorge Vasquez RP -0.19 0 Denny Bautista SP -0.07 0
Chris George SP -0.82 0
Justin Huisman RP -0.51 0
Matt Kinney RP -0.43 0
Curt Leskanic RP -0.64 0
Jorge Vasquez RP -0.19 0
Eduardo Villacis SP -0.22 0

Notable Transactions

Carlos Beltran

June 24, 2004: Traded as part of a 3-team trade by the Kansas City Royals to the Houston Astros. The Oakland Athletics sent Mark Teahen and Mike Wood to the Kansas City Royals. The Houston Astros sent Octavio Dotel to the Oakland Athletics. The Houston Astros sent John Buck and cash to the Kansas City Royals.

Johnny Damon

January 8, 2001: Traded as part of a 3-team trade by the Kansas City Royals with Mark Ellis to the Oakland Athletics. The Oakland Athletics sent Ben Grieve to the Tampa Bay Devil Rays. The Oakland Athletics sent Angel Berroa and A.J. Hinch to the Kansas City Royals. The Tampa Bay Devil Rays sent Cory Lidle to the Oakland Athletics. The Tampa Bay Devil Rays sent Roberto Hernandez to the Kansas City Royals.

November 5, 2001: Granted Free Agency.

December 21, 2001: Signed as a Free Agent with the Boston Red Sox.

Tom Gordon

October 30, 1995: Granted Free Agency.

December 21, 1995: Signed as a Free Agent with the Boston Red Sox.

November 1, 2000: Granted Free Agency.

December 14, 2000: Signed as a Free Agent with the Chicago Cubs.

August 22, 2002: Traded by the Chicago Cubs to the Houston Astros for players to be named later and Russ Rohlicek (minors). The Houston Astros sent Travis Anderson (minors) (September 11, 2002) and Mike Nannini (minors) (September 11, 2002) to the Chicago Cubs to complete the trade.

October 29, 2002: Granted Free Agency.

January 23, 2003: Signed as a Free Agent with the Chicago White Sox.

October 27, 2003: Granted Free Agency.

December 16, 2003: Signed as a Free Agent with the New York Yankees.

Honorable Mention

The 2009 Kansas City Royals 

OWAR: 45.7     OWS: 268     OPW%: .544     (88-74)

AWAR: 25.3       AWS: 194      APW%: .401    (65-97)

WARdiff: 20.4                        WSdiff: 74

Kansas City clinched the American League Central division title by a lone game over Minnesota. Zack Greinke (16-8, 2.16) merited the 2009 AL Cy Young Award as he paced the Junior Circuit in ERA and WHIP (1.073) while posting career-highs in strikeouts (242) and innings pitched (229.1). Johnny Damon (.282/24/82) tied his personal-best in home runs, slashed 36 two-base hits and registered 107 tallies. Billy Butler aka “Country Breakfast” drilled 51 doubles and swatted 21 big-flies. David DeJesus contributed 13 jacks and knocked in 71 runs. Carlos Beltran supplied a .325 BA but missed more than two months of the season due to injury. J.P. Howell saved 17 contests and collected 7 victories as the Royals’ relief ace.

On Deck

What Might Have Been – The “Original” 1969 Reds

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at “www.retrosheet.org”.

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive


The Good and the Bad: David Price Isn’t Sinking

You know his story: David Price is a $217-million man with a 4.74 earned run average, and the people of Boston aren’t happy. It’s another Crawford-Sandoval-Ramirez waste of money. Things are headed downhill for the 31-year old veteran. Or are they?

First, the bad news: the 2016 version of David Price has been worse than the 2015 David Price, and way worse than the top-caliber pitcher Boston signed him to be. And the ERA shows it.

The suspect is pitch selection, and the culprit is a sinker that doesn’t sink. Price has a two-seam fastball that over his seven-year career he has thrown some 30% of the time. In his prime, it clocked in at 94-95 mph, but since then he’s dropped almost two mph.

Usually, that level of velocity leak wouldn’t be a big deal, because if there’s enough movement and deception, batters will be fooled either way. But Price’s sinker is different.

Brooks Baseball reports that “His sinker has well above-average velocity, but has little sinking action compared to a true sinker and results in more fly balls compared to other pitchers.” Uh-oh. “Little sinking action?” There needs to be at least some element of vertical movement for a sinker to be fully effective, or, in Price’s case, a little extra velocity. But now he has neither.

The results show it. Last month, he surrendered 10 home runs, more than the previous two months combined. Also in June: 31% of his pitches were sinkers, nearly 10% more than the month before. Coincidence? I think not. He’s also allowing a .241 Isolated Power on sinkers, only three points less than Mike Trout this season. And maybe the most convincing statistic: hitters are pulling the ball 10% more than they did last year, which means they are making more solid contact and not having to stay back on his fastball. Price’s pitches are slower, and it’s making a difference.

Why is he losing velocity? There’s two possibilities and they point in completely opposite directions. The first is age. Price is 31 and he’s nearing the point where most starting pitchers start to fall on the aging curve and eke velocity. If this is the case, it’s going to be a long seven years for the Red Sox. But there is another possibility. Price has played in Tampa Bay for most of his career, where the temperatures are never 40 degrees like Boston in April. It’s entirely possible that the cold ‘froze’ him up this spring and as the season continues, he’ll regain his speed. Most likely, it’s a combination of both. But either way, it’s never a good sign when pitchers slow down.

Price has always gotten away with leaving sinkers up in the zone because they showed 94-95 mph on the radar gun. But now hitters are seeing 92mph fastballs fly straight down the middle of the plate and stay there.  Why doesn’t he just put the ball on a tee? Nine out of 10 major-league hitters will knock that pitch into the stands every time. Just look at the stats: He’s surrendered just two fewer home runs than he did last season even though he’s pitched 112 fewer innings (2015: 17, 2016: 15), and he’s allowed an average of 1.25 home runs per nine innings, which is 32 percent worse than his career average (0.84). Sinkers are sending the man to his grave.

They’re also killing his ERA. 38% of his earned runs are from home runs, and if you set his home runs to eight instead of 17, his ERA would be 4.01 instead of 4.74, a 0.73 difference. (8 is the number he had allowed last year at this point in the season.) In fact, his strikeout and walk totals are even better than last season, but the home runs negate all of it.

But we can’t blame everything on the sinker, either. Price has definitely been unlucky this season. His home run to fly ball ratio is 15.5%, an unsustainable mark, his .323 BABIP .035 more than his career average, and his LOB% 10 percent less than the 2016 league average. These will balance out in time. But his sinker is the real problem.

The only way to truly limit home runs is to limit fly balls, and for Price, the only way to limit fly balls is to stop throwing sinkers that don’t sink. The solution is (1) throw harder, or (2) find another pitch to replace his sinker. Option one is still TBD. Option two could be filled with either a change or slider — two pitches that he has used to complement his fastball but never to the level that he uses his sinker. The outlook is grim either way.

Price is still a very experienced pitcher, and once his HR/FB, LOB%, and BABIP rates come down to earth, things will even out. But if he wants to be successful for the Red Sox for the entirety of his stay, there’s a longer-term issue at stake, and if his velocity continues to leak, I’m not sure what type of David Price we’ll be looking at a year from now.


Over- and Under-achieving FIP

I have always been fascinated by pitchers that consistently post ERAs that differ significantly from their FIPs.  As a Braves fan, this interest is particularly relevant in the valuation of ace/not ace Julio Teheran.  Unfortunately for me — but very fortunately for readers — Eno Sarris tackled the specific case of Teheran and the more general case of FIP-beaters with high pop-up rates here before I could finish this post.  Regardless, the research is done, and I believe it is still relevant.

While Eno focused on a specific subset of FIP-beaters in his discussion of Teheran, I wanted to examine pitchers with extreme ERA/FIP gaps more broadly.  I included not only pitchers who overachieved based on FIP, but also those who underachieved.  I began with a sample of all pitchers since 1960 who reached 500 IP through age 25.  I then calculated the difference between ERA- and FIP- for each pitcher (FIP overachievers would have a negative number, underachievers positive).  I selected these metrics 1) because they were readily available here at FanGraphs, and 2) because I was interested in the gap relative to league average — hopefully stripping out any differences in era (should any even exist).  

I chose this age cutoff so that I had a sample of three “in-prime” seasons afterwards (age 26-28) to compare to the initial numbers below.  After I found Z-Scores for all of the u25 pitchers, I set the threshold for over/underachiever at +/- 1 standard deviation from the mean, which turned about to be an ERA- / FIP- difference of right around eight.  It is certainly arbitrary, but I felt like this adequately separated the sample so I could examine the ends of the population.

Extreme FIP Over/Underachievers
Group ERA- minus FIP- n
ALL u25 -.02 297
Overachievers (Z<1) -11.91 48
Underachievers (Z>1) 11.35 47
Since 1960, min. 500 IP through age 25.  Average ERA- minus FIP- weighted for IP.

As you can see, the spread in ERA- between over/underachievers is pretty large.  Overachievers posted ERAs 12% lower (relative to league average) than expected based on FIP, while underachievers posted ERAs over 11% higher (relative to league average) than expected based on FIP.  The group as a whole posted an ERA- nearly identical to its FIP-, which is more in line with DIPS theory expectations.

The big question remains: how “sticky” is the gap between ERA- and FIP-?  To determine this, I compared the ERA- / FIP- gap for these same samples from age 26-28.

Extreme FIP Over/Underachievers Age Comparison
Group u25 E-F- o25 E-F- Raw Diff Diff Adj. for Sample Avg. % Retained
ALL -.02 .42 -.44
Overachievers (Z<1) -11.91 -3.41 -8.50 -8.06 32.3%
Underachievers (Z>1) 11.35 4.64 6.71 7.15 37.0%
Since 1960, min. 500 IP through age 25.  Average ERA- minus FIP- weighted for IP.

From age 26-28, the sample as a whole posted an ERA- above its FIP-.  Even adjusting for that change, the over/underachievers both regressed heavily towards the mean, retaining 32.3% and 37.0% of their difference in ERA- and FIP- respectively.  While regression is powerful, both samples did continue to post differences in ERA- and FIP-.  The overachievers continued to post lower ERAs than FIPs, while the underachievers kept on allowing more runs than FIP suggested they deserved.  Interestingly, the percentage of the gap retained is similar for over and underachievers, though it is slightly smaller for FIP beaters.

The methodology isn’t perfect, but I found the results very compelling.  It does seem like consistently beating FIP is partially skill (which jibes with Eno’s results), and consistently allowing ERAs above FIPs is more than just bad luck.  As usual, this analysis leads to more questions than answers.  How many innings are needed before one can be considered a DIPS outlier?  Do FIP underachievers actually regress less than FIP beaters?  How does age-related decline affect the gap in ERA- and FIP-?  As the sample for a DIPS outlier grows, does he retain more of the difference going forward?  Etc.  I may try to dive into one or more of those questions later.  For now, hopefully this analysis is helpful as you consider how likely a pitcher on your team is to continue over/underperforming his FIP.


We’re In a Golden Age of the Lefty Fastball

The 2016 baseball season is well underway and we’re seeing an even more drastic version of the trends that we saw last year: There are more strikeouts, more home runs, and more challenges. And, notably, there has also been a steady increase in velocity across the league, assisted by the guys I’ll be highlighting here.

A “steady increase in velocity” might not be reason to stop the presses, but just soak in this Tweet real quick:


We’re basically seeing twice as many pitches thrown 95+ as we were in 2008. ¡2008!

Even left-handers, typically a step behind (always a bit of a quirky species, lefties), are chucking it. Across the league, lefties are throwing the ball 95+ mph just around 7.5% of the time. That’s way more often than the stereotype of the Tom Glavine-y, soft-tossing corner-nibbler would have you believe, but it’s 2016 and elite velocity isn’t just left to the elite pitchers anymore (Chris Sale is joined in that 95+ lefty fastball club by some guy named Buddy Boshers out of the bullpen for the Twins).

So…I’m not just interested in guys that throw hard; I want guys who throw hard and make the ball move, and I want them to be left-handed. (Truth: that lefty requirement is mostly an excuse so I can hopefully talk about Danny Duffy more, James Paxton for the first time, and because I already covered the right-handed side of things with my Charlie Morton post from the start of the season (The Unbelievable Emergence of Charlie Morton), and basically because lefties are more fun.)

A common refrain among pitching coaches is that movement is just as important as velocity. Velocity can get you to the majors, but big-league hitters will turn around 95+ fast if it’s straight. But when combined with some movement (and even better, control/command) 95+ is a high value commodity.

I’m after what I want to dub the best lefty fastball. Let’s start with the simple stuff: Who out there is throwing it 95+ most frequently? Note that the percentages here are for all pitches thrown, including the off-speed stuff.

Player Name Number of Pitches 95+ % of Pitches Thrown 95+
Zach Britton 152 93%
Sean Doolittle 118 84%
Aroldis Chapman 125 80%
Jake Diekman 111 73%
James Paxton 332 64%
Justin Wilson 81 62%
Josh Osich 58 57%
Enny Romero 86 54%
Tony Cingrani 102 53%
Jake McGee 32 51%
Danny Duffy 211 49%
Ian Krol 68 45%
Robbie Ray 208 41%
Felipe Rivero 61 35%
Sammy Solis 51 30%
Andrew Miller 52 30%
Andrew Chafin 6 26%
Carlos Rodon 90 23%
Blake Snell 45 23%

There are a number of relievers in there that I should probably get to know better. Zach Britton, Sean Doolittle, and Aroldis Chapman have all been flame-throwers for a while now; somehow their gas no longer brings the flicker to my eye that it once did. But Josh Osich and Enny Romero? Those are new guys that throw quite hard and are likely on their way to relevance.

The starters on the list are the most fun for me. James Paxton is there. Danny Duffy, too. But so are Carlos Rodon and Blake Snell. I’m not going to anoint any of these young guys just yet, but I’d venture that it’s been a long time since we’ve had four lefty starters out there throwing 95+ mph heaters at least 23% of the time. But…Carlos Rodon has a 4.16 ERA, and the other three all have fewer than 10 starts on the season. Let’s see if movement has anything to do with it.

We’re in search of the best lefty fastball and the best lefty fastball must move sideways, while also moving quickly. 10 inches of run seems like a pretty good place to set up camp.

Player Name Number of Pitches 95+
& 10+ inches of run
% of All Pitches
Jake Diekman 90 59%
James Paxton 171 33%
Josh Osich 24 24%
Cody Reed 18 20%
Chris Sale 90 16%
Sammy Solis 22 13%
Robbie Ray 58 11%
Brad Hand 26 11%
Clayton Richard 7 11%
Mike Montgomery 22 9%
Martin Perez 42 9%
Steven Matz 23 6%
Ian Krol 8 5%
Andrew Miller 9 5%
Ashur Tolliver 3 5%
Enny Romero 8 5%
Tony Cingrani 7 4%
Zach Britton 6 4%
T.J. McFarland 3 3%
Aroldis Chapman 4 3%
Sean Doolittle 3 2%
Carlos Rodon 8 2%

Look at that: Mr. Rodon and his 4.16 ERA bring up the rear, while Snell and Duffy dropped right off. But man, James Paxton is still up top there just behind Jake Diekman. Diekman is a very good reliever, who seems to be realizing his potential since his trade to Texas. Basically, by exclusively pounding the zone with that hard, running fastball, he’s posted an ERA below 2.00 since getting out of Philly.

Oh! Chris Sale, how did I forget to include him in my love fest of the young lefty starters in the league? Sale has thrown 110 pitches at least 95 mph, and of those, 90 have moved at least 10 inches. That’s nuts. His stuff is incredible.

We also see Steven Matz creep in there as 6% of his pitches are these 95 mph fastballs that move an unfair amount. Matz and his 2.96 ERA definitely belong in that quartet of young insanely talented left-handed starting pitching that I talked about before. He’ll be the fifth member of that group, and we instantly have to expand our Mount Rushmore of tantalizing excellence.

This is starting to feel a bit like the NBA where so much Amazing is happening. But it’s true: there’s a lot of amazing happening across the MLB landscape right now. These lefty fastballs are but one, tiny iota of all that is going on.

Let’s refine the batch of fastballs once more to include only those that have at least 10 inches of vertical movement, too. This admittedly feels like a laughable exercise. There’s no way that pitchers are actually throwing pitches that go 95 mph, while also running and rising that much….

Player Name Number of Pitches 95+
10+ inches of run
10+ inches of rise
Robbie Ray 15
James Paxton 14
Enny Romero 5
Rest of League 25

Oh. Damn. I see you Robbie Ray, James Paxton, and Enny Romero. I also see you Rest of League. That group included Danny Duffy, Sean Doolittle, Aroldis Chapman, Matt Moore, and Chris Sale. But really this is about those top three guys.

Ray was once a prospect known more for his feel and pitchability than a premier fastball. He’s starting for the Diamondbacks now and he’s striking out over 10 per game. His ERA sites at 4.59 and his WHIP is over 1.50, which are both significantly worse than his 2015 campaign, but still, if that pitchability from his earlier career outlook meets with his clearly impressive fastball, things could turn around quickly for the 24-year-old. I’m frankly surprised to see him here.

As for James Paxton, we know he’s throwing way harder now that he’s dropped his arm slot. I’ll save my full review of his stuff for the lengthier look that it deserves.

Then there’s Enny Romero. Romero isn’t well known in baseball circles just yet. He started a single game as a 22-year-old for the Rays back in 2013, spent 2014 throwing a 4.93 ERA in Triple-A, and hasn’t exactly torn things up in the majors since then. But he’s a young player, with a solid baseball name and a clearly electric fastball. He’s 25 and capable of figuring it out just like any other 25-year-old.

To be totally honest, I’m not entirely sure what to do with this group of pitchers. The guys atop this 95/10/10 club clearly have electric fastballs, but the electric fastball has not equated to big-league success so far. I guess that’s OK, and feeds back into the last bit of the the old pitching coach refrain: Velocity is nothing without movement…and control. But control is not sexy.

Speed is sexy, and all these guys throwing 95 are great, but Aroldis Chapman is the only one guy who’s ever thrown it 105 mph. He keeps the crown of best fastball. (All this talk of horizontal and vertical movement was really just an attempt to crown the best non-Chapman lefty fastball.)

So what is the takeaway?

This discussion mostly serves as a friendly reminder that we’re in the midst of a great revolution of left handed pitchers — all of whom make Clayton Kershaw old by comparison. These guys are throwing fastballs harder than we’ve ever seen before and there’s so many of them doing it.

Stat of the Day: I feel like I should also note that I unearthed an insane Andrew Miller pitch where he effectively threw a 95 mph slider on June 6th to some poor soul.


MLB’s Qualifying Offer: A King’s Ransom

With the MLB draft just past, I thought it would be appropriate to examine one of the most controversial topics surrounding the draft: the qualifying offer. Essentially, the qualifying offer intends to reward teams — presumably the small-market, low budget ones — that lose players in free agency. This reward comes in the form of an additional first-round draft pick for every player that signs with another team.

Only it isn’t that simple. Once a player reaches the end of his contract, the team can decide whether or not to offer the player a 1-year extension known as the qualifying offer. This new contract is equal to the average of the highest 125 salaries in MLB ($15.8 million in 2016). The player then chooses to either accept the qualifying offer or decline it — and thus, enter free agency with the assumption that he can earn more than a 1-year, $15.8 million contract. Once the player signs on with another team, his former team is awarded a first-round draft pick (to go along with the one(s) they already have, assuming they do) as compensation. Additionally, the player’s new team loses their first-round pick in the draft so long as it is outside the top 10 (in which case their second-round pick would be forfeited).

So, one would assume that, more often than not, a small-market team with a low payroll would benefit from this system. A budding star player reaches the end of his contract and commands a new contract worth hundreds of millions and spread over 5+ seasons. His current team does not have the financial resources to resign him, and another big-market team does. The cash-strapped team receives an additional first-round pick as compensation, while his new team willfully forfeits its first-round pick in exchange for his services over the next half-decade. And that’s that.

Not quite. I went back over the draft order for every year since 2013 (when the qualifying offer was first introduced) and summed the number of draft picks gained and lost. Results are shown below. I sorted the teams by their average payroll over the span in descending order. As you can see, the compensation is not in line with the assumption I presented above. In any way you shape it, the high-payroll teams are the ones benefiting from the current system. The 10 highest-payroll teams have received 19 additional draft picks over the four seasons — highlighted by the Cardinals who have gained four and lost none. The 10 teams with the lowest payrolls have received eight additional picks. The high payroll teams have a net draft pick gain of four, while the low payroll teams have a net loss of two.

Screen Shot 2016-05-29 at 5.27.18 PM

Now, I’m not coming up with any revolutionary solutions here — I’m not that smart and I don’t get paid enough. I am simply presenting data that supports that MLB’s current free-agent compensation system doesn’t benefit the teams that need it the most. In fact, this seems to be a story of “the rich are getting richer” — big-money teams are receiving the extra draft picks that were seemingly meant for the low-budget ones. Maybe MLB scraps the compensation system altogether, maybe they extend the time frame for when a player can accept the qualifying offer (they currently have seven days), or maybe they come up with some other solution. In any case, the current CBA ends after the 2016 season so us fans will likely know the answer before next year’s draft.


Hardball Retrospective – What Might Have Been – The “Original” 1984 Giants

In “Hardball Retrospective: Evaluating Scouting and Development Outcomes for the Modern-Era Franchises”, I placed every ballplayer in the modern era (from 1901-present) on their original team. I calculated revised standings for every season based entirely on the performance of each team’s “original” players. I discuss every team’s “original” players and seasons at length along with organizational performance with respect to the Amateur Draft (or First-Year Player Draft), amateur free agent signings and other methods of player acquisition.  Season standings, WAR and Win Shares totals for the “original” teams are compared against the “actual” team results to assess each franchise’s scouting, development and general management skills.

Expanding on my research for the book, the following series of articles will reveal the teams with the biggest single-season difference in the WAR and Win Shares for the “Original” vs. “Actual” rosters for every Major League organization. “Hardball Retrospective” is available in digital format on Amazon, Barnes and Noble, GooglePlay, iTunes and KoboBooks. The paperback edition is available on Amazon, Barnes and Noble and CreateSpace. Supplemental Statistics, Charts and Graphs along with a discussion forum are offered at TuataraSoftware.com.

Don Daglow (Intellivision World Series Major League Baseball, Earl Weaver Baseball, Tony LaRussa Baseball) contributed the foreword for Hardball Retrospective. The foreword and preview of my book are accessible here.

Terminology

OWAR – Wins Above Replacement for players on “original” teams

OWS – Win Shares for players on “original” teams

OPW% – Pythagorean Won-Loss record for the “original” teams

AWAR – Wins Above Replacement for players on “actual” teams

AWS – Win Shares for players on “actual” teams

APW% – Pythagorean Won-Loss record for the “actual” teams

Assessment

The 1984 San Francisco Giants 

OWAR: 42.9     OWS: 294     OPW%: .508     (82-80)

AWAR: 27.7      AWS: 198     APW%: .407     (66-96)

WARdiff: 15.2                        WSdiff: 96  

The “Original” 1984 Giants ended the season with a winning record but merely earned a fifth place finish, 9 games behind the Astros. Gary “Sarge” Matthews established a career-best with 101 runs scored while pacing the circuit with 103 walks and a .410 OBP. Chili Davis contributed a .315 BA and merited his first All-Star invitation. Dave “Kong” Kingman walloped 35 four-baggers and knocked in a personal-best 118 baserunners. Bob Brenly achieved his lone All-Star nod with a .291 BA, 20 dingers and 80 ribbies. Jack Clark supplied a .320 BA with 11 long balls prior to a season-ending injury in mid-June. Dan “Dazzle” Gladden ignited the offense following his recall from the minor leagues in late June, posting a .351 BA and swiping 31 bags.

Jack Clark is ranked 27th among right fielders according to Bill James in “The New Bill James Historical Baseball Abstract.” “Original” Giants teammates listed in the “NBJHBA” top 100 rankings include George Foster (34th-LF), Gary Matthews (46th-LF), Garry Maddox (56th-CF), Chili Davis (64th-RF), Chris Speier (68th-SS) and Dave Kingman (98th-LF).  Al Oliver (31th-CF), Manny Trillo (49th-2B) and Dusty Baker (54th-LF) make the register for the “Actual” Giants. 

  Original 1984 Giants                              Actual 1984 Giants

LINEUP POS OWAR OWS LINEUP POS AWAR AWS
Gary Matthews LF 2.68 22.93 Jeffrey Leonard LF 2.38 20.37
Dan Gladden CF 2.81 16.46 Dan Gladden CF 2.81 16.46
Chili Davis RF/CF 4.19 21.58 Chili Davis RF/CF 4.19 21.58
John Rabb 1B -0.14 1.01 Scot Thompson 1B 0.35 6.89
2B Manny Trillo 2B 0.76 8.83
Johnnie LeMaster SS -0.47 7.23 Johnnie LeMaster SS -0.47 7.23
Chris Brown 3B 0.31 2.26 Joel Youngblood 3B -0.89 9.5
Bob Brenly C 3.58 21.32 Bob Brenly C 3.58 21.32
BENCH POS OWAR OWS BENCH POS AWAR AWS
Dave Kingman DH 2.49 21.48 Jack Clark RF 2.01 11.84
George Foster LF 1.16 18.27 Dusty Baker RF 1.19 8.81
Jack Clark RF 2.01 11.84 Al Oliver 1B -0.85 6.56
Bob Kearney C 0.26 8.63 Steve Nicosia C 0.79 4.99
Garry Maddox CF 0.53 6.67 Brad Wellman 2B -0.45 3.74
Chris Speier SS -0.24 2.96 Chris Brown 3B 0.31 2.26
Rob Deer LF 0.28 1.24 Fran Mullins 3B 0.24 2.08
Randy Gomez C -0.02 0.18 Gene Richards LF -0.04 1.92
Tom O’Malley 3B -0.5 0.03 Rob Deer LF 0.28 1.24
Jose Morales -0.19 0 John Rabb 1B -0.14 1.01
Casey Parsons -0.01 0 Duane Kuiper 2B -1.06 0.82
Randy Gomez C -0.02 0.18
Joe Pittman SS -0.17 0.12
Alejandro Sanchez RF -0.4 0.08
Tom O’Malley 3B -0.29 0.01

Bob Knepper rebounded from an 11-28 mark in the previous two campaigns to achieve a 15-10 record with a 3.20 ERA and 1.190 WHIP. Gary Lavelle notched 12 saves and fashioned a 2.76 ERA as the primary closer. Frank Williams collected 9 victories in a long relief role during his rookie year.

  Original 1984 Giants                                   Actual 1984 Giants

ROTATION POS OWAR OWS ROTATION POS AWAR AWS
Bob Knepper SP 2.16 12.43 Bill Laskey SP -0.02 4.8
Pete Falcone SP 0.91 5.33 Mike Krukow SP -1.04 3.94
John Montefusco SP 0.58 3.27 Jeff D. Robinson SP -0.67 2.84
Jeff D. Robinson SP -0.67 2.84 Atlee Hammaker SP 0.96 2.28
Mark Calvert SP -0.39 0.22 George Riley SP 0.19 0.98
BULLPEN POS OWAR OWS BULLPEN POS AWAR AWS
Gary Lavelle RP 1.78 7.85 Gary Lavelle RP 1.78 7.85
Frank Williams RP 0.43 5.7 Greg Minton RP -0.02 6.29
John Henry Johnson RP 1.22 4.39 Frank Williams RP 0.43 5.7
Scott Garrelts SW -1.13 0 Randy Lerch RP 0.11 2.58
Gorman Heimueller RP -0.7 0 Bob Lacey RP -0.07 1.51
Mark Grant SP -1.1 0 Renie Martin RP -0.09 0.99
Mark Calvert SP -0.39 0.22
Mark W. Davis SP -1.91 0.18
Jeff Cornell RP -1.25 0
Scott Garrelts SW -1.13 0
Mark Grant SP -1.1 0

Notable Transactions

Gary Matthews

November 17, 1976: Signed as a Free Agent with the Atlanta Braves.

March 25, 1981: Traded by the Atlanta Braves to the Philadelphia Phillies for Bob Walk.

March 26, 1984: Traded by the Philadelphia Phillies with Porfi Altamirano and Bob Dernier to the Chicago Cubs for Bill Campbell and Mike Diaz.

Dave Kingman

February 28, 1975: Purchased by the New York Mets from the San Francisco Giants for $150,000.

June 15, 1977: Traded by the New York Mets to the San Diego Padres for Paul Siebert and Bobby Valentine.

September 6, 1977: Selected off waivers by the California Angels from the San Diego Padres.

September 15, 1977: Traded by the California Angels to the New York Yankees for Randy Stein and cash.

November 2, 1977: Granted Free Agency.

November 30, 1977: Signed as a Free Agent with the Chicago Cubs.

February 28, 1981: Traded by the Chicago Cubs to the New York Mets for Steve Henderson and cash.

January 30, 1984: Released by the New York Mets.

March 29, 1984: Signed as a Free Agent with the Oakland Athletics.

George Foster

May 29, 1971: Traded by the San Francisco Giants to the Cincinnati Reds for Frank Duffy and Vern Geishert.

February 10, 1982: Traded by the Cincinnati Reds to the New York Mets for Greg Harris, Jim Kern and Alex Trevino.

Bob Knepper

December 8, 1980: Traded by the San Francisco Giants with Chris Bourjos to the Houston Astros for Enos Cabell.

Honorable Mention

The 1906 New York Giants 

OWAR: 65.9     OWS: 361     OPW%: .591     (91-63)

AWAR: 50.8       AWS: 287      APW%: .632    (96-56)

WARdiff: 15.1                        WSdiff: 74

The New York Giants secured the organization’s fourth consecutive pennant in 1906 with a record of 91-63, placing three games in front of the St. Louis Cardinals. Third-sacker Art Devlin pilfered 54 bases and delivered a .299 BA. Harry H. Davis topped the leader boards with 12 big-flies and 96 ribbies. Converted outfielder Cy Seymour nabbed 29 bags and drove in 80 baserunners while “Wee” Willie Keeler batted .304 with 23 steals. Christy Mathewson furnished 22 victories along with a 2.97 ERA. Left-hander Hooks Wiltse recorded 16 wins with an ERA of 2.27 and a WHIP of 1.143.

On Deck

What Might Have Been – The “Original” 2004 Royals

References and Resources

Baseball America – Executive Database

Baseball-Reference

James, Bill. The New Bill James Historical Baseball Abstract. New York, NY.: The Free Press, 2001. Print.

James, Bill, with Jim Henzler. Win Shares. Morton Grove, Ill.: STATS, 2002. Print.

Retrosheet – Transactions Database

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at “www.retrosheet.org”.

Seamheads – Baseball Gauge

Sean Lahman Baseball Archive


Who Has Performed Better In the Draft?

The MLB draft has passed but its impact will last. Some selections will go down as busts (e.g. Matt Anderson by the Tigers in 1997). Others will be real bargains such as Carlos Beltran with the 49th pick in 1995. I decided to look at the numbers in an attempt to answer the following questions I read over the last few weeks:

  1. How many Round 1 picks do end up in the big leagues? What’s the average impact of a Round 1 pick? How does that compare to Round 2? Are there differences between pitcher and batters?
  2. What has been the best draft class for the 1993-2008 period? (per three first rounds)
  3. What teams have done a better job?
  4. What is the best round (top 10 overall picks)?

As I usually do, let’s define the data sources and assumptions. First, my data source is Baseball-Reference. There are many assumptions and disclaimers in this process, but the most important ones are:

  1. I am using data from 1993 to 2008 to give ample time for players to reach MLB. As I am using career WAR, I don’t want to over-penalize players that have been selected in the recent years and therefore have not accumulated MLB service time.
  2. Organizations change and so do their ways of conducting business, which evidently includes draft strategy. We are looking at teams rather than specific front offices or general managers.
  3. WAR refers to Baseball-Reference WAR (i.e. bWAR).
  4. Teams may have more than one pick per round due to compensation and supplemental picks.
  5. This methodology does not take into account the overall quality of the draft pool i.e. total WAR per draft year is not constant.
  6. All WAR is allocated to the team that drafts the player. Understandably, that is not true but let’s toy with the idea through this post.

Let’s get to it.

Question 1 – How many Round 1 picks do end up in the big league? What’s the average impact of a Round 1 compare to a Round 2 pick? Are there differences between pitcher and batters?

The table below outlines how many players have been/were called up to the majors and how many actually have had a positive career WAR i.e. over 0.0. I have also added the average career WAR per player and I have broken down the data by round and by position (pitcher and batter) to grasp the differences easily. Just take a moment with this table:

 

Round Pos Total players Players that reached MLB % of Total players Positive WAR % of players who reached MLB Average WAR per player
Round 1
Pitchers 372 242 65% 161 67% 9.7
Batters 320 225 70% 157 70% 14.4
Sub-Total 692 467 67% 318 68% 12.1
Round 2
Pitchers 247 121 49% 60 50% 8.1
Batters 244 127 52% 70 55% 13.1
Sub-Total 491 248 51% 130 52% 10.8
Round 3
Pitchers 244 99 41% 59 60% 5.5
Batters 235 88 37% 50 57% 7.3
Sub-Total 479 187 39% 109 58% 6.3
Total 1662 902 54% 557 62% 10.6

 

Three things come to my mind:

First, this provides some empirical validation of what we intuitively thought: First-round picks produce greater WAR values than the others. While I only have data for the first three rounds, it’s worth noting that the gap between Round 1 to Round 2 (10%) is smaller than from Round 2 to Round 3 (41%).

Second, I actually found surprising that 67% of first-rounders reached MLB at some point. That is two players out of three and it’s a testament to how important raw skills are when it comes to moving up through the minors.

Lastly, the answer to the question of whether t draft pitchers or batters looks like an easy one. Batters not only reached MLB at a higher pace but delivered better results as a group and as individuals. While these results are not statistically significant, they provide a pragmatic answer to the question and suggest a sound strategy might be to draft batters and trade for pitchers later down the road.

Question 2 – What has been the best draft class for the 1993-2008 period?

This table should provide guidance on how to answer this question but does not fully explain it. If we think of it as the number of players that got to MLB, then 2008 is the best year. That year highlights Eric Hosmer, Buster Posey, Brett Lawrie, Craig Kimbrel and Gerrit Cole as the most prominent stars, but offers a very low career total WAR as most of its players are still playing – they’re the youngest generation of my sample. In this class, 27 out of the top 30 picks have reached MLB, though a few for a very short stint e.g. Kyle Skipworth or Ethan Martin.

Year Total war Total players that reached MLB Average WAR per player
1993 476.3 54 8.82
1994 243.4 54 4.51
1995 484.9 41 11.83
1996 280.0 45 6.22
1997 409.5 59 6.94
1998 397.6 53 7.50
1999 402.1 52 7.73
2000 236.8 47 5.04
2001 350.9 55 6.38
2002 508.1 54 9.41
2003 297.1 60 4.95
2004 393.2 63 6.24
2005 458.1 63 7.27
2006 282.7 62 4.56
2007 325.4 69 4.72
2008 213.2 71 3.00

 

If we think of the highest total career WAR, then the winner is 2002. This class is led by two of the best picks on the sample (Zack Greinke and Joey Votto) but also features Prince Fielder, Jon Lester and Curtis Granderson. If we think of highest concentration of skills, then the 1995 class has to be the first one with an average of 11.8 WAR per MLB player. On the other hand, only 41 players got the MLB call, the lowest among the sample. While Carlos Beltran and Roy Halladay are the most notable names in that draft, player such as Darin Erstad, Kerry Wood, Randy Winn and Bronson Arroyo enjoyed nice peaks.

 

Question 3 – What teams have done a better job?

Evidently, not every team has selected in the same combination of draft slots e.g. some teams have had the opportunity to choose top picks (Rays, for example), while other have frequently picked from mid-bottom draft slots (Yankees).  It would not be fair to compare total career WAR for players the Yankees has selected against those that the Rays has because the latter had more options and access to a different pool of players than that the Yankees had. How to fix that? I am comparing what each team did on the overall pick they were slotted. If we use 2016 as an example, I would be comparing how good Philadelphia was in choosing Mickey Moniak as pick 1 against the average of all other first picks in the timeframe (1993-2008). Once I know the WAR gap between a particular team and the average WAR per pick, I need to standardize that number by the standard deviation i.e. calculating Z scores. In simple terms, this is understanding how good or bad a pick was in relation to the entire distribution of a particular draft slot. The Z-score number allows us to compare how good a 14th pick was in relation to a third pick, for example. Finally, to identify which teams have fared better, I am calculating the average of Z-scores for all picks.

Again, there are many caveats here, but this should give us a ballpark estimate on how well teams have drafted from 1993-2008. Keep in mind, this methodology does not produce a linear WAR per draft slot. That would mean, for example, that overall pick 4 will produce greater WAR than pick 5. On average, the 4th pick has produced 6.2 WAR on average, while the 5th one has produced 14.3. While this might be counter-intuitive (it is at least for me), the empirical evidence of this sample size shows that.

 

Batter Pitcher    
Teams # of batters drafted Average of OvPck – Zscore # Pitchers drafted Average of OvPck – Zscore Total Count of Name Total Average of OvPck – Zscore
Phillies 26 -0.81 24 -0.46 50 -0.64
Nationals 9 -0.70 6 -1.14 15 -0.88
Athletics 40 -0.99 30 -0.75 70 -0.89
Twins 34 -0.57 32 -1.31 66 -0.93
Diamondbacks 18 -0.84 26 -1.06 44 -0.97
Angels 18 -1.10 27 -0.88 45 -0.97
Rays 14 -0.50 20 -1.31 34 -0.97
Rangers 26 -1.06 28 -1.05 54 -1.06
Cardinals 28 -1.03 34 -1.25 62 -1.15
Giants 34 -1.23 28 -1.10 62 -1.17
Braves 32 -1.24 35 -1.12 67 -1.18
Royals 25 -1.40 32 -1.04 57 -1.20
White Sox 24 -0.65 40 -1.54 64 -1.20
Reds 28 -0.73 27 -1.70 55 -1.21
Blue Jays 32 -1.46 27 -0.91 59 -1.21
Red Sox 29 -1.33 35 -1.14 64 -1.23
Brewers 26 -0.87 27 -1.72 53 -1.30
Dodgers 21 -1.13 32 -1.44 53 -1.32
Rockies 18 -0.85 33 -1.60 51 -1.33
Pirates 27 -1.72 23 -0.88 50 -1.33
Mariners 25 -1.33 20 -1.45 45 -1.38
Mets 17 -1.14 35 -1.61 52 -1.45
Tigers 20 -0.81 32 -1.88 52 -1.46
Orioles 28 -1.05 28 -1.88 56 -1.46
Padres 40 -1.47 24 -1.54 64 -1.49
Marlins 30 -1.59 23 -1.41 53 -1.51
Astros 23 -1.45 26 -1.61 49 -1.53
Expos 26 -1.30 22 -1.85 48 -1.56
Yankees 24 -1.94 29 -1.37 53 -1.63
Cubs 24 -1.46 29 -1.95 53 -1.73
Indians 33 -2.13 29 -1.49 62 -1.83
Total 799 -1.19 863 -1.35 1662 -1.27

 

Perhaps surprisingly, the Phillies come at the top of the list. The Phillies advantage came in three picks: First, Chase Utley was drafted in 2000 with the high 15th pick and has had a great career that is up to 63.4 WAR. Second, in 1993, the Phillies chose Scott Rolen (70 career WAR) with the 46th overall pick – which seems like a bargain now. Finally, Randy Wolf in 1997 was selected in the 54th position and went on to have a 23.1 career WAR. The Nationals have had very much success on their first few years as a franchise with both Jordan Zimmermann and Ryan Zimmerman. The sample size does not include Bryce Harper or Stephen Strasburg, which may push the Nats to the top of the list in the near future.

Astros, Expos, Yankees, Cubs and Indians are the bottom five teams. Coincidentally or not, these teams have long droughts (Yankees exempted). Interesting to see if there is a relationship between draft performance and wins but I guess that’s is another post.

We could go and dig deeper for each team into what they’ve done well and not so much but that would not make sense. Teams make mistakes and it looks like the draft selection is pretty damn hard with an extremely high WAR standard deviation (11.6 WAR through the first 30 picks).

 

Question 4 – What is the best round (top 10 overall picks)?

This question is about finding the best selection on each of the first 10 picks. I’ve used the Z-score which pick was really ahead of the curve.

OvPck Year Tm Player Pos WAR Average WAR of pick OvPck – Zscore
1 1993 Mariners Alex Rodriguez SS 118.8  22.73 3.16
2 1997 Phillies J.D. Drew OF 44.9  16.23 1.88
3 2006 Rays Evan Longoria 3B 43.3  9.00 2.46
4 2005 Nationals Ryan Zimmerman 3B 34.8  6.21 2.67
5 2001 Rangers Mark Teixeira 3B 52.2  14.26 2.02
6 2002 Royals Zack Greinke SP 52.3  4.76 3.63
7 2006 Dodgers Clayton Kershaw SP 52.1  11.86 2.42
8 1995 Rockies Todd Helton 1B 61.2  6.41 3.56
9 1999 Athletics Barry Zito SP 32.6  8.70 2.24
10 1996 Athletics Eric Chavez 3B 37.4  11.31 2.04

 

Well, this is quite a nice group of players. A-Rod is the WAR leader of our sample. Even as a first pick, which on average has yielded the highest WAR, he manages to be three standards deviations above the mean. Five other players are active and two of them (Greinke and Kershaw) still are among the best starting pitchers in the game. They will continue to cement their position as great draft picks for the Royals and Dodgers. Interestingly enough, Barry Zito and Eric Chavez were part of the A’s Moneyball team that frequently over-performed a few years ago — a reminder of how important it is to build a strong core of players.

As a bonus question – these are the top 10 picks, according to this methodology:

Year OvPck Tm Player Pos WAR Drafted Out of OvPck – Zscore
2002 44 Reds Joey Votto C 42.7 Richview Collegiate Institute (Toronto ON) 3.74
2007 34 Reds Todd Frazier 3B 16.8 Rutgers the State University of New Jersey (New Brunswick NJ) 3.71
1997 70 Rockies Aaron Cook RHP 15.9 Hamilton HS (Hamilton OH) 3.71
1995 69 Pirates Bronson Arroyo RHP 26.5 Hernando HS (Brooksville FL) 3.67
1995 53 Indians Sean Casey 1B 16.3 University of Richmond (Richmond VA) 3.67
2007 27 Tigers Rick Porcello RHP 12.2 Seton Hall Preparatory School (West Orange NJ) 3.63
2002 6 Royals Zack Greinke RHP 52.3 Apopka HS (Apopka FL) 3.63
1996 18 Rangers R.A. Dickey RHP 21.1 University of Tennessee (Knoxville TN) 3.61
1997 91 Royals Jeremy Affeldt LHP 10.5 Northwest Christian HS (Spokane WA) 3.61
1995 31 Angels Jarrod Washburn LHP 28.5 University of Wisconsin at Oshkosh (Oshkosh WI) 3.60
1998 33 Expos Brad Wilkerson OF 11 University of Florida (Gainesville FL) 3.60
1995 49 Royals Carlos Beltran OF 68.8 Fernando Callejo HS (Manati PR) 3.59

 

As always, feel free to share your thoughts and comments in the section below or through our twitter account @imperfectgameb.

Note: This analysis is also featured in our emerging blog www.theimperfectgame.com