Archive for Research

What Went Wrong With Chihiro Kaneko

In the 2014 offseason, many free agents changed teams, some even changed leagues. Hiroki Kuroda went back to Japan to pitch for his hometown team, the Hiroshima Toyo Carp, while the Yankees got an upgrade (when healthy) in Masahiro Tanaka on a seven-year, $155-million deal (with a $20-million posting fee that they spent to talk to him), which he can opt out of after this season.

There was a second pitcher who was almost as good as Tanaka, who had worse stuff but excellent command. He also had some injury concerns after his 2011 injury where he missed a few starts, and in 2012 where only pitched nine starts, albeit with 63 1/3 IP in those starts though. Heading into the 2014 offseason, he had two excellent seasons, with ERAs of around 2 in 2013 and 2014, pitching 223 1/3 IP, with 200 strikeouts and 58 walks allowed, then 191 IP with 199 K and only 42 BB respectively in those seasons. He had a 1.98 ERA in those 191 innings in 2014, and a 2.01 ERA in 2013, generating interest from big-league teams and making an appearance in Bradley Woodrum’s article as a pitcher of note that might come over. He ultimately re-signed with the Orix Buffaloes on a four-year deal.

The injury bug bit him again in 2015 as he pitched in 16 starts, throwing 93 IP, and he had a lower strikeout rate than he had in 2013 and 2014 (7.6 K/9) with an ERA of 3.19. He pitched in 2016 and had a mostly healthy season, save for a declining strikeout rate (6.9 K/9) and an increased walk rate (3.3 BB/9), with an ERA of 3.83 in 162 IP. This year his strikeouts (5.7 K/9) and walks (3.0 BB/9) have stayed bad, with a slightly better 3.57 ERA in 116 IP.

What has caused this drastic downturn in performance? It seems that some of his downturn is because he’s getting older, but that doesn’t explain his increased walk rate or his severe decrease in strikeouts. Most of this is likely due to injuries he sustained in the 2015 season. And given that he hasn’t gotten better, it seems as if he’s been pitching despite an injury which has been sapping his effectiveness. He went from being as good as Alex Cobb was in 2014 (considering the thought of the average active hitter in Japan being slightly better than AAA quality) to performing like Ervin Santana this year.

He was a great pitcher with some downside, like Jered Weaver was, but Kaneko hasn’t declined that far yet. Weaver is too bad to even be on an MLB team until he gets medical help to fix his hip and/or shoulder. Weaver is one of the other pitchers who had declined that quickly. So far, he hasn’t rebounded and has continued to get worse, worse than he was last year when he was the second-worst pitcher qualified for the ERA title. It appears that Weaver is virtually unfixable. I think that Kaneko’s issues can be fixed, though, and if they are fixed, he could be an interesting buy-low opportunity.

After the 2014 season, if I were Dayton Moore (armchair GM ideas away), I would’ve signed him to a three-year, $30-million deal with lots of incentives, which could’ve raised the value to $51 million if all were reached. And I think he would’ve done quite well; we might not have this article at all. I must digress, as what-ifs are all around us. (Look at Yordano Ventura, who died far too young with so much untapped potential left.)

He looks like a potential project for the Pirates if he can show signs of improvement in his performance and peripheral stats. The Pirates and Ray Searage could definitely turn Kaneko into something of value, like they did with A.J. Burnett, Edinson Volquez, JA Happ, Ivan Nova, Juan Nicasio, Joel Hanrahan, Mark Melancon, Tony Watson and more. There’s a good amount of upside in trying for this — some prospects that can help the team in the future.

Here is a link to his player page so you can see it for yourself and make your own conclusions about him, and what he can do to remedy himself.

I don’t own any stats used; all stats are from either FanGraphs or the NPB website linked above.


Follow-Up: Which Player Would You Rather Have For the Rest of the Season?

Last week I offered a poll in the Community Blog. The poll compared three anonymous players — Frank, Tom, and Dan, asking: which player would you rather have for the rest of the season?

The descriptions of each player provided a brief background of their performance in the first half of this season, some non-relevant details of how they have been described by others, and their history of performance, to the extent that there was any. Additionally, the poll provided the major-league averages of certain offensive statistics for the first half of this season. These stats were comparable to the stats given about the individual players.

The poll was not meant to take defense into account and the descriptions were quiet on any defensive characteristics of the players, including the position they played. There was also no indication that one player was more susceptible to injury than another. Therefore, the poll selection should have been focused solely on the player’s offensive potential for the second half of this season.

I came into the poll thinking that Dan is the player I would prefer to have for the rest of the season. I started leaning towards Tom as responses to the poll came in. I never considered Frank a viable option.

After doing some research, I think all three players are viable options. However, I think Tom stands above the rest and resembles the closest thing to an objective choice when faced with a decision to take only one of these players for the rest of the season. Before explaining why, the results of the poll can be found here. Here is a summary of the 62 responses:

Question 1: Which Player Would You Rather Have For The Rest of This Season?

Dan: 37.1% (23)

Frank: 32.3% (20)

Tom: 30.6% (19)

Question 2: What Best Describes You?

I am a professional. I get paid to assess baseball players for a team, media, or other company: 3% (2)

I am extremely knowledgeable in sabermetric analytics, but not a professional: 22% (13)

I am knowledgeable in sabermetric analytics: 53% (31)

I am familiar with sabermetric concepts: 22% (13)

No Response: (3)

The Analysis of Dan

There are likely three scenarios you have in mind if you would choose Dan for the rest of the season. They all revolve around the idea that he will likely perform at a level that he has over the course of his career or above that level, bringing his total season number closer to his career average.

Below are the results of the three likely scenarios you could play out in your mind when you choose Dan.

The “Good” result is Dan performing at career averages.

The “Better” result is Dan performing 50% better or worse than his under-/over-performance in the first half of the season, on top of his career averages. For example, Dan’s BABIP of .234 was .067 points lower than his career average. Therefore, his BABIP in this scenario is .0335 better than his career average of .301, bringing it to .334 in this scenario. Conversely, his BB% was 1.6% better in the first half, so in this projection it would be .08% worse than his career average, or 6.2%.

The “Best” result is Dan performing 100% better or worse than his under-/over-performance in the first half of the season, on top of his career average. For example, his .234 BABIP, .067 point lower than his career average, is reversed completely in this projection, where his BABIP is .368. His 1.6% improvement on his career BB% is reversed completely, and his BB% is projected to be 5.4%. 

PA BABIP K BB HR BIP 1B 2B 3B wOBA
Good 360 0.301 61 25 14 259 58 18 2 0.336
Better 360 0.334 56 22 13 269 67 21 2 0.358
Best 360 0.368 50 19 12 278 76 24 2 0.380

The Analysis of Tom

The analysis for Tom isn’t quite as complicated. That may be why you chose Tom.

Tom’s numbers are very close to his career averages. The three likely scenarios you have for Tom were probably one where he hits at his career averages, one where he hits as he did in the first half, or one where he performs as Dan did in the “best” case scenario, described above.

This is what those three scenarios look like:

PA BABIP K BB HR BIP 1B 2B 3B wOBA
Same 352 0.299 84 37 23 208 46 15 1 0.373
Career Average 352 0.320 99 39 26 188 44 14 1 0.383
Best 352 0.341 113 39 29 172 44 14 1 0.399

The Analysis of Frank

The analysis of Frank is the most difficult because we have very little information about what we should expect from him. You should be confident that, despite his first half, he will not go on to have one of the luckiest and best baseball seasons in history, only because those seasons are extremely rare.

The prospect of someone having something good happen over 50% of the time his bat touches the ball is untenable. So is Frank’s .427 BABIP, which you could have backed into or just ballparked by the numbers given. In light of the league averages, and our general knowledge of baseball, we know that these results are on the extreme of a spectrum and are a product of a great talent coupled with a large amount of luck.

So, these numbers tell us Frank is talented and that he has been really lucky, but we have no context of historical performance to place that talent and luck in. Therefore, I thought the following three scenarios would be most appropriate for Frank.

The “League Average” scenario, where Frank’s performance reverts to league average for the rest of the season. These numbers coupled with his first-half numbers still result in an impressive rookie season.

The “Towards Average” scenario, where Frank’s  performance comes back toward, but not all the way to the league average. In this scenario I have brought all his numbers back half-way. Therefore, his 30% strikeout rate, 8.6% above league average, is scaled back to 25.7%, which is 4.3% lower than it was during the first half of the season.

The “Best” case scenario, where Frank’s performance from the first half of the season continues.

PA BABIP K BB HR BIP 1B 2B 3B wOBA
League Average 352 0.301 76 30 12 234 51 16 1 0.314
Towards Average 352 0.334 91 45 21 195 49 15 1 0.377
Best 352 0.427 104 59 29 160 51 16 1 0.468

Which Player Would I Rather Have For the Rest of The Season?

I’d imagine everyone knew Frank was Aaron Judge. The other two may have been more mysterious, but Tom is Giancarlo Stanton and Dan is Manny Machado.

The one scenario that I didn’t account for in my analysis is things going very poorly for any of these players in the second half. That is a real possibility, but it’s unlikely things will get much worse than what I projected for these players (I’ll discuss that a little more for each player below).

I thought Machado would be the best answer when I created the poll. A lot of that was based on bias, not the information given. Machado’s most recent seasons have been much better than his career averages suggest. That probably shaded my thoughts about how he would perform for the rest of this season. In reality, the career numbers look right, particularly in light of the struggles Machado faced in the first half of the season, which is factored into those career numbers.

I mentioned the lack of exploration of a “worst” case scenario above. In my opinion, the projection for Machado is most vulnerable to this omission. I don’t think the vulnerability is that large, though. Machado’s .234 BABIP is on the opposite, yet nearly as extreme, end of the spectrum as Aaron Judge’s .427 BABIP. While it’s possible that the bad luck continues, it’s probable it does not. The BABIP number from the first half says a lot more about luck, not Machado’s talent level.

Machado’s main issue, in a comparison with these players, is that his best-case scenario is needed to get him in the conversation. The mean wOBA of his three scenarios is .358, which is very good, but it’s not on the level of the others. His wOBA in the best scenario is .380. It is a level where the risk is not worth the reward (in the context of this poll).

In actuality, Machado has another asset: he is a very good third baseman, but for purposes of this poll that is irrelevant. Based on this, Manny Machado is not the player I would want for the rest of the season.

I’m an Aaron Judge skeptic. I think he’s likely to remain an All-Star player, but I don’t think he is one of the best players ever.  The average wOBA of his three scenarios is .386, with a high of .468 in the “best” scenario, replicating his first-half performance. The potential of such high performance tempers the risk of Judge’s floor of a .314 wOBA laid out in the “League Average” scenario.

There are a lot of scenarios that I’m leaving out here. I have brought all of Judge’s numbers down to league average, or half-way to league average. That predicts regression in areas such as BABIP and power, but it also attributes a fake ability to not swing and miss to Judge.  However, even if we said that the “League Average” scenario has a 20% chance of happening, the “Towards Average” scenario has a 70% chance of happening, and the “Best” has a 10% chance of happening, Judge’s average wOBA would be .374. This does not necessarily eliminate the issue of attributing “fake” qualities to Judge, but those “fake” qualities run both ways, as the “League Average” scenario severely underestimates his ability to hit home runs and draw walks. Either way, I hesitantly will take Aaron Judge over Manny Machado for the rest of the year.

That leaves Stanton. Why is he the best bet? Because he is not much of a gamble at all. Stanton is performing very close to his career averages, if not a shade under many of them. His projected scenarios reflect this. Stanton is close enough to his career averages that it’s not unreasonable to believe he can perform above those averages in the second half of this season and create a season meeting his career averages. It’s certainly not an unreasonable thought that he will close out the year performing in line with his career averages, nor is it unreasonable to think that his first half represents a new, slightly lower level of baseline performance for Stanton. All of this adds up to very little uncertainty. The average wOBA of the three scenarios is .385. If you had to take one of these player for this second half of the season you would take Stanton. He’s much of the upside and none of the downside. You know what’s coming and it’s going to be very good to great.

Notes:

  • These projections aren’t very scientific or complex. They are based on three scenarios that come to mind and then a basic application of standard baseball stats.
  • I used wOBA to measure the players projected success in the scenarios laid out. This version of wOBA does not account for the value of  a stolen base, caught stealing, hit by pitch, or sacrifice fly. I used the 2017 weights from FanGraphs’ GUTS to calculate wOBA. I used the weights that were available around July 21st.
  • I projected how many hits were singles, doubles, and triples by determining the percentage of non-home-run hits that were singles, doubles, and triples, respectively, between 2012-2016 and applying that percentage to each player’s overall hits (which is calculated using BABIP).
  • I projected home runs using HR/PA.

Thank you to everyone that voted in the poll!


Giancarlo Stanton Is on Fire

The millionaire slugger from Miami has had some misfortune in the past few seasons of his promising career, from common injuries to freak accidents. However, while these unfortunate happenings ended his season early each year, they are no testament to the achievements he accumulated during that time and the possibilities shown by his ability.

With a somewhat slow start to this 2017 season, the G-train seems to be picking up speed. In the month of July, he is hitting and fielding better than all three previous months of the season. For reference, I will put up his stat line that I am basing this interpretation off and add some more graphs later for easier visual interpretation.

Month G PA HR K% ISO BABIP AVG wOBA Def HR/FB Hard%
April 23 100 7 27.0% 0.264 0.296 0.264 0.366 -1 28.0% 39.3%
May 27 115 7 20.0% 0.28 0.325 0.299 0.382 -1.1 23.3% 33.3%
June 27 112 7 25.9% 0.274 0.271 0.242 0.365 -1.1 29.2% 34.9%
July 15 67 9 20.9% 0.526 0.235 0.298 0.487 -0.7 45.0% 44.2%

The first thing that grabs my attention is his home-run numbers in the month of July. Compare them to the home-run totals from each previous month and it does not seem like much of a difference, but when you take into consideration the plate appearances the difference is more discernible. On average it took Stanton 109 at-bats throughout the first three months of the season to reach the seven-home-run mark. In July however, at only 67 at-bats, he has already passed his previous monthly home run total by two home runs. At that rate, by the time he reaches that 109th plate appearance he could have 14 home runs in total for the month of July. That is double the home-run production that he has given in any other previous month this season (after writing this he just put up another two home runs in one game!).

To speak more on his power, take a look at his ISO, which is a metric that basically measures just that, his power.

View post on imgur.com

As you can see, it has shot up tremendously in the month of July, far higher than any previous month (in which they were still very high. The .270 mark is still far above league average). And while it is almost certain that he will not be able to maintain an above .500 ISO for the rest of the season, it is still a remarkable achievement to obtain throughout the duration of a whole month, as July is almost over. Another stat to look at is his weighted On Base Average (wOBA). League-average wOBA consistently sits around .315 – .320 season to season, and Giancarlo’s is currently at .487 (at the time of writing this article). The explanation for that can be summed up in two words. He’s mashing.

View post on imgur.com

He is hitting the ball harder, higher and farther. More consistently too, and the proof is all in the numbers. His home-run-to-fly-ball ratio is up, along with the percentage of balls he hits hard.

View post on imgur.com

And while I, like most other people in the world, would attribute this surge of excellence to a lucky hot streak, this might not be the case. In fact, he might not be getting lucky at all. Batting Average on Balls in Play, or BABIP, is a statistic that is useful for getting a sense of how “lucky” or “unlucky” a position player has been in terms of offense. League-average BABIP usually sits around, again, .300. Anything far above or below that number could point to a batter being “lucky” or “unlucky,” respectively. Stanton’s BABIP is .235, far below the league average and even further below his career average (.318). This means that when he puts the ball in play, excluding home runs, he only gets on base roughly two out of ten times. Sounds pretty unlucky to me, especially for a player of his known caliber, which would explain his lackluster batting average that sits at .298. When his BABIP starts regressing back to the normal .300 area, who knows just how good he could be playing.

View post on imgur.com

I also thought it would be a good idea to take a look at some of the charts and heat maps that FanGraphs offers to see if I can gather some more information, and what I found was pretty interesting. I have seen a lot of Stanton’s at-bats, and through visual memory, I can recall that most of the bad ones end with him striking out on a breaking ball low and away. After taking a look at the heat maps for the percentage of pitches he gets in specific locations of the zone, my memory served me pretty well.

View post on imgur.com

As you can see, during the first three months of the season Stanton got a lot of pitches low and away. Pitchers would pitch him there because, well, that is a hard place to hit, and for the most part it worked for them. But during the month of July, they are no longer going after that weak spot. More of the pitches that Stanton has seen this month have been concentrated in the middle and upper part of the zone, the part of the zone that he thrives in. This serves to further explain his monstrous July!

The future for the Marlins slugger is beyond bright, and as a Marlins fan, I cannot wait to sit by and watch.

 

*Side note*

This is my second post in the FanGraphs community! And while I am very excited, I at the same time want to be sure to improve with each and every post and write about things that people want to hear. If you, the readers, do not have anything to say about the content of the articles but do have some constructive criticisms please feel free to leave a comment! Have a good one!


Do Sluggers Really Swing More on 3-0?

According to Baseball Prospectus’s set of Run Expectations matrices, when Evan Gattis stepped up to the plate in the fifth inning of Houston’s June 27 game against Oakland, the Astros were expected to score an average of 2.26 runs (taking the three-year mean from 2014-2016). The bases were loaded for Houston, which was down 1-0, and in over four innings of work thus far, opposing pitcher Sean Manaea had already walked a trio of batters, including the previous hitter. Against Gattis, Manaea had gotten himself into even more trouble, with the first three pitches missing outside. On the fourth pitch, the Astros’ mighty slugger, evidently with the green light to swing, did exactly that, with the following result:

gattis_gif

By the time Houston’s next batter, Brian McCann, stepped to the plate with two outs and a runner on third, the Astros were only projected to add an average of 0.352 runs to the one that had already crossed home plate. As it turns out, McCann grounded out to shortstop, and the ‘Stros ended up scoring only one total run from a bases loaded, no-out situation. As calculated by Baseball Reference’s wWPA metric, Gattis’s run-scoring double play actually decreased the team’s chances of winning by ten whole percentage points, and they’d eventually drop the game to Oakland, 6-4.

To an innocent MLB.TV subscriber who happened to see the preceding events play out, it seemed an odd scenario for Gattis to get a 3-0 green light. After all, Manaea had been relatively wild up to that point in the game, and he’d even walked the previous batter, Carlos Correa. It got me thinking about league-wide trends on 3-0 swings, and thanks to Baseball Savant, there’s a wealth of 3-0 count-related data to pore through.

First, there are some interesting trends involving the overall league frequency of 3-0 pitches and swings. Using R’s ggplot data visualization package, I graphed the frequency of pitches in a 3-0 count, relative to pitches in other counts, as well as the swing rates on those pitches:

totalandswingpct

Batters are swinging more and more at 3-0 pitches, even though those pitches are steadily becoming less common. In a league that’s been increasingly prioritizing power, it’s possible that batters are responding accordingly to the fact that they know, with a high degree of accuracy, what pitch they’re about to see. Of the 4,721 3-0 pitches through the 2017 All-Star break, nearly 87% of them were categorized as some type of fastball, as categorized by Baseball Savant.

But which batters are most frequently given the go-ahead to swing away in a 3-0 count? Common perception is that the more powerful the batter, the more likely he is to be given a green light from his manager; this would certainly fit the Gattis anecdote above, and a 2014 Beyond the Box Score article noted that “the guys who swing 3-0 are sluggers,” citing Albert Pujols and Ryan Howard’s high swing numbers as evidence.

I graphed the number of each batter’s 2016 3-0 swings against his 2015 SLG, limiting the data set to batters who saw at least ten 3-0 pitches to avoid outliers (among these outliers: pitchers Jose Fernandez, Jake Arrieta, and Madison Bumgarner, each of whom swung at at least one 3-0 pitch). If powerful hitters really do swing more often on a 3-0 count, we’d expect to see a positive relationship in the data. Of course, this analysis does come with the caveat that batters, once given the freedom to swing, still can choose not to, and pitchers are likely less inclined to groove a 3-0 fastball to hitters they know are more likely to punish them for doing so.

As it turns out, the five batters with the highest number of swings were all notable sluggers — Joey Votto (17), David Ortiz (14), Mike Napoli (14), Giancarlo Stanton (14), and Josh Donaldson (12). Take a look at the below graph:

batter_SLG_2015_swings

Evidently, there is some sort of relationship between the number of 3-0 swings a batter takes, and that batter’s power. It might, however, make more sense to look at the rate of 3-0 pitches a batter swings at, rather than the absolute amount. After all, one might expect a batter with a high slugging percentage to have (a) more at-bats that reach 3-0, as the pitcher would be more likely to pitch around him; and (b) more at-bats total, as his high slugging percentage would warrant more frequent appearances in the starting lineup.

As illustrated below, I charted each batter’s 2016 3-0 swing rate against his 2015 SLG:

batter_SLG_2015

Interestingly, there doesn’t seem to be much of a correlation between a batter’s prior-year slugging percentage and his current-year 3-0 swing rate, although we should acknowledge the small sample sizes of the pitches driving each individual batter’s swing rate. (For what it’s worth, I performed the same analysis using ISO, rather than SLG, and limiting the data to batters who saw at least twenty 3-0 pitches, rather than ten, and got very similar results.) A list of the top batters by swing rate, again including only those batters facing at least ten 3-0 pitches, doesn’t exactly comprise an All-Star team, either — while Stanton and Pujols are numbers four and five, respectively, the swing leaders also include Rickie Weeks Jr. (1), Wilson Ramos (2), and Hernan Perez (7).

There’s also not much reason to believe that the batters who do swing most often at 3-0 pitches tend to make any better contact than those who don’t. The following chart compares batters’ 3-0 swing rates with their 3-0 swings’ expected wOBA, and, as the R^2 indicates, their relationship is nonexistent:

results_swingVSxwOBA

Finally, let’s observe the relationship, if there is one, between 3-0 swing rates and player age. Sam Miller, now an ESPN writer, penned an excellent 2014 article for Baseball Prospectus in which he listed a few managers’ respective 3-0 strategies. Ned Yost, for example, did only grant the green light to the most powerful members of his lineup — but only with one out, and only in certain game situations. On the other hand, Davey Johnson, then in charge of Washington, was far more liberal. My hunch, though, is that managers are generally most inclined to let their veteran players swing away. What follows is a plot of 2016 3-0 swing rates against player age:

swingVSage2

As it turns out, there’s not much of a reason to suspect a relationship here, either. But again, each analysis comes with the caveat that batters don’t have to swing when given a 3-0 green light, and some batters may not even need an explicit green light signal to know that they’re allowed to swing.

Even so, we can conclude the following: although 3-0 counts are occurring less often, relative to other counts, batters are swinging at a steadily increasing rate — perhaps to take advantage of the grooved fastball they’re virtually guaranteed to see. Pitchers, therefore, shouldn’t necessarily take for granted that the hitter won’t swing at their 3-0 pitch, and shouldn’t necessarily expect younger and/or less powerful hitters to refrain from swinging. And while batters’ wOBA on 3-0 is significantly higher than on any other pitch, in a stark contrast to common perception, it surprisingly doesn’t appear that powerful batters make any better contact than weaker hitters. I may eventually replicate this analysis with a focus on different game scenarios — for example, whether sluggers behave differently in blowouts, or in high-leverage situations — but for now, I’ll definitely be paying extra attention next time I see a power hitter or veteran up with a 3-0 count.


Maikel Franco Is Adjusting

Baseball Prospectus, in their 2015 scouting report of Maikel Franco, had this to say:

“Extremely aggressive approach; will guess, leading to misses or weak contact against soft stuff; gets out in front of ball often—creates hole with breaking stuff away; despite excellent hand-eye and bat speed, hit tool may end up playing down due to approach…”

We saw early this year, and even last year, that exact prediction come to life. Franco seemed to be flailing about vs the soft stuff, beating too many pitches into the ground, and even popping too many up. He never really stopped hitting the ball hard, but we saw too many of those hit in non-ideal ways. For most of the first part of this year the slider gave him absolute fits, and Alex Stumpf wrote about that here. He’s striking out at a career-low rate (13% on the year), but he still isn’t really walking that much although it’s bounced up a percentage point from last year (7.3% in 2017).

Here’s a rundown of his career batted-ball profiles:

ballprofile

I was watching the Phillies game vs. the Marlins on the 18th, and Franco went 3-4 with the go-ahead HR off Dustin McGowan. His HR came on a slider middle-away — literally the exact pitch that’s done nothing but given him fits all year. I also noticed that his batting stance seemed to be different. More upright, quieter. I pulled up a highlight video of an at-bat from early May. Here’s a screencap of his stance just before the pitcher starts his delivery:

francold

That AB ended in an RBI line drive to right. Here’s a screencap of the HR in question from Tuesday, at a similar point in the pitcher’s delivery:

franconew

Now if that’s not a mechanical change, I don’t know what is. He’s closed off his stance, eliminated a lot of the knee bend, and seems to have raised his hands juuuuuust a touch. It could be the difference in the camera angle though. Phillies hitting coach Matt Stairs mentioned they’d been trying to get Franco to cut down on his leg kick, so let’s look at that too:
Old leg kick:

oldlegkick

New:

newlegkick

Shortly after contact, old:

pocold

and the recent HR, similar point:

newpoc

The “leg kick” seems to be more of a toe tap, and hasn’t changed. What did change, though, is the quality of his follow-through. His head is on ball, he’s better transferred his weight to his front foot, and the results follow. The old AB was a line-drive single opposite field, which looks less of an intentional opposite-field hit and more of a product of bad mechanics. Being so open, he really could only go to right field with authority. If he tried to pull it he’d roll over the pitch. That also would cause him to struggle with the breaking pitch away, which he’d bounce to second. Closing off has allowed him to better get the bat head into a more ideal position to cover the whole plate with authority. He’s always had the bat control to make contact everywhere, but it looks now like he’s improved his chances of making quality contact all over the zone. Here’s the same look at his batted-ball profile since the start of July:

bballnew

Here’s some assorted metrics, same time period:

kbbnew

vs. his career metrics:

metricscareer

He’s cut his grounders by over 10%, raised his liners by 3%, and turned the rest into fly balls (8%). He’s likely always going to have a pop-up issue, but his pull/center/oppo profile is back to where he was at in 15/16, and he’s hitting the ball hard at a higher rate than ever. Also, his strikeout rate is 6%(!!!!!!)!!!!! He’s making more contact than ever, and that contact is better than ever.

We’ve seen Franco get us hyped before, but never before has there been this type of major mechanical change to point to. Miguel Sano did something similar preseason by raising his hands and quieting his pre-swing load, and it’s paid dividends. Since I started this article, Franco went 2-4 with a single, double, and sac fly; and three of those batted-ball events were hit at 100+mph (the single and double; he was robbed by the 3B on a sharp liner as well).

Going back to his 2015 scouting report: Franco’s still aggressive, if not slowly becoming less aggressive the more he’s in the majors. By changing up his stance, however, he’s closed up the two major holes in his report: getting out in front of the breakers away, and bad contact on soft stuff. Keep an eye on this. One of the more frustrating hyped prospects seems to have made the transformation we all hoped he would, right in front of our eyes.


The Opportunity Baseball Organizations Are Missing: Part II

In Part I, I suggested the “How” and “Why” as to what organizations might be missing. In Part II, I will give you the “What” – the specific findings that underpin an unprecedented opportunity for an organization to capture a significant competitive advantage.

At the major league level, approximately 20-30% of players suffer from significant swing path issues. Since path issues are likely the single largest factor in player failure and underperformance, an organization with a systematic approach to “cure path” would be at significant advantage relative to the remaining teams. Not only would the club benefit directly through improved offensive production, but having an effective cure to path would provide superior insight into “true talent” as the largest remaining factor.  There are also logical extensions into how this could be further monetized in potential areas such as player arbitrage, draft selection, etc.

In summary, the comprehensive solution can be simply stated as follows:

1) The optimal swing paths that exist in the muscle memory of the best hitters can be quantified and visually represented to players performing below potential.

2) A systematic process can be built around the above for significantly improved performance.

Before getting into the key findings, I would like to talk about process using another parallel to investing. There is a popular view that all hitters are different and a “systematic process” cannot be utilized for fixing the ones performing below potential. Successful investors are also presented with a significant amount of uniqueness in their decision process. They have all developed a process that effectively deals with uniqueness, not avoid it all together. So if a process isn’t effective in getting hitters to potential, it’s not that you need to abandon process, you just need a different one. Swing path is a core mechanic that can be systematized and as you will see, there is plenty of room for customization within a systematic process.

The findings below are presented with an extremely high level of conviction based on several years of research including a patent filing in 2013. History will be the ultimate judge but based on communication with a handful of organizations, there is a strong possibility that many clubs will be unwilling to consider such non-traditional sources of value.

The Findings – Quantifying the Optimal Swing Path

Variables:
1) X Axis – Swing Loft
2) Y Axis – Bat Angle (vertical)
3) Z Axis – Swing Direction (Very small changes – Not considered here)
4) Timing (technically, horizontal Bat Angle but we’ll just call it timing for simplicity)
5) Ball Contact Point (Relative to ball equator – Not considered here in terms of loft)

Constraint – The Bat Angle for any given pitch height represents a straight line from the ball to the chest area such that the intersection between the body and the swing plane is a point in the chest area not the mid-section or waist.

Nothing major yet – just some visual representations of the Variables and the Constraint.

Although not a major finding, many are still of the opinion that the bat should be relatively level resulting in a much lower swing plane/body intersection as in the illustration above. The importance of Bat Angle will become more clear shortly. For now, I’ll use two extreme Infield Fly Ball Rates (IFFB%) as a general proxy for a path quality. Below, you will see the Bat Angle (and plane/body intersection) for one of the lowest IFFB rates (Joey Votto) and one of the highest (Kevin Kiermaier). Note the significant 10° difference between the Bat Angles for the same height pitch.

Looking more broadly, the average Bat Angle on low-middle pitches for the players with the five lowest IFFB rates (2015 and 2016) was 34° while the average Bat Angle on low middle pitches for the players with the five highest IFFB rates  was 26°. An example application is screening for players with high IFFB rates on low pitches. Since a relatively flat bat is required for an infield fly ball, this type of screen can highlight players with consistently insufficient Bat Angle.  However, this is only a small part of a more comprehensive approach to identifying players with poor paths as discussed below.

Timing is a Separate Loft Factor

To illustrate Timing Loft, consider the model below in which Swing Loft has been set to zero, thus 100% of the loft is a result of Timing Loft (Bat Angle set to theoretical maximum to illustrate the point).

Conversely, for a high pitch (Bat Angle set to theoretical minimum), 100% of the loft comes from Swing Loft while Timing determines the direction of the hit – pull, center, or oppo (below).

 

Loft Goals, Angle Mix, and Loft Contribution

Given the principles previously illustrated, optimal angle combinations can be constructed for each pitch location. Each location will have different loft contributions from Timing and Swing Loft based on the height of the pitch. So ball height determines Bat Angle which determines the mix of loft contribution. We will discuss customization shortly. For now, let’s just assume a launch angle goal of 15 degrees for a particular player.

 

High Pitch Low-Middle Pitch* Very Low Pitch
Bat Angle 20° 45° 65°
Swing Loft 11.7° 7.5° 4.2°
Timing Loft 3.3° 7.5° 10.8°
Total (Goal) Loft 15° 15° 15°

 

*Note – A middle to low pitch was used to illustrate a 50/50 mix of loft contribution which occurs at a 45° Bat Angle. A true middle pitch has approximately 30-35° of Bat Angle

 

Thus, the optimal Swing Loft for a low-middle-height pitch for a player with a 15 degree loft goal is only 7.5 degrees – the other 7.5 degrees comes from timing

 

AND Swing Loft is not the major loft factor for low pitches – which is a lot of them.

 

Extending these concepts, the optimal loft goal and loft contribution mix can and should be adjusted for different hitter types. Pull hitters will have a greater relative contribution from Timing Loft while opposite field hitters will have a greater relative contribution from Swing Loft. While adjusting the loft goal higher for more powerful hitters is an option, there are potential drawbacks that should be carefully considered and are discussed below.

 

Reconciling the Swing Up / Swing Down Views

It is interesting to consider the above in light of baseball ignoring Ted Williams for so many years. Usually, there is some rationale for significant movement in a particular direction. In hindsight, it is not too difficult to consider that the “swing down” movement came about (in part) because the bat path / ball path matching issue can’t be solved with a simple one-factor (i.e. Swing Loft) solution as Williams proposed (specifically, page 67 of The Science of Hitting where he shows front knee bend to hit low pitches). While Swing Loft should never be negative, one can certainly appreciate how minimal Swing Loft and relying on Timing Loft for low pitches might feel down to a player. A few takeaways:

1) I am convinced that the failure of Williams’ teachings to fully catch on as well as the highly variable success rates today in adding loft is because some players are able to intuitively arrive at the correct mix of loft contribution while others are forcing too much Swing Loft.

2) Since “timing adjustments” are clearly required to achieve high levels of loft using an optimal mix of angles, excessive loft goals are problematic for many hitters, particularly those who are not natural “pull” (early timing) hitters.

3) Given the significant number of consistent hitters in the 13-15 average LA range, this indicates a Swing Loft on average, of 6.5 to 7.5 degrees for a low-middle-height pitch – nowhere close to what many believe is required to become a card carrying member of the “Fly Ball Revolution.”

Visually Representing Optimal Swing Paths to Hitters

I am by no means an expert in neuroscience; however, it seems relatively straightforward that the more information hitters can transfer out of conscious thought into subconscious/muscle memory, the better they will perform. Hitters don’t want to (and shouldn’t) think about complex angle combinations. Consequently, a visual representation of the optimal combination of angles tailor made for each considering their power (i.e. to determine goal loft) and pull vs oppo tendencies can quickly correct a consistently poor swing path. Yes, “Keep it Simple” but solve the complexity first.

Below is the device that allows a hitter to train for optimal angle mixes through seeing and feeling the optimal paths for different pitch locations. The ball joint allows flexibility for any mix of angles while the angle guide provides compound angle settings based on a hitter’s customized loft goal and pull/oppo preference.

I will keep the product plug to a minimum. Additional information may be found here.

 

Finding Great Paths in the Data

Statcast data confirms that the best (most consistent) hitters hit the ball significantly flatter (in terms of bat/ball contact, not launch angle) than average. You can read more about the details of this here. I refer to estimated spin impact as Mean Unexpected Distance (MUD).

In addition to hitting the ball flat, one of the best indicators of a great swing path is low variability (Standard Deviation) of a player’s launch angles. After witnessing significant reduction in launch-angle variability through focused training, I had a significantly high level of conviction that this was a key indicator of “quality of path” several years ago. The availability of Statcast data through Baseball Savant changed everything. The data not only confirmed the benefits of flat contact previously discussed but also proved that data combined with video analysis can assess  “quality of path” with a very high degree of accuracy. Considering no other data than (low) MUD scores and (low) Standard Deviation,  the following hitters are returned (based on 2015 and 2016 data):

Player Avg MUD Avg Std Dev
Chris Davis -16.0 21.5
Freddie Freeman -14.3 20.3
Joe Mauer -13.2 21.4
Joey Votto -12.8 20.1
Brandon Belt -9.9 19.6
Miguel Cabrera -9.9 20.4
Paul Goldschmidt -8.7 21.5
J.D. Martinez -8.6 21.6
Nick Castellanos -7.8 19.4
Adrian Gonzalez -6.9 21.0
Matt Carpenter -6.6 20.0
Yan Gomes -4.5 22.0
Christian Yelich -3.9 21.1
Mike Trout -3.0 20.6
Logan Forsythe -2.7 21.0
Howie Kendrick -2.4 21.8
Daniel Murphy -2.0 21.9
Justin Turner -1.2 21.4

The average wRC+ and BABIP for the hitters above are 129 and .330, respectively. Given the return of Cabrera, Votto, Mauer, Trout, Freeman, J.D. Martinez, and several others considering no other performance factors, the benefits of low Standard Deviation of LA and flat contact seem relatively clear.

 

Putting It All Together – A Better Training Approach

I believe it would be a mistake to ignore the iterative training process that the best hitters have utilized up to this point. Hit a lot of balls, keep what works, discard what doesn’t and repeat the process over a very long period of time. In many cases, the only thing differentiating good from bad paths is that a player’s “filter” allowed something to get through that should have been discarded.

The drawback of the iterative approach is that it takes a very long time. If a player gets off track, it can take a frustratingly long time to go through the process to fix something that may have a very simple solution. Combining what we know from the discussion above, the variability of launch angles can be used in training sessions to quickly determine if a path is moving in the right direction or not before a player fully commits to a contemplated change.

In other words, path issues can be effectively addressed from opposite directions – static/device training  to improve path and dynamic (pitched balls)  training focused on reducing LA volatility. In training sessions with BP-type pitching, players with good paths are able to get down to a 13-14 degree standard deviation.

Implications From The Findings

Part I  pointed out the first step in the research process (as I have known it) is “Identify the Key Drivers”. I believe it is fairly safe to say that most, if not all organizations, believe that path is a key factor. This leads to the logical question of – Why didn’t MLB organizations “go deep” on one of the largest factors of player performance?  I believe there are two primary reasons:

1)  They assumed there was nothing of significance to be found – they concluded before they considered.

2)  It was outside the scope of responsibilities for employees on both the data/analytics side and the player development side of the organization.

Looking more broadly, it would be my guess that most organizations would say they do not believe significant “Moneyball-size opportunities” exist. It is this type of thinking that suggests that they likely do. I’m currently looking into another key driver of performance that could possibly be addressed through a similar systematic approach. Until baseball organizations change their thinking (and possibly their organizational structure), it’s likely that these opportunities will continue to exist. The source is the same – the “Gap in the Middle” that was outlined in Part I.

Clearly, all of the findings presented here are “known” in the muscle memory of the best hitters with great paths.  To this point, however, this muscle memory knowledge had not been understood or quantified in a way that could be systematically transferred to other players. By separating the loft factors, quantifying optimal paths for each location, and presenting a simplified visual representation of the optimal combination of angles, hitters can correct path issues with a very high rate of success.

Going forward, it will be interesting to see how organizations change in regard to considering non-traditional sources of value such as the “Gap in the Middle” previously discussed. At least one, the Houston Astros, announced in March (two days after Part I but likely just a coincidence) that they were moving their lead analyst, Sig Mejdal, to get “on-field experience.” This move, combined with his title of “Process Improvement,” indicates that they might be ahead of the pack in terms of considering new ideas. If you are aware of other clubs moving in this direction, please indicate in the comments. Based on my communication with a few organizations, I believe several are going to be late to the party as they appear unwilling to challenge existing assumptions.

Given the possibility of more “meat on the bone” for the findings above, I will likely take another short break before publishing the next article. However, for those interested in considering opportunities where data and mechanics intersect such as what has been presented above, there is considerably more material for your future consumption.


The Bad Aaron Judge Comps

Aaron Judge is good.  Some might say he is great.  The front-runner for AL Rookie of the Year and MVP is the face of MLB for 2017, but the face of MLB for the future?  Unfortunately, maybe not.

It’s hard to find something negative to say about the New York Yankees right fielder, but in order to play devil’s advocate and not get our hopes up too high about Aaron Judge, just in the event that he has a down season, I was able to find some rather unflattering comps for the slugger.

First, there’s his minor-league career.  Aaron Judge was a pretty good prospect ranking first in the Yankees’ system in 2015 and 17th in baseball according to MLB Pipeline.  However, just because a prospect is ranked highly does not mean they are without flaws.  Judge would strike out in at least 21 percent of his plate appearances in all levels in the minor leagues.  This article from 2016 even identified Judge’s proficiency to strikeout:  

Judge’s Triple-A debut at the end of 2015 did not go well. He slashed .224/.308/.373, well below both his career levels and expectations. More alarming, he struck out a career high 28.5-percent of the time (74 times in 260 plate appearances). [The 2016 season] has been more of the same. His batting average is a bit deceiving sitting at .284 (heading into this weekend), considering he currently has a nice .354 BABIP compared to last seasons .289. His plate discipline is troubling.

Perhaps the lofty expectations of Judge have him pressing. You simply can’t overlook the fact that his strikeout rate is nearly identical to the small sample size of last season’s Triple-A numbers (27.2-percent). It has to be at least a slight bit worrisome that this is a trend and not a slump. His walk right is dropping daily to a new career low (6.8-percent or eight walks in 103 plate appearances).

The article seems to point to his plate discipline as his main flaw — as other evaluators have — but is overall positive with his prospect status.  But his strikeout tendency should not be overlooked.  He has failed to improve on that statistic in his short major-league career, where he has struck out in 32 percent of his plate appearances between his call-up in 2016 and now.  However, because he also takes his walks, his walk percentage is rather high, which puts him in exclusive company.

Since 2000, there have only been four players with at least 300 plate appearances who have struck out in over 29 percent of their plate appearances and walked in at least 16 percent of them: Jack Cust (2007, 2008, 2010, 2011), Ryan Howard (2007), Adam Dunn (2012), and Aaron Judge (2017).  All of these seasons resulted in wRC+ well above 100, which means that they were productive players; however, these player were known to be the embodiment of the “three-true-outcome” hitters.  Dunn had five consecutive seasons of 40 or more home runs, but also led the league in strikeouts four times; Cust led the league in walks once and strikeouts three times; and Howard led the league in home runs twice and strikeouts twice.  Admittedly, these comps are not encouraging.  Although these players were not horrible in the simplest definition, their careers were short-lived and their production sharply declined.  For Cust and Dunn, it forced an early retirement, and Howard a well-publicized and sad end to an illustrious career.

But it’s not just Aaron Judge’s strikeout and walk percentage — it’s also his raw strikeout numbers.  Judge is on pace to strike out over 200 times this season.  While it’s already been established that he is strikeout-prone, it does not serve him justice that the 200-strikeout threshold is upon him.  No player who has struck out 200 or more times in a season has had a very high average.  As the legendary Pete Rose noted, the highest single-season average for a player with 200 or more strikeouts was .262 (Chris Davis holds that honor).  The short list of 200 single-season strikeout players is a whopping five players long: Mark Reynolds, Adam Dunn, Chris Davis, Chris Carter, and Drew Stubbs.  Kris Bryant had 199 in his rookie season (he was called up late to the bigs due to service-time considerations, so it’s likely that he would have joined this club), and Ryan Howard had 199 twice and Jack Cust had 197 once.  Dunn, Howard, and Cust again…

I love Aaron Judge, and I love 500-plus foot home runs, but we also have to be realistic and rational in our love and praise for the slugger.  The worst thing that the New York sports world can do is rattle this kid if, and when, he goes from being an All-Star to the 25th man on a roster.  There is nothing I want to see more, as a Yankees fan and a baseball fan, than Judge succeed; it’s good for the sport.  But I also don’t want to get my hopes up too high, because nothing stings more than a player of his caliber going down the path of Adam Dunn, Jack Cust, or Ryan Howard.


dSCORE: Starting Pitcher Evaluations

Early this spring I did a writeup on dScore (“Dominance Score), an algorithm that aims to identify early on pitcher “true talent.” That article reviewed RP performance for 2016.

Here’s a quick review of dScore and how it works:

dScore takes each pitcher and divides them up into a bunch of stats (K-BB%, Hard/Soft%, contact metrics, swinging strikes; as well as breaking down each pitch in their arsenal by weights and movements). We then weight each metric based on indication of success–for relievers, having one or two premium pitches, missing bats, and minimizing hard contact are ideal; whereas starters tend to thrive with a better overall arsenal, minimizing contact, and minimizing baserunners. Below is a breakdown of the metrics we used in our SP evaluations:

Performance metrics: WHIP, K/BB%, Soft%, Hard%, GB%, Contact%, SwStk%, Z-Contact%, O-Contact%

Pitch metrics: wPitch, vPitch (where “Pitch”= FA, FT, CU, SL, CH)

Our current weighting for SPs is a bit more subjective and complex than our RP weighting system, but I’m looking to implement a similar weighting system to the way we weight RP metrics in this evaluation in the near future.

dScore has been around for a year or so now, and one thing I was asked when I initially posted was whether or not it has any “predictive” tendencies. The answer is a pretty clear “no”–BUT what it does do very, very well is validate performance. There’s a fine line between saying “the numbers say pitcher X’s going to stay good” and saying “pitcher X has been good, and this confirms he’s been good”. The problem with the metric is it uses per-pitch statistics, rather than Fielding-Independent metrics. What that means is at a technical level, dScore views the pitcher as directly responsible for everything that happened after a pitch is thrown. There’s been a few outside cases that I’ll get into in a later article; but generally if a pitcher’s been bad, he’s generally viewed as having been bad, or vice versa. It seems particularly bad at projecting regression from underperformance, although I haven’t been tracking pitcher movement as well as I should. I’ll look to implement some sort of evaluation by next year.

 

Top Performing SP by Arsenal, 2017
Rank Name Team dScore
1 Max Scherzer Nationals 55.73
2 Alex Wood Dodgers 55.54
3 Corey Kluber Indians 49.15
4 Chris Sale Red Sox 46.43
5 Clayton Kershaw Dodgers 43.53
6 Dallas Keuchel Astros 38.90
7 Noah Syndergaard Mets 33.45
8 Lance McCullers Astros 32.17
9 Randall Delgado Diamondbacks 30.50
10 Zack Godley Diamondbacks 29.69
11 Stephen Strasburg Nationals 26.92
12 Jacob deGrom Mets 25.13
13 Luis Severino Yankees 24.38
14 Luis Castillo Reds 23.65
15 Trevor Cahill Padres 23.63
16 James Paxton Mariners 21.46
17 Kenta Maeda Dodgers 20.61
18 Zack Greinke Diamondbacks 20.48
19 Nate Karns Royals 20.42
20 Carlos Carrasco Indians 19.96
21 Rich Hill Dodgers 17.86
22 Masahiro Tanaka Yankees 17.43
23 Danny Salazar Indians 17.06
24 Brad Peacock Astros 16.51
25 Marcus Stroman Blue Jays 15.48

 

The Studs

The top eight guys are really a who’s-who. Scherzer, Wood, Kluber, Sale, Kersh, Keuchel, Syndergaard…Only guy I’m touching on here is Thor, who’s close to begin throwing again. Lat injuries are a whole lotta “?????” for pitchers, but he’s certainly worth a buy if someone is (stupidly) wanting to sell.

 

The Loaded Teams

Astros – Dallas Keuchel (6), Lance McCullers (8), Brad Peacock (24) / McCullers has broken out. Consider him a stud going forward.

Diamondbacks – Randall Delgado (9), Zack Godley (10), Zack Greinke (18) / Delgado is likely more of a bullpen option at this point. Godley had an awful first outing off the break, but dScore really believes in him.

Dodgers – Alex Wood (2), Clayton Kershaw (5), Kenta Maeda (17), Rich Hill (21) / Come on, really? Give some other team a chance!

 

The Young Breakouts

Zack Godley (10) – I touched on him above. Although I’m pretty sure he’s due for regression, dScore continues to think he’s got premium stuff. Continue to roll with him.

Luis Castillo (14) – He’s 29 innings into his big-league career, but that’s also 29 innings vs. the Nationals (twice), Rockies (once, in Coors), and the Diamondbacks (once, in Chase). All three teams rank in the top five in the NL in runs scored. BUY. / FUN FACT: The Rockies rank third in runs scored, but are tied with the Padres for dead last in the NL in wRC+ at 81.

James Paxton (16) – He is who we thought he is.

 

The Still Believin’

Kenta Maeda (17)

Masahiro Tanaka (22)

Danny Salazar (23)

Tanaka’s been god-awful. dScore agrees with his 3.73 xFIP though, and says he should’ve been significantly better than he is. Salazar has somehow been worse, but once again dScore sides with his 3.57 xFIP and says BUY when he comes back from the minors, although I feel like that’s what Salazar’s always been. Every metric says he should be significantly better than he actually is. In 10 years I feel like his career is going to spawn the ultimate sabermetric “what could have been” from FanGraphs.

 

The Just Missed

Jacob Faria (26)

Jose Berrios (28)

Mike Clevinger (29)

Jordan Montgomery (30)

Chris Archer (31)

A whole bunch of kids and Archer, aka the pitcher we all want Danny Salazar to be.

 

R.I.P

Nathan Karns (19) – Thoracic Outlet Syndrome. Well, it was a good idea for the Royals…

 

Notes From Farther Down

Newly-minted Cubs ace Jose Quintana is sitting at 76th. Remember how I said this metric was bad at projecting regression from underperformance? Quintana was sitting just inside the top 100 before his last start. Even though dScore agrees he’s been bad, I’m still buying Quintana in bulk. Old Cubs ace Jon Lester is still getting love from dScore, even after his absolute meltdown vs the Pirates. He’s at 39th. Fellow lefties Sean Manaea and Eduardo Rodriguez bookend him at 38th and 40th respectively. Manaea was sitting in the high-teens for most of the season, then seemed to lose feel for his slider and effectively stopped throwing it. That really hurt his hittability and K’s. It came back around last start vs. Cleveland. I’m continuing to buy him as a #2 ROS. Boston activated Rodriguez recently. Adam Wainwright (104), Julio Teheran (108), Jake Odorizzi (123), Matt Harvey (137), Aaron Sanchez (140), Cole Hamels (143) are a whole bunch of ughhhhh. I’m out on all but Hamels, who I’d argue to hold. His strikeouts disappeared before getting shelved with an oblique strain, then got shelled in his first start back vs. Cleveland. His last three starts have been vintage, and I’m anticipating dScore to catch back up.


Introducing XRA: The New Results-Independent Pitching Stat

There are a multitude of ways that we can judge pitchers. Most people look at earned run average to gauge whether a pitcher has been successful, while many old school announcers will still cite a pitcher’s win-loss record. ERA is a nice, easy way of looking at how a pitcher has performed at limiting runs, but it doesn’t come close to telling the whole story. In the early 2000s, Voros McCracken created the idea of Defense Independent Pitching Stats or DIPS, which credited the pitcher only with what he could actually control. Fielding Independent Pitching was born from this theory and only took into account a pitcher’s strikeouts, walks and home runs allowed. It turns out that a pitcher’s home run rate is not terribly consistent, thus xFIP was created by Dave Studeman to normalize the home run aspect of the FIP equation by using the league home run per fly ball rate and the pitcher’s fly ball rate.

In 2015, a new metric was developed by Jonathan Judge, Harry Pavlidis and Dan Turkenkopf called Deserved Run Average or DRA. This new stat attempts to take into account every aspect that the pitcher has control over and control for everything that he does not, thus crediting the pitcher only for the runs that he actually deserves. DRA, however, is still dependent on the result of each batted ball. If the batter hits a ball deep in the gap and it rolls to the wall, the pitcher is charged with a double, but if the center fielder lays out and makes a remarkable catch, the pitcher is credited with an out. When evaluating pitchers, why should it matter whether they have a Gold Glove caliber defender behind them or not? It shouldn’t, and that’s where Expected Run Average comes in.

Expected Run Average or XRA gives pitchers credit for what they actually can control. FIP attempts to do this as well but assumes that pitchers have no control over batted balls. While the pitcher does not control how the fielders interact with the live ball, he does have an impact on the type of contact that he allows. XRA is based on a modified DIPS theory that the pitcher controls three things: whether he strikes the batter out, whether he walks the batter and the exit velocity, launch angle combination off the bat. After the ball leaves the batter’s bat, the play is out of the pitcher’s hands and should no longer have any effect on his statistics. The goal is to figure out a way to measure, independently of the defense and park, how each pitcher performs on balls in play. Since 2015, StatCast has tracked the exit velocity and launch angle of every batted ball in the majors. Each batted ball has a hit probability based on the velocity off of the bat and its trajectory. The probability for extra bases can also be determined. These batted ball probabilities have been linearly weighted for each event including strikeouts and walks to give each player’s xwOBA, which can be found on Baseball Savant. This is the perfect way to look specifically at how well a pitcher has performed on a per plate appearance basis.

Once xwOBA is found, then XRA can be calculated. The first objective is to find the pitcher’s weighted runs below average. To do this, I used the weighted runs above average formula from FanGraphs except I made it negative since fewer runs are better for pitchers.

wRBA = – ((xwOBA – League wOBA) / wOBA Scale) * TBF

For example, Max Scherzer has had a .228 xwOBA so far this season and has faced 487 batters. After finding the league wOBA and wOBA scale numbers at FanGraphs I can plug these numbers into the formula.

– ((.228 – .321) / 1.185) * 487 = 38.22

Max Scherzer has been 38.22 runs better than average so far this season, but now I need to figure out what the average pitcher would do while facing the same number of batters. To find this I need the league runs per plate appearance rate and multiply that number by the number of batters that Scherzer has faced.

League R/PA * TBF = Average Pitcher Runs
.122 * 487 = 59.41

So a league average pitcher would have been expected to surrender 59.41 runs facing the number of batters that Scherzer has so far this season. Now that we know how the average pitcher should have performed we can find the expected number of runs that Scherzer should have surrendered so far this season by subtracting his wRBA of 38.22 from the average pitcher’s runs.

Average Pitcher Runs – Weighted Runs Below Average = Expected Runs
59.41 – 38.22 = 21.19

Based on Scherzer’s xwOBA, he should have only given up 21.19 to this point in the season. If this sounds incredible it’s because this is the lowest mark of any starting pitcher though the first half of the season. Finally, XRA is found by using the RA/9 formula by multiplying the expected number of runs allowed by 9 and then dividing by innings pitched.

(9 * Expected Runs) / Innings Pitched = XRA
(9 * 21.19) / 128.33 = 1.49

Max Scherzer’s XRA of 1.49 is easily the lowest of any starter through the first half. The second best starter has been Chris Sale who has a 2.15 XRA. Of course these names are not surprising as they each started the All Star Game and are both currently the front runners for their leagues’ respective cy young award.

Here is a list of the top ten qualified pitchers:

Pitcher XRA
Max Scherzer 1.49
Chris Sale 2.15
Zack Greinke 2.26
Corey Kluber 2.33
Clayton Kershaw 2.34
Dan Straily 2.87
Lance McCullers 2.89
Chase Anderson 3.11
Luis Severino 3.17
Jeff Samardzija 3.23

And the bottom ten:

Pitcher XRA
Matt Moore 6.58
Kevin Gausman 6.47
Derek Holland 6.32
Matt Cain 6.26
Ricky Nolasco 6.26
Wade Miley 6.17
Johnny Cueto 6.10
Martin Perez 5.97
Jason Hammel 5.95
Jesse Chavez 5.84

Full First Half XRA List

It is interesting to see that three members of the Giants rotation rank in the bottom seven in all of baseball. In fact, AT&T Park is such a pitcher-friendly park that once you park adjust these numbers, Moore, Cain and Cueto become the three worst pitchers in baseball. It’s not surprising then why the Giants are having such a disappointing season.

One measure of a good stat is whether or not it matches your perception. Therefore, while it is interesting to see Dan Straily as one of the best pitchers in baseball and Johnny Cueto as one of the worst, it is much more assuring to see Max Scherzer, Chris Sale and Clayton Kershaw as some of the very best in the sport. The numbers for relievers also reveal how dominant Kenley Jansen and Craig Kimbrel have been. This is all good evidence that XRA is doing what it is supposed to do, accurately displaying how good pitchers have actually been, independent of all other factors.

Another important characteristic of a good stat is how well it correlates from year to year. While ERA is the most simple and popular way to look at pitchers, it is not very consistent. XRA is much more consistent than ERA and FIP and also compares favorably with xFIP. However, it is not as consistent as DRA. DRA controls for so many aspects of the game that it should be expected to be the most consistent. However, being the most predictive or most consistent stat is not necessarily the goal of XRA. The real goal is to show how well the pitcher actually did, and XRA seems to do this remarkably. While not being as consistent as a stat like DRA, the level of consistency is extremely encouraging and puts it right in line with the other run estimators.

XRA is a stat that takes luck, defense, and ballpark dimensions out of the equation. When evaluating a pitcher, he shouldn’t be penalized for giving up a 350-foot pop fly for a home run in Cincinnati while being rewarded for that same pop fly being caught for an easy out in Miami. With XRA, no longer will people have to quibble about BABIP, since it is results-independent and removes all luck from consideration. A ground ball with eyes will now be treated the same whether it squirts through for a single or is tracked down for an out. Pitching ability will no longer need to be measured with an eye on the level of the defense. It takes a good offense, a good pitching staff and a good defense to make a great team, and with XRA we can finally separate all of these important factions.


Is Kershaw Really a Postseason Choker?

Dodgers superstar ace Clayton Kershaw has already cemented himself as the greatest starting pitcher of this generation and could go down as one of the best of all time. Despite all his tremendous regular-season success, an ongoing narrative has haunted him throughout most of his career, a well-known theory that Kershaw chokes in the postseason and can’t pitch in big games.

But in reality, this actually hasn’t been the case, and the fact that so many people consider Kershaw to be a choke artist speaks more to his amazing regular-season dominance than any struggles he’s had in the playoffs. Through 282 starts in the regular season, Kershaw has an outstanding 2.35 ERA and 0.998 WHIP, so anything worse than that in the postseason is going to feel like a disappointment.

The main argument defending Kershaw’s postseason woes for awhile now has been lack of sample size. As Kershaw has reached the playoffs more and more this argument has weakened a little bit but is still relevant, as his 89 total postseason innings pitched is less than half of what Kershaw pitches in a typical regular season. It’s a large enough sample size that we can make some conclusions about how Kershaw has pitched in the playoffs, but not enough that we can judge his true-talent level. We have 1892.1 innings of regular-season data to judge his true-talent level.

Let’s start with the basic statistics. In 18 games (14 starts), Kershaw is 4-7 with a 4.55 ERA and a 1.16 WHIP. At first glance these numbers seem not horrific, but very underwhelming for what we’ve come to expect from Kershaw. This ERA is a mix of some very good starts and some not so good ones that evens out to a mediocre 4.55.

But as we start delving into the advanced statistics, Kershaw doesn’t look so bad. His FIP is a very good 3.13, with his xFIP about the same at 3.17. These stats take into account the things the pitcher can mostly control — strikeouts, walks and home runs — in an attempt to gauge a pitcher’s true-talent level in the sample size given, and are on the same scale as ERA. So in a sense, Kershaw has had some bad luck in the playoffs, and while the results still haven’t been as great as his regular-season results, he has still mostly pitched like himself.

But where does this FIP come from, and why is it so much lower than his ERA? FIP takes into account strikeouts, an area in which Kershaw has actually performed better in the postseason than in the regular season. In the regular season, he has averaged 9.88 K/9, while in the postseason, he has averaged 10.72 K/9. He has also kept his walks down in the playoffs, averaging 2.73 BB/9, which is only a little bit worse than his regular season 2.37. As a result, his 21.5 K-BB% in the postseason is nearly identical to his 21.2 regular season K-BB%. So the problems he’s had in the postseason haven’t had to do with walking too many hitters or not striking out any batters. In that regard, he’s still pitched like the Clayton Kershaw we know and love. So where have his issues come from?

The answer to that is a higher average on balls in play, a higher HR/FB%, and a bad bullpen coming in to relieve him. FIP also takes into account home runs, and he has allowed more home runs in the postseason, averaging 1.01 HR/9 (which is still good, just not Kershaw good) versus an outstanding 0.58 HR/9 in the regular season. It’s really not fair to criticize him too much for this since his postseason sample size is still less than half of a regular season. In fact, that 1.01 HR/9 is actually better than his 2017 regular season HR/9 so far, which is a very uncharacteristic 1.22 in a year where he’s been neck-and-neck with Max Scherzer for the Cy Young award. Kershaw has allowed more home runs in the postseason as a result of not only a slightly higher fly ball% but also a higher HR/FB%, 10.9 versus 7.7 in the regular season. While this doesn’t mean that he’s been unlucky, it does mean that his HR/FB% is likely to regress closer to his career norms. xFIP takes this into account and the number ends up being virtually the same as his FIP.

In addition to the extra home runs, Kershaw hasn’t been as lucky on balls in play as he has in his career. In the regular season, he’s held a .269 BABIP, which for most pitchers would be thought to be unsustainable, but Kershaw’s pitched for so long now that it’s become clear that he’s just that good. He hasn’t been quite as lucky in the postseason, where he’s allowed a .295 BABIP. And it’s not like Kershaw has allowed way more hard-hit balls in the playoffs than in the regular season, although he has allowed slightly more. He has a 20.1 line-drive rate in the playoffs, which is just slightly higher but very similar to his 19.8% in the regular season. Pitchers obviously try to prevent line drives, as they often result in hits, and Kershaw has prevented line drives from being hit about as well in the playoffs as in the regular season. So that’s not the problem.

Kershaw has allowed slightly more fly balls — 40.2 FB% versus 34.3% — and this, paired with the higher HR/FB%, makes for a bad combination and more home runs. He’s still allowed ground balls at a similar rate, only slightly less, at 39.7% versus 45.9%. So has Kershaw allowed more well-hit balls in the postseason than in the regular season? Yes, but only slightly, and not enough that he should be considered a choker. The only slight increase in line drives shouldn’t result in as big a gap in BABIP as it actually does, meaning that luck has not quite been on Kershaw’s side the way it has been in the regular season. He’s struck people out like regular-season Kershaw, he’s prevented walks like regular-season Kershaw, and he’s prevented balls from being well hit only slightly less than regular-season Kershaw. That, in addition to slightly more fly balls leaving the ballpark, has resulted in a really good pitcher that maybe is not quite as good as regular-season Kershaw, but still very good, and it certainly doesn’t warrant calling him a “choke artist.”

It can also be argued that Kershaw has been overused and over-pressured to do well. He’s been so ridiculously good in the regular season that the expectations are for him to be just as good in the playoffs and to do it practically every three or four days against the best teams in baseball. Anything less and he seem like a disappointment. People often overlook the great moments he’s had in the playoffs, like when he came out of the bullpen against the Nationals to save a tight game or when he dominated the eventual World Champion Cubs in Game 2 of the 2016 NLCS. As a result of high expectations and trust in Kershaw, he has perhaps been left in games slightly longer than he maybe should have.

An occurrence that has plagued Kershaw in the postseason a few times is going deep into games and then getting hit around before his exit from the game. He’s often left with men on base, and the relievers coming in after him haven’t exactly been kind to him, allowing nine of the 14 runners he’s left on base to score. Let’s say the bullpen comes in and dominates, stranding all 14 of those runners, and his postseason ERA drops from 4.55 all the way down to 3.64.

Also remember that in the playoffs, teams are in their full strength and effort, doing everything they possibly can to try and win. These are the best teams in baseball, the teams that had everything working well enough for 162 games to make it past all the other teams and into the playoffs. The offenses Kershaw has to face in the playoffs are going to generally be better than the average offense he might face throughout the season. It is not uncommon for great pitchers to have slightly worse results in the playoffs. Madison Bumgarner, a famous “postseason hero” for the Giants, has a postseason FIP only 0.02 better than Kershaw’s and an xFIP 0.43 worse than Kershaw’s. Luck can go in very different directions for some pitchers in small sample sizes, and this is a perfect example.

Look at Pedro Martinez. In more postseason innings pitched than Kershaw, he has a significantly worse FIP/xFIP (3.75/4.31) despite an unsustainable low BABIP of .257, lower than his regular season .279. And no one thinks of him as a postseason “choker.” Greg Maddux, another all-time great, also has a worse FIP/xFIP (3.66/4.45) than Kershaw in even more innings pitched (198). And nobody considers him a postseason choker. Roger Clemens is the same deal. 3.52 FIP, 3.91 xFIP in 199 innings pitched. These pitchers are still considered all-time greats despite having postseason numbers that are arguably worse than Kershaw’s.

This really goes to show just how good Kershaw has been in the regular season. He puts up godlike numbers and then when he puts up “only” good numbers in the playoffs, it seems like he’s bad in comparison. When you look at the aforementioned fellow all-time greats, it’s clear that Kershaw is not the first great pitcher to have a little trouble in the playoffs.

So has Kershaw been as utterly dominant in the playoffs as in the regular season? No. But has he been a choke artist who gives up eight runs every time he’s put under pressure? No, not at all. He has had some rough outings in the postseason, particularly against the Cardinals, where he hasn’t been able to dominate and take control of the game quite like normal, but he has also had plenty of good moments of great pitching and when he’s left with runners on base, his bullpen has mostly let him down. All he really needs is one great World Series run to erase this ongoing narrative once and for all. No matter what, these small hiccups in the playoffs shouldn’t diminish the legendary career that Clayton Kershaw is in the midst of.