Archive for Player Analysis

Beware the Brew Crew

The Milwaukee Brewers have had a really quiet off season. Just how quiet? They only signed two players to major league contracts. For a team that needed a lot of help, two major league signings doesn’t seem like a lot. However, they did get a lot of help this off season. The other teams in the NL Central have failed to make a splash big enough to make the central a three team race again, and this is a potential opening for the Brewers to move in.

The Brewers were, and are, not expected to make a playoffs appearance during the 2014 season, but is that really true? They could. They very well could, and here’s how:

First, the three other teams who made the playoffs last season have regressed. The Cincinnati Reds have not done anything to improve. They lost their, arguably, two most important players to free agency in Bronson Arroyo and Shin-Soo Choo. The two players combined for an even six wins above replacement. Their replacements (Billy Hamilton and Tony Cingrani) have a combined WAR of 2.9, a 3.1 difference! Albeit, the two players have not been major players in 2013 having spent most of the season in the minors, but that is more reason to be concerned. Who knows how two second year major leaguers with little experience under their belt will do to replace two All-Star caliber players. Will the loss of Choo and Arroyo hurt the Reds? Of course! And Hamilton and Cingrani may not be the best replacements for a team who won one of the NL Wild Card spots in 2013.

The other team who didn’t make moves AND who won the NL Wild Card series, the Pittsburgh Pirates, is in a tougher boat. They lost several key players in Marlon ByrdJustin Morneau, and A.J. Burnett and they replaced them with, well, nothing really. The only major league signing that the Buccos pulled off was for Edinson Volquez who had an absolutely atrocious season in 2013 and is the least likely replacement for an ace. Plus, first base and right field are still questions with no viable replacements at those positions. So does this mean that the Pirates will be out of the playoffs? I don’t think that the front office will go down without a fight. They want to appease their fan base and they still have many pieces in place to win over 80 games again, but unless they upgrade the rotation, first base, and left field, they are not going anywhere.

The final team and NL Central winners are perhaps in the best shape to make the playoffs again. The St. Louis Cardinals have done enough to maintain their dominance in the central. With Jhonny Peralta and Peter Bourjos in the fold including dominant young players such as Oscar Taveras and Michael Wacha, the Cards are looking like they will win another central title. But the Brewers might have something to say.

Other than the Cardinals, the Brewers have made the most important moves to improve their ballclub for 2014. They addressed all of their issues: The rotation, first base, and a left handed relief pitcher (according to ESPN). The rotation was fixed momentously with the addition of Matt Garza. Garza, one of the most sought after starters during free agency, will help to form a powerful front three rotation. With Kyle Lohse and Yovani Gallardo leading the way and Marco Estrada and Tyler Thornburg rounding things out, the Brew Crew’s rotation is looking like it can compete with the best of them. Plus, the addition of Garza helps to address another issue. Will Smith, a lefty who was acquired in the Norichika Aoki trade, will move to the bullpen. Here, the Brewers are able to add to an already strong bullpen that features a strong back-end and now a stable and reliable left handed pitcher.

Although the Brewers never signed a first basemen to a major league deal, the ones that they were able to acquire will impact the ball club in many ways. Mark Reynolds and Lyle Overbay will help what was a weakness for the Crew last season. Their combination of power, defense, ability to platoon, and familiarity to the NL Central and other leagues will impact the Brewers as if they had signed a major league contract. Plus, the Brewers have many great players in place at other positions. Jean SeguraCarlos GomezJonathan Lucroy, and even Ryan Braun will make a formidable lineup while young players like Khris Davis and Scooter Gennett have shown that they can play at the major league level.

Overall, the Brewers are a much better team and are starting to look much better than the 2013 season. They have addressed all of their pieces while other teams in the NL Central have regressed. Although the Brew Crew may not make the playoffs, as many predict, they will cause havoc and surely improve from the 74-88 record they posted last season.


The Last Remaining Top Starting Pitcher

Ubaldo Jimenez: Check.

Suk-min Yoon: Check.

A.J. Burnett: Check.

Ervin Santana: Nope.

The first three names have all signed contracts within a week and a half, the last one has not. Ervin Santana, a top 50 free agent according to many, is still unsigned and, according to MLB Trade Rumors top 50 free agents list, the only starting pitcher unsigned. So what does that mean for Santana? Well, it means that he may garner a large contract with a large sum of money from a desperate team, or he’ll be robbed of what he’s actually worth. Steve Adams of MLB Trade Rumors predicted that Santana would receive a 75 million dollar contract over five years. Pretty good by any standards, but most likely not what he will get. Jimenez received 50 million while Matt Garza received 50 million as well only weeks ago, while Ricky Nolasco early on in the winter received a 49 million dollar contract. Of course, the annual average salary varies for each player, the highest guarantee salary is 25 million less than that predicted for Santana. So although he may still receive his projected 75 million, the likelihood of that happening looks slim. At this point in the stage, a four year deal seems logical, but I think with an annual salary of ~12 million, perhaps less. Although his career numbers and career in general don’t garner a salary like this, teams will match this price, or exceed it, in order to fill a hole.

The fact that Santana, and many other free agents, took so long to sign does not bode well with the player’s association and reflects negatively on the qualifying offer. The fact that a team is passing over a player with ties to a draft pick means A) that teams value their picks more so than ever and B) that the ability to win now is not as important as the future. Let me explain.

Option A makes sense. Many teams have depleted farm systems a la the New York Yankees and the Los Angeles Angels, so restocking their farm system and building towards the future (whether that be future trades, future post season aspirations, etc) is a viable, and necessary, option for all teams whereas option B is only for a few teams. Not every team is in a position to win now, so signing a player tied to their draft picks would be a lose-lose, but then you have the other teams who can win, and can win now. The Yankees clearly have attested to this. They have signed players tied to draft picks and thus lost those picks, but they are in an excellent position to win now, and for the future. You see, since a team loses a draft pick, they are obligated, but not obliged, to sign players to long term deals in order to make the signing worth wile. The Seattle Mariners believe this as do the aforementioned Yankees.

Thus the qualifying offer, although in place to help players which it does, can hurt teams and players alike. Teams can’t make respectable offers to players without losing their draft picks, and if they do, they tend to offer the player more money than he is worth. While the players, on the other hand, receive large paydays and security for their families, they do have to wait for a team to take a chance on him, if they even want to and lose their draft pick. And Santana perfectly reflects this. The notion that a team in need of a player (the Toronto Blue Jays for example) is not willing to offer a worthwhile deal to a player because they need the picks, while the player has to hope that what he receives is a viable, and legitimate, contract.

In conclusion: I do not like the qualifying offer. It ruins a team’s ability to sign a free agent while at the same time makes a player less valuable since his is tied to a draft pick.


Justin Verlander: Ready to Regain Righteousness

So last year in my 10-Team, 35 man roster, dynasty fantasy baseball league, I found myself in need of some starting pitching after the first two months of the season. I was last in my league in quality starts, and near the bottom in ERA and WHIP. Manny Machado was my cornerstone 3rd baseman, and he was hitting a fiery .355 for the month of May. For how good he is, I did not believe he was a batting title contender, so was interested in seeing what I could get for him.

Enter Justin Verlander. The Tigers’ ace, (at the time) had a 1.83 ERA and was 3-2 in April, and had hit a rough spot in May where he surrendered 16 ER in 12.2 innings. After going 17-8 with 239 K’s in 2012, I felt like this was a great time to buy low on the guy, while selling high on Machado. So I traded the Orioles’ phenom for the Mr. Kate Upton. Well, safe to say, that WAS NOT the trade that ended up winning me the league. From the end of May forward, Verlander posted a 3.36 ERA for the remainder of the season and was walking batters like it was the cool thing to do, posting a 3.07 BB/9 (which is AWFUL for him). He ended the season with a 3.46 ERA and only 13 wins, which were his worst totals in those categories since 2008. This isn’t an argument against Machado’s lack of offensive ability, which I will discuss at a later date. Instead, I will be telling you why Verlander’s performance last season was a fluke, and he will regain his Cy Young form in 2014.

As pitchers age, they usually lose a little oomph on their fastball. People will probably look at Verlander and assume this is the reason why he was less effective in 2013.

 

Year

Age

Fastball Velocity (average)

2010

27

95.5

2011

28

95.0

2012

29

94.7

2013

30

94.0

 

Based on the table above, you can see how he has lost some velocity on his fastball. For more detail, follow this link to view his velocity charts for 2013 and compare it to his prior years. If you notice, in his first five starts of 2013, his fastball average was hovering around the 93 mph mark, well below his average of the last four years, 95.2 mph. In those first five starts, Verlander posted an ERA of 1.83, had a K/9 of 9.38, WHIP of 1.19, and held batters to a .242 average. Even with a fastball that seems to be slowing down, Verlander has still found a way to retire batters, and more importantly, still strike them out. So the argument that his fastball is becoming “too hittable” isn’t necessarily correct.

BABIP, for those of you who don’t know, is the percentage of time that if a batter makes contact with a ball and puts it in the field of play, it will go for a hit. Generally, the league average for hitters falls somewhere between .290-.310. But there are plenty of factors that can influence BABIP, such as a player’s skill, defense behind a pitcher, and our good friend LUCK. More on that in a moment, but first, let’s establish what factors influenced Verlander’s BABIP. From 2008-2012, Verlander had an average BABIP of .282, which is below the league’s average range of .290-.310. Based on this sample size, we can assume Verlander’s skill set is above the mean for pitcher. Secondly, Defense. According to baseballreference.com, the Detroit Tigers ranked 12th out of 15 AL teams last year in errors and double plays. On a more optimistic note, they were 4th in fielding %. Those numbers indicate that they were a mediocre, at best, defensive team, which would cause Verlander’s BABIP to slightly increase toward the league mean. Lastly, we don’t have a way to measure luck, but Verlander’s 2013 BABIP was way above his recent average of .282, sitting at .316.

Point being, there were too many balls that were put in play that fell for hits considering all the conditions I stated above for Verlander.

It was not just his inflated BABIP that led to a down year in 2013 for Verlander. He posted a five-year high in BB/9, at 3.09. When you walk people and then give up hits, runners are bound to score. In 77.2 innings in June and July of last year he walked 33 batters. In the final 97.2 innings of last season and the playoffs, he only walked a combined 22 batters. He was able to regain his control in the second half that he had lost mid-way through 2013. I think the control he demonstrated toward the end of 2013 will carry over into 2014.

One more random stat to consider: Verlander’s IFH% (infield hit percentage) for his career sits at 5.9%. Last season, that stat jumped up to a recent high at 8.3%. Reasons for that stat being high could result from the inefficiencies of Miguel Cabrera at 3rd base, or inconsistent defense of Jhonny Peralta. The Tigers now have the more athletic Nick Castellanos at 3rd, and made a mid-season trade last year for Jose Iglesias. Both of those additions provide upgrades defensively for the Tigers compared to last year.

With everything that I’ve discussed, this guy is being way undervalued in fantasy drafts this year, going in the 5th or 6th rounds depending on the format. If you can grab him in the 4th over guys like Zach Greinke or Madison Bumgarner, I would do so. He still strikes people out at a high rate, posting 217 K’s last year. Also, don’t forget that he pitches for a team with one of the most potent offenses in the game. When Verlander’s BABIP regresses, his improved defense and control kicks in, he will regain his righteousness.


Platoon-Split All-Star Team

The 2013 All-Star Game has already been played, and the result was decided. The AL defeated the NL in a 3-0 effort in a game  that was filled with players of all different types. The aging veterans who want a last hurrah. The rising stars who are getting their first taste of what it is like to play among the elite in baseball. The overpaid superstars and the underpaid superstars. However, I thought it would be interesting to assemble an all-star team of players with large platoon split.

Call it an Island of misfit toys or misfit all-stars, if you’re feeling Moneyball-esque.

Catcher

Vs. RHP Jason Castro: PA’s 380, wOBA .371, wRC+ 137

Vs. LHP Derek Norris: PA’s 173, wOBA .426, wRC+ 177

Combined: PA 553, wOBA .387, wRC+ 149

Castro doesn’t actually lead all catchers against RHP. That honor belongs to Joe Mauer. However, Mauer ranks within the top three catchers against left-handed pitching, which makes him not really have a huge platoon split. Therefore I rendered him ineligible as a platoon partner. It makes sense that the Athletics would have a catcher who is so effective in hitting left-handers, because they also have John Jaso who is known to mash righties (.363 wOBA vs RHP). If there is anything an Astro fan should be happy about  — which there isn’t much — it’s the fact that Jason Castro eats right- handed pitching for lunch and he also is one of the better catchers in the league.

First Base

Vs RHP Chris Davis: PA’s 434  wOBA .473, wRC+ 203

Vs. LHP Nick Swisher: PA’s 224 wOBA .398 wRC+ 158

Combined: PA’s 658, wOBA 447 wOBA, wRC+ 187

Davis was considered the best first baseman, as he led the league in dingers and compiled a WAR of 6.8. While Davis was performing at near-immortal levels against right-handed pitching, he was also very vulnerable against left-handed pitching with wRC+ of 104 against LHP. Nick Swisher is an interesting case because he is a switch hitter, but really struggles against right-handed pitching with a wRC+ of 93. This makes me wonder if Swisher should consider going the Shane Victorino route, and drop batting lefty to focus solely on batting right-handed. We don’t know if this strategy works for everyone — it’s probably a case-by-case situation — but it’s something to keep in mind.

Second Base

Vs RHP Robinson Cano: PA’s 420, wOBA .410, wRC+ 160

Vs LHP Brian Dozier: PA’s 148, wOBA .421, wRC+ 171

Combined: PA’s 568, wOBA .408, wRC+ 161

I had a hard time picking Cano simply because while Cano is definitely better at hitting righties than lefties, he’s not that bad at hitting lefties. Last season, Cano had a wOBA of .343 and wRC+ of 114 against LHP. That’s not a bad mark, however it is a sizable enough difference to create a platoon split. On the other hand, this points out that Dozier is a little underrated, and if he is used in the right roles, he could be a very valuable player. I find this platoon an interesting dichotomy: an overpaid superstar in Cano and a cost-effective role player in Dozier.

Shortstop

Vs. RHP Ian Desmond: PA’s 507, wOBA .344, wRC+ 118

Vs. LHP Jhonny Peralta: PA’s 136, wOBA .414, wRC+ 164

Combined: PA’s 643, wOBA .344, wRC+ 126

Shortstop was by far the hardest position for which to make a platoon. The LHP side was easy with Peralta because he led all shortstops when it came to facing lefties. The problem came with the right-handed side because the guys who could hit righties well — such as Tulowitzki and Lowrie — could also hit lefties pretty well. I settled with Desmond because even though he is well balanced against LHP and RHP, he wasn’t as balanced as Tulo or Lowrie.

Third Base

Vs. RHP Adrian Beltre: PA’s 516, wOBA .370, wRC+ 129

Vs. LHP David Wright: PA’s 150, wOBA .454 wRC+ 199

Combined: PA’s 666, wOBA .397, wRC+ 143

There were a lot of good-hitting third baseman last year. Miguel Cabrera led all third baseman in hitting against right handers and left handers. Wright and Beltre are number two to Cabrera. They also both have large platoon splits. Wright can hit RHP, it’s just that the split between PA’s against RHP versus his PA’s against LHP is huge. Beltre, on the other hand, is somewhat insignificant against lefties.

Right Field

RHP Daniel Nava: PA’s 397, wOBA .392, wRC+ 146

LHP Hunter Pence: PA’s 178, wOBA .415, wRC+ 174

Combined: PA’s 575, wOBA .399, wRC+ 154

This is where things can get a little arbitrary because there are a lot of corner outfielders, and therefore a lot of corner outfielders who have platoon splits. You could sub out both outfielders for a combination of Michael Cuddyer and Giancarlo Stanton. However, I thought that it would be more fun to point out how undervalued Nava is. Nava had a breakout year in Boston, and he did so by destroying right handers. Pence actually isn’t all that bad against RHP, wRC+ of 119 against RHP, which is kind of surprising considering he’s a lefty with a long swing. Bruce Bochy should probably take more advantage of Pence’s ability to hit left handers well. I think that both players are underrated.

Center Field

Vs. RHP Shin-Soo Choo: PA’s 491 wOBA .438, wRC+ 183

Vs. LHP Carlos Gomez: PA’s 140, wOBA .421, wRC+ 171

Combined: PA’s 631, wOBA .430, wRC+ 179

Choo is easily one of the worst defensive center fielders in the game, and he probably should shift over to a corner outfield spot in Texas. A lot of people express concern over the Choo contact because of the poor defensive play combined with a massive platoon split. Choo is godly against RHP, but below average against LHP (wRC+ of 81). The three-year, $24 million contract extension that the Brewers gave Gomez looks like it was a steal. Not only did they get a guy who punished left handers, but they also got a guy who led the NL in WAR, had great defense, and even some decent pop.

Left Field

Vs. RHP Dominic Brown: PA’s 381, wOBA .366, wRC+ 133

Vs. LHP Justin Upton: PA’s 164  wOBA .422, wRC+ 174

Combined: PA’s 545, wOBA .382, wRC+ 145

There isn’t anything interesting about why I picked these two, other than the fact that I did consider Matt Holliday instead of Brown. However,  Holliday’s split wasn’t as large as Brown’s. I wouldn’t expect Dominic Brown to perform as well against righties again; he’s in for some serious regression to the mean.

If these platoons were put into practice you could probably get as good or better production than the elite hitters in baseball. This list, just like the actual all-star game roster, is diverse. You have players who are considered elite — such as Choo, Cano, Wright, and Beltre — and then the undervalued guys such as Dozier, Nava, Norris and Castro. It’s surprising that most teams don’t take more advantage of platoons since they could get elite production from two players for a fraction of the cost.


Tanner Roark’s Z-Swing%, and Related Observations

Although the Nationals had a disappointing 2013 season overall, Tanner Roark (RHP) was one of their more pleasant surprises. The Nats brought him up in August, as injuries and performance problems created openings for several pitchers in their minor league system.

While Taylor Jordan also performed well, I think it’s fair to say that Roark had the most impressive and intriguing debut for the big-league team. Roark accumulated excellent “traditional” stats, and he did so at least in part by exploiting an unusual but highly effective talent: making batters not swing at good pitches. This post explores Roark’s story, and opens up the question of how his distinctive forte, zone-swing rate, contributes to effective pitching.

To recap, Roark finished 7-1 with a 1.51 ERA over 53 2/3 innings. He allowed only 1 home run in total, or 0.17 home runs per 9 innings; and the league batted .197 against him (– “batting average against” or “BAA”). The Nationals’ ace, Stephen Strasburg, allowed 0.79 home runs per 9 innings, with a BAA of .205. Roark was comparable in BAA to Strasburg, and much, much better at preventing home runs.

Of course, Strasburg reached his figures in 183 innings of pitching as compared to Roark’s 53 innings of pitching. This is what is sometimes described as a smaller sample. But we should not discount Roark’s performance too quickly. His 53 innings involved five starts and nine relief appearances, and a total of 12 appearances with at least two innings pitched. This is considerably more than, say, one start and no relief appearances. Roark played for the Nationals for the last two months of the season. His stint in the majors last year was substantial enough, I think, to merit serious interest.

Roark’s 2013 performance was surprising in part because of his pedigree. In 2012 Roark was 6-17 as a starter in Triple-A, pitching for the Nationals. His 2012 ERA in Triple-A was 4.39 (although his FIP [Fielding Independent Pitching rating] of 3.85 was better). Providing more background, Adam Kilgore wrote in September 2013 that

Roark has never been regarded as a star or a significant prospect. In 2008, the Rangers drafted him in the 25th round. The Nationals acquired him and another minor league pitcher for Cristian Guzman at the 2010 trade deadline. Last winter, the Nationals left Roark unprotected from the Rule 5 draft for the second straight year. They invited him to major league spring training this year, and shipped him out in the first round of cuts.

(Washington Post, Nationals Journal, 9/17/2013; http://www.washingtonpost.com/blogs/nationals-journal/wp/2013/09/17/tanner-roarks-incredible-start-built-on-command-feel-for-pitching/)

Roark’s 2013 performance was also surprising because, with a fastball averaging 92.6 mph, he had good but not overwhelming velocity.

Going back to FIP and similar topics, another reason why Roark’s 2013 performance was surprising was because of some relationships between his statistics. For instance, although his 2012 Triple-A ERA (4.39) was higher than his 2012 Triple-A FIP (3.85), this relationship reversed itself last year in the majors, with Roark posting a 1.51 ERA and a 2.41 FIP. In addition, his xFIP (“expected Fielding Independent Pitching”) was 3.14, significantly higher than the FIP.

“ERA < FIP < xFIP” spreads of this size are not unheard of, but they are rare, especially when your ERA is less than 2.00. In fact, ERA < FIP < xFIP distributions of this type suggest that you are identical to Clayton Kershaw (1.83 ERA / 2.39 FIP / 2.88 xFIP) and that you have just signed a contract worth 215 million dollars!

These observations about Tanner Roark’s performance and pedigree raise several questions:

How did he perform so well in 2013?

What is going on with his ERA<FIP<xFIP distribution?

What can we say about his future performance?

Taking a quick initial look at the ERA<FIP<xFIP distribution, a “negative” delta between ERA and FIP is often attributable to the pitcher having a low Batting Average on Balls in Play (BABIP). Roark’s BABIP was indeed very low, at .243. (Kershaw’s was .251).

Also, although this might sound odd, Roark’s extremely low HR rate (0.17 per 9 innings) pushed his ERA below his FIP, even though home runs are a fielding-independent matter. Roark was fine (league average or better) on the other FIP elements — walks, K’s, HBP’s. But combining these normal-range statistics with his homer rate produces a compromise number and some information loss.

Turning to xFIP, this calculation substitutes out the pitcher’s own homer rate for the league average homer rate. As we might expect, the league average homer-rate was much higher than Roark’s, and this explains the FIP < xFIP delta, while also contributing to the delta between his ERA and his xFIP.

These observations tend to intimate that some of Roark’s statistics are not likely to repeat themselves. Before turning to the “future performance” question identified above, I want to look more at the first question of trying to understand Roark’s 2013 success. There are aspects of Roark’s pitching last year which suggest that his strong performance numbers were not an accident, and that his apparent prowess is not simply overmagnified by the small prism of his innings total.

The first statistic of interest is that Roark was seriously good at throwing pitches in the strike zone which batters did not swing at. This is the Z-Swing% statistic recorded on FanGraphs and other places. Roark’s Z-Swing rate in 2013 was 54.8% (per Baseball Info Solutions [BIS]), or 55.9% per PITCHf/x. This means that batters only swung at Roark’s pitches in the strike zone about 55% of the time.

(BIS and PITCHf/x converge around 55% for Roark’s Z-Swing%. These systems actually diverge, or report different percentages, for some other stats which are not independent of Z-Swing%. Although this is interesting, the differences do not materially affect our evaluative questions. I will cite the BIS plate discipline statistics throughout and compare them to PITCHf/x at various points below).

The complement of Z-Swing% is what I will call “Z-pass” — the phenomenon of non-swings on pitches in the strike zone. Tanner Roark’s Z-pass rate last year was 45% — batters passed on about 45% of his pitches in the strike zone.

This was a very high Z-pass rate. In fact,

  • It was the highest Z-pass rate on the Washington Nationals, by about 5 percentage points, among Nationals pitchers with at least 50 innings.
  • It also was more or less the highest Z-pass rate in all of major league baseball, again among pitchers with at least 50 innings. Roark came in first in Z-pass rate according to BIS. According to PITCHf/x, Roark was tied for sixth-best in Z-pass rate, behind Sonny Gray with a 47% Z-pass rate.

A high Z-pass rate is indicative of several good pitching qualities. Z-passes are good because they mean that batters are laying off a higher number of pitches which damage their cause and advance the pitcher’s cause. A high Z-pass rate indicates that the pitcher is accumulating strikes while maintaining an atypically lower risk of allowing a hit. (This is true if the pitcher is hitting the strike zone at a reasonable rate. More on this below). Tactically speaking, the Z-pass is the best outcome on the swing v. strike zone matrix below.

In Zone

Out of Zone

Swing

??

??

No Swing

Strike

Ball

Swings on pitches in the zone and out of the zone can lead to hits, and worse. By contrast, if we assume that non-swings in the zone lead to strikes, the Z-pass simply constitutes a good outcome for the pitcher.

How often did Roark throw strikes? In 2013 Roark hit the strike zone 47.7% (BIS) of the time. This was about 3 percentage points ahead of major league average (44.9%). 3 percentage points comes out to about one standard deviation above average. (PITCHf/x reports a higher league-wide strike-zone rate — 49.4% — and a higher strike-zone rate for Roark as well, at 53.8%. PITCHf/x appears to have a larger strike zone than BIS).

It therefore appears Roark was exploiting his elite Z-pass rate often enough for it to be useful, and indeed for him to have an advantage over hitters. Roark accumulated strikes at a good rate; and, by strongly suppressing swings at pitches in the zone, he lowered the risk of allowing a hit. It appears this dynamic was a main factor in Roark’s success in 2013. That’s part of the answer to our “How did he perform so well” question.

Another factor which stands out from Roark’s strike-zone data is that he threw first-pitch strikes 70.6% of the time. This tied for third in major-league pitchers with at least 50 innings in 2013. Consistently gaining an initial advantage over hitters, and doing so at an elite rate, was another main factor in Roark’s success.

Other discussions of Roark have cited his command, his aggression, and an improved mental approach. Going back to Adam Kilgore, he writes:

Roark’s ascension began last season, when he told himself he would not allow his temper to control him on the mound. He would not the things out of his control – fluky hits, errors, whatever – distract him. He would throw strikes. He would be confident. He would attack, above all else.

“I feel that last year is when I had my, I guess, mental turnaround,” Roark said. “That was the biggest thing for me.”

(Washington Post, Nationals Journal, 9/17/2013; http://www.washingtonpost.com/blogs/nationals-journal/wp/2013/09/17/tanner-roarks-incredible-start-built-on-command-feel-for-pitching/)

We can certainly see command at work in Roark’s low homer rate, and his low walk rate (5.4%). We can see both command and aggression at work in his first-pitch strike rate. Roark’s league-leading Z-pass rate substantiates the command/aggression understanding of his performance, and also adds to this understanding.

A pitcher who suppresses swings on pitches within the zone is presumably hitting unattractive parts of the zone, but he may also be throwing in-zone pitches which do not present to hitters as strikes. This sounds like a pitcher on whom it is difficult to make good contact. This is a third idea, beyond Z-pass rate and first-pitch strike rate. One way, however, to be averse to good contact is to be a high Z-pass pitcher.

Being a high Z-pass pitcher does not entail being a high strikeout pitcher. Roark’s strikeout rate was only one percent below major-league average (again, among pitchers with 50 innings and up). Of course, on other measures, like ERA, Roark was much better than league average. I think that connecting Z-pass rate with suppression of good contact can help us understand why.

Z-passes represent hittable pitches – pitches in the zone – which were not hittable enough to induce a swing. Poetically speaking, Z-passes involve real visual ambiguity: since they end up in the strike zone, they can’t look that bad; but they do not look good enough to induce a swing.

How well does this characterization actually apply to Roark’s pitches? On this question, we have the following from the Atlanta Braves:

“He wasn’t missing with any pitches over the plate, it seemed like,” said Braves catcher Gerald Laird. “When he was going away, he was throwing that little two-seamer back door, when he was coming in he was running that two-seamer in on your hands, and he had that little slider working.

“Tonight it seemed like he was hitting his spots and wasn’t making any mistakes. I know (Freddie Freeman) was saying he was starting it at him and running it back over. When he’s doing that it’s hard to pull the trigger.”

(http://www.washingtontimes.com/blog/nationals-watch/2013/sep/17/tanner-roark-shines-nationals-complete-doubleheade/#ixzz2prxGGOUh)

Of course, these descriptions of visual ambiguity — or of evidence which shifts within a fraction of a second — presumably apply to all or most of a high Z-pass pitcher’s offerings, not just to his pitches in the strike zone which do not elicit a swing. The image that emerges is of a player whose whole volume of pitches is tough to react to in a manner that creates good contact.

Roark was actually pretty good at inhibiting contact of any kind, especially on pitches within the strike zone. However, a look at his contact numbers does not immediately confirm this interesting and important point. As we see in the table below (from BIS by way of FanGraphs, again looking at 50+ IP), many of Roark’s contact rates were actually above league average, sometimes by more than one standard deviation.

O-Swing%

Z-Swing%

Swing%

O-Cont%

Z-Cont%

Cont%

Zone%

F-Strike%

SwStr%

Roark

34.90%

54.80%

44.30%

77.40%

92.90%

86.50%

47.70%

70.60%

6.00%

MLB (50+ IP)

31.33%

65.63%

46.74%

66.35%

86.79%

79.22%

44.92%

60.63%

9.50%

std dev

3%

3%

3%

7%

4%

4%

3%

4%

2%

Before turning to contact rates, you will have noticed that this table also gives us a look at how Roark’s Z-swing rate compared to the rest of baseball. According to BIS, Roark was 3 standard deviations above average on a positive pitching statistic which is completely independent of fielding. He was two standard deviations (56% Z-Swing%, as opposed to 63% league average) ahead according to PITCHf/x — this is still pretty good for a former 25th-round pick! Some other observations:

  • O-contact. Here Roark was much higher than average, but this may not be a bad thing, since contact outside the zone is less likely to be productive for the hitter.
  • Z-contact. Roark again was higher than average. But this somewhat unsettling number should not be digested outside of its relevant context, which is helpfully provided by Roark’s Z-swing rate. Looking at Z-contact multiplied by Z-swing yields the interesting result that Roark allowed contact on 51 percent of his strike zone pitches, as opposed to a league average of 57 percent, with a standard deviation of 3 percent.

(PITCHf/x condenses this gap, in much the same way that it condenses the gap between Roark and MLB on Z-pass. PITCHf/x reports Roark at 52.2% contact on all pitches within the zone, and MLB at 54.6%. Thus, if we switch from BIS to PITCHf/x, Roark’s contact rate goes up, and MLB’s goes down.

However, as noted above, PITCHf/x appears to be working with a larger strike zone than BIS (MLB-average Zone% of 49.3 vs. MLB-average Zone% of 44.9). This point complicates Roark’s apparent movement back towards league average. In brief, the fact that Roark’s swing rates go up — while the MLB average goes down — on larger renditions of the strike zone may be a testament to his effectiveness, rather than a knock against it.

  • SwStr (swinging strikes/total pitches). Since Roark did a good job suppressing contact within the zone, Roark’s low swinging-strike number does not seem to be an especially important piece in his overall puzzle.

The standard contact rates reported by BIS and PITCHf/x do not do a good job of communicating how well a pitcher actually prevents contact, because these contact rates only look at swings. Since you can suppress contact by suppressing swings, multiplying the contact rate by the swing rate provides a better view of how a pitcher is actually doing along this dimension. Despite a “zone-contact” rate which was higher than league average, Roark was very good to excellent at suppressing contact within the strike zone.

We are exploring a clue provided by Roark’s excellent Z-pass rate that Roark was good at inhibiting solid contact. This clue was supported by our look at Roark’s contact rates, which indicate that he was pretty good at suppressing contact flat out. The idea that Roark’s pitches were visually ambiguous enough to limit good contact receives further confirmation from his batted-ball statistics. In addition, looking at these statistics (2013, 50+ IP) will bring us around nicely to the question of how well Roark might sustain his performance in future seasons.

BABIP

GB/FB

LD%

GB%

FB%

IFFB%

HR/FB

Roark

0.243

1.95

24.30%

50.00%

25.70%

13.20%

2.60%

MLB (50+ IP)

0.289

1.436

21.1%

44.6%

34.3%

9.7%

10.2%

Std dev

0.031

0.681

2.5%

7.8%

7.7%

4.0%

3.8%

Roark’s ground-ball, fly-ball, and infield-fly rates combine to indicate a strong bias against good contact. Roark had a somewhat high line drive rate, and, admittedly, line drives are a form of good contact. For instance, I suspect it’s unusual to have a somewhat high line-drive rate and a markedly low BABIP. Roark’s line-drive rate provides one specific indication that his BABIP is due to increase. However, a low line-drive rate is not entirely at odds with the idea that a pitcher is suppressing good contact — especially if we are thinking about home runs. Since most line drives are not home runs, a slight tendency towards line drives is a small but genuine homer-prevention measure.

In this way, Roark’s line drive rate coheres with his ground-ball, fly-ball, and infield-fly rate statistics. All of these rates, and especially their combination, suggest a low-homer pitcher. Why didn’t Roark give up a lot of home runs? Well, he got a lot of grounders and infield flies, while limiting his fly balls overall, and he gave up a somewhat high proportion of line drives. It is very plausible to suppose that Roark’s extremely low HR/FB rate overshoots the anti-homer bias suggested by his other batted-ball rates. Equally, however, the other rates tell a clear enough story that a low homer rate is not at all a surprise. Roark was very good at inhibiting good contact.

How will he do in the future? A nice way to frame this question is in terms of Roark’s ERA, FIP, and xFIP numbers mentioned earlier. And, leading up to that, I think it’s helpful to assess the respective importance of two things: (1): the overall coherence of Roark’s 2013 statistics; and (2) the sample sizes in which they were achieved.

In terms of coherence, Roark’s statistics tell a consistent story:

  • Looking at Z-pass, Roark was very good at limiting swings on good pitches
  • Looking at Z-swing * Zone%, Roark was very good at limiting contact within the zone
  • Looking at his batted ball rates, Roark was very good at limiting good contact

I could be wrong about this, but I do not see relationships among Roark’s 2013 statistics which point to trouble looking ahead. These statistics tell a consistent story of effectiveness. You can focus on his low swinging-strike rate if you like, but this rate was consistent with Roark being at least one standard deviation (two sd’s according to BIS) better than average on limiting contact within the zone.

In addition, there are pockets within Roark’s portfolio where some stats are very good and others are even better, like the HR/FB rate relative to Roarks other batted-ball statistics. However, this type of overshooting is a good problem to have. To the extent that the non-harmonic components of Roark’s statistical portfolio are extremely good statistics, this relates to the issue of our expectations for future years. A version of Tanner Roark based on 2013, but without the extra anti-homer overshooting, would still be above MLB-average.

As noted above, Roark only pitched 53 innings, and that’s a much lower total than what a starting pitcher would typically accumulate over a full year. Although we intuitively regard this as a small sample, it does not follow that Roark’s performance is without predictive value. As is often pointed out on the pages of FanGraphs, statistics stabilize, or acquire predictive value, at different thresholds (http://www.fangraphs.com/library/principles/sample-size/). Generally speaking, fielding-independent stats stabilize more quickly for pitchers than fielding dependent stats; this is a helpful point in assessing the forward relevance of Roark’s 53 innings.

Some of Roark’s relevant statistics are above their stabilization thresholds. Roark allowed 153 balls in play (BIP), which puts him above the stabilization points for groundball rate and flyball rate:

70 BIP: GB rate

70 BIP: FB rate

Roark faced 204 batters, which is above the stabilization points for walks and strikeouts:

70 BF: Strikeout rate

170 BF: Walk rate

However, Roark was league-average in K’s and was “only” one standard deviation above average in walks; these numbers are not as good as Roark’s plate discipline statistics like Z-pass and suppression of contact within the zone. So it’s not clear whether Roark reached the stabilization points for key parts of his performance.

But this is more or less where I will have to leave it. Figuring out the stabilization point for Z-pass is beyond the scope of the present study. Indeed, my post has probably pushed us to near overload regarding things that we ever wanted to know about Tanner Roark! By the same token, it’s not clear that learning more about Roark’s statistical profile would shift our opinion much about his prospects for future performance. This is what I think we have to consider:

In an intuitively small sample size, Roark put up a consistent portfolio of excellent fielding-independent stats: on limiting zone-swings, limiting contact in the zone, and limiting good contact. Very broadly, the size of a sample has to be balanced with the consistency of the evidence within it. Just imagine watching a one-round boxing match in which one competitor knocks the other one down three times. This is a small sample which tells a very compelling story about the respective abilities of the boxers. Roark’s sample size is larger, of course, and his performance was not as dominant. Nonetheless, his limited 2013 season is packed with a lot of positive indicators.

Here are a few final comments about what Roark might do in the future, framed in terms of his ERA, FIP, and xFIP:

ERA

FIP

xFIP

Roark

1.51

2.41

3.14

MLB average (50+IP)

3.68

3.75

3.78

standard deviation

1.11

0.82

0.62

As we discussed above, the delta between Roark’s ERA and FIP is primarily a matter of his low BABIP and his very low homer rate. Although Roark’s BABIP will probably go up, there are signs he may be better than average at suppressing hits: he showed a tendency to induce ground balls and infield flies; the latter especially inhibit BABIP.

Roark’s very low homer rate pulls down both his ERA and his FIP. Although his .17 homers per 9 innings will almost certainly go up, there are signs he may be better than average at suppressing home runs…signs which are distinct, that is, from his one homer allowed in 53 2/3 major league innings!! Roark’s tendencies toward ground balls, infield flies, and line drives are all anti-homer measures. These tendencies flow, by hypothesis, from his ability to inhibit good contact by throwing visually ambiguous pitches.

The most eligible view by far is that Roark will regress towards league average in future years. But accepting this view should not deprive us of optimism. Roark could go back at least one standard deviation on each of the ERA-like measures and still be at league average or better than league average. That’s a good position for any pitcher. It’s a great position, albeit a paradoxical one, for a pitcher who is currently slated to compete for no better than the 5th spot in the Washington Nationals’ 2014 starting rotation!! Suffice to say I think that Roark ought to receive full consideration for the opportunities available to him.


Believing that Starlin Castro Will Rebound in 2014

Earlier today, I was looking at trends and projections for some Cubs prospects and looked up Starlin Castro.  A trend immediately struck me: his 2010 batted ball statistics are nearly identical to his 2013 peripherals.

Stat:      ISO         LD%        GB%       FB%      IFFB%

2010:   .108      19.5%      51.3%     29.2%     7.0%

2013:   .102      19.9%      50.7%     29.4%     7.6%

These two seasons are closer than any of his other seasons in batted ball numbers.  A key difference?  2010 BABIP was .346, 2013 BABIP was .290.  His career BABIP is .323.  So is it we assume some good luck in 2010 and bad luck in 2013?

It should be noted that his BB% in 2013 was his career low, and his K% was his career high mark.  So can we expect some regression in those numbers as well?

I think the answer is yes to both questions.  In 2012, his BABIP was .315.  Even if Castro could return to that level (right around his career average), he looks much better than the .245 hitter we saw in 2013.

Additionally, his K% in 2013 was 3.8% higher than his previous career high, so I tend to expect a slightly lower rate in 2014 (though his contact rate in 2013 was also the lowest in his career, so if that is a trend, it is possible the K% could stay).

I’m still a firm believer in the idea that the past management, while trying to teach Castro to be selective and patient, actually taught him to take pitches for the sake of, well, taking pitches.  This could also potentially explain the low contact rate.  The numbers indicate that he didn’t learn to distinguish balls from strikes any better, and that maybe for him, the best approach is to swing at whatever looks good.

Given the striking similarities between his rookie season in which he hit .300 and garnered national attention as an upcoming star and 2013, it’s easy to dream about a bounceback 2014 season.  Only time will tell if that’s a reality, but I believe that Cubs fans have reason to be optimistic.

(I posted this earlier at the-billy-goat.mlblogs.com.  For more Cubs news and analysis, feel free to check out the blog.)


Ervin Santana vs. Ubaldo Jimenez

While Ervin Santana and Ubaldo Jimenez certainly have their similarities, they each have different risks and benefits associated with them. They have often been connected throughout the offseason, as they have similar price tags and each is connected to draft pick compensation. They have also been linked this offseason because each is coming off an impressive season following a very bad season, and overall inconsistencies in their careers. However, the two pitchers are not incredibly similar, as one profiles more as a durable innings-eater and the other carries more upside.

The 31-year-old Ervin Santana provides many more innings than Ubaldo Jimenez, as he has eclipsed the 200-inning plateau three times in the past four seasons. Santana, however, has often outperformed his peripherals, especially this past season. In 2013, Santana posted his career-best 3.24 ERA, but his FIP was 3.93, which suggests some regression in 2014. Even looking back at the past five seasons, Santana has had an FIP under 4.00 just once. It may seem as if he has the ability to outperform his peripherals consistently, but during that same span his ERA surpassed 5.00 during two seasons most recently in 2012.

As I mentioned above, Santana’s best quality is his ability to go deep into starts consistently throughout the season. Santana is also a tremendous strike-thrower, as he walked just 2.18 batters per 9 innings, which is an improvement upon his still impressive 2.81 BB/9 for his career. Santana is also an effective groundball generator, as his groundball rate has been above 43% for the past three seasons. The real knock on Santana has been his inconsistencies throughout his career, with three seasons of an ERA above 5.00 and just four seasons of an ERA under 3.00 during his 9-year career. While Santana’s ERA was the best of his career, in 2013, his other metrics were not much better than his career norms, which suggests he hasn’t necessarily figured anything out.

The 30-year-old Ubaldo Jimenez, unlike Santana, has a reputation for struggling to go deep into games. He has not thrown 200 innings in a season since 2010 and has only done it twice in his 8-year career. His struggles to last deep into games are likely related to his high K/9 and very high BB/9. Both strikeouts and walks drive a pitcher’s pitch count up and he has never had a BB/9 lower than 3.50. As I stated above, Jimenez carries more upside with him, as his career K/9 is a full strikeout per 9 higher than Santana, but Jimenez’s peripherals are also better than his ERA. Jimenez has a career 3.78 FIP, compared to his 3.92 ERA. During his time with the Rockies, Jimenez was an outstanding groundball pitcher, but since moving to the Indians, his GB% has slipped to 38.4% in 2012 and 43.9% in 2013. Despite pitching in hitter-friendly Coors Field for the majority of his career, Jimenez’s Hr/9 has been better than Santana’s in every season of his career.

Compared to Santana, who has had an FIP under 4.00 just once in the past five seasons, Jimenez has had an FIP under 4.00 four of the last five seasons. Jimenez’s only truly bad season, in terms of FIP, was 2012 when his FIP ballooned to 5.06 and his ERA climbed to 5.40. While being able to go deep into starts is pivotal in being a reliable and consistent starter, Jimenez certainly carries the highest upside and actually most consistent performance between the two starters. Jimenez has also proven that he can pitch in a high run-scoring environment, such as Coors Field. Santana, however, has pitched the majority of his career in a pitcher-friendly park at Angels Stadium of Anaheim for every season except one.

Looking into the numbers, it is clear that Ervin Santana is the best bet of the two starters to reach 200 innings. It is also evident that Ubaldo Jimenez has the greatest potential to provide above-average production inning per inning. Neither starter is an ace or likely to become one and each comes with legitimate questions. However, in terms of which starter is better, it really depends on what a team is looking for. If they want a starter that can provide 200+ innings season after season, then Santana is by far the better option. If the team is seeking a starter that can consistently provide an ERA around or below 3.50, then Jimenez is the better option. Since each starter has a similar price tag, it is really a question of which type of starter the team is looking for. Personally, I prefer Jimenez to Santana because he has provided more consistent numbers across the board and has only had one truly bad season.


Warning! Beware of Nelson Cruz

Lately I’ve been hearing some rumors connecting the Tigers to free agent outfielder Nelson Cruz. I understand how fans have been hungry for another power bat since Prince Fielder was traded, but Nelson Cruz is not the guy you want. It’s not because of the whole PED suspension last year, or even the fact that he single-handedly dismantled the Tigers in the 2011 ALCS. No, it’s simply because he is not that valuable of an all around baseball player.

I don’t particularly enjoy writing pieces where I talk about a player’s shortcomings. At the end of the day, these guys are major leaguers and I’m still a kid who’s a fringe high school bench player who doesn’t know whether he’s a natural right-handed or left-handed hitter (I’m really bad at both). But due to the recent clamoring for Cruz, I figured it was my duty to all my readers to expose the truth about him.

The Good

Nelson Cruz is a solid power hitter. Despite having a shortened season due to a 50 game suspension, Cruz still managed to hit 27 HR in 109 games. With a respectable ISO of .240 in 2013, and a career ISO of .228, Nelson can still manage to hit for very good extra base power. His wOBA in 2013 was .359 and is .353 over the course of his career, both being good. Bottom line, he’s a good power hitter, but I never said that I’m debating that aspect of his game.

The Bad

Nelson Cruz does not have very good plate discipline. Assuming that we’re talking about the guy that’s supposedly going to be “protecting” Miguel Cabrera in the batting lineup, plate discipline does play a huge factor in this discussion. We don’t want a guy who’s a free swinger batting after a walk to the best hitter in the game who also happens to be really slow on the base paths. Last season, Cruz swung at 30.8% of pitches that were outside the strike zone (O-Swing%), which is really bad. He only made contact with 73.1% of the pitches that he swung at (Contact%), which is also bad. His BB/K was also bad, clocking in at 0.32. Bad. What have we learned so far? Basically, Nelson Cruz is an all or nothing hitter, which some fans really don’t mind. In the case that he’d be hitting behind Miguel Cabrera, I’d tend to shy away from a hitter like Cruz.

The Ugly

To an extent, all the bad I mentioned could be forgiven if Nelson Cruz wasn’t such a terrible defensive outfielder. Move him to DH you say? The Tigers have Victor Martinez and Miguel Cabrera who will both rotate time at 1B/DH, so there is no room whatsoever for another DH. Cruz would have to play everyday in RF or LF. He is 34 years old and 240lbs. His ability to chase down balls in open space is clearly declining. Sticking him in the outfield with Torii Hunter, who looked lost in the outfield for most of 2013, would be a horrible idea for a team that used the offseason to vastly improve their infield defense. The stat that I like to use for defense is UZR, but because of Nelson’s shortened season though, I’m going to use UZR/150. Last year, Cruz’s UZR/150 was -6.5, which is way below average. Considering he’s posted a negative UZR for the last three seasons, you can see that he is not very good at defense and is clearly not getting any better. It’s also worth mentioning that the Tigers would have to give up their 2014 first-round draft pick to the Texas Rangers considering Cruz turned down their qualifying offer of $14 million.

Total Value

In 2013, Cruz was worth 1.5 Wins Above Replacement. Andy Dirks 2013 WAR: 1.7. Obviously WAR is not the end-all-be-all statistic, but it does give a pretty good idea of what a player is worth when you replace him with someone who is league average at his position. In this case, the WAR of each player is practically identical, which means over the course of a season they will somehow be worth the same amount of wins to their team. Cruz will probably cost around $7-9 million in 2014, whereas Andy Dirks is already under contract for only $1.625 million. Assuming Cruz signs for $8 million and has the same WAR as 2013, the Tigers would be paying $5.33 million per win for him. Andy Dirks with his current contract and WAR? $956,000 per win. I know this might be some moneyballin’ right here, but if the goal of baseball is to buy wins, wouldn’t you rather have the wins at a cheaper cost?

Conclusion

When all aspects of the game are taken into account, you see that Nelson Cruz is a below-average baseball player with a plus power tool. The Tigers have Miguel Cabrera, Victor Martinez, Ian Kinsler, Austin Jackson, and Torii Hunter (and sometimes Alex Avila too) who will all contribute their fair share of runs this upcoming season. Not only do they not need a one-dimensional power hitter, he just doesn’t make sense for the makeup of their lineup which now features a solid balance of on-base average, power, and speed. Mix that with the huge liability that he is on defense, and you get a player that I don’t want to play for the Detroit Tigers.

 

For more information from me on the Detroit Tigers, visit www.ttowntiger.com


Another Look at Tom Glavine’s Generous Strike Zone

Jeff Sullivan recently suggested that despite his reputation Tom Glavine did not pitch to a significantly more generous strike zone. Sullivan points out Glavine did not get significantly more called strikes than other pitchers, even during the peak of his career. Sullivan’s analysis piqued my interest and made me wonder if Glavine’s reputation for getting a wider strike zone helped him succeed in ways beyond called strikes.

Glavine’s reputation alone likely influenced a batter’s behavior at the plate, encouraging batters who were behind the count to swing at questionable pitches. Batters believed if they did not swing these pitches would be called strikes for Glavine (when a batter swings at a pitch out of the zone when the batter is ahead of the count that has more to do with a pitchers stuff than the batter giving the pitcher an expanded zone). So, what would we expect from a pitcher who is getting batters to expand the strike zone? You would expect batters to make poor contact, yielding a lower BABIP. The batter would most likely swing at pitches outside the zone when the batter is behind the count.

Based on this reasoning, I hypothesize that Tom Glavine will see a greater reduction in quality of contact when he gets ahead of the count than a league-average pitcher. I’m going to look at the time span from 1991 to 2002 because that was the time span Jeff looked at and because I like palindromes.

To measure quality of contact I will be looking at BACON (batting average on contact). BACON is slightly different than BABIP because BACON includes home runs. If batters are expanding the strike zone when Glavine is ahead in the count we should see the quality of contact decrease. To measure the decrease in quality of contact, I will look at the ratio of BACON when Glavine is ahead to BACON to when Glavine is behind (the lower the number the greater improvement the pitcher experiences by getting ahead in the count). I will refer to this measure as EXP (a lower EXP shows a greater decrease in quality of contact, an EXP above 100 shows an increase in quality of contact).  The graph below compares Glavine’s EXP to the league average EXP for each season during the 11-year span.

 The league-average EXP is consistent year to year, hovering around 91, which suggests batters expand the strike zone for most pitchers when batters are behind in the count. Glavine’s EXP is not always better than the league-average EXP. In ‘94 and ‘96 Glavine was actually worse when ahead in the count than when he was behind.  This is to be expected because BACON takes a while to stabilize. Looking at Glavine’s data for a single season is subject to a fair amount of random noise because you have a relatively small sample of data. One season for Glavine gives us about 170 fair balls with Glavine ahead and 280 fair balls with Glavine behind. However, over a larger sample BACON stabilizes. At around 2,000 fair balls (more than in a single season for Glavine) BACON stabilizes. For example, when looking at the league-average EXP for a full year BACON is stable — with 3,500 fair balls with the pitcher ahead of the count and 4,600 fair balls with pitcher behind the count.

To make sure we are not just attributing skill to some random variation we need to look at a larger sample for Glavine. Over the 11 year span form 1991-2002 Glavine induced weaker contact (lower BACON) than the league average both when he was ahead of the count and behind the count. This is not surprising as we would expect a good pitcher to be better than average ahead and behind the count.  What’s interesting is Glavine has better than league-average EXP  (87 vs. 92) which suggests Glavine is better at expanding the strike zone than league-average pitchers. This comes with the caveat that while we have 3,056 fair balls when Glavine is behind the count, we only have 1,853 fair balls when Glavine is ahead — just shy of the 2000 at which the measure should stabilize.  Even so, the difference between Glavine’s EXP and the league-average EXP is very convincing.

Glavine (1991-2002)

MLB ave (1991-2002)

Ahead Behind EXP Ahead Behind EXP
BACON

0.266055

0.304319

87.42626

0.303134

0.330999

91.58153

To stabilize BACON, I increased the sample by looking at all the balls put in play. I compared balls put in play when the pitcher had two strikes to balls put in play when the pitcher had fewer than two strikes, which led to EXP2: the ratio of BACON when a pitcher has two strikes, to when he has fewer than two strikes. The table bellow shows a comparison of the quality of contact in two strike counts to non-two strike counts.

Glavine (1991-2002)

MLB ave (1991-2002)

2 Strikes Not 2 Strikes EXP2 2 Strikes Not 2 Strikes EXP2
BACON

0.275

0.302

91.22

0.3118

0.331

94.19

Even with this larger sample size Glavine’s BACON is still lower than the league average in respective counts. More importantly, his EXP2 is still better than league average (although higher than his EXP).  Pitchers in general try to induce weaker contact when they are ahead of the count, but the data shows Glavine is doing something special to induce even weaker contact.

Is Glavine getting batters to give him a wider strike zone? We cannot definitively say what is causing this pattern in the data, but we are seeing the type of numbers we would expect to see if the batter was giving him a wider strike zone.

 

All splits number are from Baseball-Reference.


The Curious Case of Jason Castro

As we look for candidates to regress in 2014, a popular choice is Houston catcher Jason Castro for it seems the Astros backstop has two targets on his back: a high strikeout rate last year of 26.5% and a high BABIP of .351. Steamer and Oliver both project a steep drop in BABIP that will drag his batting average from a solid .276 to the .250s. As Brett Talley wrote, Castro screams regression.

Or does he?

Talley points to Castro’s strikeout rate that has been topped only 61 times in the past decade, and only four times the player matched or bettered a batting average of .276. But that measure may miss the mark. No one is suggesting Castro’s strikeout rate will worsen. When it comes to batting average, the critical question, then, is whether he can come close to maintaining a high BABIP.

On that question the evidence is more promising. In the last decade, only 38 of 1,509 batters have had an infield-fly rate lower than Castro’s 1.8%. Only 47 had a line-drive rate higher than Castro’s 25.2%. Taken together, those two select groups actually have 10 matches — players who managed both a lower infield-fly rate and higher line-drive rate. Here they are along with their BABIP, batting average and strikeout rate:

Player, year, BABIP, Avg., K-rate

Joe Mauer, 2013, .383, .324, 17.5%

Joey Votto, 2011, .349, .309, 12.9%

Howie Kendrick, 2011, .349, .297, 17.3%

Matt Carpenter, 2013, .359, .318, 13.7%

Michael Young, 2007, .366, .315, 15.5%

Joey Votto, 2013, .360, .305, 19%

Adam Kennedy, 2006, .313, .273, 14.3%

Bobby Abreu, 2006, .366, .297, 20.1%

Michael Young, 2011, .367, .338, 11.3%

Chris Johnson, 2012, .354, .281, 25%

 

What might we gather from this evidence?

(1) All but one of the players topped .276.

(2) The skills involved seem somewhat repeatable: Votto and Young each appear twice and as a group they generally in their careers combined a high LD rate, low IFFB rate and a high BABIP.

(3) We wouldn’t expect a player who whiffs a quarter of the time to have a batting average as high as someone who strikes out half as much while putting up similar LD and IFFB rates. Castro is unlikely to approach the median average of this group of .307.

(4) Castro doesn’t need to approach the median average to avoid significant regression. He is more likely to hit closer to last year’s mark than he is to hit in the .250s.