What Does April Mean?

                            HOW MUCH DOES APRIL MATTER ?

April is the first hard evidence of what may be in store for the new season. But like any single month, April usually has some extreme results. Typically 40% of all teams start conspicuously well (.600 +)  or poorly (.400 minus) where by season’s end only 12% of all teams are at those outer edges of win percentage.  With 85% of the season to go, plenty of time remains for fates to change.  Or do they really alter that much post-April ?   This study focuses on that question and is based on the April records of all teams from 2000-2017  compared to their post-April results and chances for making the playoffs.  Two issues are addressed.  First, how closely have teams’ remaining matched their Aprils and second, what effect April has had on teams’ playoff chances.

                 PREDICTICE VALUE OF APRIL FOR REMAINDER OF SEASON (ROS) RESULTS

April records of all teams from 2000-2017, were divided into six win % categories: Excellent (.650+ win % ), Good (.550 – .650), Slightly Positive  (.500 – .549), Slightly Negative (.450-.499),  Weak (.350 -.449) and Poor (below .350).   Teams in each April win category were compared to their post-April and full season win/loss percentages. The percentage of teams in each April category who played at playoff level (.580), contention level (.540+) or near-contention (.500) after April were also measured as well as the percentage of teams in each April win category who made the playoffs.  Following are the results:

April W/L                    April Win % Last 5 month Full Yr .580+ last 5 .540+ Last 5 .500+ Last 5 Made Playoffs   Pct of All Pct of Playoff
Category Teams Average Win % Win % Months Months Months Teams Teams
.650+ 60 0.688 0.535    0.558 25% 53% 72% 60% 11% 23%
.550-.649 119 0.595 0.519 0.530 18% 44% 65% 43% 22% 33%
.500-.549 104 0.519 0.510 0.511 19% 37% 55% 33% 19% 22%
.450-.499 75 0.472 0.495 0.491 11% 29% 48% 20% 14% 10%
.350-.450 126 0.409 0.484 0.473 11% 25% 37% 14% 23% 11%
under.350 56 0.305 0.445 0.424 4% 7% 23% 4% 10%    1%

Each level of April performance has had better remaining performance and playoff chances than the level immediately below it. Contrasts between the top 1/3 and bottom 1/3 April teams are stark.  Two thirds of teams who’ve started well (.550 +) play  .500 + ball afterward and roughly half make the playoffs.  Only one third of teams who have been .449 or worse in April play .500+ ROS and just 9 % of these early strugglers have made the playoffs. Nearly 4 of every 5 of all playoff teams were .500+ or better in April and less than 1 in 10 playoff teams started .425 or worse. April has done a good job of quickly identifying contenders and non-contenders.

Much of that comes from April strongly relating to prior season results and current season records having solid resemblance to prior season records. So April simply confirms that many (if not most) teams are headed for the same general fate as last year.  While these are generally reliable maxims, they’re hardly infallible. Many teams April wrongly suggest things will stay the same.   Likewise April often may indicate a change is coming in the current season and it doesn’t materialize.   That will be explored next – just how often does April “fool” ?

HOW MANY TEAMS ARE APRIL “FOOLERS” ?

Roughly 3 in 8 teams had more than .100 pt change in their April v. rest of season (ROS) win percentage. For some teams at the April extremes, even huge April v. ROS differentials don’t change their basic seasonal fate. The 2003 Yankees started the year at .769 win %  but played “only” .596 ball the rest of the way and won over 100 games.  Conversely a .260 point jump in win % for the 2000 Tigers over the last 5 months still didn’t push them over .500 on the season. Still for a great number of teams the last 5 months can wash out a great deal of the good or bad April does. Hot starters fall apart and teams buried in April can rise from the ashes.

To show both the amount and type of “foolery” April provides a logical starting point is prior year records. Fully 70 % of teams with 90+ wins the prior year have .500 + Aprils v. 37% for teams with 69 minus wins. Since many teams tend to have Aprils which are “characteristic” of their prior season , measuring the true amount of “April deception” can be done in two ways: 1 – How many Aprils “characteristic” of last year give true v. false signals of another similar season ?  2 – How often do  “uncharacteristic” Aprils end up being true v. false signals of a better or worse year ?

 1st – How reliable are “characteristic” Aprils.  The following chart illustrates this

Prior Yr Wins     April Win Category       # Teams         % win  90+   % win 81+    % Playoffs

90 +                       .520 +                            83                       64 %               84 %             62 %                                   80-89                     .520 +                           71                        48 %              76 %             49 %                                         70- 79                    .499 minus                  73                           5 %              22                  7%                                              Below 70              .499 minus                   65                          3 %                6 %              6 %

As can be seen, prior year good teams with good Aprils have had even better odds than their already favorable odds of making the playoffs.   On the other side, prior year sub-.500 who have sub-.500 Aprils have seen their already thin odds get even slimmer.  Only 9 of 138 such teams have overcome the odds of a bad start.   The percentage of teams finishing above .500 is also remarkably different.   80 % of good teams with good Aprils end up plus .500 on the year, where a mere 14 % of bad teams with bad Aprils do. Conclusion: “characteristic” Aprils are highly reliable indicators of either continued contention or non-contention.    

“Highly reliable” does not mean “perfect”. The 90+ game winner/good April formula didn’t work for  defending world champions 2004 Marlins, 2013 Giants nor defending NL West champion 2008 D-Backs or 2005 Dodgers.  Nor did bad team/bad April deter the 2015 Rangers, 2011 D-Backs, and two Rockies teams (2009, 2007) plus the 2007 Cubs from rising up and making the playoffs.  But these are exceptions.

Applying these principles of “characteristic” Aprils to 2018 would bode well for the Red Sox, Yankees, Astros, Cubs and Diamondbacks. It would not for the Tigers, Rangers, Marlins, Padres, White Sox, Orioles, Royals and Reds.  Only 1 in 15 sub-.500 prior year teams with sub-.500 Aprils have made the playoffs so such odds would indicate  none of the above 8 will either.  Of course none of these clubs were expected to contend but neither were the Braves, Phillies, Pirates, and Mets who’ve had good Aprils.  Which brings up “uncharacteristic” Aprils.

2nd – How often are “uncharacteristic” Aprils false or true signals of a change in team fortunes ?

Logically when Aprils deceive, teams often return to their “true selves” (good or bad) in the last 5 months. The data largely supports that logic.  But it also supports the notion that some uncharacteristic Aprils correctly signal changes in a team’s fortunes. This time the data is parsed more finely to show where the false and true indicators of change can appear.

Prior Yr

90 + wins  + .401 – .499 April      26 % win 90+  – 77 % win 81+    29% in playoffs

90 + wins  .399 minus April        14 % win 90+  – 43 % win 81+     21 % in playoffs

70-79 wins  .600 plus April         39 % win 90 + – 72 % win 81 +    44 % in playoffs

70 -79 wins .500-.599 April        22 % win 90 +  – 51 % win 81 +    25 % in playoffs

Below 70 win  .600 + April         18 %  win 90+   – 64 % win 81 +   18 % in playoffs

Below 70 win  .500-.599 April     7 % win 90 +   – 22 % win 81 +   14 % in playoffs

 

More extreme “uncharacteristic” Aprils have greater accuracy in signaling real change. Of the 90 + winners who started below .400 in April, less than half finished above .500 and only 22 % made the playoffs. By contrast, 90 + prior year winners who had .520+ Aprils made the playoffs 62 % of the time.  So very bad Aprils from previously good teams are strong early warning signs of trouble ahead.

What about extremely strong Aprils from previously bad teams ? Of the 29 teams who had prior losing records but .600 + plus Aprils, 20 finished above .500 and 10 made the playoffs.  This includes some notable turnarounds:  2000 Cardinals, 2000 White Sox, 2006 Tigers, 2012 Orioles, 2013 Red Sox, 2015 Astros, 2015 Cubs, 2017 Rockies, 2017 D-Backs.    Of course, excitement over great Aprils by prior bad teams should be tempered by the fact that 65 % of such teams still have missed the playoffs. This may somewhat damp expectations raised by the start of the 2018 Mets.  Although the 2015-2016 Mets were playoff teams so it’s quite possible their strong April 2018 could be a legitimate sign of revival.  Teams who gained 15 or more wins over the prior year showed an average of 120 points jump in April v. prior year    win %.  The Mets have jumped 220 points.

Mildly uncharacteristic Aprils have a higher incidence of sending wrong signals. Only 22 % of the bad (“below 70”) teams with .500-.599 Aprils finished over .500. Only 23% of the 90+ winners who started .400-.499 became bad sub-.500 teams thereafter. Former 70-79 winners with good but not great Aprils (.500-.599) have boosted their playoff odds but at 20% those odds are still below average. Seven teams fit this description in  2018 – Mariners, Pirates, Phillies, A’s, Braves, Blue Jays, Giants.  Historical odds are that 1 will make the playoffs, 2 would be very optimistic.

However, one type of mildly uncharacteristic April is very noteworthy. Previously good teams who show up with modestly bad Aprils (.400-.499) have seen a big drop in playoff odds (29% for bumpy .400-.499 starts v. 62% for “characteristic” good Aprils).  That is not good news for 2018 Dodgers, Nationals or Twins who all were expected to be playoff teams again.  Which brings up the next issue – how does April influence playoff chances ?

                  WHAT IMPACT DOES APRIL HAVE ON MAKING THE PLAYOFFS  ?

As one old adage goes “you can’t win the pennant in April but you can lose it”. While there is a definite truth in this, April has the fewest games of any month, and still leaves ample time to recover. The 2001 Oakland A’s had a miserable 8 -17 April but went on to win 102 games.  That said, they were the only sub-.400 April team to win 100 + games and only one of 8 teams who started that poorly and still made the playoffs.   Another 10 such bad start teams played the rest of the season at contender levels (.540+ ) yet failed to make the playoffs and their poor Aprils were instrumental in that.

While it is clearly possible to recover from a bad start, bad Aprils leave a diminished margin of error. Two-thirds of all playoff teams start the year solidly (.530+ ), the vast majority (78%) are at least .500 + and fully 95 % of all playoff teams have avoided disastrous (sub. 400) Aprils.   Teams who’ve started miserably haven’t been able to count on fellow playoff contenders being in the same underwater boats.  They have to play serious catch-up with rivals whose yachts have begun to float away.  An “average” playoff team has a .564 April win percentage.  So a club with a 9 -15 April usually has to catch teams who’ve gone 14-10 or better.  If the 15-9 April teams plays at “only” .537 ROS and gets 88 wins, it takes a .580 ROS from the poor April starter to overcome that.  Playing at .580 + level ROS has been done by only 15% of all teams, which equates to being one of the top three teams in one’s league after May 1st.

Those 8 playoff teams who started sub-.400 averaged a .604 ROS win percentage.  Five of the eight were .580 + ROS and the lowest ROS win percentage was .572 which translates to 93 wins over a full season.    In addition, there were four teams with a .580+ ROS that missed the playoffs since their average Aprils were 9 – 14.    Despite Herculean May 1st-on efforts and being better ROS than their key divisional or wild card rivals those four teams (2004 Giants, 2005 Indians, 2011 Red Sox, 2012 Angels) lost out on the playoffs because of their inferior April records to rivals who had solid or even stellar starts.

 

Losing ground to playoff competitors due to a bad April and rivals’ typically good starts makes the task tougher. Even slightly below .500 starts mean a club usually needs to play at .560+ ROS to catch up as most playoff bound teams are already above .550 + in April. The other side is that very strong Aprils can provide a cushion to play at less than .550 ROS (the average playoff team is .581 ROS). There are even occasions where teams with less than a .520 ROS have made it in due to strong Aprils:            2016 Mets, 2015 Astros, 2014 A’s, 2006 Cardinals, and 2000 Yankees.  Ironically there are two World Series winners in that group (Cardinals and Yankees).   The Cardinals were particularly unusual as they were the only team to make the playoffs with a sub-.500 ROS record.

The lone .600 + ROS who missed the playoffs (2005 Indians) provide a classic example. They were the second best team in the AL after April, outplaying their division rival and subsequent World Series winner White Sox   84-55  to  82-56  ROS.   However, with the 17-7 April of the Sox, and Indians’ poor 9 -14 start, Sox gained a 7.5 game cushion.  The double whammy was that the Tribe’s poor April also cost them the wild card to Boston.   Had the Indians played even 12-13 instead of 9-14 they’d have ousted Boston.   Of course Cleveland’s 13-16 July didn’t help either, nor did going 1 -5 the last week of the season (including a 3 game sweep by the Sox) after the Indians had whittled the lead down to 1.5 games on Sept. 24.  But the April cushion built by the Sox allowed them to withstand an incredible Aug/Sept run by Cleveland and the Tribe was forced to play unbelievably well to stay in the hunt.   This all happened before the second wild card was introduced in 2012, and if it had applied back then Cleveland would have made it as that second wild card.   So has this second wild card now made it easier for April stumblers to recoup ?

HOW THE SECOND WILD CARD HAS CHANGED PLAYOFF ODDS

Adding a second wild card team has changed the odds in some meaningful ways as follows:                         Prior to 2012, teams had a 94 % chance of making it with 92+ wins , but only 43% with 87-91 wins did.  After 2012, teams had a 100 % chance of making it with 92+ wins, and an 83 % chance with 87 -91 wins.

Clearly 87 – 91 wins stands nearly twice the chances of making the playoffs than before when 92 wins was the benchmark to lock in a spot.  Prior to 2012 good Aprils helped but a team needed to be stoking the engine every month to reach that 92 plateau.  Only when teams hit the magic 580 ROS (94 win pace full season) did they have a near lock (96%) getting in.  Now that lock is at 560 ROS. This is a critical difference. That can make it easier for a strong April team to ease into the playoffs with a lower ROS or it can make it easier for a team that stumbles in April to recoup. In fact for those teams who started less than .450 and then made the playoffs prior to 2012, the average ROS was .604.  For the three teams that have done that since 2012, the average ROS is .574.  But that only 3 teams have made it with subpar Aprils says the new wild card system has not been a boon to bad April starters.

It’s a lot easier to have a .600+ April than a .600 + ROS.  111 teams have had such Aprils, but only 42 have done so ROS.  The new wild cards have actually made it easier for the good April teams to coast in with lower ROS records and there is no shortage of good April teams.  Since 2012, each year an average of 15 teams start the year with .520 + Aprils to compete for 10 playoff spots.  Of those teams, 56% (or 8 per year) make it in.    That leaves only 2 spots on average per year for sub-.500 April teams to compete for.

Following is a breakdown of the number of playoff teams after 2012 who’ve played at given ROS levels categorized by their April win levels

APRIL WIN LEVEL 600+ROS 580-599 ROS 560 -579 ROS 540-559 ROS 520-539 ROS 500 -519 ROS  # teams
600 Apri 4 4 6 4 2 3 23
550-599 3 4 2 2 2 0 13
500-549 4 3 4 1 1 0 13
450-499 3 3 1 1 0 0 8
400-449 0 0 1 0 0 0 1
below 400 0 0 2            0 0 0 2

As can be seen, 49 of the 60 playoff teams were .500 + in April. However, 91 % of the under .500 April teams who made the playoffs were forced to play .560 + ROS to get in.   Whereas only 64% of teams who started .550 or better were able to play under .560 ROS and still make it. So the second wild has so far given the fast start April teams a better chance to ease coast into playoffs at a lesser ROS pace rather than make it easier for the slow starters catch up.   But the same math could be applied to any month, good or bad, so to paraphrase the classic Passover question:  why is April so different from all other months ?

PSYCHOLOGICAL EFFECTS OF GOOD OR BAD APRILS

Despite the fact that April is only 15% of the overall season, when early-mid season personnel decisions are being made, April results can still have considerable impact.   April’s record can influence decisions such as: the patience a team has for a younger player with early season struggles or whether the team tries for a mid- May trade to replace an injured starter and/or considers promoting a top notch AAA player despite his arb clock issues.  When trading season starts in June, 35% – 40% of the team’s record at that point has been baked in by April’s wins and losses.  Even by the late July deadline, April still comprises 25% of the season.  If early results have affected fan attitudes and attendance, ownership may be either more or less willing to commit dollars to bigger name players at the deadline.

These factors may give April importance beyond its mathematical impact on the standings.   That teams tend to mirror their April win-loss %’s as the season progresses may be in part that April can create a sort of self-fulfilling prophesy.   The 2014 Cleveland Indians were 75-60 after April 30th, the Oakland A’s were 70-63 over the same time. Yet Oakland got the wild card by 3 games over Cleveland due to a 18-9 April v. the Indians 10-17.   Early season results were still impacting July decisions as the A’s were buyers and the Indians sellers.   To be sure, the A’s record worsened post-July and the Indians got better.  But without Jon Lester (2.35  ERA with A’s) and Jeff Samardzija (3.14) who knows how much worse it might have been for the A’s .  While ridding themselves of Justin Masterson may have helped the Indians and trading Cabrera didn’t hurt, how much better would they have been had they gotten an OFer and starting pitcher in July instead of being sellers ?

Conversely, the 2016 White Sox benefited from a 17 -8 April despite a May tailspin which left them with a 29 -27 record on June 4th and only 2 games behind in the AL Central. They then traded a very talented younger prospect named Fernando Tatis Jr.  for James Shields.  Needless to say this is a trade that has not worked well short or long term.  Despite the fact that the Sox had 3 straight losing seasons prior to 2016,  management seemed to believe that April/early May represented the success they felt the team was capable of as opposed to the more recent reality of losing.  Successful Aprils can sometimes keep wishful thinking alive for too long.

CONCLUSIONS

Clearly April has proven to be a good proxy for the team’s chances going forward that season. But April records have to be viewed in context of all evidence.  How much of a factor were injuries, over or under performances, or new offseason acquisitions?  Thoughts of dumping contracts and rebuilding, while too premature for May 1st, are still logical for poorer prior year clubs who are off to bad starts.

For clubs in the playoff hunt, April can have real impact since 3-5 more or fewer wins can make/break. Since 2013 teams with 87-88 wins have made the playoffs in 10 of 11 cases where only 2 of 14 teams with 85-86 wins have. Clearly these margins apply to other months’ results too, but as noted before, a very good or bad April can affect team decisions in June and July.   Having some breathing room afforded by a 16-10 April instead of the catch-up pressure of a 10-16 start can play into the psychology as well.

Winning the division winner is far preferable to having to win a one-game playoff as a wild card and April provides a good checkup on division rivals. Both Boston and NY look like they will fight it out all year although Toronto can’t be ignored. In the Central, the Indians’ are in a strong position with their chief rival, the Twins, are 4.5 games back already and the rest of the Central bad teams who’ve started with bad Aprils.   Houston’s good start helps especially since all of their closest chasers (Mariners, Angels, A’s) were sub-.500 teams last year, but the Angels improved through offseason acquisitions.  In the NL East, even though the Nationals are 5 games back they’re chasing teams who were all sub-.500 last year (Mets, Braves, Phillies).  Where the Dodgers, who are 8 games back, are pursuing a D-Back team that won 93 last year.  Turner’s absence hurt in April but so did the lack of offense from the rest of the team which may continue particularly since Seager is lost for the year.    Plus the bullpen woes cannot be overlooked. So April’s 12-16 record cannot be easily dismissed as an aberration.  Nor can the historical evidence of diminished playoff odds (20-25% range) of good teams who’ve had the Dodgers’ kind of April.   We shall all see soon enough.


World Series Hangover: A Different Look

Following up on a recent Jay Jaffe post, I am examining the question of whether there is a World Series hangover. Unlike that post (which was great, but answered a slightly different question than I am interested in), I compared the full season performance of World Series winners and losers relative to their true talent level. I looked at all teams that went to the World Series in 2012-2016. I only went back to 2012 because that is far back as I could find projections in my less than thorough internet search. Finally, I omitted 2017 because they have played too few games this year for my purpose. As a proxy for true talent level I used projected wins from Clay Davenport.

Why did I look at actual versus projected win totals? I did not find changes in absolute win totals informative in terms of the question I was asking. Teams change year-by-year. Changes in absolute win total could simply reflect talent level changes. By using projected wins as a baseline I hoped to control for, at least somewhat, changes in talent level. Using projected wins as a baseline also allowed me to examine whether any changes in performance across years was due to over/under performing in the World Series year versus over/under performing in the year after the World Series.

Let us get to it. Below you will find the projected win total (pWins), the actual win total, and the average projected win total (average pWin) for the World Series teams in the year they went to the World Series (WSyear) and the year afterwards (WSyear+1). Busy figure, bear with me.

Win totals versus projected win totals

My main point here is that the average projected win total is similar for the World Series year and the year after (a 1.3 win increase). The second point is to show the raw data as good practice. Next, I cared about how the teams performed compared to their projections in each year. That information can be found in the figure above, but better yet, here is a figure showing actual win total minus projected win total for the year the teams went to the World Series.

Actual win total - projected win total (WS year)

This is interesting. Teams that went to the World Series outperformed their projections by 8.2 games on average. With the exception of the 2012 Tigers all teams outperformed their projections (note that the 2017 Astros and Dodgers outperformed their projected win totals by six and seven games, respectively). The probability of 9/10 teams outperforming their projected win total is 0.010. Teams that go to the World Series outperform their talent level. What about in the year after the World Series? Below is the same figure as above with the year after the World Series added.

Actual win total - projected in total (year after World Series)

Alright then. In their post World Series season teams have, on average, performed right at their true talent level (-0.8 wins). What have we learned? Obviously the sample is small and the data for the year after the World Series trip is quite noisy. That said, within this sample, teams were projected to win a similar number of games in their World Series year as the year after. They substantially outperformed their projections in the year they went to the World Series. They then came back to earth in the year after their World Series trip.

Keeping in mind my question was regarding a year-by-year change in a team’s performance relative to their true talent level, I conclude that there is a World Series hangover of a sort. Yet, its nature is quite different than one might think. Rather than teams underperforming after going to the World Series it appears that they over-performed in the year they went to the World Series. In other words, any World Series hangover may result from our powerful friend regression.


The Effect of Batted-Ball Direction on Launch Angle

Fortunately Statcast now has a function that allows to sort for batted ball direction. This opens the chance for some new studies. Until now we just had launch angle (LA) and exit velocity (EV), however, that is not quite perfect because we already new that it is easier to pull fly balls for power. This was known intuitively for a long time https://www.fangraphs.com/fantasy/getting-to-know-fly-ball-pull-percentage-fb-pull/ but was hard to quantify until now.

One of the effects is certainly that parks are bigger in center field than they are down either line. However I also looked at EV and average distance of balls pulled, hit to center and oppo at angles of 20-35 degrees which are typical HR angles. For this article I only looked at right handed hitters, -45 to -15 was defined as pull, -15 to 15 as center center and 15 to 45 degrees as oppo.

View post on imgur.com

You can see that pulled balls yield a 343 ft distance and 92.4 EV. To center it is slightly lower (91.4/338) but to opposite field it drops dramatically to 290/86.2. From a physics standpoint that makes sense because the contact on inside pitches is supposed to be further out front so that the swing is slightly longer and thus has more time to accelerate to contact which probably means more bat-speed at impact.

wOBA supports this, while liners are relatively stable in production, the wOBA of pulled fly balls is dramatically higher. On grounders this trend is reversed and oppo grounders are better than pulled grounders.

View post on imgur.com

I also looked at the top and bottom 20 of the league in pull and oppo LA:

View post on imgur.com

You can see that pull LA has a pronounced positive effect while oppo LA even has a slightly negative effect. It might make sense to try to lift more on pulled balls and slightly try to suppress LA (“get on top”) on oppo hit balls. Not sure if this is possible with the same swing though, I think usually the guys having a high FB pull rate also have high grounder pull rates because that is the natural tendency of the swing.

So it seems to be pretty simple: pull the ball in the air and be productive.

However it isn’t quite as simple. Already before Statcast it was known that pulled balls are hit on the ground at a much higher frequency https://www.fangraphs.com/blogs/the-pros-and-cons-of-pulling-the-baseball-2/.

Launch angle supports that, pulled balls last year had an average LA of 5.6 degrees vs 13.1 for balls up the middle and 20 degrees oppo.

This makes sense and actually is something that isn’t easily combatted with the modern swing. The modern swing goes slightly up and pulled balls are hit out front. You can lift a ball like this but if you are a little too far out front the bat has risen above the plane of the pitch which means you hit the top of the ball and roll over hitting a hard topspin grounder, often into the shift.

This is especially pronounced on low pitches.

There are some hitters who have developed a tool to combat that rolling over with the uppercut swing as I have shown in this article https://www.fangraphs.com/community/finding-keys-to-elevate-the-ball-more/ by using a steeper bat angle but it is not easy to do as the league still tends to have much lower launch angles on low and especially away pitches https://www.fangraphs.com/community/effect-of-pitch-selection-on-launch-angle-and-exit-velocity/.

I broke this down a little more looking at batted ball directions and pitch locations inside the zone

View post on imgur.com

You can see that low pitches that are pulled are especially hard to lift, most extreme is that on low and away pitches but even the down and in pitch only yields a modest 6 degree LA.

I also looked at pulled balls above 10 degrees on low pitches. The leaders in that stat were in this order Stanton, Machado, Salvador Perez, Hunter Renfroe, Nelson Cruz and Mookie Betts. Those were some pretty good hitters last year, so maybe that is a skill that deserves further examination.

We all have seen Bryce Harper pull outside pitches for a homer and it does happen but generally trying to pull anything away is not a good receipt. If it works it usually is on pitches up (still yields a positive 7 degree LA to pull up and away pitches).

An adjustment that might make sense is trying to hit up and away and middle away balls to center rather than the other way. That way you could bring down the average EV of those pitches from a too-high-upper-20s average EV to a better low 20s EV, which yields a better BABIP on those pitches which tend to yield lower EVs. I elaborated in this article why mid-20s LAs are ideal but actually average LAs should be lower (between like 12 and 18 or so)
https://www.fangraphs.com/community/why-launch-angle-can-only-be-optimized-not-maximized/.

Overall an LA optimizing strategy using batted ball direction could look like this.

View post on imgur.com

So pulling the ball is good but only if you have the skill to put it in the air. Selecting the right pitches to do it certainly helps. On pitches that are low and away it still makes sense to follow the old advice to hit it were it is pitched. And for pitchers it might make sense to work the outside corner more, however that is also a fine line since you need to prevent the old Jose Bautista strategy of creeping closer to the plate and turning the outside pitch into a middle pitch. For this you need to pitch inside some to keep the hitters honest.


Swing Speed: Exit Velocity’s More Impressive Cousin

This post was co-written by John Edwards. If you’re not already familiar with his work, you can (and should!) follow him here

Launch angle and exit velocity became a big deal when MLB released them through Statcast at the start of the 2016 season. They instantly told an old story in a new way. It wasn’t surprising to see Nelson Cruz, Giancarlo Stanton, or Miguel Cabrera at the top of the leaderboards. We knew they knocked the snot out of the ball. But now we knew that they knocked the snot out of the ball in excess of 110 mph or better, and at 34 degrees or better.

Two years later and the terms are nearly ubiquitous, even speckled through broadcasts. But they’re often provided without context as colorful notes in single instances. Do we really care how fast the ball went out in the moments we’re watching, or at what angle, as long as it went out? It doesn’t tell us how the dinger or double or snagged liner happened, just that it did; and we just saw it with our own eyes.

So, what about that how? What’s contributing to a player generating that record exit velo or optimal launch angle?

Swing speed.

Swing speed could help inform us of how well a player is tuned into their timing at the plate and where they’re making contact, both of which tell more of our old story in an exciting new way than launch angle or exit velo alone. But the problem with swing speed is we don’t have that data. It’s simply unavailable: while Baseball Savant used to feature bat speed it no longer does.

Fortunately, enough data exists that we have approximations to work with. David Marshall reverse-engineered the formula Baseball Savant used in calculating swing speed – and now we can play around with those numbers!

Let’s look at who the best hitters were by bat speed last season, with a minimum of 100 batted ball events.

swsp1

Since the formula for predicted bat speed is essentially average exit velocity (AEV) accounting for pitch speed, and AEV is the majority factor in the equation, the leaders in bat speed are also among the leaders in AEV. But there are still some differences, and they’re some very important differences! The speed at which a pitch comes in affects how fast it goes out, so players facing pitchers who throw harder might register lower average exit velocities than a player with comparable bat speed facing pitchers who throw slower.

But bat speed isn’t consistent from plate appearance to plate appearance. Sometimes you check your swing, other times you let loose. But there’s an important trade-off: many of the hitters with superb bat speed strike out frequently, and hitters with low bat speed (Mallex Smith, 50.0 MPH or Billy Hamilton, 51.0 MPH) make a lot of contact without striking out. Low bat speeds allow for more contact and fewer whiffs, but high bat speeds allow for better contact at the expense of greater whiffs. As a result, bat speed is loosely correlated to contact% (R^2 of 0.09), and better correlated to contact% than exit velocity (R^2 of 0.08).

swsp5

MLB hitters are aware of this loose correlation. Since 2015, they’ve swung .4 MPH slower than average with two strikes, collectively opting for more contact and foul balls so they can stay alive longer in at-bats. But the guys who are the best at managing this are among the best in the game at generating offense, and they don’t all necessarily slow down their swings at the same rate. However, each had a wOBA with two strikes roughly 15% better than overall league average in 2017.

swsp2

The leaders here show us multiple paths to success with two strikes in regard to players slowing down or speeding up their bats. Beyond that, we get a few bands of players worth noting. Anthony Rendon and Francisco Lindor were really in a class of their own last year when it came to generating offense. They’re the only two players who were about 80% better than the average player overall, and lost less than 60 percentage points of wOBA with two strikes. Rendon only swung .2 MPH slower in those instances while Lindor swung .9 MPH slower.

That’s not to say they were the best, though. Joey Votto (2.1 MPH slower), Austin Barnes (1.0), Mike Trout (.4), Bryce Harper (.4 faster), and Rhys Hoskins (.5 faster) generated the most offense with two strikes. Collectively, they were so much better than most of their peers that they were able to absorb a bigger drop in effectiveness with two strikes in the count and still pose a considerable threat.

Whether swinging slower or faster than average with two strikes, the way these players optimized their swing speed with two strikes informs us of their approach more than their launch angles or exit velos alone. But what about the guys at the other end of the spectrum?

Giancarlo Stanton and JD Martinez had the largest differences in offense created with two strikes of anyone in the league. You can see all the data here. Per John’s own Statcast database, they each had dips in wOBA of more than 160 points when their backs were up against the wall, implying that the way they sold out for power when they were down to their final strike really didn’t work in their favor. They both swung slower than the .4 MPH average drop in those instances, and a peak at their heat maps suggests they were way more willing to hack at offerings out of the zone, too.

A lot of their peers actually acted in a similar manner, too. It turns out that 40 of the 50 players who saw the biggest drop in wOBA with two strikes slowed down their bat in those counts. They’re even more diverse of a group of players than the ones who saw the least drop. There might not be another offensive context where you’ll see Carlos Correa ranked with Lonnie Chisenhall, or Jose Altuve with Michael A. Taylor, or Josh Donaldson with Patrick Kivlehan.

Examining players in this light provides a unique perspective to some of the game’s most critical moments. Despite the variance in the quality between these players, the 2017 approximations suggest that they didn’t exhibit much of a two strike approach at all. Slowing down your bat but expanding your strike zone to chase pitches that are inherently less hittable seems like a recipe for Ks. 

If swing speed can tell us who’s optimizing their approach at the plate — or who isn’t — can it also help us predict a outbreak? We compiled hitters with at least 100 batted ball events in 2016 and 2017 (using batted ball events since our predicted bat speed equation uses exit velocity), and saw which hitters saw the most improvement from 2016 to 2017. Unsurprisingly, swinging harder resulted in much better production at the plate.

swsp3

Conversely, most hitters who declined in bat speed declined in production (except for Delino DeShields, curiously enough).

swsp4

But having a slow bat speed isn’t necessarily a bad thing, nor should all players strive to increase their bat speed. We discussed previously how bat speed and contact% are inversely related — not swinging out of your shoes every at-bats means that you have better time to react to pitches and make contact.

For hitters like DeShields, Suzuki, and Gordon, they want as much contact as possible – their maximum bat-speed isn’t comparable to guys like Gallo and Judge, so there isn’t really a way to sell out for power here. Judge and Gallo can get away with striking out so much because the few balls that they put in play frequently go yard, but if someone like Gordon adopted that approach, the increase in power wouldn’t compensate for the increased strikeout rate.

Instead, Deshields, Suzuki, and Gordon produce by making as much contact as possible and relying on their speed to beat out hits on their weak contact. By relying on their speed and balls-in-play for production, it’s beneficial for these hitters to not swing out of their shoes.

Using bat speed to predict breakouts is similar to looking at exit velocity changes to predict breakouts, but has its trade-offs: it’s better in that it accounts for differences in pitch velocities faced, but it’s worse in that our bat speed predictions are only approximations.

They’re still something, though, and they give us more of a predictive look at what goes into making a great hitter than hearing about their launch angle or exit velo in isolated instances.


How Much Does April Matter?

                                      HOW MUCH DOES APRIL MATTER ?

April provides the first hard evidence of what may be in store for the new season. But like any other single month, it usually has some conspicuously extreme results.  Typically 40% of all teams start notably well (.600 +)  or poorly (.400 minus) where by season’s end only 12% of all teams are at those outer edges of win percentage.  With 85% of the season to go, plenty of time remains for fates to change.  Or do they really change that much ?        This study focuses on the April records of all teams from 2000-2017 compared to their post-April results and odds for making the playoffs.  Two issues are addressed.  First, how closely teams’ remaining five months have corresponded to their Aprils and second, what effect April has had on teams’ playoff chances.

              PREDICTICE VALUE OF APRIL FOR REMAINDER OF SEASON (ROS) RESULTS

April records of all teams from 2000-2017, were divided into six win % categories: Excellent (.650+ win % ), Good (.550 – .650), Slightly Positive  (.500 – .549), Slightly Negative (.450-.499),  Weak (.350 -.449) and Poor (below .350).   Teams in each April win category were compared to their post-April and full season win/loss percentages. The percentage of teams in each April category who played at playoff level (.580), contention level (.540+) or near-contention (.500) after April were also measured as well as the percentage of teams in each April win category who made the playoffs.  Following are the results:

April W/L               # April Win % Last 5 month Full Yr .580+ last 5 .540+ Last 5 .500+ Last 5 Made Playoffs Pct of All Pct of Playoff
Category Teams Average Win % Win % Months Months Months Teams Teams
.650+ 60 0.688 0.535    0.558 25% 53% 72% 60% 11% 23%
.550-.649 119 0.595 0.519 0.530 18% 44% 65% 43% 22% 33%
.500-.549 104 0.519 0.510 0.511 19% 37% 55% 33% 19% 22%
.450-.499 75 0.472 0.495 0.491 11% 29% 48% 20% 14% 10%
.350-.450 126 0.409 0.484 0.473 11% 25% 37% 14% 23% 11%
under.350 56 0.305 0.445 0.424 4% 7% 23% 4% 10%    1%

 

Each level of April performance has had better last 5 months performance and playoff chances than the level immediately below it. Contrasts between the top 1/3 and bottom 1/3 April teams are quite significant.  Two thirds of teams who’ve started well (.550 +) remain near contention by playing  .500 + ball afterward and roughly half make the playoffs.  Only one third of teams with .449 or worse Aprils play .500+ thereafter and just 9 % of these early strugglers have made the playoffs. Nearly 4 of every 5 of all playoff teams were .500+ or better in April and less than 1 in 10 playoff teams started .425 or worse. April has done a good job of quickly identifying contenders and non-contenders.

Much of that is due to April records strongly relating to prior season results. Current full season records also have had solid resemblance to prior full season records. So April simply confirms that many (if not most) teams are headed for the same general fate as last year. While these are generally reliable maxims, they’re hardly infallible. Many teams do change fortunes and sometimes April wrongly suggest things will stay the same.   Likewise April often may indicate a change is coming in the current season and it doesn’t materialize.   That will be explored next – just how often does April “fool” ?

HOW MANY TEAMS ARE APRIL “FOOLERS” ?

Roughly 3 in 8 teams had more than .100 pt change in their April v. rest of season (ROS) win percentage.          For teams who start extremely well or poorly, even huge April v. ROS differentials haven’t change their season’s destinies. 2003 Yankees started the year at .769 win %  but played “only” .596 ball the rest of the way and won over 100 games.  Conversely a .260 point jump in win % for the 2000 Tigers over the last 5 months still didn’t push them over .500 on the season. But for a significant number of teams the last 5 months can wash out much of the good or bad April does. Hot starters fall apart and teams buried in April can rise from the ashes.

For determining both the amount and type of April deception the logical starting point is prior year records. Fully 70 % of teams with 90+ wins the prior year have .500 + Aprils v. 37% for teams with 69 minus wins. Since so many teams tend to have Aprils which are “characteristic” of their prior season , measuring the true amount of “April deception” can be done from two angles: 1 – How many Aprils “characteristic” of last year give true v. false signals of another similar season ?  2 – How often do “uncharacteristic” Aprils end up being true v. false signals of a better or worse year ?

 1st – How reliable are “characteristic” Aprils.  The following chart illustrates this

Prior Yr Wins April Win % # teams % win 90+ % win 81 + % in Playoffs
90 + Wins .520 + 83     64 %    84%    62%
80 -89 Wins .520 + 71     48 %    76 %    49 %
70 -79 Wins .499 minus 73       5 %    22 %      7 %
Below 70 Wins .499 minus 65       3 %    6 %      6 %

 

As can be seen, prior year good teams (90+ wins) with good Aprils have had twice the chances of making the playoffs as an average team (30%). On the other side, prior year sub-.500 teams with sub-.500 Aprils have had very little chance of making postseason. Only 9 of 138 such teams have overcome their bad starts.   The percentage of teams finishing above .500 is also remarkably different.   80 % of good teams with good Aprils end up plus .500 on the year, where a mere 14 % of bad teams with bad Aprils do. “Characteristic” Aprils are highly reliable indicators of either continued contention or non-contention.    

“Highly reliable” does not mean “perfect”. The 90+ game winner/good April formula didn’t work for  defending world champions 2004 Marlins, 2013 Giants nor defending NL West champion 2008 D-Backs or 2005 Dodgers.  Nor did bad team/bad April deter the 2015 Rangers, 2011 D-Backs, and two Rockies teams (2009, 2007) plus the 2007 Cubs from rising up and making the playoffs.  But these are exceptions.

Applying these principles of “characteristic” Aprils to 2018 would bode well for the Red Sox, Yankees, Astros, Cubs and Diamondbacks. It would not for the Tigers, Rangers, Marlins, Padres, White Sox, Orioles, Royals and Reds.  With only 1 in 15 former sub-.500 /sub-.500 April teams having made the playoffs historical odds would indicate that none of the above eight teams will either.  Of course these clubs weren’t expected to contend but neither were the Braves, Phillies, Pirates, and Mets who were also bad teams from last year.  The difference is that latter four teams have had good Aprils.  So what do their “uncharacteristic” Aprils mean ?

 How often do “uncharacteristic” Aprils send true or false signals of change ?

After a surprising start in April many teams revert to their “true selves” (good or bad) in the last 5 months. But some percentage of uncharacteristic Aprils often correctly signal changes in a team’s fortunes. This time the data is parsed more finely to show where false and true indicators of change may appear.

Prior Yr Wins April Win % % Win 90+ % Win 81+ % in Playoffs
90 + Wins .401-.499   26 %    77 %     29 %
90+   Wins .399 minus   14 %    43 %    21 %
80 -89 Wins .401-.499   20 %    43 %    25 %
80 -89 Wins .399 minus   10 %    28 %    10 %
70 -79 Wins .600 +   39 %    72 %    44 %
70 -79 Wins .500 -.599   22 %    51 %    25 %
Below 70 Wins .600 +   18 %    64 %    18 %
Below 70 Wins .500 -.599     7 %    22 %    15 %

 

“Uncharacteristic” April at the extreme ends tend to be fairly accurate in signaling real change, particularly changes for the worse.  Very bad sub-.400 Aprils by former 81+ winners have shown that most are in trouble that season.    Nearly 60 % fail to achieve even a .500 season and only 15 % make the playoffs.  Some 90+ winners (21%)  with below .400 Aprils get off the mat and still make playoffs.  But that contrasts sharply with their prior year 90+ winning brethren who start April at .520 + and have made the playoffs 62% of the time.

What about extremely strong Aprils from previously bad teams ? Prior 70-79 winners who play .600 + in April have done quite well as 44 % have made the playoffs.  One caution, however, is small sample size as only 14 teams fall into this category. For really bad prior year teams (70 wins and below)  who start .600+ most achieved .500 seasons and 1 in 5 made the playoffs. This includes some notable turnarounds:  2000 Cardinals, 2000 White Sox, 2006 Tigers, 2012 Orioles, 2013 Red Sox, 2015 Astros, 2015 Cubs, 2017 Rockies, 2017 D-Backs.    Of course, excitement over great Aprils by prior bad teams should be tempered by the fact that most still have missed the playoffs. This may apply to expectations raised by the start of the 2018 Mets.  Although the 2015-2016 Mets were playoff teams so it’s quite possible their strong April 2018 could be a legitimate sign of revival.  Teams who gained 15 or more wins over the prior year showed an average of 120 points jump in April v. prior year win %.  The Mets have jumped 220 points.

Mildly uncharacteristic Aprils send wrong signals of change more often than not. Which is logical as a 65 game winner who goes 14-13 in April has shown less transformative evidence than one who’s 18-9. Nor has a former 100 game winner with a 12-13 April shown the same reason for concern as a 7 -18 start might.   Only 22 % of “below 70” teams with .500-.599 Aprils finished over .500. Former 70-79 winners with decent .500-.599 Aprils have upped their playoff odds but only to a subpar 20%. Seven teams fit this description in  2018 – Mariners, Pirates, Phillies, A’s, Braves, Blue Jays, Giants.  History says 1 will make it, 2 would be very optimistic.

However, one type of mildly uncharacteristic April is noteworthy. While previous 90+ winners with modestly bad .400-.499 Aprils still have a high recovery rate (77 % end up over .500) they’ve had a big drop in playoff odds. Only 29% make it after such starts.  That is not good news for 2018 Dodgers, Nationals or Twins whose goal is to make the playoffs.  Which brings up the next issue – how does April influence playoff chances ?

   WHAT EFFECT DOES APRIL HAVE ON MAKING THE PLAYOFFS  ?

One old adage is “you can’t win the pennant in April but you can lose it”. While there is definite truth in this, April has the fewest games of any month, and still leaves ample time to recover. The 2001 Oakland A’s had a miserable 8 -17 April but went on to win 102 games.  That said, they were the only sub-.400 April team to win 100 + games and only one of 8 teams who started that poorly and still made the playoffs.   Another 10 such bad start teams played the rest of the season at contender levels (.540+ ) yet failed to make the playoffs and their poor Aprils were instrumental in that.

While it is clearly possible to recover from a weak start, bad Aprils leave a diminished margin of error. Two-thirds of all playoff teams start the year solidly (.530+ ), the vast majority (78%) are at least .500 + and fully 95 % of all playoff teams have avoided disastrous (sub. 400) Aprils.   Teams who’ve stumbled early can’t count on fellow playoff contenders being in the same underwater boats.  They have to play serious catch-up with rivals whose yachts have begun to float away.  An “average” playoff team has a .564 April win percentage.  So a club with a 9 -15 April is trying to catch teams who’ve gone 14-10 or better.  If the 15-9 April teams plays at “only” .537 ROS and gets 88 wins, it takes a .580 ROS from the poor April starter to overcome that.  Playing at .580 + level ROS has been done by only 15% of all teams, which equates to being one of the top three teams in one’s league after May 1st.

Those 8 playoff teams who started sub-.400 averaged a .604 ROS win percentage.  Five of the eight were .580 + ROS and the lowest ROS win percentage was .572 which translates to 93 wins over a full season.    In addition, there were four teams with a .580+ ROS that missed the playoffs.   Despite being better ROS than their key divisional or wild card rivals those four teams (2004 Giants, 2005 Indians, 2011 Red Sox, 2012 Angels) lost out on the playoffs because of their inferior April records.

The other side is that very strong Aprils can provide a cushion to play at less than .550 ROS. There are even occasions where teams with less than a .520 ROS have made it in due to strong Aprils: 2016 Mets, 2015 Astros, 2014 A’s, 2006 Cardinals, and 2000 Yankees.  Ironically there are two World Series winners in that group (Cardinals and Yankees).   The Cardinals were particularly unusual as they were the only team to make the playoffs with a sub-.500 ROS record.

The lone .600 + ROS who missed the playoffs (2005 Indians) provide a classic example. They were the second best team in the AL after April, outplaying their division rival and subsequent World Series winner White Sox   84-55  to  82-56  ROS.   However, with the 17-7 April of the Sox, and Indians’ poor 9 -14 start, Sox gained a 7.5 game cushion.  The double whammy was that the Tribe’s poor April also cost them the wild card to Boston.   Had the Indians played even 12-13 instead of 9-14 they’d have ousted Boston.   Of course Cleveland’s 13-16 July didn’t help either, nor did going 1 -5 the last week of the season (including a 3 game sweep by the Sox) after the Indians had whittled the lead down to 1.5 games on Sept. 24.  But the April cushion built by the Sox allowed them to withstand an incredible Aug/Sept run by Cleveland and the Tribe was forced to play unbelievably well to stay in the hunt.   This all happened before the second wild card was introduced in 2012, and if it had applied back then Cleveland would have made it as that second wild card.   So has this second wild card now made it easier for April stumblers to recoup ?

 

HOW THE SECOND WILD CARD HAS CHANGED PLAYOFF ODDS

Adding a second wild card team has changed the odds in some meaningful ways as the following illustrates.

% of all teams who win 92 + games and make the playoffs % of all teams who win 87 -91 games and make the playoffs % who win 82-86 and make the playoffs
2012 – 2017               100 %               83 %         8 %
2000 – 2011                 94 %               45 %           7 %

 

Clearly 87 – 91 wins has had twice the chances of making the playoffs than before. Before 2012 teams needed to be stoking the engine every month to get to 92 wins. However with the bar now lowered to 88 wins since 2012, this allows more margin for error. That margin can go two ways. It can benefit teams who stumble in April and need a lesser ROS to get in.  In fact for teams who started less than .450 and then made the playoffs prior to 2012, the average ROS was .604.  For the three teams that have done that since 2012, the average ROS is .574.   But the post-2011 lower win threshold can also help teams who come out strong in April who don’t have to tear it up in the last 5 months to get in.   So which has it helped most ?  Following is a breakdown of the percentage of playoff teams after 2012 who’ve played at given ROS levels comparing those who had .600 + Aprils v. below .600 Aprils.

  REST OF SEASON WIN %’s            
April Win % 0.625 0.605 0.591 0.581 0.575 0.569 0.556 0.549 0.537 0.529 0.519 0.509
600 + Aprils 0% 9% 22% 26% 39% 48% 61% 61% 78% 78% 87% 96%
Below 600 6% 17% 31% 54% 63% 74% 80% 86% 91% 94% 100% 100%

 

Those who started .600 or better had a much lower burden to meet ROS.   Only 26% of the 600+ April teams who made playoffs achieved .580 ROS where over half of the below .600 Aprils had to meet that burden.  While having a .600+ April is no assurance of making the playoffs, it virtually is if teams play at least decently thereafter.   Here is what has happened to the 34 teams with 600+ Aprils from 2012-2017.

.600+ Aprils # Tms 550 + ROS 520-550 ROS 500-520 ROS Under 500 ROS
In Playoffs 23    14        6       3          0
Missed PO’s 11      0        2       2          7

 

As can be seen, 600+ April and 520+ ROS has been a successful formula 20 of the 22 times. The only two to miss were 2013 Rangers (91 wins) and 2012 Bucs (90 wins), who also happened to be the only 2 of 42 teams who’ve won 90+ since 2012 and missed the postseason. Playing .520 is hardly a torrid pace as it equates to 84 wins over a full season.   Playing at .500- .520 ROS isn’t playoff caliber yet 3 teams still made it, all helped by .650+ Aprils (2016 Mets, 2015 Astros, 2014 A’s).   The 7 teams who collapsed to sub-.500 after hot Aprils clearly didn’t deserve to get in.

One final cut of the data can establish who has been helped more by adding 2 more wild cards – the April surgers or the April stragglers. .565 ball equates to 92 wins and .520 ball equates to 84 wins . Teams who play at .565 levels ROS should make the playoffs.  Teams between .520-.564 should contend.

April Start Teams # PO teams .565 ROS –Made PO .520-.564 ROS-Made PO .500 -.519 ROS- PO
.565 + April

.500 + ROS

42    33                20  -20     15 – 10      7 – 3
.500-.565 A

.500 + ROS

24    16                12 – 12     10 –   4      2 – 0
.449- April

.500+ ROS

25    11               11 – 10     11 – 1     3 – 0

 

This defines the task confronting a good team with a sub-.500 April. Every year on average there are 4 such teams who rebound with .500+ ROS at a typical year looks like. However, they have to find a way to better several of the 11 teams who’ve started .500 + in April and are still +.500 and there are only 10 playoff spots.   Their only remedy is to play .565 + ball ROS and for one team (2012 Angels) even that wasn’t enough due to a terrible 8 -15 April.     But the .565+ April starter has it easier as a .520-.564 ROS gives him a 2/3 shot of still making it where only one team of 11 who started below .500 (2016 Giants) was able to eek into playoffs with less than .565 ROS.    The April .565 starter still has a shot with a tepid .500 – .520 ROS, but those are all teams as noted before that were red-hot in April (.650+).

Due to the abundance of teams starting well or decently (in April 2018, 13 teams were .565+ and 19 were .500+) the poor starters simply have to be a lot better ROS than their rivals to shove their way through the crowd.   The second wild card has benefitted the good April starters moreso than the good teams with bad Aprils. Of course the same math could be applied to any month, good or bad, so to paraphrase the classic Passover question:  why is April so different from all other months ?

PSYCHOLOGICAL EFFECTS OF GOOD OR BAD APRILS

Despite the fact that April is only 15% of the overall season, when early-mid season personnel decisions are being made, April results can still have considerable impact.   April’s record can influence decisions such as: the patience a team has for a younger player with early season struggles or whether the team tries for a mid- May trade to replace an injured starter and/or considers promoting a top notch AAA player despite his arb clock issues.  When trading season starts in June, 35% – 40% of the team’s record at that point has been baked in by April’s wins and losses.  Even by the late July deadline, April still comprises 25% of the season.  If early results have affected fan attitudes and attendance, ownership may be either more or less willing to commit dollars to bigger name players at the deadline.  These factors may give April importance beyond its mathematical impact on the standings.   That teams tend to mirror their April win-loss %’s as the season progresses may be in part that April can create a sort of self-fulfilling prophesy.

The 2014 Cleveland Indians were 75-60 after April 30th, the Oakland A’s were 70-63 over the same time. Yet Oakland got the wild card by 3 games over Cleveland due to a 18-9 April v. the Indians 10-17.   Early season results were still impacting July decisions as the A’s were buyers and the Indians sellers.   To be sure, the A’s record worsened post-July and the Indians got better.  But without Jon Lester (2.35  ERA with A’s) and Jeff Samardzija (3.14) who knows how much worse it might have been for the A’s .  While ridding themselves of Justin Masterson may have helped the Indians and trading Cabrera didn’t hurt, how much better would they have been had they gotten an outfielder and starting pitcher in July instead of being sellers ?

Conversely, the 2016 White Sox benefited from a 17 -8 April despite a May tailspin which left them with a 29 -27 record on June 4th and only 2 games behind in the AL Central. They then traded a very talented younger prospect named Fernando Tatis Jr.  for James Shields.  Needless to say this is a trade that has not worked well short or long term.  Despite the fact that the Sox had 3 straight losing seasons prior to 2016,  management seemed to believe that April/early May represented the success they felt the team was capable of as opposed to the more recent reality of losing.  Successful Aprils can sometimes keep wishful thinking alive for too long.

CONCLUSIONS

Clearly April has proven to be a good proxy for the team’s chances going forward that season. But April records, also have to be viewed in context of all the evidence.  How much of a factor were injuries, over or under performances, or new offseason acquisitions?  Thoughts of dumping contracts and rebuilding, while too premature for May 1st, are still quite logical particularly for poorer prior year clubs who are off to bad starts.

For clubs in the playoff hunt, April can have real impact since 3-5 wins one way or another can be make/break. Since 2013 teams with 87-88 wins have made the playoffs in 10 of 11 cases where only 2 of 14 teams with 85-86 wins have. Clearly these margins apply to other months’ results too, but as noted before, a very good or bad April can affect team decisions in June and July.   Having some breathing room afforded by a 16-10 April instead of the catch-up pressure of a 10-16 start can play into the psychology as well.

Winning the division winner is far preferable to having to win a one-game playoff as a wild card and April provides a good checkup on division rivals. Both Boston and NY look like they will fight it out all year although Toronto can’t be ignored. In the Central, the Indians’ are in a strong position with their chief rival, the Twins, are 4.5 games back already and the rest of the Central bad teams who’ve started with bad Aprils.   Houston’s good start helps especially since all of their closest chasers (Mariners, Angels, A’s) were sub-.500 teams last year, but the Angels improved through offseason acquisitions.  In the NL East, even though the Nationals are 5 games back they’re chasing teams who were all sub-.500 last year (Mets, Braves, Phillies).  Where the Dodgers, who are 8 games back, are pursuing a D-Back team that won 93 last year.  Turner’s absence hurt in April but so did the lack of offense from the rest of the team which may continue particularly since Seager is lost for the year.    Plus the bullpen woes cannot be overlooked. So April’s 12-16 record cannot be easily dismissed as an aberration.  Nor can the historical evidence of diminished playoff odds (20-25% range) of good teams who’ve had the Dodgers’ kind of April.   We shall all see soon enough.


The Other Former Pirates’ Pitcher

All stats and tables from Baseball Reference. All batted ball data from Baseball Savant.

After Gerrit Cole’s magnificent start on Friday, and indeed, his string of magnificent starts to open the season, it follows that we would hear a lot about him and his resurgence (some might say his breakout). Lost in the buzz around Cole is the start-of-season performance of another former teammate of his, Francisco Liriano. Liriano has not been otherworldly, but has so far looked more like he did in 2013 than in 2017. That is to say, he’s looked quite good!*

*Before you start playing the Small Sample Size Song, I know this is a small sample size, but I wanted to write about it, so stow your tunes at the doorstep.

Here’s a summary of his 6 starts so far, including Friday’s 7 innings of 1-run ball:

April 2

IP H R ER BB SO HR GB FB LD
Liriano 6.2 4 1 1 2 3 0 8 12 4

Pretty good for a guy that ran a 5.66 ERA last season! As you can see, Liriano pitched quite well in his first start, allowing only 6 baserunners over 6.2 innings, making for a WHIP just under 1. He also allowed only 4 line drives, and pitched the Tigers to a 6-1 victory over Kansas City. A pleasant surprise to open the season for the more pessimistic among us (such as me).

April 9

IP H R ER BB SO HR GB FB LD
Liriano 6 3 2 2 3 4 1 6 10 2

Another surprising start for our friend Francisco. This time, he only went 6 innings, and allowed 2 earned runs instead of 1, but I’d take that from my projected #5 any day. His WHIP here is exactly 1, and he allowed just 2 line drives this time around. More good results from the 34-year old.


April 17

IP H R ER BB SO HR GB FB LD
Liriano 5 5 2 2 3 7 1 7 5 4

Liriano didn’t look as good this time around, but he was never supposed to be an ace in Detroit. We presumably signed him for depth, and while the deal seemed a head-scratcher at the time, Al Avila is looking pretty smart now that Daniel Norris has gone down. This start, Liriano went 5 innings and allowed 8 baserunners along with 2 runs. He had his highest strikeout total of the season, but aside from fly balls allowed, everything else was worse. The thing is, this still isn’t a bad start. It is not, by definition, a Quality Start™, but it’s still a relatively okay one, and it’s certainly still above what we expected from Liriano.


April 22

IP H R ER BB SO HR GB FB LD
Liriano 5.1 2 3 3 4 6 1 4 8 3

Another passable outing from Francisco. 2 of his 3 earned runs came on the next pitcher allowing inherited runners to score, but that doesn’t change the box score, or the fact that he allowed 6 baserunners in 5.1 innings. His line drives were down, but if you’ve been paying attention, you’ll notice that he’s allowed a home run in every start except his first. I’d hesitate to call allowing a home run three starts in a row a definite trend, but it’s certainly starting to look like one. Hopefully Franky can turn it around soon.


April 28

IP H R ER BB SO HR GB FB LD
Liriano 6.1 6 3 3 2 1 0 11 11 6

This is more like it! This qualifies as a quality start. Again, 2 of the 3 earned runs were from inherited runners scoring, and again, that doesn’t change the box score, but it’s always nicer to see a 6 in the IP column than a 5. What’s more, Frenchy did not allow a home run for the first time in almost a month. The worries begin again when you take a look at the number of baserunners allowed (8), and are slightly heightened when you see that he only struck out one batter, but on the plus side, his 2 walks are the lowest he’s allowed since his first start with the Tigers.


May 4

IP H R ER BB SO HR GB FB LD
Liriano 7 3 1 1 2 5 0 11 5 0

Finally, we come to yesterday’s start, which by every metric is the best. Liriano pitched 7 strong innings, allowing 1 earned run on 5 baserunners. What’s more, not only did he not allow a home run — he didn’t allow a single line drive! With only 2 walks again, Francisco seemed to recapture the magic he’d shown in a majority of his 2018 starts.

 


 

This article was supposed to be uplifting for Tigers fans. I went into this thinking I’d be able to write nice things about Francisco Liriano and demonstrate that while he’s no Gerrit Cole, he’s still much better than people are giving him credit for.

To my great despair, this does not seem to be the case. Liriano sports a lovely 2.97 ERA and a very respectable WHIP of 1.073. These, plus his H/9 and BB/9, are down a striking amount from 2016, and all but his BB/9 are his best since 2006(!). Unfortunately, his peripherals tell a different story. Liriano’s FIP is 4.13, which is mostly attributable to his paltry 6.4 K/9, itself a point of concern — it’s his lowest ever, by 1.1. But the batted ball data paints an even more dismal picture for Francisco’s future.

Since 2016, Liriano’s hard hit percentage has remained pretty stable, from 33 to 33.7 to 32.7 this year. That puts him in the company of such luminaries as Jeff Samardzija, Brandon Morrow, and Sean Newcomb. That’s a little harsh; this year, he’s closer to guys like Justin Verlander, Clayton Kershaw, and Masahiro Tanaka. But it’s early in the season, and I’d still say Liriano comps closer with JC Ramirez than Corey Kluber.

The more worrying data lie in Liriano’s expected outcomes. Statcast measures expected wOBA (xwOBA) based on batted ball profiles and compares it to actual wOBA. Since Statcast began tracking batted balls in 2015, Liriano’s wOBA and xwOBA have remained within 15 points of each other. This season, there is a 109 point difference. A sobering number, to say the least. Couple that with the fact that his xSLG is a frightening .513, it seems our ostensibly-resurgent pitcher has just been exceedingly lucky. I haven’t watched a Liriano start yet this year, but I have listened to a few, and I distinctly recall hearing quite a few exclamations of astonishment from Dan Dickerson directed toward our middle infield.

What originally started as a post meant to proclaim the newfound prowess of a dubious offseason acquisition ended up as a bleak prediction for his future. But we must remember, in our unexpected despair, that this is baseball, and hope spring eternal for the simple reason that we really have no idea what could happen. Nobody could have predicted Andruw Jones’ death spiral, or Rick Ankiel’s conversion from pitcher to outfielder.

And so I remain foolishly optimistic that Liriano’s success is for real. If he starts to pitch poorly, I will probably appeal to small sample size until at least July, while ignoring the massive amount of cognitive dissonance required to hold that position and still write this article. Luckily for me, I don’t care. I am a Tigers fan first, and I am duty-bound to have faith in our players until their last breath, or at least their last breath in a Detroit uniform.

My realistic prediction is that Liriano will pitch to a 4.5-5 ERA for the rest of the season, and I won’t be disappointed. My homer prediction is that he continues to showcase his recaptured abilities and pitches to a 3-3.5 ERA. I will be pleasantly surprised if that happens. If it doesn’t, well, this team was supposed to suck anyway.


The Endless Possibilities of Franchy Cordero

At the end of April, Mike Petriello wrote on the most interesting rookie you need to know more about, Padres outfielder Franchy Cordero. The way Cordero hits the ball, paired with how he runs and can defend, make him more than intriguing. However, Petriello detailed that the margin of error within Cordero’s game could turn him into just about anything — be it Keon Broxton or Aaron Judge.

The two potential comps couldn’t possibly represent further opposite ends of the spectrum. Broxton was demoted to the minors last season and Judge was a Rookie of the Year winner and MVP candidate. So where will Cordero end up shading himself within that vast spectrum? Consider those three players in their first extended stint in the Majors.

Franchy1

Can you tell who’s who?

Players A and C are Judge and Broxton in 2016. The two had largely similar plate discipline. The biggest differences came in Broxton’s reluctance to chase out of the zone, which fueled a lower strikeout total and a high amount of walks. But in between them, as player B, is Cordero. He swung more, chased nearly as much as Judge, made the least contact in the zone by a big margin, and whiffed way more than any of the three. While these are just descriptive numbers — things we can look at after the fact — it’s easy to see how Cordero approaches the batter’s box similar as these two other guys whose difference in success could fill the Grand Canyon.

The real interesting part comes in looking at their plate discipline after their first extended stint in the Majors. It gives us a sense of how each player bought into their skill set, possibly based on the success they did or didn’t have in their debuts.

Franchy2

Notice anything? Broxton went one way in the season after his debut, while Judge and Cordero have gone the other, more productive way. Broxton simply did the things you don’t want a player to do. He reached out of the zone more, made less contact doing it, and created fewer free passes for himself. Judge reached out of the zone less, made more contact, and shrunk his K-BB rate to Rick Moranis levels. It’s funny how one decision can impact so many results.

Baseball GIF-downsized_large (3)

So far, Cordero’s second extended stint in the Majors has mimicked Judge’s. He’s trending in all the same ways and generating monster power while he’s at it. Petriello noted how he’s part of the roughly one percent of hitters in the entire league to have hit a ball 115+ mph in 2018. That single data point alone is enough to project a pretty positive profile. What he’s really doing to generate that kind of exit velocity, though, is optimizing his mechanics with his contact point. It’s even more impressive when you combine it with how he hits homers. Since Statcast went live in 2015, the greatest average home run distance from any single player is Carlos Gonzalez, at 421 feet. Giancarlo Stanton is second at 420 feet. Cordero has averaged 438 feet per bomb after six home runs so far in 2018.

The Padres seem to believe that Cordero isn’t a finished product just yet, but that he’s good enough to learn on the job. The biggest truth to that is probably most easily visible in his free swinging ways. When he came up last year, he struck out in more than 44% of his plate appearances while drawing a walk in only six percent. But this year, his improved discipline at the dish has resulted in a jump in walks of four percent and an 11% decrease in whiffs. He’s not quite stepping up to Aaron Judge levels, but he’s demonstrating that he’s learned two things. One is that he can let tempting pitches out of the zone go because the contact he does make is strong enough to wait for. The other is that just because a pitch is in the zone doesn’t mean he has to swing. In this sense, it’s like a pitcher sequencing his stuff. By letting pitches go that don’t necessarily play into his strengths, Cordero is giving himself more opportunities to meet the ones that do.

If you’ve been wondering about sample sizes for all the examples above, that’s fair. Most of them are relatively small, potentially opening them to scrutiny because they don’t provide us the stability we crave when evaluating players. But that doesn’t mean they’re useless. In this context, they act in two ways: as indicators of aggression in each of Judge, Broxton, and Cordero; and what they’ve each learned once they had a chance to stay up in the Majors.

Franchy Cordero has proven to be more than intriguing, and he’s found himself in a unique situation that many clubs wouldn’t likely provide a young player. But the Padres are in a unique spot, too, and Cordero, unheralded as he may be so far, may be critical in helping to elevate them in the standings as time moves on.

Plate discipline data from FanGraphs. Heat maps from Baseball Savant; gif made with Giphy. 


The Anatomy of 2,999

There is beauty in the penultimate. While hit number 3000 will be the moment that is played at Albert Pujols’ inevitable Hall of Fame ceremony, that milestone could only be reached due to the 2,999 victorious battles waged before it. This is the story of Miguel Castro vs. Albert Pujols. The following article focuses on the complicated beauty of everything that surrounded the penultimate hit of a cherished milestone. The following piece is also showcase of how being in touch with batting analytics can and should help managers make the correct bullpen calls.

Miguel Castro is a young, below average reliever. Since his trade from the Rockies to the Orioles in 2017, Castro has posted an ERA of 3.25 and a WAR of -0.1. These numbers are far superior to the ones posted during his stint with the Rockies, but they are not anything particularly special. During his development, Castro has all but ditched the fastball, as he initially (2015) threw it 63% of the time. By 2017, when he would first duel with the aging Pujols, batters saw a fastball from Castro a mere 1.7% of the time, with even less of a fastball dish rate so far in 2018. Castro now makes his career on Changeups, Sliders and especially Sinkers. Castro threw batters a Sinker 58.8% of the time in 2017, this puts his Sinker rate at 6th among 2017 relievers. These numbers have stayed relatively the same so far in 2018, although Castro has thrown slightly less Sinkers in favor of more Changeups. As baseball writers have lamented the death of the Sinker, Castro has been one of the few pitchers that still rely heavily on the dying pitch.

The Albert Pujols of St. Louis needs no introduction, he is one of the most prolific hitters of all time, and a future Hall of Famer. The Albert Pujols of Anaheim is a different player altogether. Much has been written recently on FanGraphs about the decline of Pujols, so I will spare those details here. Instead, I want to focus on how Castro allowed hit number 2,999 to occur against a batter that had been unable to get on base in all their previous meetings. 

In 5 meetings at the plate that span from August 18th 2017 to May 3rd 2018, Pujols has hit on Miguel Castro one time. On May 3rd, Pujols hit a 96 mph sinker (Castro’s average sinker speed this year) and in doing so acquired his 2,999th hit. In all of their three previous meetings Pujols hit into an out, and on their subsequent meeting Albert was hit by an inside Changeup. So what was different about their 4th meeting? For the first and only time, Castro threw a sinker close to the center of the strike zone. In their previous 3 meetings, Castro threw Sinkers on the inside and outside of the plate, as well as mixing in Sliders that got looking strikes on multiple occasions. On Thursday night however, after a Slider that got called a ball and, just like in previous encounters, a Slider that got Albert looking, Castro threw a Sinker down the middle-right, and paid the price.

From 2016 to 2017, Pujols’ Batting Average slid across the board against every single pitch but two. One of those pitches just happens to be Miguel Castro’s specialty, the Sinker. (The other is the Curveball.)  In fact, of all the pitches that Albert sees on any given day, he has the best chance to get on base while facing a Sinker by a wide margin. In 2017, Pujols batted .338 against the Sinker, compared to .250 against the Changeup, his next highest batting average against a given pitch. Average is not the only thing Albert was better at while facing a Sinker. His stats across the board are at their highest in 2017 and now 2018 when facing the Sinker. Pujols has a higher SLG% and more HRs when facing a Sinker. He had the most doubles in 2017 against the Sinker compared to any other pitch. One of the three Triples in his entire career came against a Sinker. In short, Albert undoubtedly likes to see a pitcher that throws Sinkers.

 

Analyzing Pujols’ batting average in the strike zone with and without the data for Sinkers since June 1st 2016 shows just how effective Albert has been against the afformentiond pitch. Almost every area of the strike zone saw an increase in average when attempts at Sinkers were factored in. Of special note is the mid to upper right quadrant, where averages increased in every sector. This is the area in which Castro threw the Sinker that would create Pujols’ 2,999th hit.

To futher analyze Pujols’ batting preference for Sinkers, I also compared the heatmaps of Albert’s average against Fastballs compared to Sinkers.

Unsuprisngly, we again see a great disparity between Pujols’ performance when facing Sinkers and when facing other types of pitches.

The conclusion here is that on Thursday night Buck Showalter replaced Chris Tillman with the worst possible choice. With runners on and Pujols’ soon coming up to bat, Showalter subbed in Castro, a pitcher whose main pitch was the favorite of the upcoming batter, who then summarily hit the Sinker into play and scored runs on a breezy double. An event that would put the former St. Louis slugger one hit way from history. If Baseball Clubs would have teams of analytics people, those who could have warned Showalter before he sent out Castro, teams could make more informed decisions about who to put out in relief in high risk situations as seen on Thursday night.

  • Data was sourced from Fangraphs and BaseballSavant

Thank you for reading! This is my first piece in the whole baseball analytics realm, and chances are this thing has logical fallacies or something of the like. Any helpful comments/critcism/pointers are much appreciated.


Umpires Disproportionately Eject Non-White Players

Anthony Rendon was ejected from a game earlier this month for … not contesting the strike zone. He flipped his bat down, faced away from the umpire, and did not visibly open his mouth. He was tossed by Marty Foster, for, what crew chief Joe West described incorrectly as ‘throwing equipment.’ (The pathologization of a non-white player’s actions after the fact to justify an ejection by a white ump is the subject of an entirely different set of analyses.)

After the game, Rendon actually went on record to say that umpires, like players, should be held to specific standards and demoted if they fail to meet those standards. This statement is remarkable for a couple of reasons. One, as most Nats fans know, getting Rendon to say anything, particularly anything of substance, to the media is pretty tough. He is, to forgive the pun, a pretty close-mouthed guy. For another, he points out that umpires, like players, are now doing their jobs in the Statcast era – we know, to a pretty refined degree, how well or not well they’re performing.

Players can be subject to replays that will tell them if their hand left the bag for the fraction of a fraction of a second, such as what happened to Jose Lobaton for the last out in the 8th in 2017 NLDS Game 5 (stay salty, my friends). But a home plate umpire’s word, particularly about the strike zone, is law. I understand ball vs. strike calls not being subject to replay. Even as someone who thinks most of the league’s pace-of-play ‘innovations’ are utter nonsense, I can’t see a good system in which every pitch could be subject to review. (Though, if the manager could make it one of their challenges, that’d be a start.) Umpires, therefore, should be held to the same standards, including performance reviews, as the players whose games they call.

The other thing that makes Rendon’s statement noticeable is that he’ll be facing the same umpiring crew in the final game against the Mets of this series and is likely to face them again this season. Saying that an umpire isn’t, in effect, doing their job commensurate with how Rendon is doing his is putting a pretty wide target on his own, and his team’s, back.

But beyond this instance, Rendon’s relatively mild approach to being struck out looking was disproportionately punished. He was ejected for not doing a whole heck of a lot, a punishment that seems incredibly disproportionate to a ‘crime’ that didn’t seem to go against MLB rules, written or unwritten.

I quickly tweeted out asking for an analysis of non-white versus white players in similar circumstances, because I had a hard time picturing a white player (like, say, Kris Bryant) being tossed for the same thing. Since no analysis existed, I did my own.

My analysis of available player ejection data from 2015-2017 led to the unmistakable conclusion: Non-white players, and Latino players in particular, are tossed at rates completely disproportionate to their representation in the league.

Methodology

Here’s a spreadsheet of data I compiled, mostly using Umpire Ejection Fantasy League data. I decided to limit it to 2015-7, in part because of use of Statcast and relatively consistent replay rules.

I also came at this analysis assuming any particular non-white and Latino player was as likely as any white, non-Latino player to be ejected, and so compared player ejections with league representation percentages for particular ethnicities. However, in doing analysis on position players only – that is, excluding pitchers – I didn’t have the league representation percentages adjusted for position players.

A major limitation in my data is having to hand-assign players as being white or non-white, and Latino or non-Latino. This was done using country of origin and knowledge of US-born players, and therefore is limited by my personal knowledge, particularly for US-born players. For instance, Marcus Stroman’s mother is from Puerto Rico and he was offered the chance to pitch for Team PR in the WBC. For the purposes of this analysis, he was classified as ‘nonwhite’ and ‘Latino.’

I also don’t know how players self-identify; I’m assuming Anthony Rendon, whose family is from Mexico and who was offered the opportunity to play for Team Mexico, self-identifies as Latino, but I don’t know if he’s stated that specifically. For non-US-born players, I also classified all players born in Latin American countries as Latino, but again, that’s not the same as asking for someone’s self-identification and that’s not the same as how any particular umpire perceives any particular player. For example, Francisco Cervelli, who is Italian and Venezuelan, was classified as non-white and Latino for this analysis.

I also classified Latino players as ‘non-white’ for the purposes of this analysis. While many Latinos self-identify as white, the Racial and Gender Report Card for Major League Baseball, where I got the league demographic data, identifies them as non-white and calculates them in the total of ‘players of color.’ So I maintained this classification for the purposes of this analysis. Any mistakes are unintentional; I welcome comments with suggestions for re-categorization.

Lastly, the umpiring corps has, as far as I know, not changed dramatically year to year. It’s a notoriously narrow pipeline and one almost entirely composed of white men. Analysis showed that some umpires toss players more than others, but this hasn’t been controlled for brawls. Additionally, the numbers of players tossed is a reflection of the number of games worked, which I haven’t controlled for.

This analysis isn’t meant to ascribe ejecting non-white and Latino players to any particular bad actor within the umpiring corps but to show a pattern of behavior.

 

The data:

2015 2016 2017 Grand Total
Non-white 50 52 29 131
Latino 39 42 22 103
Non-Latino 11 10 7 28
White 50 38 44 132
Non-Latino 50 38 44 132
Grand Total 100 90 73 263

 

Non-white players being ejected accounts for almost 50 percent of total ejections, despite players of color never being more than 42.5 percent of the league. Latino player ejections account for 38 percent of ejections, despite Latinos never being more than 31.9 percent of the league. Non-white, non-Latino players (of whom most are African-American), accounted for about 11 percent of ejections, fitting with representation in the league, except that no Asian players were ejected in this time period, and Asian players made up between 1.2 and 1.9 percent of the league. So, non-white, non-Latino, non-Asian players make up about 9-10 percent of the league and 11 percent of the ejections.

2017 is, therefore, a bit of a fluke. Of total players, nonwhite and Latino players were actually not tossed any more often (relative to their representation in the league) than their white peers.

 

Percentage of Total Ejections 2017        Percentage of the League
Non-white players ejections 39.7% 42.5%
Latino players ejections 30.1% 31.9%
Non-white, non-Latino players

ejections

9.6% 10.7%
White players ejections 60.3% 57.5%

 

I then controlled for two things:  pitcher ejections and ejections by non-home plate umpires, figuring that most pitcher ejections were as a result of beaning batters (which, yep, keep tossing them), and non-HP ejections might result from ejections during brawls, arguing slide calls, or in circumstances dissimilar to Rendon’s.

 

2015 2016 2017 Grand Total
Nonwhite 31 36 26 93
Latino 23 26 19 68
Non-Latino 8 10 7 25
White 26 26 32 84
Non-Latino 26 26 32 84
Grand Total 57 62 58 177

A few things became noticeable. One, overall ejections seem to be dropping, but non-pitcher, HP umpire ejections are holding pretty steady. Two, in 2015 and 2016, non-white position players comprised the majority of ejected players. Not only were non-white position players being ejected at a rate disproportionate to their representation in the league, they were being ejected more often than their white peers.

Latino position players were also being ejected by home plate umpires at rates disproportionate to league representation in 2015 and 2016 – 40 percent of ejections in 2015, despite Latino players being 29 percent of the league, and 42 percent of ejections in 2016, despite being 28.5 percent of the league.

For 2017, non-white and Latino players were ejected slightly more frequently than representation would account for.

Percentage of Total Ejections 2017 Percentage of the League
Non-white players ejections 44.8       42.5
Latino players ejections 32.8 31.9
Non-white, non-Latino players

ejections

12.1 10.7
White players ejections 55.2 57.5

So, is 2017  a step in the right direction or a flukey year or something else? No idea, and with the 2018 season being nascent, it’s hard to say. If there have been interventions on the part of the MLB or the umpire’s union, fantastic, but those interventions have not, as far as I’m aware, been made public.

I also know we won’t know for a while about 2018 ejections because ejections aren’t all timed equally. One of the weird things about this data is that white players tend to be ejected in early months, and non-white and Latino players make up the majority (or at least a disproportionate percentage) of ejections after May.

April    May   June   July   August Sept. October Grand Total
Nonwhite 14 17 27 24 22 25 2 131
White 18 37 16 15 19 24 3 132
Grand Total 32 54 43 39 41 49 5 263
April May June July August Sept. October Grand Total
Latino 10 14 18 20 18 21 2 103
Non – Latino 22 40 25 19 23 28 3 160
Grand Total 32 54 43 39 41 49 5 263

What this means is that the early ‘eye test’ for white and non-white players being ejected at similar rates won’t bear out in later months.

What if they deserve it?

None of this has addressed a fundamental question in considering ejections: Some guys have it coming. I tried to control for this in considering repeat offenders – that is, if there are certain players who, by virtue of reputation and absent any racial dynamics, just get tossed a lot.

Of the guys who’ve been tossed more than three times, the results are … very unsurprising:

Ian Kinsler    4    
Josh Donaldson    4
Mike Napoli    4
Matt Kemp    5
Yunel Escobar    5
Bryce Harper    7

Of these repeat offenders, Escobar and Kemp are non-white, and the former is Latino. The rest are the kind of love-’em-or-love-to-hate-’em white guys you might expect to make up such a list. So again, the eye test of ‘Bryce gets tossed too,’ doesn’t bear out when you look at the number of different players tossed total.

For ‘three-peaters’ – guys tossed 3 times in the past three seasons – of the 13 tossed three times, only two, Joey Votto and Justin Turner, aren’t Latino. And for players tossed once or twice – so not for having a rep as a showboat or arguer or ‘disrespectful’, 57 Latino and 115 non-Latino players have been tossed in 3 years. So, 33 percent of ejections have been for Latino players, despite the fact that Latinos averaged at 30 percent of the league’s players during this time. For non-white players, 76 non-white and 96 white players were ejected once or twice, meaning 44 percent of players ejected once or twice weren’t white, with the league averaging 41.5 percent non-white players during this time.

In totality, 36 percent of the players being tossed are Latino, and 46.5 percent of the players being tossed are non-white, both higher than their representation in the league.

If ejections are the league’s way of dealing with argumentation at the plate, we should consider that Latino players and non-white players are already disproportionately disciplined by their fellow players – and brawls are more likely to break out between players of different ethnicities.

We should also consider why players are perceived to ‘have it coming’ to them for arguing, ‘showboating,’ or other displays of either enthusiasm or disrespect, depending on your perspective, and why Latino and non-white players are dinged for it so more than their white peers for what are likely similar behaviors.

Umpiring by largely white umpires on increasingly non-white players is a cross-cultural conversation, one that’s monitored by 40,000 fans, TV viewers, and the ever-watchful eye of Statcast. The league has a vested interest in solidifying its presence in Latin American countries and in trying to encourage African-American players – who are a decreasing percentage of players overall – to continue with the game.

I don’t pretend to know what’s in an umpire’s heart (I assume pine tar and certificates for failed eye exams). I didn’t do this analysis to say that any particular umpire is actively thinking that they should eject a non-white or Latino player because they are non-white or Latino. What I discovered in doing all of this is that there is a very clear pattern of behavior among umpires when it comes to player ejections when the Statcast era is taken in its totality. An action may cause harm – in this case, an ump being more likely to throw out a non-white player – without any specific racist intent.

Additionally, the idea that umpires are enforcing ‘respect’ (and Joe West said that was Foster’s intent in tossing Rendon – “You have to do something or he loses all respect from the players.”) on non-white and Latino players is particularly galling. If non-white and Latino players are disproportionately perceived as ‘disrespectful’ of the game for similar actions as their white peers, such as tossing a bat after a strikeout, then the issue is perceptions and not players.

This, of course, is a societal issue beyond baseball. Analyses of behavioral perception by white teachers show that they tend to ascribe disrespectful, aggressive behaviors to non-white students at higher rates than they do to white students or than black teachers do with black students. Analysis of school punishments shows that black and Latino students are suspended and expelled at much higher rate than their white peers without any evidence they’re misbehaving more. So this is not a problem unique to player-umpire dynamics, but instead is one indicative of broader structural societal dynamics.

To work on addressing this as a structural issue, the league can change how it handles ejections. A few proposals:

  • All plays over which a player is ejected are automatically reviewable, including balls vs. strikes. If an umpire ejects a player on a strike call that, on review, is revealed to be a ball, the player isn’t ejected. If a player makes contact with an umpire, they should be ejected but if players see that there is a clear and objective appeals process to an ejection, my guess is that they’re more likely to calmly walk off than explode.
  • All ejections should be reviewed as part of a rigorous rating process for umpires. Umpires who repeatedly eject players for calls that, on review, they should not have made (such as a bad ball vs. strike call) should experience some form of penalty – by being demoted, retrained, or fined.
  • Umpires’ ability to call balls vs. strikes compared with what Statcast determines is in or out of the zone should be made publicly available. If an umpire is consistently below a certain percentage of accuracy, they should be demoted or retrained.
  • Player strikeout rates should be adjusted for umpire accuracy the way player defense is adjusted for particular ballparks.
  • Diversify the umpire corps. Currently, umpires are generally older white men who feel tasked with enforcing ‘respect’ from young, increasingly non-white players. I’m not saying that simply hiring more people of color (including women of color) is a cure-all for these kinds of issues, but diverse perspectives may mean a decrease in unintended slights between players and umpires, and a general change in player-umpire dynamics.
  • Radically, I would also like strike calls to be reviewable. They would cost a manager a challenge if incorrect like any other play. If a manager challenges – and is correct in challenging – strike calls repeatedly, then the umpire, and not the player or manager, should be held at fault.

If this sounds like we can replace a home plate umpire with Statcast for calling balls vs. strikes, then I’m for it. As the cliche goes, I didn’t watch the game for the umpiring, and if a computer can do what the umpires are doing in a fashion that doesn’t disproportionately penalize players of color, then I don’t see a downside.

I co-host Resting Pitch Face, a bi-weekly baseball podcast with a Nationals bias. I can be reached on Twitter at @sydrpfp.

 


Finding Keys to Elevate the Ball More

Everyone is looking for keys to get players to elevate the ball. One important point is certainly the so called attack angle. The attack angle is the angle of which the bat attacks the ball (uppercut, level or down). Baseball used to teach swinging down but now you actually want a small uppercut. Players use different cues to achieve that. Common cues are for example leaning slightly back to the catcher and work up with the front elbow.

Up in the zone elevating is pretty easy. The league average launch angle (LA) in the upper third of the zone is 20 degrees. Even Christian Yelich averages 15 degrees in the upper part oft the zone. In a prior analysis I also found out that LA in the upper part of the zone has little influence on wOBA, the 20 lowest average LA guys in the upper third actually had a slightly better wOBA than the 20 highest LA guys (.402 vs .393). 170 out of 182 hitters last year averaged 10-plus degrees.

That is very different low in the zone. The league average LA in the lower third was just 5 degrees and over 30 guys actually had a negative LA. Here the wOBA for the high LA guys is 80 points higher than the low LA guys. The difference is made low in the zone.

View post on imgur.com

So the key for the low LA guys is definitely still to lift the low pitch. So how can this be achieved? You definitely need to swing up and you also need to avoid rolling over and hitting a grounder to pull field which is what the sinker-ballers try to achieve.

One theory is that on low pitches you tilt the shoulders more down and hit with the bat pointing more to the ground. The cue is that for high pitches the bat turns more like a merry go round and on low pitches more like a ferris wheel.

This Ferris Wheel like path makes sure that the bat comes through more straight through from below rather than going across the ball which leads to rolling over.

Mike Trout is so good at this that he is able to sometimes even hit down and in pitches  to dead center for a bomb while most have to pull that ball. Jeff wrote a nice article about this: https://www.fangraphs.com/blogs/jabo-mike-trout-has-a-new-trick/

Of course this Ferris Wheel path also has his disadvantages, for example Trout used to be very bad on high pitches the first 4 years of his career. Still he got away with that because most pitchers would only pitch up like once per at bat and not live up in the zone so Trout would just take but ideally a batter would flatten out the bat up in the zone and swing steeper down which Trout actually did last year causing him to improve up.

But the traditional level bat, level shoulders cue is definitely hurting on low pitches and made the sinker so popular. Now that more guys learn the new swing path the sinker doesn’t work as well anymore but there are still hitters who struggle down (like Hosmer and Yelich).

The pitch up is getting more popular but it can not suppress launch angle. The high pitch lifts itself, when a pitcher pitches up he needs to compensate for the higher LA by more pop ups, lower EV and more Ks.

It is a good sign that Hosmer now thinks about swinging up more but if he wants to increase his LA he either needs to stop swinging at pitches in the lower third and target pitches up or change the rotation axis of his bat to more vertical on top of his attack angle because if you swing up but across the ball on low pitches all you do is hitting your grounders with more topspin.

I measured the vertical angle of some good and bad low ball hitters. On the left of the picture you have Yelich and Hosmer and the other pics are Ortiz, Trout and Votto who are all excellent low ball hitters. All pitches I chose were about knee high and on the inside of the plate because that affects the bat angle.

What you can see is that Hosmer and Yelich have an angle in the mid 20s while the other three are in the low to mid 40s.

View post on imgur.com

That is important because on low pitches the flat barrel will naturally rotate to the left causing top spin similar to a tennis top spin while the steep barrel will rotate up and on the line to CF. https://www.youtube.com/watch?v=MJHTQncT-AA

So to learn to elevate getting a positive attack angle by leaning slightly back and keeping the head over the rear hip during the turn and working the front elbow slightly up definitely is important, but you also need to match the rotation axis of the bat around its long axis with the height of the pitch. The old cue of not dropping the back shoulder and hit with a level bat has its merits on pitches above the waist where a too steep bat angle is indeed bad (see young Trout) but on pitches mid-thigh to knee height this cue is very destructive. In the upper third of the zone the bat angle will be relatively flat ,but in the lower third you need to drop the back shoulder and tilt the rotation axis oft he bat down to around 40-45 degrees.

That means changing the swing isn’t that easy, you have to account for several things. It can be done but it is some work, will we see Hosmer and Yelich making all those adjustments? If they don’t make it they could also adjust less and try to just avoid hitting the low pitch but of course, that would eventually give the pitchers an opening to exploit.

So far there is no improvement for Hosmer. It is early but his GB rate is 58%. He either needs to stay away from the low pitches and target pitches up (and away in his case) or make more changes to his swing.