Gregory Polanco’s Power Struggle

Baseball has spoiled us. For the past two years baseball teams, spectators, and analysts have been in awe of the young talent in the game. Last year,  the likes of Yasiel Puig and Jose Fernandez came out of nowhere and dominated the league. The year before, Mike Trout and Bryce Harper came up  and were beyond exceptional baseball players.

This year has been a little bit dry in terms of top prospects coming up and turning into the best players in the league. George Springer has been an exciting player to watch but not dominant like Trout, Puig or Fernandez. There’s a case to be made for Jose Abreu, but he was an international signing. Abreu wasn’t developed in the White Sox’ farm system for multiple years. Instead, Abreu was major league ready upon signing his contract.

The other top prospect whose performance hasn’t been captivating — at least during his first month in the big leagues — is Gregory Polanco.

Polanco is considered an extremely toolsy young player. Most scouting reports agree that Polanco has a pretty good glove — profiling as a corner outfielder — and he has a good arm.  Polanco’s bat has been more of a projection than a reality. In the minors it was easy to see that he had some talent. In his minor league career he put up an .842 OPS. There were still some rough edges around Polanco’s game as his ability to hit for power fluctuated during his time in the minors.

Season Team PA ISO SLG
2010 Pirates (R) 200 0.085 0.287
2011 Pirates (R) 203 0.124 0.361
2011 Pirates (A-) 10 0.000 0.100
2012 Pirates (A) 485 0.197 0.522
2013 Pirate (A+) 241 0.161 0.472
2013 Pirates (AA) 286 0.144 0.407
2013 Pirates (AAA) 9 0.000 0.222
2014 Pirate (AAA) 274 0.194 0.540

There were times in Polanco’s minor-league career in which he hit for decent power, but he definitely bounced back and forth between having above-average power and power outages. In his 2014 Pittsburgh Pirates Top Prospects post, Marc Hulet described some of Polanco’s difficulties:

“At the plate, he flashes the ability to hit for both average and power but he’s still learning to identify and handle breaking balls.”

Polanco has been in the big leagues for about 40 games. In his first couple of plate appearance he looked like a pretty dynamic toolsy outfielder. However, as he has accrued more plate appearances, it has become apparent that he is still an unfinished product. Polanco has hit .247/.324/.352. which has been good for a WAR of 0.2. This is a pretty small sample size of only 183 PA’s, however Polanco has been having difficulty hitting for power. Right now Polanco is getting on base a decent amount, but his lack of power offsets the value of his ability to get on base.

Pitch Type Count AB K BB HBP 1B 2B 3B HR BAA SLG ISO BABIP
Fourseam 230 46 6 8 0 11 2  0  0 .283 .326 .044 .325
Sinker 207 37 6 8 0 6  0  0 0 .162 .162 .000 .194
Change 57 13 3 0 0 6  0  0  0 .462 .462 .000 .600
Slider 111 32 9 2 0 5  0 0 3 .250 .531 .281 .250
Curve 56 12 5 1 0 1 0  0 1 .167 .417 .250 .167
Cutter 66 14 5 1  0 3  0 0 1 .286 .500 .214 .375
Split 26 9 2 0  0 1  0 0 0 .111 .111 .000 .143
Slow Curve 1 0 0 0 0  0  0 0 0 .000 .000 .000 .000

Pitchers have mostly fed Polanco fastballs. In his first 40 games, Polanco has seen a fastball around 60% of the time, and that’s the pitch against which he seems to be able to have the most success. Polanco also doesn’t hit for a lot of power when he does make contact with a fastball (.044 ISO against fastballs). Despite that, Polanco was known for not being able to handle breaking balls — in the minors — so far he has actually been able to take advantage of some curveballs, cutters and sliders. All five of the home runs he has hit have come off of either a slider, a curveball or a cutter.  The story seems to be that he can catch up to the hard stuff, but doing something meaningful with it is a different story.

We’re dealing with a small sample size,  so we don’t know whether Polanco has a propensity for hitting sliders, cutters, and curveballs a long ways. Other off-speed/breaking pitches — such as sinkers and splitters — have been difficult for Polanco to hit.

2014 O-Swing% Z-Swing% Swing% O-Contact% Z-Contact% Contact% Zone% F-Strike% SwStr%
League 30.6% 65.7% 46.4% 66.0% 87.4% 79.6% 45.1% 60.1% 9.2%
Polanco 28.3% 59.3% 41.4% 53.3% 88.3% 74.4% 42.1% 54.2% 10.0%

Polanco is still showing good knowledge of the strike zone as he’s drawing a lot of walks this season. Despite his proclivity for drawing walks, there are some issues with his plate discipline stats. These mainly have to do with his ability to make contact. While Polanco does swing at lower number of pitches outside of the zone  — with an O-Swing%  of 28% — he only makes contact with those pitches 53% of the time.

The only good part of Polanco’s offensive performance has been his ability to draw walks. However, if you’re not making enough contact and not hitting for power, that doesn’t make you a productive major-league hitter.

Polanco has not necessarily been a great hitter in his first couple of months in the majors. I’m sure some people expected Polanco to step in and be one of the best hitters in the game. The truth is that this game is incredibly hard, and there are certainly growing pains when it comes to young players. Polanco has to figure out how to hit for more power to be a useful major leaguer. He hasn’t done that.

That doesn’t mean that Polanco won’t do it; we are dealing with a small sample size of just 183 PA’s, and it’s completely plausible that he makes the adjustment necessary to fix his problems. Polanco showed that he could adjust quickly.  That’s why he rose so quickly through the minors; he was able to adjust to the new challenges each level presented him. Demoting Polanco is completely pointless as he hasn’t been bad, and he has nothing left to prove in the minors.

Polanco now needs to prove himself in the majors, and that might just take some time.


Collins Working the Lineup

Over the course of 162 games, there’s only so much influence a manager of any baseball team could have over their outcome. After 105 games the Mets actual record is 3 wins shy of their projected record of 53-52, making this a .500 team. Several factors contribute to this discrepancy like losing your ace pitcher to injury, scrambling for a closer to begin the season, developing a major league catcher, adapting to a new hitting coaches philosophy, and setting the most productive lineup possible just to name a few. What Terry Collins has done with this team to this point can only be admired, but help has arrived and changes must be made to maximize team production.

The move of Curtis Granderson from the cleanup to leadoff role proved to be successful as the team surged from June’s end through July. Daniel Murphy and Curtis Granderson’s slash line numbers are almost identical, batting average is the only big difference which Daniel Murphy leads Granderson by about.060 AVG points and make him a more ideal leadoff hitter. Curtis Granderson hit 6 home runs from the leadoff spot which minimized his RBI potential which essentially is the reason Sandy Alderson signed him. In moving Daniel Murphy into the leadoff spot, the Mets actually increase their leadoff OBP while putting Curtis Granderson into a role where his RBI opportunities increase dramatically.

Daniel Murphy’s SLG% is nearly that of Curtis Granderson with half as many HRs, meaning that Daniel Murphy is doing a better job of getting into scoring position than our current leadoff hitter. The only 2 reasons the Mets have kept Murphy out of the leadoff spot in the past were lack of speed on the basepaths and low OBP. Now Daniel leads our starting players in SB showing he has some speed and base running ability and his OBP is amongst the team leaders. David Wright being the best hitter on the team (despite struggles in 2014) deserves the 2nd spot in the order. His power has declined this season, however his OBP is still respectable and he should remain in a table-setting role followed by Granderson. Lucas Duda has earned his cleanup role as he’s hit over .280 in the past couple of months with at least 5 HRs per month. He is driving the ball to all fields and should be a key contributor to driving in runs once our table-setters do their jobs.

The top 4 lineup spots should be configured as follows:

1  2B Daniel Murphy        (.293/.340/.412) 28 2B, 7HR, 11SB

2  3B David Wright           (.278/.339/.401) 24 2B, 8HR, 5SB

3  RF Curtis Granderson  (.232/.339/.415) 18 2B, 15HR, 8SB

4  1B Lucas Duda               (.259/.356/.500) 22 2B, 18HR, 3SB

For the next spot in the lineup, this player has had a tale of 2 seasons. Travis d’Arnaud has adjusted quickly since his demotion to AAA on June 6th. Since being recalled on June 24th, d’Arnaud has a slash line of (.302/.337/.646). He has lengthened our lineup and has earned the spot of the 5 hitter.

5  C Travis d’Arnaud

Before June 6th demotion    (.180/.271/.320) 3 2B, 3HR

Since June 24th Promotion (.302/.337/.646) 7 2B, 4HR

Season Stats                            (.232/.298/.379) 10 2B, 7HR

Right after Travis d’Arnaud in the Mets order is when they begin to look thin offensively. Having early success in the season but struggling as of recent is Juan Lagares, the defensive wizard and minor league doubles machine. This kid showed an advanced approach to lead off the year and is capable of making the bottom of our order a productive one. He isn’t seeing the ball well like he was in the first half, but we need to remember he is in his first full season in the bigs and known primarily for his route to catch baseballs and cannon for an arm, any offense is a plus.

6  CF Juan Lagares (.271/.306/.375) 16 2B, 2HR, 2SB

7  RF Chris Young/Eric Young/Kirk Nieuwenhuis/Bobby Abreu/denDekker

Our right field position is a question mark. I’m not saying the Mets haven’t produced anything from the position, but they don’t have an everyday right fielder which is a need to be addressed in the off-season or via trade before Thursday’s deadline. Though not one player has stepped up and taken over this position, I still believe they have produced more than my “ideal” 8 hitter, Ruben Tejada. In every championship team there is that one scrappy player that is on the squad solely for defensive prowess. Through the course of the season I have seen many different Ruben Tejadas. I’ve seen the defensive shortstop, the slap hitter, the kid in way over his head, and the wanna-be slugger with warning track power. This player is undoubtedly our 8 hitter and those who look too dependently on his OBP must take into consideration how many times he has walked for the sole reason that the worst hitting pitching staff is just 4 pitches away.

Ruben has been intentionally walked 10 times, twice as much as any player on the Mets. Ruben Tejada hasn’t defended the way he has in the past which quieted his lack of offense. In a New York setting, he shouldn’t start and the Mets executives know that. Ruben is a bridge to the future, an inexpensive filler until we land in a position of contention where an offensive producer is necessary at the position. Until then we have a shortstop with a strong arm and instincts but lacks the speed to get too many balls up the middle or steal a base when we need him to. He has no power and is offensively irrelevant as his slash line below shows. A shortstop with any tools is an upgrade here.

8  SS Ruben Tejada (.226/.351/.281) 9 2B, 2HR, 1SB


xHitting (Part 4): 2014 Fantasy Edition!

Welcome to the fourth installment of xHitting!  As always, reader comments and feedback are super encouraged and appreciated.  (Links to parts one, two, and three)

Briefly recapping the method, the gist is to estimate the expected rate of each individual hit type based on a player’s underlying peripherals, and in turn recover all the needed components to compute expected versions of wOBA, OPS, etc.  The only real change to the model since last time is that I now utilize a “hybrid” predicted home run rate, that averages between actual and (raw) predicted home run rate, with the weight given to actual HR rate increasing in the number of plate appearances.  (This is explained in part three, for those curious.)

Perhaps the more exciting change, though, is that this time I actually have results for an ongoing season, which potentially can help for fantasy purposes.  (Not that most readers need my help necessarily.)  Related to fantasy usage, there were a few requests to see a full spreadsheet of past results (2010-2013 seasons), which I have posted here.  Again feel free to take it or leave it at your leisure.

Note: I collected most of these data at the All-Star Break, so numbers may be a few weeks behind, but they’re still mostly true.  Also, for time considerations I only fetched 2014 stats for qualified leaders.  This even leaves out a few big names, but I couldn’t justify time to fetch every player.

So far, I’ve typically posted the biggest “over-” and “under”-achievers for a given season.  And I suppose I’ll continue that tradition today.  But while these lists are useful for highlighting which players seem most likely to regress, it overlooks another main use of the model, which is to assess the realness of a player’s apparent “breakout” or “decline;” at least in-sample.  (In some cases, the model may think that a player’s breakout is entirely justified, given peripherals, while others it may view more skeptically.)  Thus, today I’ll also post a second list, of players who seem to have taken a pronounced step forward/step back this season, and what the model thinks of their season-to-date performance.

Okay, time for results!  I’ll start with the list of “over-” and “underachievers.”

2014 Underachievers (1st half) 2014 Overachievers (1st half)
Name wOBA xWOBA Diff Name wOBA xWOBA Diff
Jean Segura 0.256 0.305 -0.049 Casey McGehee 0.345 0.277 0.068
Chris Davis 0.306 0.353 -0.047 Yasiel Puig 0.398 0.340 0.058
Mark Teixeira 0.352 0.397 -0.045 Matt Adams 0.376 0.324 0.052
Gerardo Parra 0.289 0.327 -0.038 Mike Trout 0.428 0.381 0.047
Brian McCann 0.298 0.330 -0.032 Marcell Ozuna 0.343 0.300 0.043
Torii Hunter 0.323 0.355 -0.032 Lonnie Chisenhall 0.396 0.359 0.037
Joe Mauer 0.308 0.340 -0.032 Scooter Gennett 0.355 0.320 0.035
Jimmy Rollins 0.320 0.352 -0.032 Marlon Byrd 0.344 0.309 0.035
Brian Roberts 0.304 0.334 -0.030 Giancarlo Stanton 0.397 0.363 0.034
Buster Posey 0.326 0.352 -0.026 Hunter Pence 0.359 0.325 0.034

A general pattern I notice is that, having worked with this model for a while now, there do seem to be players that give the model some trouble and have a disproportionate tendency to appear on this list from year to year.  A few of these players appear on this list… more on that later.

Partly for that reason, I wouldn’t necessarily say to “buy low” the guys on the left, nor “sell high” the guys on the right; although you can if you want.  I won’t address every player, but I have some scattered comments:

  • For readers who prefer OPS, .020 wOBA translates to about .050 OPS, on the margin.
  • .397 predicted for Teixeira?  Not sure where that came from…
  • Poor Segura.  All things considered, I think nobody deserves a big second half more than he does.
  • Whatever happened to Casey McGehee’s power?  The guy once hit 23 home runs in a season, but now has ISO of .073, with surprisingly low fly ball distance.
  • Although Chisenhall’s breakout is not as impressive if you take out what the model thinks is luck, it’s still a pretty impressive improvement.
  • Chris Davis is sort of the reverse of Chisenhall.  Adding back in what the model thinks has been bad luck, he’s still way down from what he did last year, but not nearly as disappointing as he probably has been to many owners thus far.

As mentioned, certain players do seem to be able to over/underperform the model somewhat consistently; the same way we think some pitchers are usually better or worse than their FIP.  With now 4.5 years of data to work with, however, I think I can make educated guesses about which players systematically deviate from the model predictions.  I’ll term this deviation the “player fixed effect.”

(Requiring at least 1000 PA from 2010 through 2014 first half)

Model loves too much Model loves too little
Name Player FE
estimate (wOBA)
Name Player FE
estimate (wOBA)
Brian Roberts -0.033 Wilson Betemit 0.032
Todd Helton -0.026 Brandon Moss 0.032
Jean Segura -0.026 Ryan Sweeney 0.028
Jose Lopez -0.025 Mike Trout 0.027
Mark Teixeira -0.025 Peter Bourjos 0.026
Russell Martin -0.024 Matt Carpenter 0.025
Darwin Barney -0.023 Brandon Belt 0.025
Chris Getz -0.023 Melky Cabrera 0.025
Jimmy Rollins -0.021 Carlos Ruiz 0.024
Jason Bay -0.020 Chris Johnson 0.024

Comments:

  • Again, .020 wOBA is equivalent to about .050 OPS, on the margin.
  • Taking out their apparent fixed effect, Teixeira is only underperforming his xWOBA by about .020, and Brian Roberts is actually doing about par.
  • On the reverse side, Mike Trout’s “adjusted” xWOBA jumps up to .408, where really it probably doesn’t surprise us that he’s outperforming even that, since he’s Mike Trout.  And although Giancarlo Stanton misses the Top 10 cutoff above, his apparent fixed effect of +.022 would be 11th; so his “adjusted” xWOBA is more like .385.
  • Yasiel Puig (.058) would also be on the list of “positive fixed effects” if we relaxed the PA requirement (he has 826 during this time).  And Matt Adams (~.040) might also be well on his way to that list; although he has fewer plate appearances still than Puig.
  • I don’t really have good explanations/know any common themes for players with negative fixed effects.  Maybe readers can help?
  • For Trout, home runs are pretty clearly the area where the model underestimates him.  In any given season (2010-2014), he hits about twice as many HR as the model thinks he should in the “raw” prediction.
  • And Trout’s not the only “HR rate defier,” either; just the most salient.  In general, the model has never done as well with home runs as it does with singles, doubles, and triples.  It seems there are other important determinants of home run hitting that really should be in the model, but currently are not.  Intuitively, I sort of would like velocity and angle of the ball off the bat, but so far have not found a good data source to actually include these.  (Maybe that will change in the coming years as MLBAM releases “Hit F/X” style data?)  Until then, reader suggestions are also super welcome here.

And now, finally, for the other usage: here’s a partial list of players who have taken either a pronounced step forward or back this season, relative to established norms.

2014 “Decliners” 2014 “Improvers”
Name Career wOBA 2014 wOBA 2014 xWOBA Name Career wOBA 2014 wOBA 2014 xWOBA
Nick Swisher 0.352 0.285 0.305 Michael Brantley 0.324 0.394 0.404
Joe Mauer 0.373 0.308 0.340 Lonnie Chisenhall 0.328 0.396 0.359
Allen Craig 0.350 0.289 0.309 Seth Smith* 0.334 0.389 0.356
Billy Butler 0.352 0.300 0.309 Victor Martinez 0.362 0.416 0.422
Evan Longoria 0.365 0.315 0.323 Jonathan Lucroy 0.342 0.383 0.354
Domonic Brown 0.315 0.267 0.267 Anthony Rizzo 0.342 0.382 0.382
Chris Davis 0.351 0.306 0.353 Nelson Cruz 0.356 0.393 0.380
Matt Holliday* 0.385 0.342 0.318 Jose Altuve 0.319 0.356 0.325
Jean Segura 0.299 0.256 0.305 Brian Dozier 0.311 0.344 0.362
David Wright 0.377 0.335 0.305 Kyle Seager 0.334 0.367 0.344
Buster Posey 0.366 0.326 0.352 Dee Gordon 0.297 0.329 0.318
Shin-Soo Choo 0.369 0.333 0.346 Alcides Escobar 0.284 0.312 0.300
Dustin Pedroia 0.356 0.325 0.337 Casey McGehee 0.321 0.345 0.277
Jed Lowrie 0.327 0.297 0.305
Jay Bruce 0.343 0.315 0.326

* – To avoid inflation from Coors Field, for these players I’ve taken the total from 2011-13 seasons only

Comments:

  • At least in-sample, Brantley’s breakout seems to be pretty much entirely justified.  Of course this doesn’t mean that he won’t regress somewhat, but if I were to guess, I’m a little more optimistic than ZiPS and Steamer (which currently project .341 and .333 RoS, respectively).  Similar deal for some others.
  • “Yikes” for Billy Butler and Domonic Brown, whose declines this season seem (at least in-sample) to be entirely justified.
  • I’m not sure why the model dislikes Casey McGehee so much.  Obviously his fly ball distance (mentioned earlier) isn’t doing him any favors, and his .369 first-half BABIP is probably unsustainable.  Still, .277 xWOBA?  Seems harsh.

As with any fantasy advice, don’t take any of this too literally…  Take it or leave it as you see fit.

Lastly, although I hyped this piece from a fantasy perspective, the overall goal remains that I would love to see more work done to de-luck hitter stats, the way people do so often for pitchers.  (FIP for pitchers, and xWOBA or xWRC+ for hitters! Is the dream.)

Reader thoughts on how to improve the model, or requests for players not already mentioned?


Projecting the Mariners

At the time of writing, the Seattle Mariners are 54-50 and 0.5 games back in the race for the second wild card spot. With rumors flying as to the upcoming trade deadline, the benefit of selling prospects to improve their odds varies heavily based on how view their current roster. Dave Cameron rang the warning bell for all the of the second wild card contenders, pointing out that these teams are vying for a one game playoff in Anaheim. Not such a great prize.

But. The Mariners haven’t made the playoffs in 13 years and the prospect of a wild card spot has made this most exciting campaign in a decade. So how good are the Mariners? At 54-50, the Mariners have won at a clip of .519, but FanGraphs’ projection models paint a very different picture of the team. FanGraphs’ playoff odds page believes the Mariners are a .503 team going forward. Not so hot.

But. The Mariners’ season-to-date-stats projection from FanGraphs is a lot more favorable, suggesting the true talent level of M’s, based on CoolStandings’s version of Pythagenpat adjusted for remaining schedule, is .557. To put that in context, it’s the 4th best in baseball, behind the third-place Angels (.572) and 5th place Dodgers (.549). The Mariners, by this measure, are the both the 3rd best team in the AL West and the American League.

The CoolStandings model, like that of Baseball Prospectus, relies on base runs and a modified Bill James Pythagorean W-L. As best I understand it, this means that the Mariners, by base runs, have been ‘unlucky’ to the tune of the gap between their projected rest of season W% and their current W%, for a loss of 0.038 W%. 3.8% of the M’s 104 games already played comes out to 4 wins, so the Mariners’ ‘true talent level’ record would be 58-46. That’s still a good margin worse than the Angels, but it’s also well above all the other wild card contenders. By this model, the M’s would be expected to win 86.1 games this year, beating out the next-best Blue Jays by almost 2 wins.

Conversely, by the FanGraphs Zips-Steamer projection system, the M’s should win 83 games and come a game or so short of the Blue Jays. By these numbers, the M’s still finish the as the best team not to make the playoffs.

There is a huge gap between these projections, coming out to about 8 wins over a full season and 5 over the roughly two-thirds of the season already played. The rest of this article is an amateur attempt to account for that gap, and to assess how good the Mariners might actually be.

The CoolStandings model relies on past performance and therefore absorbs some mathematical ‘luck.’ For example, the Mariners’ runs allowed this season has to have been impacted by a league high strand-rate. For that reason, the figures are probably overly optimistic of the M’s chances.

But that isn’t to say the M’s look like a .503 ballclub either. To account for the difference between the M’s .519 W% and their .503 Zips-Steamer talent level, I looked at the ZiPS projections for the rest of the season, but in ignoring Steamer, this is only a rough guess at how the projection could be too pessimistic .016% of 104 games is about 2 wins, so that’s the extent of the disagreement between reality and luck against the projections. For the remainder of the season, that .016% is worth about one win.

For example, Michael Saunders has been worth 1.7 fWAR this year in 219 PAs, a pace of approximately 4.2 fWAR over 550 PAs, but Zips says he’ll be worth 0.7 fWAR over 161 PAs the rest of this year, a clip of 2.39 fWAR/550. If we say the M’s have played two-thirds of their season already, that says that Saunders should’ve been worth 1.6 fWAR this year, and has overperformed to the tune of .1 of the M’s wins.

Looking a little deeper, we can see that ZiPS doesn’t punish Saunders playing time projection very much despite his injury trouble, in that it sees him basically playing full time the rest of the way. If we take the ZiPS projection and put it over his 219 PAs of 2014 service time, the picture is a little clearer: ZiPS says Saunders should’ve been worth 0.95 fWAR this year. So Saunders has already given the M’s almost a full win more than he should’ve, says ZiPS.

Other contributors to the M’s supposed over-performance include Felix and Ackley.

Ackley’s defense has been fine so far, per UZR, but ZiPS says it should be bad. So despite underperforming at the plate by 9% per wRC+, ZiPS says Ackley’s glove has already given the M’s a full half-win more than can be expected.

A topic of some discussion, Ackley’s fielding is hard to assess. To the naked eye he’s looked ok, not terrible, and UZR seems to agree. There was his spectacular catch the other day, and there’s also his mediocre arm, which possibly has been taxed at a below-average rate, but this is all speculation. But if we trust Ackley’s to date figures and performance, we can give the Mariners a third of a win back over their remaining schedule.

And then there’s Felix, who has already contributed a remarkable 5.5 fWAR. ZiPS penalizes Felix quite a bit, expecting him to regress quite a bit. If Felix continues pitching at his current level, the M’s again would be expected to win another half game more than ZiPS suggests. Felix both has reached new highs with his changeup and has benefited quite a bit from Zunino’s pitch framing, as have the M’s, and Zunino’s framing is neither accounted for by the projection systems nor his season-to-date fWAR.

Then there’s Chris Young. Young has been worth 0.6 fWAR and 2.7 rWAR. Young alone can account for the gap between FanGraphs’ and ZiPS’ perception of the Mariners and their current performance. A lot has been written about Young’s season, and going back to 2009, the last season ZiPS looks at for their projections, Chris Young’s 2014 is the single greatest overperformance of ERA against FIP. Young probably hasn’t been as good as his ERA nor nearly as bad as FIP, but I can’t speculate at his true talent.

Between Young, Zunino’s framing, Ackley’s defense, Saunders’ somewhat expected improvement, and Felix’s dominance, the Mariners seem like an especially tough team to project. For a counter example, the Blue Jays are expected to win 84.3 games by ZiPS-Steamer and 84.9 by Cool Standings, versus the Mariners 3 game swing between the projections. Clearly ZiPS-Steamer is the more reliable model and clearly it’s missing a significant piece of the picture. But were I a betting man, I’d certainly bet the M’s finish better than 83 wins.

What these numbers suggest is that by both the models the Mariners are close enough to be competitive for the wild card, and that acquiring marginal talents like Marlon Byrd or a DHing Matt Kemp (not going to happen) could have a real impact on the team’s chances. By both models, going all in for Ben Zobrist at SS and a right-handed OF might not be be such a bad idea, nor would be so ludicrous to pursue David Price. That said, a bad trade is a bad trade no matter the context of standings, and the M’s suffer from an overpay in any event.

What the holes in the ZiPS projections say, however, is that maybe the M’s recent slide isn’t especially important, and that while this probably isn’t the 4th best squad in baseball, it’s still probably a good team, and a team to be excited about. Because even if the M’s fail to make another roster move, they should be a competitor.


Looking at Attendance after Aces are Dealt

As baseball season and the summer months heat up, so too do the trade rumors. Almost every year, baseball media and fans postulate and prognosticate who might be traded before the annual trading deadline.

This year, the big fish on the market is Rays left-hander David Price. With only one year left on his contract, it is unlikely the Rays can afford to keep the former Cy Young Award Winner. But with the team winning eight in a row and 19 of their last 24, trading their ace doesn’t seem like a sure deal anymore. Most recent reports say the Rays management will wait until the absolute last minute to make a decision on if, where, and for whom the popular lefty will be traded.

With the Rays’ status with regards to popularity and market, some of the talk in regards to trading David Price has wound into the realm of attendance. The Rays are currently last in the Major Leagues in attendance, and some are concerned attendance could drop even lower if they traded their best pitcher. There are those who think Rays fans would consider the trade a message from ownership to wait until next year. And if that’s the message, why not wait until next year to buy a ticket?

To estimate how Rays attendance might react to a possible trade of David Price, I looked at 12 prior trades of ace pitchers over the last 37 years. Via Baseball-Reference.com, I looked at attendance before and after each trade. I also looked at winning percentage before and after.

My goal is to see if two maxims hold true:

  1. Attendance goes up when teams win and goes down when teams lose.
  2. A team that trades its best pitcher will have a worse record after the trade.

Hence, if attendance is attached to winning and ace pitchers are attached to winning, attendance should drop after ace pitchers are traded.

Is this really the case? Or is attendance in some cities more sensitive to major trades than others?

Let’s begin by looking at the granddaddy of superstar pitcher trades: the Tom Seaver trade. On June 15, 1977, after a slight tiff with ownership, the Mets shipped the franchise’s first ace to the Reds for Steve Henderson, Pete Flynn, Pat Zachary, and Dan Norman. The Mets were bad before but worse after and attendance followed suit.

Twelve years later, in 1989, two aces were traded during the season. On May 25th, the Mariners moved ace Mark Langston to the Expos for a bevy of prospects headlined by future ace Randy Johnson. Mariners fans reduced their attendance by nearly the same amount Mets fans did in 1977. Although playing .500 baseball prior to the trade, the Mariners winning percentage dropped significantly after the trade.

Two months after the Langston trade, the Minnesota Twins traded 1988 Cy Young Award winner Frank Viola to the Mets for Rick Aguilera, Kevin Tapani, and three other pitchers. The Twins were two games under .500 at the time of the trade, and then played .500 after the trade. Despite their slight improvement, attendance dropped 12.95% after the Viola trade.

We fast-forward to 1998 and another Mariners trade. During the 1998 season, the Mariners dealt the aforementioned Johnson to the Astros for Freddy Garcia, Carlos Guillen, and John Halama. While Johnson immediately did well in Houston, the Mariners played better after his departure, going 28-25 after the trade. Like the 1988 Twins, however, the positive play did not lead to an increase in attendance, as the average per game attendance went down after the trade.

Our next trade is the Bartolo Colon trade in 2002. On June 27, 2002, the Indians shipped Colon and Tim Drew to the Expos for Cliff Lee, Grady Sizemore, Brandon Phillips, and Lee Stevens. The Indians played .467 baseball before the trade and a lesser .447 clip following the deal. Attendance, however, jumped after the trade, up 10.04% over the team’s final 45 games.

We look at Cleveland again in 2008, when the Indians moved CC Sabathia to the Milwaukee Brewers for Michael Brantley, Matt LaPorta, and three other players. After trading Sabathia, the Brewers vastly improved their record, finishing the season 44-30. Attendance also went up after the Sabathia trade, from 25,964 to 27,766 per game, an increase of 6.94%.

The 2009 season saw the trade of three high profile pitchers. Two were legitimate aces, and the other a former ace that might give us insight to a Rays attendance prediction.

The first major pitcher trade in 2009 again involved the Indians. On July 29th, the Tribe shipped Cliff Lee and Ben Francisco to Philadelphia for Jason Knapp, Carlos Carrasco, Jason Donald and Lou Marson. Unlike the Colon or Sabathia trades, following the Lee trade, the Indians winning percentage and attendance per game both decreased.

Two days after the Indians traded Lee, the San Diego Padres moved right-hander Jake Peavy to the Chicago White Sox for Clayton Richard and three other players. Like the Twins in 1989 and the Mariners in 1998, the Padres played better after moving their ace, finishing the remaining 59 games with a 34-25 record. Unfortunately, also like the ’89 Twins and ’98 Mariners, less fans came out to see their now-winning team.

Our final pitcher trade of 2009 occurred on August 29th, when the Rays moved former ace Scott Kazmir to the Angels for Sean Rodriguez, Alex Torres, and Matthew Sweeney. Kazmir was no longer the Rays ace in 2009, handling over the title to James Shields and the up-and-coming David Price. But Kazmir still had name value in the Tampa Bay area, despite his decreased effectiveness.

After trading Kazmir, the Rays stumbled to a 15-20 finish. They went from being 4.5 games out of the wildcard to finishing 11 games out of the playoffs. Per game attendance following the Kazmir trade also dropped considerably, from 24,169 per game to 19,574 per game. This attendance decrease of 19.01% is the biggest drop of any of our surveyed trades.

The next year, two of our most frequent subjects collided when the Mariners traded Cliff Lee. After signing with Seattle in the offseason, Lee was sent to the Rangers for the stretch run. After the trade, the Mariners, who had played .400 baseball prior to trading Lee, finished the season with a .350 winning percentage and saw attendance drop 4.99% over the last 39 home games.

In 2012, the Brewers were on the dealing side when they sent Zack Grienke to the Angels for Jean Segura and two other players. While the Brewers were 10 games under .500 before the trade, they reversed fortune after the deal, going 39-25, a .609 clip. Attendance also increased after moving Grienke, albeit by 124 fans per game, or only 0.3%.

In our final trade, we look at the Chicago Cubs. Prior to trading Matt Garza on July 22, 2013, the Cubs were 10 games under .500 and averaging exactly 33,000 fans per game. After trading Garza, the Cubs dropped to 30 games under .500 and lost 919 fans per game in the seats, a 2.78% decrease.

There are many other trades and fanbases I could have looked at (the Ubaldo Jimmenez trade in 2011 comes to mind), but this small sample set gives a wide spectrum of possible outcomes resulting from trading an ace pitcher. From what we looked at, we found:

  • 50% of the data set decreased in both record and attendance
  • 25% increased in record and decreased in attendance
  • 16% increased in both record and attendance after trading their ace
  • 8% decreased in record but increased in attendance

The Indians are particularly interesting, seeing a different outcomes each time they traded an ace. The Mariners saw an attendance drop after both the Langston and Johnson trades but played better after trading Johnson and worse after moving Langston. Perhaps Langston had a bigger effect on the team in 1989 than Johnson did in 1998.

So what would happen if the Rays traded David Price? Given their current winning streak and the attendance sensitivity seen after the Kazmir trade, my initial estimate would have them in the same category as the 1989 Twins, 2009 Padres, and 1998 Mariners – an improved winning percentages but lower attendance. An better record post-trade might not be difficult considering the beginning of the Rays season was a disaster marred by injured players who are slowly returning (Alex Cobb, Jeremy Hellickson, David DeJesus, and possibly Wil Myers).

But with the Rays struggling to fill seats, moving fan favorite David Price might be a bad public relations move. From the studies I have done, games David Price has pitched in have drawn 6% more than average. That could be because Joe Maddon sometimes aligns the rotation so Price faces prime opponents such as the Yankees and Red Sox, teams that traditionally draw well at Tropicana Field. But some of Price’s “bump” could be the allure of seeing one of the best pitchers in the American League.

My estimate is the Rays would suffer an initial attendance drop if they traded David Price. Games against the Red Sox and Yankees (especially Jeter’s last series in Tampa Bay) will continue to do well. Bobbleheads and other promotions will also do well (expect a good turnout for the Don Zimmer sno-globe). And if the team plays well enough to contend, attendance may recover, but even then, the Rays won’t average over 20,000 per game.

Then again, doubtful they would draw 20K on average even with David Price in the rotation.


Using Triple-A Stats to Predict Future Performance

Over the last couple of weeks, I’ve been looking into how a players’ stats, age, and prospect status can be used to predict whether he’ll ever play in the majors. I used a methodology that I named KATOH (after Yankees prospect Gosuke Katoh), which consists of running a probit regression analysis. In a nutshell, a probit regression tells us how a variety of inputs can predict the probability of an event that has two possible outcomes — such as whether or not a player will make it to the majors. While KATOH technically predicts the likelihood that a player will reach the majors, I’d argue it can also serve as a decent proxy for major league success. If something makes a player more likely to make the majors, there’s a good chance it also makes him more likely to succeed there. This hypothesis may be less true for players at the Triple-A level since such a high proportion of these players make it to the majors, but I still think it provides some insight. To address this issue, In the future, I plan to engineer an alternative methodology that takes into account how a player performs in the majors, rather than his just getting there.

For hitters in Low-A and High-A, age, strikeout rate, ISO, BABIP, and whether or not he was deemed a top 100 prospect by Baseball America all played a role in forecasting future success. And walk rate, while not predictive for players in A-ball, added a little bit to the model for Double-A hitters. Today, I’ll look into what KATOH has to say about players in Triple-A leagues. Due to varying offensive environments in different years and leagues, all players’ stats were adjusted to reflect his league’s average for that year. I also only considered what happened during or after the sample season. So if a former big leaguer spends the full season in Triple-A, he’s only considered to have “made it to the majors” if he resurfaces again. For those interested, here’s the R output based on all players with at least 400 plate appearances in a season in Triple-A from 1995-2011.

AAA Output

This output looks pretty similar to what we saw for Double-A hitters, including the “I(Age^2)” coefficient, which adds a bit of nuance into how a players’ age can predict his future success. But in this version, there’s also an interaction between ISO and age. Basically, this says that the ability to hit for power is much more important for older players than younger players at the Triple-A league level.

Rplot

By clicking here, you can see what KATOH spits out for all players who logged at least 250 PA’s in Triple-A as of July 7th. . I also included a few interesting players who missed the 250 PA cut off, including Mookie Betts, Rob Refsnyder, Ramon Flores, and Kris Bryant. Here’s an excerpt of the top players from Triple-A this year. Joc Pederson tops the charts with an impressive 99.91% probability. Many of these players have already played in the majors, so these values can be interpreted as the odds that said player will play in the majors in the future.

Player Organization Age MLB Probability
Joc Pederson LAD 22 100%
Gregory Polanco PIT 22 100%
Kris Bryant CHC 22 100%
Mookie Betts BOS 21 100%
Arismendy Alcantara CHC 22 100%
Oscar Taveras STL 22 99%
Stephen Piscotty STL 23 98%
Steven Souza WSN 25 98%
Javier Baez CHC 21 98%
Maikel Franco PHI 21 97%
Taylor Lindsey LAA 22 97%
Domingo Santana HOU 21 97%
Enrique Hernandez HOU 22 96%
Chris Taylor SEA 23 95%
Jake Marisnick MIA 23 95%
Mikie Mahtook TBR 24 94%
Rob Refsnyder NYY 23 94%
Alfredo Marte ARI 25 93%
Carlos Sanchez CHW 22 93%
Nick Franklin SEA 23 93%
Ramon Flores NYY 22 92%
Ronald Torreyes HOU 21 92%
Joe Panik SFG 23 91%
Tyler Saladino CHW 24 91%
Giovanny Urshela CLE 22 90%

Now that I’ve gone through all levels of full-season ball, I’ll start at the bottom and cycle through the short-season leagues. These samples will be pretty small, but perhaps not completely useless now that those players have a few weeks’ worth of games under their belts. At the very least, it will be interesting to see what KATOH’s able to tell us about batters so far away from the big leagues, even if it’s a little premature to ask KATOH about 2014’s players.


World’s Best Pitcher Faces World’s Best Hitter

If you’re an East Coaster  — like myself — and you stayed up late enough to watch the Angels-Mariners game on Saturday night, you were in for treat. The game, based on the pitching matchup alone, would have been an exciting one to watch: Garrett Richards versus Felix Hernandez. There’s a fair argument that Hernandez is the best pitcher in baseball right now. The same could be said about Clayton Kershaw, but if you’re talking about pitchers who have been healthy for the whole season, then King Felix is your guy.

Richards has been no slouch either, turning himself into the bona-fide ace in Anaheim, pitching to a ERA/FIP/xFIP line of 2.55/2.69/3.21, which is good for a WAR of 3.2. This game previewed as more of pitcher’s duel than a slugger’s fest.

However, this game didn’t just feature the game’s best pitcher.  It also featured arguably the game’s best hitter: Mike Trout. So, how does the world’s best pitcher pitch to the world’s best hitter?

First Pitch

trout pitch 1

This is in the fourth inning, and it’s Trout’s second time facing Hernandez.  Hernandez starts out with a fastball. Sucre set up for a pitch low and away, and Hernandez misses with a 92 MPH up and in. A slight mistake, but definitely not a costly one. The count is 1-0.

Second Pitch

Like any great pitcher, Hernandez fixes the mistake he made on his first pitch. Sucre sets up for the same location, and Hernandez throws a 92 MPH fastball low and away. Hernandez nails it right on the money. The pitch was well executed, and Trout even thought about going around on it. Trout doesn’t go around, but that doesn’t matter because the pitch is a called strike. Hernandez evens up the count at 1-1.

Third Pitch

third

Clearly,  the Mariners had a plan: pitch Trout low and away. This is the third time that Sucre set up in that location, however this time Hernandez decided to flash his signature changeup. This hard changeup  probably would have bounced in the dirt had Trout not fouled it off. It seems as though Hernandez plans on going after Trout by  first establishing the fastball in a particular location, and then attack with the off-speed stuff. The count is in Hernandez’s favor at 1-2.

Fourth Pitch

Hernandez comes after Trout with another changeup, and Sucre sets up in the same location as the last pitch. Hernandez — like any pitcher — is clearly going for the punch out. Hernandez is trying to execute the same pitch, while hoping for better results. Unfortunately he misses inside for a ball. Trout has worked the count to 2-2.

Fifth Pitch

When you throw a pitch two times in a row, you run the risk of becoming predictable. Hernandez decides to break his streak of throwing changeups, and goes for the hard stuff. Sucre sets up in the same location that he always does, however Hernandez floats 93 MPH fastball up and away for a ball. The count is full. The best thing about a full count is that you know that one of three things will happen: strikeout, walk or the batter makes contact.

Sixth Pitch

Well, something did happen. Trout made contact, but it was nothing  meaningful,  as he fouled off a 94 MPH fastball from Hernandez. Nothing is really new from the Hernandez-Sucre side. Sucre sets up in the same location he has been setting up for the entire at bat, and Hernandez probably would have hit his spot had Trout not fouled it off. The count is still 3-2.

Seventh Pitch

troutdouble

Hernandez comes back with a 94 MPH fastball, hoping to hit the low and away location that Sucre sets up. However, Hernandez leaves the pitch up a little bit just enough to slightly miss the location that Sucre had set up. Trout is one of the best hitters in the game, and the best hitters in the game can take advantage of a pitcher’s slight mistake. Trout almost takes this pitch yard , as it bounces off the wall for a double. This double ended up being nothing significant, as Hernandez managed to work his way out of the inning without Trout scoring.

Crisis averted for Hernandez.

Trout happened to come out successful in that at-bat as he got himself a pretty big hit against Hernandez, however later on in the game Hernandez managed to strike out Trout.

Hernandez finally gets Trout on his signature nasty changeup that breaks on the inside part of the plate. If you look at Mike Trout’s heatmap,  you know that going inside on him is risky business. Dave Cameron even wrote a whole article about it. Luckily, Hernandez’s changeup is good enough that he can get away with pitching Trout inside.

One of the best hitters in the game faced off against one of the game’s best pitchers. You could see why these two are the best at what they do. They recognize mistakes, capitalize on those mistakes, and correct their own mistakes. There are a lot of at-bats each year in baseball, and to someone who knows nothing about baseball, this might look ordinary.  However,  if you look closely enough, you can see the intricacies of a particular at-bat. That’s when you start to realize how complex baseball is.


Changes ZiPS Believes In

Mitchel Lichtman’s projection pieces on hitters and pitchers for the rest of the season were discussed quite a lot last month starting with this.  It is hard when you are rooting for a team, and subsequently its players, not to buy in when someone is doing well or poorly.  So let’s look at the heartless projecting system ZiPS to see if it is actually buying into some of the performances of 2014 so far.

To do this I pulled the 2014 pre-season wOBA projections and compared them to the ZiPS (RoS), rest of season, projections.  If you take the RoS wOBA minus what ZiPS was expecting prior to 2014 you should be able to see which players are now expected to hit significantly better or worse the rest of the way.  Here are the top/bottom-five players:

 photo ZIPSros_zpsebe79a2a.jpg

The bottom five, with the exception of Colvin, have been very disappointing and their respective teams would love even the RoS numbers at this point.  The projection still believes Brown can be an above average offensive player despite his putrid play to this point of 2014, but it is starting to look like Raburn’s age might be catching up to him and Gyorko’s rookie year might have been a mirage.  Schierholtz makes less sense, but he has been so bad that ZIPS can’t ignore it, and he was never a great player to begin with.

Others names of note that are projected to finish the year worse may not be surprising.  Raul Ibanez looks done with eyes and statistics, Jean Segura’s lack of plate discipline has really caught up to him, and Brian McCann may not be aging particularly well despite being a lefty with power in the Yankees’ home park.

There are a lot of players on the positive side, and you can see that the nominal and percent wOBA changes are larger for the improvement group too.  There are 31 players with RoS wOBA at least 5% above their pre-season projection while only 17 projected to be 5% or more worse than expected.  Does this mean that ZiPS is actually an optimist?

The Padres believe in Seth Smith as well, having recently signed him to extension.  He is a righty masher, though they only rarely let him face same-handed pitching.  Victor Martinez is 35 years old and decided to have a renaissance, and may end up with his best hitting season ever.  Baseball is weird.  I’m not sure what to make of Steve Pearce.  He has been around since 2007 without ever accumulating more than 200 PAs, but this season he finally has and the Orioles are making out like bandits.  The other two are what you expect on such a list, young players taking a step forward.  JD Martinez was who I was thinking about when I started this.  I have seen him play several times recently, and he seems to put together a quality plate appearance every time up. Mesoraco, like Martinez, is 26 and has had a huge power spike along with a lot more strike outs to the point where he seems like a different player altogether.

Two Cleveland Indians just missed the top five improvers: Michael Brantley and Lonnie Chisenhall seem to have finally taken a step forward too.  There were two notable Brewers as well.  ZiPS seems to have finally decided to believe in Carlos Gomez and Jonathan Lucroy.

Yes, believing in projections sometimes means we need to temper our enthusiasm when a player we like breaks out or be patient with someone slumping.  It can also be a good way to see when players are truly locking into higher levels of play.  For the older players here it is likely that they will come back to the pre-season projections again next year because Victor Martinez is probably not going to turn into a much better hitter year after year at this age, but for the younger guys we may be starting to see who is taking a step forward.


Using Double-A Stats to Predict Future Performance

Over the last couple of weeks, I’ve been looking into how a players’ stats, age, and prospect status can be used to predict whether he’ll ever play in the majors. I used a methodology that I named KATOH (after Yankees prospect Gosuke Katoh), which consists of running a probit regression analysis. In a nutshell, a probit regression tells us how a variety of inputs can predict the probability of an event that has two possible outcomes — such as whether or not a player will make it to the majors. While KATOH technically predicts the likelihood that a player will reach the majors, I’d argue it can also serve as a decent proxy for major league success. If something makes a player more likely to make the majors, there’s a good chance it also makes him more likely to succeed there.

Things that were predictive for players in low-A and high-A included age, strikeout rate, ISO, BABIP, and whether or not he was deemed a top 100 prospect by Baseball America in the pre-season. However, a player’s walk rate was not significant in predicting a player’s ascension to the majors. Today, I’ll look into what KATOH has to say about players in double-A leagues. For those interested, here’s the R output based on all players with at least 400 plate appearances in a season in double-A from 1995-2010. Due to varying offensive environments in different years and leagues, all players’ stats were adjusted to reflect his league’s average for that year.

AA Output

Unlike in the A-ball iterations of KATOH, a player’s double-A walk rate is predictive — albeit only slightly — of whether or not he’ll make it to the show. While walk rate is statistically significant, it still matters much less than the other stats: it takes 3 or 4 percentage points on a player’s walk rate to match what 1 percentage point of strikeout rate does to a player’s MLB probability.

This version is also different in that there are a couple of significant interaction terms, signified by the last two coefficients in the above output. The “I(Age^2)” term adds a little bit of nuance into how a players’ age can predict his future success. While the “ISO:BA.Top.100.Prospect” term basically says that if you’re a top 100 prospect, hitting for power is slightly less important than it would be otherwise. Hitting for power and making Baseball America’s top 100 list both make a player much more likely to make it to the majors, but if he does both, he’s a tad less likely to make it than his power output and prospect status would suggest independently. Put another way, a few top 100 prospects hit for power in double-A, but never cracked the majors — such as Jason Stokes (.241 ISO), Nick Weglarz (.204 ISO) and Eric Duncan (.173 ISO). But virtually all of the low-power guys made it, including Elvis Andrus (.073 ISO), Luis Castillo (.076 ISO), and Carl Crawford (.078). For non-top 100 guys, many more punchless hitters topped out in double-A and triple-A.

By clicking here, you can see what KATOH spits out for all current prospects who logged at least 250 PA’s in double-A as of July 7th, as well as a few that fell short of the cutoff — most notably Joey Gallo, Kevin Plawecki, and Robert Refsnyder. Topping the list is Mookie Betts with a probability of 99.95%, and of course the prophesy was fulfilled when the Red Sox called up the 21-year-old last month. Here’s an excerpt of the top players from double-A this year:

Player Organization Age MLB Probability
Mookie Betts BOS 21 100%
Francisco Lindor CLE 20 100%
Gary Sanchez NYY 21 99%
Austin Hedges SDP 21 99%
Alen Hanson PIT 21 99%
Jorge Bonifacio KCR 21 98%
Blake Swihart BOS 22 98%
Kris Bryant CHC 22 93%
Ketel Marte SEA 20 91%
Rangel Ravelo CHW 22 90%
Robert Refsnyder NYY 23 86%
Jake Lamb ARI 23 85%
Jake Hager TBR 21 84%
Darnell Sweeney LAD 23 83%
Joey Gallo TEX 20 82%
Preston Tucker HOU 23 81%
Scott Schebler LAD 23 79%
Kevin Plawecki NYM 23 79%
Cheslor Cuthbert KCR 21 78%
Kyle Kubitza ATL 23 77%
Michael Taylor WSN 23 76%
Christian Walker BAL 23 76%
Ryan Brett TBR 22 75%

Keep an eye out for the next installment, which will dive into what KATOH says about hitters at the triple-A level.

Statistics courtesy of FanGraphs, Baseball-Reference, and The Baseball Cube; Pre-season prospect lists courtesy of Baseball America.


Do Rookie Hitters Decline in the Second Half?

Do rookies perform worse after the All-Star break?

My claim over this statement is nonexistent, while the original thought of its occurrence was brought to my attention by Adam Aizer on the CBS Fantasy Baseball Podcast.

My judgment dissuaded, I thought that it would be worth the effort to look into the validity of the statement.

From the perspective of an offensive player, rookies infrequently make enough of an impact in the size of leagues (i.e. 10-team and 12-team leagues) that pedestrian Fantasy Baseball players occupy. For those sizes of leagues that the aforementioned owners participate in, a rookie hitter that is worth owning is either an elite prospect or a player that has preformed beyond their true talent level. As a result, the former is rare, while it would make sense for the latter to regress to their true talent level and is more common than the former. The idea that rookie hitters decline throughout the year is just a misevaluation of the player’s true talent level.

To put another way, it is the same logic that comes into play with a recent event: the Home Run Derby. Players that participate in the Home Run Derby are players that have exceptional first halves, which are often beyond their true talent level. These players often perform worse in the second half than they did in the first half, not because they participated in the monotonous and dated event that has become the Home Run Derby, but because, just like the rookies who perform worse in the second half of the season than the first, they have regressed toward their true talent level; when the rookies regress, they have just regressed to the point where they are not ownable.

The research looks at all player seasons between 1988 and 2013 where a batter was in their first season, had 250 plate appearances in the first half of the season, and had 250 plate appearances in the second half of the season.

Screen Shot 2014-07-20 at 8.48.48 PM

The rookie second half decline and the post Home Run Derby slump intuitively make sense, but intuition does not always bear truth. Through cognitive ease we rationalize that “Swinging that hard for that long throws off your timing”; “A rookie is too young to be able to make it through the long hot summer.”

Because most fantasy leagues are small, the only reason that the common rookie was on our teams to begin with is because they had to play beyond their ability in the first half of the season. The rookie who is on our team right now, unless he is a reputable prospect, is probably a safe bet to decline. But as a whole, we can see that there is no decline in rookie performance based on first half/second half splits.

Our desire to perceive a decline is just our desire to hold onto our ability as talent evaluators. We know that Yangervis Solarte is a great player, and the only reason he hasn’t been able to sustain his performance is because he is rookie that can’t play out the season: common baseball logic. In actuality, Solarte was not as good as some originally thought, and his true talent was never good enough to be on a 10 or 12 team league.

Summary:

Rookie hitters, as a generalization, are not good enough to play in 10 or 12 team leagues, and, as a generalization, those that do play in ten team leagues regress to their true talent level, which is not valuable enough to be ownable.

Devin Jordan is obsessed with statistical analysis, non-fiction literature, and electronic music. If you enjoyed reading him, follow him on Twitter @devinjjordan.