Archive for Player Analysis

xFantasy, Part IV: “Projecting” Breakouts and Busts in 2017

Back in December, I introduced “xFantasy” through a series of entries here at the FanGraphs Community blog. At its inception, xFantasy was a system based on xStats that integrated hitters’ xAVG, xOBP, and xISO in order to predict expected fantasy production (HR, R, RBI, SB, AVG). The underlying models are put together into an embedded “Triple Slash Converter” in Part 2. Part 3 compares the predictive value of xFantasy (and therefore xStats) vs. Steamer and historic stats, ultimately finding that for players under 26, xStats are indeed MORE predictive than Steamer!

To quote myself from the first piece, Andrew Perpetua over at the main blog has developed a great set of data using his binning strategy, which has been explained and updated this offseason, including some additional work since then to include park factors and weather factors. He produces xBABIP, xBACON, and xOBA numbers based on Statcast’s exit velocity/launch angle data, along with the resulting ‘expected’ versions of the typical slash-line stats, xAVG/xOBP/xSLG. Recently, Andrew has published a set of “2017 estimates” that takes the past two years of Statcast data and weights them appropriately to come up with the best estimate for a player’s xStats moving forward. After a bit of back and forth on Twitter with Andrew discussing how exactly these numbers get weighted, I think they are looking really good. I’m now adopting these numbers as the basis for xFantasy from this point on.

There are a few key takeaways from xFantasy so far that will tell us where to go next:

  1. xFantasy is not *truly* a projection. We don’t have minor-league data. We don’t have data from before 2015. At this point, xFantasy for 2017 is a weighted average of player performance from 2015-2016, so keep in mind that things like injuries or down years might have tanked a player’s xStats.
  2. More data is always better than less data. Steamer projections do a better job with established players than xFantasy does, likely due to having more info about past performance.
  3. Players under 26 have short track records, and xFantasy beats Steamer in projecting them going forward! For young players, or players that have undergone some significant, recent transformation at the MLB level, xFantasy could give us better info than traditional projections.

So what’s it mean? At this time, I will echo Andrew’s repeated recommendations that you should *not* use xFantasy as your projection system of choice in 2017. On average, Steamer will do better (at least for now…I think 2017 could be the year where we finally have enough Statcast data to put up a challenge). But xFantasy could be very useful in helping you to identify players (on a case-by-case basis) with short track records that might deserve a bump up or down from the projections spit out by the traditional systems.

For now, I’ve identified 10 (five up, five down) hitters aged 26 and under heading into 2017 that might deserve a second look based on xFantasy. Included below is each player’s xFantasy line and Steamer-projected 2017 line, both scaled to 600 PA, along with the 5×5 $ values, and at the far right, the difference between the two.

While the Billy Butler/Danny Valencia debacle was definitely the most interesting thing going on with the A’s late in 2016, Ryon Healy was a pretty good story himself. He came seemingly out of nowhere to hit over .300 with 13 HR in 283 second-half PAs, playing his way into a spot as the everyday 3B and likely No. 3 hitter for the 2017 A’s. xStats says you should believe it, with a .324 xAVG and 30 xHR. Steamer hasn’t bought into the average/power yet, but the relatively low ~20% K rate looks real.

Trevor Story was the best player in baseball for a couple of weeks this past year, and it seems to me that the late-season injury has made people forget that. xFantasy didn’t forget, though, and even with the huge K-rate, is seeing a .281 xAVG with 39 HR and 12 SB. Based on this line, I’m slotting Story comfortably into the same tier of SS’s as Correa, Seager, and Lindor for 2017. Downgrade in weekly H2H leagues where the away games can kill him a bit.

Gary Sanchez and Trea Turner have been well covered by Andrew here and here. I’ll just add that even though both are expected to regress from their lofty 2016 performances, xFantasy backs up the idea that they’ll both still be among the best players in baseball. Steamer is missing the boat on both guys.

I personally had a love/hate relationship with Tyler Naquin in 2016, who bounced on and off my roster in the “Beat Paul Sporer” NFBC league and always seemed to hit well when he was on the wire, and never when he was on my team. He’s been a trendy topic this offseason among people still using “Sabermetrics 1.0” to point at his BABIP and say he’ll be terrible in ’17. Statcast says he actually hit well enough to earn a .370 BABIP! Combine that with what seems to be a developing power profile and something like 15 SBs and you’ll have a nice little player for your fantasy squad. Just hope Cleveland plays him!

On the downside, we have quite a few players that have been trendy ‘sleeper’ picks in the lead-up to 2017 drafts so far. Javier Baez, even if he manages to find playing time in a crowded Cubs infield, just hasn’t hit the ball well enough to overcome the poor plate discipline. Mitch Haniger hit .229 in limited time (123 PA) but Statcast says he hit even worse than that — let’s hope it’s just a sample-size thing, because a .213 xAVG won’t cut it if you’re only getting 20 HR from him.

Yasiel Puig has been in the major leagues longer than many of these guys, so at this point maybe we should just believe Steamer, but I figured it would be worth including him here because it’s an interesting case to study. He hit .255 and .263 in 2015 and 2016 respectively, and that wasn’t bad luck according to Statcast, with a .249 xAVG in that time. Steamer still buys a bounceback to his pre-2015 ways with a .284 projection. I’m actually leaning toward Steamer here, because I believe that Puig’s stats have been heavily influenced by his various leg injuries over the past two years. Maybe I should see repeated injuries and use that to project future injuries, but in this case I’m going to give a 26-year-old the benefit of the doubt and say that a healthy Puig should match this Steamer projection in 2017.

Two more 24-year-olds close us out:  Max Kepler was very, very good in July and very, very bad after that, en route to an xFantasy line that doesn’t believe in the power, and *does* believe in the very poor BABIP and AVG. Staying away from that garbage pile, and moving on to another…A.J. Reed! He was supposed to be the chosen one last year, and instead he gave us his best 2014 Melvin Upton impression…without the speed. His playing-time picture is even more unclear than Baez’s, and even if he plays, Statcast tells me he has some work to do.

And finally, for an honorable mention of a player that’s new on the scene, but too old to qualify, I have to bring up Ryan Schimpf:

Woah.

Next time…

I closed out Part 3 by promising xFantasy for pitchers was coming, and it is! Using a model based on scFIP, xOBA, and xBACON, xFantasy for pitchers v1.0 now exists. There’s still work to be done in order to determine how useful it actually is, though!

As I said last time, it’s been fun doing this exploration of rudimentary projections using xFantasy and xStats. Hopefully others find it interesting; hit me up in the comments and let me know anything you might have noticed, or if you have any suggestions.


The Least Interesting Player of 2016

Baseball is great! We all love baseball. That’s why we’re here. We love everything about it, but we especially love the players who stick out. You know, the ones who’ve done something we’ve never seen before, or the ones that make us think, “Wow, I didn’t know that could happen.” It’s fun to look at players who are especially good — or, let’s face it, especially bad — at some aspect of this game. They’re the most interesting part of this game we love.

But not everyone can be interesting. Some players are just plain uninteresting! Like this guy.
http://gfycat.com/TinyWeakBonobo
OMG taking a pitch? That’s boring. You’re boring everybody. Quit boring everyone!

https://gfycat.com/GargantuanCreamyAmberpenshell
You caught a routine fly ball? YAWN! Wake me when something interesting happens.

But it’s hopeless; nothing interesting will ever happen with Stephen Piscotty. I’m sure the two GIFs above have convinced you that he was the least interesting player in baseball last year. But, on the off-chance that you have some lingering doubts, we can quantify it. I’ve made a custom leaderboard of various statistics for all qualified batters in 2016. For each of these statistics, I computed the z-score and the square of the z-score. In this way, we can boil down how interesting each player was to one number — the sum of the squared z-scores. The idea is that if a player was interesting in even one of these statistics, they’d have a high number there. Here are the results:

Click through for an interactive version

I don’t need to tell you who the guy on the far right is. On the flip side, though, there are two data points on the left that stick out. The slightly higher of the two is Marcell Ozuna, with an interest score of 1.627. The one on the very far left is Stephen Piscotty, with an interest score of 0.997. That’s right — if you sum the squares of his z-scores, you don’t even get to 1! This is as boring and average as baseball players get.

Where the real fun begins, though, is when you start making scatter plots of these statistics against each other. I’ve made an interactive version where you can play around with making these yourself, but here are a few highlights:


AVG vs. SLG


IFFB% vs. OPS


ISO vs. wRC+

Pretty boring, right? But wait, there’s more! Let’s investigate a little further what went into his interest score. Remember how we summed his squared z-scores and got a value below 1? Well, let’s look at the individual components that went into that sum.

The Most Boring Table Ever
Statistic Squared z-score
LD% 0.108
GB% 0.002
PA 0.296
G 0.220
OPS 0.001
BB% 0.057
SLG 4.888e-05
WAR 0.007
BABIP 0.141
K% 0.103
IFFB% 0.0004
ISO 5.313e-05
FB% 0.007
wOBA 0.022
AVG 1.69e-29
wRC+ 0.025
OBP 0.006

Yes, you’re reading that right — where he stood out the most was in games played and plate appearances. Yay, we got to see that much more boring! Also, I think it is especially apt that his AVG was EXACTLY league average.

All right, time to step back and be serious for a second. As Brian Kenny is always reminding us, there is great value in being a league-average hitter. Piscotty was worth 2.8 WAR last year, just his second year in the league. He’s already a very valuable contributor to a very good team. Maybe it’s time we started noticing guys who do everything just as well as everyone else, and value their contributions too?

(Nah, I’m going to go back and pore over Barry Bonds’s early-2000s stats for the next few hours.)

All the code used to generate the data and visualizations for this post can be found on my GitHub.


dSCORE: Pitcher Evaluation by Stuff

Confession: fantasy baseball is life.

Second confession: the chance that I actually turn out to be a sabermetrician is <1%.

That being said, driven purely by competition and a need to have a leg up on the established vets in a 20-team, hyper-deep fantasy league, I had an idea to see if I could build a set of formulas that attempted to quantify a pitcher’s “true-talent level” by the performance of each pitch in his arsenal. Along with one of my buddies in the league who happens to be (much) better at numbers than yours truly, dSCORE was born.

dSCORE (“Dominance Score”) is designed as a luck-independent analysis (similar to FIP) — showing a pitcher might be overperforming/underperforming based on the quality of the pitches he throws. It analyzes each pitch at a pitcher’s disposal using outcome metrics (K-BB%, Hard/Soft%, contact metrics, swinging strikes, weighted pitch values), with each metric weighted by importance to success. For relievers, missing bats, limiting hard contact, and one to two premium pitches are better indicators of success; starting pitchers with a better overall arsenal plus contact and baserunner management tend to have more success. We designed dSCORE as a way to make early identification of possible high-leverage relievers or closers, as well as stripping out as much luck as possible to view a pitcher from as pure a talent point of view as possible.

We’ve finalized our evaluations of MLB relievers, so I’ll be going over those below. I’ll post our findings on starting pitchers as soon as we finish up that part — but you’ll be able to see the work in process in this Google Sheets link that also shows the finalized rankings for relievers.

Top Performing RP by Arsenal, 2016
Rank Name Team dSCORE
1 Aroldis Chapman Yankees 87
2 Andrew Miller Indians 86
3 Edwin Diaz Mariners 82
4 Carl Edwards Jr. Cubs 78
5 Dellin Betances Yankees 63
6 Ken Giles Astros 63
7 Zach Britton Orioles 61
8 Danny Duffy Royals 61
9 Kenley Jansen Dodgers 61
10 Seung Hwan Oh Cardinals 58
11 Luis Avilan Dodgers 57
12 Kelvin Herrera Royals 57
13 Pedro Strop Cubs 57
14 Grant Dayton Dodgers 52
15 Kyle Barraclough Marlins 50
16 Hector Neris Phillies 49
17 Christopher Devenski Astros 48
18 Boone Logan White Sox 46
19 Matt Bush Rangers 46
20 Luke Gregerson Astros 45
21 Roberto Osuna Blue Jays 44
22 Shawn Kelley Mariners 44
22 Alex Colome Rays 44
24 Bruce Rondon Tigers 43
25 Nate Jones White Sox 43

Any reliever list that’s headed up by Chapman and Miller should be on the right track. Danny Duffy shows up, even though he spent most of the summer in the starting rotation. I guess that shows just how good he was even in a starting role!

We had built the alpha version of this algorithm right as guys like Edwin Diaz and Carl Edwards Jr. were starting to get national helium as breakout talents. Even in our alpha version, they made the top 10, which was about as much of a proof-of-concept as could be asked for. Other possible impact guys identified include Grant Dayton (#14), Matt Bush (#19), Josh Smoker (#26), Dario Alvarez (#28), Michael Feliz (#29) and Pedro Baez (#30).

Since I led with the results, here’s how we got them. For relievers, we took these stats:

Set 1: K-BB%

Set 2: Hard%, Soft%

Set 3: Contact%, O-Contact%, Z-Contact%, SwStk%

Set 4: vPitch,

Set 5: wPitch Set 6: Pitch-X and Pitch-Z (where “Pitch” includes FA, FT, SL, CU, CH, FS for all of the above)

…and threw them in a weighting blender. I’ve already touched on the fact that relievers operate on a different set of ideal success indicators than starters, so for relievers we resolved on weights of 25% for Set 1, 10% for Set 2, 25% for Set 3, 10% for Set 4, 20% for set 5 and 10% for Set 6. Sum up the final weighted values, and you get each pitcher’s dSCORE. Before we weighted each arsenal, though, we compared each metric to the league mean, and gave it a numerical value based on how it stacked up to that mean. The higher the value, the better that pitch performed.

What the algorithm rolls out is an interesting, somewhat top-heavy curve that would be nice to paste in here if I could get media to upload, but I seem to be rather poor at life, so that didn’t happen — BUT it’s on the Sum tab in the link above. Adjusting the weightings obviously skews the results and therefore introduces a touch of bias, but it also has some interesting side effects when searching for players that are heavily affected by certain outcomes (e.g. someone that misses bats but the rest of the package is iffy). One last oddity/weakness we noticed was that pitchers with multiple plus-to-elite pitches got a boost in our rating system. The reason that could be an issue is guys like Kenley Jansen, who rely on a single dominant pitch, can get buried more than they deserve.


What to Do With Justin Upton?

Justin Upton is still only 29.

It can be easy to forget about a guy who hasn’t come close to a peak that was over five years ago, but few can maintain the level of excellence that was Justin Upton’s 2011 season.  The hype built from a year like that is huge. The 2005 No. 1 overall pick posting a six-win season at 23 with 31 HR and 21 steals. It’s pretty exciting.

Guys who have enough power to do this are generally pretty talented.

Flash forward to 2016.

Fresh off signing a six-year, $132-million contract, Upton posted a 77 wRC+ along with a .235/.289/.381 slash line in the first half of the season. He struck out in nearly a third of his plate appearances and held a walk rate below his career average.

Most importantly, though, when the Tigers paid Upton big money, they paid him to hit dingers and knock the ball around the yard for extra bases. So someone like him hitting nine homers with a .146 ISO over 350 plate appearances is worrisome.

Yet at the end of the season, Upton ended up with an overall wRC+ of 105, and an ISO of .219.

For qualifying batters, Upton held the crown for the highest second-half ISO increase (.172) while having the fourth-highest ISO of the second half (.318). Meanwhile, he held a second-half wRC+ of 142.

Now, I do understand that he had 86 fewer plate appearances in the second half (356 vs. 270), so it is reasonable to take the ISO and wRC+ increases with a grain of salt. But Upton slugged 22 homers in 270 trips, good for the fourth-most in the second half, behind walking flame Brian Dozier, Khris “I hit dingers through the marine layer” Davis, and Jedd Gyorko (??!!??!).

I don’t know if people expect Upton to start breaking down or expected him to start breaking down but he still continues to crush the ball. For what it’s worth, on average he hits the ball as hard as Paul Goldschmidt (92.3 MPH) and barrels up balls at the same rate as Kris Bryant (7.7% Brls/PA) for an expected ISO of .235 (thanks to Billy Stampfl’s eISO equation). Just for fun, he set his max exit velocity at 114 MPH on his last homer of the year.

Upton is projected by Steamer for a .346 wOBA and 116 wRC+ this season, good for a 2.1 WAR. Given his ability to crush baseballs and his age, I still think Upton has a good chance of surpassing his projections. He’s showed that he has big power, but his first half is weighing his projections down.

The Tigers’ plans going forward are banking on whether or not they can put themselves in a playoff position during the first half before they think about any sort of fire sale. They’re projected to be in the thick of the wild-card race, so they may run a repeat of 2016 and push through to see how close they come. But they can’t continue this way to pay an old core through the next three to four years. In cases like Miguel Cabrera and Victor Martinez, they don’t have much choice but to eat those contracts until they run up.

If things don’t go as planned, the first thing they can do is to ship off J.D. Martinez as a rental and start to rebuild a mostly barren farm system. Martinez is due to become a free agent at the end of the year and there is little chance that the Tigers will offer him the lucrative contract extension that he most likely wants. Ian Kinsler could go next to whomever may need a second baseman and is willing to accept his age. If they really wanted, the Tigers could also see if they could send off Justin Verlander (given that they eat a sizable chunk of his contract).

But the best move for the Tigers could come in the form of trading a resurgent Justin Upton, who can prove that his second-half numbers were no fluke and that the 29-year-old can maintain solid power, as he has throughout his career. It’s tough to find a home for Upton, but the Yankees might be the ones willing to take on his contract, as they’ll be done with CC Sabathia’s monster contract at the end of this year, and Brett Gardner’s contract in 2018. A team like the Yankees might prefer to take Upton and his $22-million AAV rather than test the market for sluggers like J.D. Martinez (also 29) and have to possibly pay more. The Yankees might be willing to take on some of the money and go with a safer outfield bet in Upton rather than having to wait for Aaron Judge or Clint Frazier to become steady contributors. This Yankees team looks like they’re trying to win now, given that they just signed Matt Holliday and Aroldis Chapman, so they might be willing to part with a few prospects at the deadline.

Left field is a weak position right now, and a contender could be looking for a power bat to provide 2-3 wins a year. Should Justin Upton carry his second-half resurgence into 2017, his bat could be too good to pass up, and the Tigers could move his contract and get something going towards a rebuild.


An Attempt to Quantify Quality At-Bats

Several of my childhood baseball coaches believed in the idea of “quality at-bats.” It’s a somewhat subjective statistic that rewards a hitter for doing something beneficial regardless of how obvious it is. This would include actions such as getting on base, as well as less noticeably beneficial things like making an out but forcing the pitcher to throw a lot of pitches. There is some evidence that major league coaches use quality at-bats and, through my experience working for the Florida Gators, I noticed that some college coaches like using it too. However, how it is used varies from coach to coach and it is a stat that is rarely talked about in the online community. Since there doesn’t seem to be a consensus of what a quality at-bat is, I decided to define a quality at-bat as an at-bat that results in at least one of any of the following:

  1. Hit
  2. Walk
  3. Hit by pitch
  4. Reach on error
  5. Sac bunt
  6. Sac fly
  7. Pitcher throws at least six pitches
  8. Batter “barrels” the ball.

There is some room for debate on a few of these parameters (e.g. if six pitches is enough, whether or not sacrifices should be included, etc.). However, in my experience this is roughly in line with what most coaches use, and I think it does a good job of determining whether or not a hitter has a “quality” at-bat. In my analysis I was excited to be able to include the new Statcast statistic, barrels. I have seen coaches subjectively reward a hitter with a quality at-bat for hitting the ball hard, but barrels gives us an exact definition of a well-hit ball based on a combination of exit velocity and launch angle.

The first player I used to test this definition was Billy Hamilton. Hamilton is a player that has always interested me, partially because stealing bases is entertaining, but also because there has always been speculation about whether or not he will ever be able to develop into an average hitter. I also find him interesting because his career has consisted of one awful offensive season sandwiched between two less horrible but still sub-par offensive seasons. His wRC+ in 2014 was 79, in 2015 it was an unsightly 53, and in 2016 it was back up to 78. I thought that his quality at-bat percentages might be able to give us a clue as to whether or not he could become a better hitter. By pulling Baseball Savant data from Bill Petti’s amazing baseballr package, I counted all of Billy Hamilton’s quality at-bats in each of his three MLB seasons. I then divided those quality at-bat totals by his total plate appearances to get his quality at-bat percentages:

2014:  41.75%

2015:  42.28%

2016:  47.52%

It is never ideal to make sweeping conclusions about statistics — especially new ones that are not widely used or understood — without putting them in context. However, at the very least, I think it is a good sign that Billy Hamilton has experienced an upward trend in his quality at-bat percentages. Based on my definition, these results show that he is making more effective use of his at-bats and that he is continuing to develop as a hitter.

To put Hamilton’s scores in some context, I calculated the quality at-bat percentages for several other players and provided them below. I have not had a chance to run every player as of yet, but I think this chart can give you a feel of where Billy Hamilton stands compared to other players. It is also interesting to point out Jason Heyward’s large drop-off in quality at-bat percentage. This is yet another indicator of how poor his 2016 season was. Additionally, and not surprisingly, Joey Votto and Mike Trout have, relatively, very high quality at-bat percentages, while Adeiny Hechavarria (a player who had a wRC+ just north of 50 last season) had a quality at-bat percentage well below that of even Billy Hamilton.

 

                                                      Quality at-bat percentages
Year Billy Hamilton Mike Trout Jason Heyward Joey Votto Adeiny Hechavarria
2014 41.75% 56% 47% 56% 41%
2015 42.28% 55% 48% 56% 42%
2016 47.52% 58% 40% 59% 39%

 

There is more research that needs to be done here in order to make more intelligent conclusions. I would like to run more players through my statistic, including minor leaguers, to see just how well quality at-bats can be used in evaluating talent, development, and predicting future success. I believe that quality at-bats are something that could be relevant in many of the same ways as quality starts. Neither of these statistics inform you of the nuances that make a player great (or not so great), but they do give you an idea of a player’s reliability in having a passable performance. I believe that with further analysis into quality at-bat percentages using the definition I created, we may be able to learn more about how hitters make use of each and every at-bat.


Examining the Tendencies of the Rockies’ Rotation

Don’t you just love how talking about one topic in baseball can bring you to a completely separate topic than the one you were discussing? For instance, my friend and I were discussing possible landing spots for Mark Trumbo (before he decided to head back to Baltimore). One team that came up was the Colorado Rockies and how they shouldn’t have signed Ian Desmond and should’ve gone with Trumbo instead. This led to talking about the Rockies’ rotation and the fact that it wouldn’t matter what sluggers they had if the rotation was — for lack of better words — “trash.” This led me to think what I’m sure many of you are wondering: How is the Rockies’ starting rotation?

Now, we can look at ERA, FIP, and whatever advanced metric you prefer until we’re blue in the face. But what I wanted to focus on is what type of pitchers they bring into Coors Field, mainly in regard to batted-ball statistics. I want to see if the front office prefers to bring in ground-ball pitchers to combat the altitude and ballpark factors of the stadium. I also want to take a look at the pitch mix of their starting five to see if that has a hand in how their rotation is selected.

One would imagine that a pitcher with a good mix of ground balls and fly balls would be preferred in a starting rotation. Too many ground balls and you have a better chance of giving up more hits. Too many fly balls and you risk the opportunity for more home runs. Like the library on FanGraphs says, “If you allow 10 ground balls, you can’t control if zero, three, or nine go for hits, but you did control the fact that none are leaving the park.” Considering a park with the altitude and home-run factor of Coors Field, you would expect a rotation of primarily ground-ball pitchers to lessen the chance of a home run.

Let’s look at Tyler Chatwood and Chad Bettis first. Chatwood and Bettis have very similar stats across the board in addition to being the only two that are above-average ground-ball pitchers. While their HR/FB% are close and below league-average, where they both differ are the home and away splits. While Chatwood seems to get lit up at home, Bettis goes the opposite direction and actually has more fly balls go for home runs when he isn’t starting in Colorado.

Now let’s look at Jorge de la Rosa. Jorge has the worst HR/FB% of any starter on the team, by far. In fact, he was ranked 20th overall in 2016 for HR/FB%. Another stat that Jorge is last in for the starting rotation? Fastball usage, and by a considerable margin. For all MLB starting pitchers with a minimum of 60 IP, he ranks fifth-last in fastball usage in 2016. Maybe this is why the Rockies prefer to stick with fastball-type pitchers. Since 2011, the Rockies have used 21 different starting pitchers. Of those 21, 13 (62%) have been above the league average in fastball usage. In the four years that Jorge has been used as a starter, he’s sat at the bottom of the list three times (he was ranked eighth-last in 2013).

Something else I found noteworthy in the chart is that all five starters have higher fly-ball rates when pitching away as opposed to at home. While the difference for Tyler Anderson is very minuscule (0.2%), the fact that all five fall under this criteria makes it seem more than coincidental. Could they be pitching differently at home than they are when they’re away? Let’s take a historical look.

According to Baseball-Reference, this is the list of the most common Colorado Rockies starting pitchers from 2011 – 2016. The list gives us 30 total pitcher-seasons and 21 unique pitchers. Out of the 30 pitchers listed, 21 (70%) have a lower fly-ball rate at home than they do when pitching away. Additionally, 23 (76%) have a higher ground-ball rate at Coors as opposed to any other stadium. This leads me to believe that Rockies pitchers are conditioned to pitch differently when they are at home versus when they are away. This would make sense, since Coors has the highest park factor in all of baseball and anyone from a fair-weather fan to a front-office executive understands that keeping the ball on the ground in that park is best.

The last question we have to ask is, “Is this change effective?” The short answer is, not really. As seen, 14 out of the 30 (46%) pitchers have a higher HR/FB% when pitching away, while 15 out of the 30 (50%) pitchers have a higher HR/FB% when pitching at home (Eddie Butler in 2015 is the odd man out at an even 0.00%). The good news is that four out of the five latest seasons have the Rockies’ starting rotation having a lower HR/FB% than the league average for starting pitchers. The bad news is that all five seasons were losing seasons.


Hierarchical Clustering For Fun and Profit

Player comps! We all love them, and why not. It’s fun to hear how Kevin Maitan swings like a young Miguel Cabrera or how Hunter Pence runs like a rotary telephone thrown into a running clothes dryer. They’re fun and helpful, because if there’s a player we’ve never seen before, it gives us some idea of what they’re like.

When it comes to creating comps, there’s more than just the eye test. Chris Mitchell provides Mahalanobis comps for prospects, and Dave recently did something interesting to make a hydra-comp for Tim Raines. We’re going to proceed with my favorite method of unsupervised learning: hierarchical clustering.

Why hierarchical clustering? Well, for one thing, it just looks really cool:

That right there is a dendrogram showing a clustering of all player-seasons since the year 2000. “Leaf” nodes on the left side of the diagram represent the seasons, and the closer together, the more similar they are. To create such a thing you first need to define “features” — essentially the points of comparison we use when comparing players. For this, I’ve just used basic statistics any casual baseball fan knows: AVG, HR, K, BB, and SB. We could use something more advanced, but I don’t see the point — at least this way the results will be somewhat interpretable to anyone. Plus, these stats — while imperfect — give us the gist of a player’s game: how well they get on base, how well they hit for power, how well they control the strike zone, etc.

Now hierarchical clustering sounds complicated — and it is — but once we’ve made a custom leaderboard here at FanGraphs, we can cluster the data and display it in about 10 lines of Python code.

import pandas as pd
from scipy.cluster.hierarchy import linkage, dendrogram
# Read csv
df = pd.read_csv(r'leaders.csv')
# Keep only relevant columns
data_numeric = df[['AVG','HR','SO','BB','SB']]
# Create the linkage array and dendrogram
w2 = linkage(data_numeric,method='ward')
labels = tuple(df.apply(lambda x: '{0} {1}'.format(x[0], x[1]),axis=1))
d = dendrogram(w2,orientation='right',color_threshold = 300)

Let’s use this to create some player comps, shall we? First let’s dive in and see which player-seasons are most similar to Mike Trout’s 2016:

2016 Mike Trout Comps
Season Name AVG HR SO BB SB
2001 Bobby Abreu .289 31 137 106 36
2003 Bobby Abreu .300 20 126 109 22
2004 Bobby Abreu .301 30 116 127 40
2005 Bobby Abreu .286 24 134 117 31
2006 Bobby Abreu .297 15 138 124 30
2013 Shin-Soo Choo .285 21 133 112 20
2013 Mike Trout .323 27 136 110 33
2016 Mike Trout .315 29 137 116 30

Remember Bobby Abreu? He’s on the Hall of Fame ballot next year, and I’m not even sure he’ll get 5% of the vote. But man, take defense out of the equation, and he was Mike Trout before Mike Trout. The numbers are stunningly similar and a sharp reminder of just how unappreciated a career he had. Also Shin-Soo Choo is here.

So Abreu is on the short list of most underrated players this century, but for my money there is someone even more underrated, and it certainly pops out from this clustering. Take a look at the dendrogram above — do you see that thin gold-colored cluster? In there are some of the greatest offensive performances of the past 20 years. Barry Bonds’s peak is in there, along with Albert Pujols’s best seasons, and some Todd Helton seasons. But let’s see if any of these names jump out at you:

First of all, holy hell, Barry Bonds. Look at how far separated his 2001, 2002 and 2004 seasons are from anyone else’s, including these other great performances. But I digress — if you’re like me, this is the name that caught your eye:

Brian Giles’s Gold Seasons
Season Name AVG HR SO BB SB
2000 Brian Giles .315 35 69 114 6
2001 Brian Giles .309 37 67 90 13
2002 Brian Giles .298 38 74 135 15
2003 Brian Giles .299 20 58 105 4
2005 Brian Giles .301 15 64 119 13
2006 Brian Giles .263 14 60 104 9
2008 Brian Giles .306 12 52 87 2

Brian Giles had seven seasons that, according to this method at least, are among the very best this century. He had an elite combination of power, batting eye, and a little bit of speed that is very rarely seen. Yet he didn’t receive a single Hall of Fame vote, for various reasons (short career, small markets, crowded ballot, PED whispers, etc.) He’s my vote for most underrated player of the 2000s.

This is just one application of hierarchical clustering. I’m sure you can think of many more, and you can easily do it with the code above. Give it a shot if you’re bored one offseason day and looking for something to write about.


Searching For Overvalued Pitchers

A little while ago, I created a post here about finding undervalued pitchers by looking at improvements between the first and second halves of the season. I had created a linear regression model for the predictions using data from 2002 to 2015, but when trying to use the same model to find overvalued pitchers, it didn’t exactly work as expected (I use the word “work” loosely here — in all likelihood, my predictions will fail as badly as the new Fantastic Four movie). It did find pitchers who suffered massive setbacks, but the majority of those were primarily due to increased — and probably unsustainable — home-run rates.

For example, Matt Andriese had an extremely successful first half of 2016. He put up a 2.77 ERA in 65 innings, backed up by a 2.85 FIP. But those numbers were much like my ex-girlfriend: pretty on the surface, but uglier once you get to what’s underneath. He struck out a lower percentage of batters than the average pitcher during that time while giving up more hard contact. The biggest sign, though, was his deflated home-run rate. He allowed just 0.28 home runs per nine innings, with only 3.2 percent of his fly balls going over the fence. This righted itself in the second half, where his HR/9 increased to 2.15 and his HR/FB to 17.4 percent. On the other hand, he improved his strikeout and walk rates, actually leading to a drop in his xFIP from 4.04 to 3.92 from the first half of the season to the second.

So then what should we expect from Andriese in 2017? The model I created predicts a 5.56 ERA from Andriese, leaning toward his 6.03 ERA from the second half of last season. While it’s unlikely he will allow fewer than 0.3 home runs per nine innings next year, it’s equally as unlikely that he’ll allow over 2 — after all, no qualified pitcher did so over the course of the 2016 season. Andriese’s full-season FIP of 3.78 actually closely aligned with his xFIP of 3.98, so it’s fair to guess that his home-run rates will level out and his ERA in the coming year will be in that range. That would signify an improvement from his 2016 season, rather than his decline predicted from the model.

So, instead of using the model, I took a simpler approach. Here are the players with at least 50 IP in each half of the 2016 season whose xFIP increased the most from the first half to the second:

xFIP Splits
Name First Half xFIP Second Half xFIP Increase
Tanner Roark 3.64 4.83 1.19
Drew Smyly 4.07 5.10 1.03
Hector Santiago 5.05 5.94 .89
Aaron Sanchez 3.41 4.29 .88
James Shields 4.82 5.70 .88
David Price 3.12 3.98 .86

For the purposes of this article, I’ll ignore Santiago and Shields since it’s unlikely that either of them will be relevant in 2017. That leaves four other pitchers whose skills declined dramatically over the course of the season and who you might want to avoid in your drafts.

Tanner Roark

Believe it or not, Roark’s already 30 years old. He’s actually had pretty decent success in his four years in the majors, with a 3.01 career ERA in over 573 innings. On the flip side, over that same time he has a 3.73 FIP, 3.96 xFIP and 4.06 SIERA. That’s not to say he’s a bad pitcher — just perhaps not as good as his ERA would have you believe. The same can’t be said for his second half of 2016. Despite actually bringing his ERA down from 3.01 to 2.60, his already-inflated FIP and xFIP numbers got even worse. His strikeout rate declined by 2.5 percent while his walk rate rose by about the same amount, leading to just a dismal 1.87 K/BB in the second half. His HR/9 nearly doubled as well, but not due to a substantial increase in his HR/FB rate — rather, his fly-ball rate rose from 26 to 37.6 percent, more in line with his pre-2016 career average of 33.9 percent. Why, then, was he able to continue to be successful? A .230 BABIP and a 86 percent strand rate offer an answer. Don’t expect another sub-3 ERA season from Roark — instead, look more toward his Steamer projection of 4.15.

Drew Smyly

For many last year, Smyly was a popular target. He was a high-strikeout guy who was able to limit walks and generate infield flies, prompting Mike Petriello to write this ringing endorsement for him. In his 114 1/3 innings for Tampa Bay before 2016, Smyly had maintained a 2.52 ERA and was among the best at generating strikeouts. But it all went wrong last year. As Tristan Cockcroft points out, Smyly’s season was marked by a first half of bad luck and a second half of deteriorated skills but better luck. His first-half 5.47 ERA was likely undeserved, as he continued getting strikeouts and limiting walks, but was plagued by a .313 BABIP, 63.2 percent strand rate and a 15.0 HR/FB rate, which corresponded to a 4.45 FIP and 4.07 xFIP. His ERA dropped to 4.08 in the second half, but nearly all of his peripheral stats worsened. A move to Seattle won’t fix all his problems, as Safeco Field was actually more hitter-friendly than Tropicana Field in 2016. The sky is the limit for Smyly, but there’s reason to be cautious. It’s possible he bounces back, but this could be who he is now.

Aaron Sanchez

This guy is good, don’t get me wrong. It took a while for some people to catch on, but I was always on his bandwag…all right, so I was one of the guys who didn’t buy in right away. That’s why I don’t do this for a living. Anyway, seeing his name on this list surprised me. After some digging though, it turns out that in my ignorance, I may have been onto something. In 2015, in Sanchez’s trial run as a starter, he was all right. A 3.55 ERA hid a 5.21 FIP and 4.64 xFIP before he got injured and was subsequently moved to the bullpen. When he returned on July 25, he was a completely different pitcher. This time, while he may not actually have deserved his 2.39 ERA, a 3.10 FIP and 3.33 xFIP showed he had made some kind of improvement. Or had he? After all, he only threw 26 innings in the second half of last season. And while there was undoubtedly a huge improvement for him in strikeout and walk rates, something else caught my attention. Take a look at Sanchez’s batted-ball type percentages from 2015:

Pretty clearly, Sanchez improved his batted-ball profile after becoming a reliever. His 2015 second-half ground-ball percentage of 67.6 percent would be the greatest of all of the 1281 qualified pitcher-seasons since 2002, when the statistic started being tracked. His fly-ball percentage of 18.3 percent, while not as extreme, would still rank as the ninth-lowest since 2002. That begs the question: would he be able to sustain those rates when he moved back to the rotation? The answer, as it always is with historically extreme rates, was no:

Both of his rates came crashing back to historically-accurate norms pretty much right away, and they continued to trend in the wrong direction as the season progressed. This, consequently, caused Sanchez’s xFIP to skyrocket. His strikeout and walk rates got worse from the first half of the 2016 season to the second, but only slightly. What really moved his xFIP was his fly-ball rate, which soared (pun intended — maybe I should do this for a living) from 21 percent to 31.8 percent. It’s difficult to say where Sanchez will go from here — after all, this was his first full season as a starter. If he can keep his fly-ball rate at last year’s 25.1 percent — which ranked fourth-lowest among qualified starters — he could still be a pretty decent starting pitcher, even with regression to a league-average HR/FB rate. What’d be even more impressive, though, is if he could keep his batted-ball rates at his numbers from the first half of 2016, which were among the league’s best. Perhaps with a full season under his belt, Sanchez may now have the stamina and endurance to achieve this feat. If he does, look out. If he doesn’t, you’re looking at an average guy.

David Price

Now that I’ve written nearly an entire article’s worth about one guy, let’s talk about another player from the AL East. Price, for much of his career, has been among the elite at the position. Before last season, the only time he had had an ERA above 3.50 was his first season as a starter back in 2009. Every year of his career, he’s been an above-average strikeout guy, but he topped even his own lofty standards when he struck out 27.1 percent of the batters he faced in the first half of 2016. He was unable to sustain that rate, and in the second half of the season he managed to strike out just 20.3 percent of batters, which would have been his lowest full-season rate since 2009. So what changed? Actually, it might have been the first half that was the fluke. Price allowed a 74.2 percent contact rate in the first half, contrasted with a 79.1 percent rate in the second. Those numbers don’t necessarily mean much on their own, but the difference is easy to spot when looking at his career rates:

Price’s whiff rate was higher than ever in the first half of 2016, but it’s tough to figure out why. Per Brooks Baseball, Price was generating swings and misses on his changeup at a career-best rate in the first half, but I couldn’t find any obvious changes to his velocity or movement on the pitch or any other. It’s fair to wonder, then, if his second-half numbers are what we should expect from Price at this point in his career, since his contact rates during that time were much more sustainable. He probably won’t be as bad as his 2016 3.99 ERA, but I wouldn’t be shocked to see it end up above 3.50 for the second year in a row.

Of course, this is not a comprehensive way to find overvalued pitchers. It’s a crude approach, but one that’s meant to highlight guys who fell off in the second half, as they’re the ones more likely to carry over those declined skills into 2017. That being said, xFIP obviously isn’t perfect, and these players all showed that they were capable of posting above-average results over half a season. Take a risk on them if you want, but be warned that they may not be worth the price.


The 2017 Phillies Can Change Baseball Forever

The GM of the Philadelphia Phillies has been accumulating the players to potentially pull off the greatest singleseason heist in the history of baseball.

How will they do this, you might ask?

By utilizing the 3-3-3 rotation.

I will explain why recent rotation alterations by the 1993 Athletics and 2012 Colorado Rockies were not successful. Then I will show how the Phillies version of the 3-3-3 will change the baseball world. But first, let me explain the 3-3-3 rotation and its benefits.

The classic 3-3-3 rotation uses three groups of three pitchers each, pitching once every three games.

Game 1 – Innings 1-3 (Pitcher#1) Innings 4-6 (Pitcher #2)  Innings 7-9 (Pitcher #3)

Game 2 -Innings 1-3 (Pitcher #4)Innings 4-6 (Pitcher #5)Innings 7-9 (Pitcher #6)

Game 3  – Innings 1-3 (Pitcher #7) Innings 4-6 (Pitcher #8) Innings 7-9 (Pitcher #9)

Ideally, each pitcher will throw three innings or 30-50 pitches per appearance. By the end of the season each pitcher will pitch about 162 innings over 54 appearances.

This rotation will help pitchers succeed by:

1) Allowing hitters only one plate appearance against each pitcher

2) Eliminating fatigue by keeping pitch counts down

The more opportunities a hitter has against a pitcher, the better success he has. Dave Fleming of Bill James Online provided statistical evidence from 2008 supporting this fact:

 PA  BA OBP SLG OPS

1st PA in G 108606 .255 .328 .398 .727

2nd PA in G 44505 .270 .334 .431 .765

3rd PA + in G 34520 .282 .346 .453 .800

Notice how every hitting statistic increases with each at-bat. To make a few comparisons, Eduardo Nunez was an All-Star last year, and his OPS was .758. All-Star Xander Bogaerts had an OPS of .802. So if you leave a pitcher in past the third AB (generally 7th or 8th inning) you’re facing a lineup full of 2016 Xander Bogaertses. Not exactly a winning formula.

A similar pattern was echoed in pitch counts:

PA BA OBP SLG OPS

Pitch 1-25 87685 .261 .333 .410 .743

Pitch 25-50 39383 .257 .326 .400 .726

Pitch 51-75 31791 .270 .333 .429 .763

Pitch 76-100 24261 .277 .344 .450 .795

The fact that pitches 1-25 were less effective than 25-50 is due to lineup construction. The rest of the numbers clearly show that pitchers are exponentially worse after the 50th pitch.

In this post, I will explain:

1) Why the 3-3-3 rotation did not work for La Russa in 1993

2) Why the Rockies’ alternative rotation wasn’t accepted in 2012

3) The benefits the 3-3-3 rotation will provide the Phillies in 2017 and beyond

Before we begin, there a few concepts we must accept:

1) Baseball is not archaic; it is ever-changing

2) Categorizing pitchers as only “starters”, “relievers” or “closers” is limiting to the pitchers’ value and abilities. We have to look beyond these inadequate labels. I will use these terms in this article, but attempt to focus on these underlying meanings:

a) Starter – Pitcher trained to throw 5+ innings

b) Reliever – Pitcher trained to throw 1-2 innings

c) Closer – Pitcher with experience throwing the last inning

3) There is no one system that produces winners or losers. You must utilize your personnel to the best of their abilities and limitations

Why the 3-3-3 rotation did not work in 1993

1) The Athletics did not have the personnel to execute the strategy

2) The experiment lasted one week

First, the Athletics had one of the worst pitching staffs in the league in 1993. They were in last place when they implemented the 3-3-3 rotation and had lost nine of their last 12 games. Here is a list of their ERAs in ascending order:

Name                      Training      ERA    Synopsis

Bobby Witt                 SP           4.21     97 ERA +

Goose Gossage          RP           4.53    Age-41 season

Todd Van Poppel      SP           5.04     21-year-old rookie

Ron Darling               SP           5.16       79 ERA+

Bob Welch                  SP           5.29     Age-36 season

Mike Mohler          RP / SP     5.60     Started 9 of 42 appearances

Kelly Downs           RP / SP     5.64     Started 12 of his 42 appearances

Shawn Hillegas      RP / SP      6.97    Started 11 of 18 appearances

John Briscoe             RP            8.03    Started 2 games in 139 IP in career

Only Bobby Witt and Goose Gossage had an ERA under 5.04. Witt was by far their best pitcher and his 97 ERA+ shows he was below average.

The second reason it did not work is the experiment only lasted one week. The public and media backlash from the switch to this rotation was so great that La Russa was forced to abandon the experiment after one week. One week! I don’t care what you do in baseball, if it only lasts one week, then you didn’t give it a real chance. Buster Posey hit .118 in his first week in the MLB in 2009, but the Giants wisely kept him around for 2010.

Why the Rockies’ alternative rotation did not work in 2012

1) They did not have the right personnel

First, let’s describe the specifics of the Rockies’ new rotation. It was a four-man rotation of Jeff Francis, Jeremy Guthrie and rookies Drew Pomeranz and Christian Friedrich. In each start, these four pitchers were given a strict 75-pitch limit. Three rotating pitchers called “piggybacks” would then relieve them.

Game 1 – Francis (75 pitches) Piggyback #1 Reliever #1 Closer #1

Game 2 – Guthrie (75) Piggyback #2 Reliever #2 Closer #1

Game 3 – Pomeranz (75) Piggyback #3 Reliever #3 Closer #1

Game 4 – Friedrich (75) Piggyback #1 Reliever #1/2 Closer #1

Similar to the 1993 A’s, the Rockies made their switch out of desperation. When implemented on June 20th, the Rockies were 18 games below .500 and in a 6-15 slump, on pace to lose over 100 games. Here is a look at the top six Rockies pitcher stats by the end of the year, with ERAs in ascending order:

Name                       Training         ERA       ERA+     IP

Jhoulys Chacin           SP               4.43        105         69

Drew Pomeranz         SP               4.93         94         96.2

Alex White               SP/RP           5.51          84          98

Jeff Francis                 SP               5.58          83          113

Christian Freidrich    SP               6.17          75           84.2

Jeremy Guthrie          SP               6.35          73          90.2

Only one of these starters was even an average pitcher. Three of the four rotation mates were at least 27% worse than the average pitcher in 2012. The issue with the 1993 A’s and the 2012 Rockies are they made these moves in the middle of last-place seasons. They were desperate to change what were the worst pitching staffs in the league. No team heading for a last-place finish is going to respond well to a complete overhaul of the staff in the middle of the summer.

The good news for this particular experiment, however, is that the Rockies pitching staff performed much better after the change was made. In the first 21 games that it was implemented, the starting pitchers improved from a league-worst 6.28 ERA to a league-worst 5.22 ERA. That’s more than an entire one-run improvement! Still the league worst (control your laughter), but that’s a major improvement.

I believe that gives us hope that an alternative and better rotation can be found in the correct circumstances. With the right rotation mates and the correct distribution of pitch counts, I believe there is room for improvement. The key is to train and implement the rotation before the season begins. No pitcher is going to be motivated to try a new system if it is implemented in the middle of a terrible season. It has to be the game plan to begin with, and everyone must be on board. Below you will see why the Phillies have the perfect staff for a 3-3-3 rotation. I have used the 3-3-3 rotation as my basis, but implemented some changes inspired by the 2012 Rockies to ensure success.

How the 3-3-3 Rotation will benefit the Phillies

1) Utilizing the perfect personnel

2) Peak value from assets

3) Health (Physical and Mental)

Personnel

The Phillies have eight middle-of-the-rotation MLB-ready starters who have demonstrated the ability to get MLB hitters out for multiple innings per appearance. The Phillies have five quality relievers who have demonstrated the ability to get MLB hitters out for one inning+ per appearance. Let’s take a look at the 2016 Phillies stats in order of ascending ERAs:

Name             Training    MLB IP 2016    ERA 2016      MLB service

Asher                 SP                27.2                    2.28              0.061 years

Neris                 RP                 80.1                   2.58               1.104 years

Benoit            RP / CP           48                      2.81                Final Year

Neshek          RP / CP            47                     3.06                Final Year

Eickhoff             SP                 197.1                  3.65                1.045 years

Hellickson       SP                 189                     3.71                Final Year

Ramos             RP                 40                       3.83               0.101 years

Buchholz         SP               139.1             Career 3.96          Final Year

Velasquez        SP                131                       4.12                1.086 years

Nola                  SP                 111                      4.78                 1.076 years

Gomez          RP/ CP           68.2             4.85 w/ 37 SV       Final Year

Eflin                   SP               63.1                     5.54                  0.111 years

Thompson        SP               53.2                     5.70                 0.058 years

Asher, Eickhoff and Hellickson were MLB starters with ERAs under 3.71 last year. Buchholz has the ability to be a front-line starter coupled with a career 3.96 ERA. Velasquez and Nola showed great promise despite rather average ERAs in the 4s. Velasquez sported a 10.6 K/9 ratio while Nola’s curveball has the best horizontal movement in the Majors (9.3 inches, beating out Gerrit Cole). The only two pitchers who disappointed were Eflin and Thompson, two young starters getting their first crack at the majors. Let’s count on them performing better next year.

The best reason why this personnel is perfect is because all of the trained starters have generally similar projections. From a projection and performance standpoint, all of these pitchers are middle- to back-of-the-rotation guys with upside. Nola and Velasquez are projected #2/#3 guys while Eflin, Thompson, Asher and Eickhoff are #3 to back-of-the-rotation guys (Though Eickhoff did have an impressive year in 2016). There is no Kershaw or Verlander or Bumgarner or Cueto who are expected to dominate and throw eight innings every start.

By only allowing them up to 50 pitches and one time through the lineup, the numbers listed in the introduction illustrate that the 3-3-3 rotation puts these players in the best possible position to succeed. Since the numbers are now in their favor, pitchers will have a refined focus and confidence. They can make a structured game plan on how they’re going to attack each hitter. This will limit extended innings under duress and ultimately build confidence in the minds of these young pitchers.

You may ask, Kevin, the Phillies aren’t going to contend in 2017. Why go through such a drastic change to get marginally better?

The answer is using the 2017 season as a stage for their assets to increase in value.

Asset Valuation

The Phillies are not in line for a winning season in 2017. They most likely won’t win 80 games in 2018. But 2019 is their year. That amazing 2018-2019 class of Kershaw, Donaldson, Machado, Harper, Pollock, LeMahieu, Keuchel, Harvey, Wainwright, Corbin, Smyly and Shelby Miller will be theirs for the taking, as the only money they have tied up is to Odubel Herrera. Even the 2017-2018 class of Arrieta, Cobb, Darvish, Duffy, Pineda, Tanaka (option), and Cueto (option) could insert an ace or #2 into their staff.

That is why they need to act now. They must increase their pitchers’ values now and acquire better assets with 2019 in mind. The free-agent market will be booming from 2017-2019, thus lowering trade-market value of any player after this year’s deadline. Instead of trading away prospects to get the guys they need, teams will simply open their pocketbooks. Now is the time to trade these middle-of-the-rotation guys away. Especially because they are not all in the 2019 plans.

“Utility Pitchers”

What is the most overpriced asset on the market right now? Relief pitching. More specifically, pitchers who can pitch multiple innings in relief in tough situations. See: Andrew Miller, Kenley Jansen, and Aroldis Chapman. By utilizing the 3-3-3 method, you are training your starters to pitch multiple innings in different scenarios and relieve in later innings. The 3-3-3 method trains your pitchers to achieve the greatest possible value by becoming what I like to call “utility pitchers.”

What makes players like Ben Zobrist, a .266 career hitter, and Ian Desmond, a .267 hitter, worth $60-70 million? They are utility players. Teams these days love utility players and are willing to pay big money for them. They are more valuable now than they have been in all of history. The same can be said for utility pitchers.

If you have ever been to the Arizona Fall League, it is used as a stage for the game’s top prospects. Starting pitchers generally pitch three innings, and relief pitchers will pitch 1-2 innings each for the remainder of the game. They do this to give teams’ top minor-league players exposure to higher competition with an added benefit of raising prospect value in the eyes of other teams. By sending their players to compete with top minor-league competition for all scouts to see, a good showing will raise potential trade interest. For example, this year the Giants sent a young catcher named Aramis Garcia, a former second-round pick. Garcia doesn’t fit into the Giants MLB plans with a player like Buster Posey entrenched at catcher until 2022, but they used him as one of their eight player selections anyway. I can surmise they did this to boost his stock for potential trade scenarios. The Phillies do not have all their current pitchers in their 2018-2019 MLB plans, so why not show them off to other teams?

By using the 3-3-3 method in the MLB as a stage for their abundance of young pitching talent, their pitchers will:

1) Get experience against the top talent in the world

2) Potentially increase their trade value

3) Limit innings to 130 – 160 IP

4) Give young pitching the best chance to succeed at the MLB level

5) Keep their innings down and arms fresh

The Phillies 2017 3-3-3 rotation, which you will notice is a quasi version of the 3-3-3 that I referenced above, would look like this:

1st Group – Hellickson (3) Asher (3) Eflin (2) Neris (1)

2nd group –  Nola (3) Eickhoff (3) Thompson (2) Gomez (1)

3rd Group –  Velasquez (3) Buchholz (3) Benoit (1) Ramos (1) Neshek (1)

Why this particular grouping?

1. Ability to sell three of what we call “closers” at the deadline. They can also switch Benoit and Ramos to the closer role on any particular day, giving Klentak five pitchers with closing experience to sell.

2. Give Eflin and Thompson only 2 IP per appearance because of their struggles last year. This should increase their confidence by decreasing their perceived pressure.

3. Since the Phillies signed two relievers to one-year deals in the offseason, it is apparent that Klentak wants to sell them off at the deadline. This is why I chose the quasi 3-3-3 system.

Imagine Klentak’s bargaining power at the deadline if he has even three of these newly trained utility pitchers pitching well, especially if one is a guy like Asher, Eflin, or Thompson? He could promise 5+ years of control of a utility pitcher who can be a traditional starter or a multi-inning reliever out of the bullpen.

Some people will read this and think that this would be a “demotion” or “devaluation” from being a “starter.” This is not true. All of these pitchers made it to the MLB as what you would call “starters.” They have excelled at pitching 6+ innings per game. This experiment would simply add value to all of them. Just as playing Ben Zobrist at LF, RF and SS doesn’t take away his ability to play 2B.

Most relief pitchers don’t get drafted as closers or relief pitchers. They are given chances at various roles and stick with whichever role suits their strengths best. Look at Chapman and Andrew Miller. Look at Joe Blanton! Terrible pitcher as a labeled “starter” but excelled in a set-up role for the Dodgers last year. General managers won’t trade for a guy for a postseason run if he hasn’t proven that he is going to be a solid contributor in the specific role they need for their team. So by using 2017 as a value-booster, you train all of your pitchers for multiple roles so you can have the leverage to trade any of your guys to any team. Every postseason team needs pitching. The 3-3-3 rotation will give Klentak unlimited options to acquire talent that will help the 2019 team be successful. GMs are most vulnerable at the deadline, and it is time to take full advantage.

Some people might argue that bringing up all of these pitchers at once would be a waste of MLB service time. But what is more important to a GM who has multiple pitchers with middle-of-the rotation ceilings? An option year or service time? This experiment is exactly that, an experiment. It is a trial run for one half of a season to ramp up current asset valuations to acquire a lot of quality pieces for the future. Since all of these pitchers are already on the 40-man roster, sending them to the minors would waste an option year anyway. So why not give this a try? The worst thing you could lose is half a season of MLB service time on a few guys who have served less then 20% of one year in their career.

HEALTH

In an arm-health study by Dr. James R. Andrews the following chart is comprised:

Ages 14 and under – 66+ Pitches (4 days rest) 51-65 (3) 36-50 (2) 21-35 (1) 1-20 (0)

Ages 15 and over – 76+ Pitches (4 days rest) 61-75(3) 46-60 (2) 31-45 (1) 1-30 (0)

These pitchers are prized assets. Millions of dollars coupled with thousands of hours of prep, coaching and playing time are used per arm. Why don’t we take better care of these players?

As a kid, your parents told you to eat your vegetables, sleep eight hours a night and stay in school while getting 60 minutes of exercise a day. But as we grow older we continually skip our vegetables, sleep five or six hours a night, forget to keep our brains active, and rarely exercise. We feel that we can still function this way, but more importantly, we feel we have to function this way. This is because we put too many responsibilities on ourselves at the expense of our own well-being. I’m arguing that we are giving these pitchers too many responsibilities, at a detriment to their peak physical health. Why? Because traditional baseball knowledge tells us that a five-man starting staff is the right way to go in 2017. But look back at history: there used to be one-man, two-man, three-man and even four-man rotations. Those proved to be unsuccessful. I am saying that the five-man rotation isn’t working either. It’s time to make a change.

What if we treated these valuable multi-million-dollar arms with the care that we take with our Little League arms? I propose a hopeful plan of three innings finished for each starter, but an absolute maximum of 36-50 pitches no matter what. These pitchers will then receive two days of rest for every 36-50 pitches, thus receiving the care a child under 14 would receive (see chart above). It is impossible to argue that this wouldn’t be a healthier system than the one we have now. Finally, let’s shift back to trade value. If Klentak is making deals on July 31 and a playoff contender is asking him how his players can help them win a championship, health is another big concern! If he can say that his pitchers have been put on a stricter regimen than any other team in the league, and that his players’ arms are healthier and more fresh than any other team in July in the history of baseball, that is going to increase his bargaining power. Remember, keeping players healthy, putting them in the best position to succeed and increasing trade value all are focused on the 2019 season. Klentak’s initial plan has always been focused on the 2019 season. And this plan will add tremendous benefit to that goal.

Conclusion

Now I am not saying that every team should utilize this strategy. I am not saying this is the future of baseball for eternity. I am saying that with the Phillies assets, at the perfect time in their development, this will be a great strategy to use. A Double-A or Triple-A prospect is worth much less than an MLB-proven prospect. A pitcher who can relieve, start and spot-start is worth more than just a conventional “starter” or “reliever.” More utility is always better than less utility. Healthier arms are better than overused arms.

I am saying the Phillies should give this a try for half of a season in which they won’t win more than 80 games. There is nothing to lose. And hey, if everything goes to plan, maybe this starts a revolution. If not, then they seamlessly revert to a five-man rotation in August. The goal of business is to buy low and sell high, looking for the most reward for the least amount of risk. This is about as high-reward as you can get in a sub-.500 season with about as little risk as I can imagine.

A new idea is always crazy before it makes sense. In the 1920s and 30s it was a rule that star pitchers had to throw 10-20 relief appearances in addition to their normal starting roles. In the 1880s, catching a ball on one bounce was an out. It even used to be legal for a first baseman to grab a runner by the belt so he couldn’t steal second! It is time for a new discussion about the modern-day pitching staff. It is time for rebuilding teams to try new things to get an edge on the competition. It is time for the game of baseball to go through yet another change. We owe it to the fans, to the players, and to the history of our beloved game. We owe it to ourselves to put our reputations on the line for the greater good of baseball.


Searching For Undervalued Pitchers

When looking to the future, there are countless ways to try and find undervalued pitchers.

One such way is to look at which pitchers’ FIPs outperformed their ERAs last year. This is a good approach, but it isn’t enough. For one, there will be players who consistently underachieve on their metrics, like the ever-teasing Michael Pineda. He sits second on the 2016 leaderboard in ERA-FIP, but his ERA is more than half a point greater than his FIP for his career and over a point greater each of the past two seasons.

The other problem with this approach is that FIP has become mainstream enough that everyone will be doing this same thing. Players who outperformed their FIP will be be common targets on draft day, driving up their prices and eliminating any sleeper potential that they had. This, too, is the downside of projections and other easily accessible data.

A different approach is then needed. In that spirit, I decided to create a linear regression model to predict a subsequent year’s ERA based on the difference in first- and second-half splits from the previous year, as well as that year’s ERA. This would help find the players who improved the most from the beginning of the year to the end, and perhaps players who are likely to carry over those improvements into the next season.

The model was generated using data from 2002 to 2015 obtained from FanGraphs’ splits leaderboard, with only pitchers with at least 50 IP in each half-season being considered so as to remove potential outliers. Non-significant variables were removed, and a final model was created. The resulting model was then used with 2016 data to predict ERA in 2017. The following graph shows those predictions, after being rescaled, plotted against 2016 ERA:

For the most part, the predictions line up with their 2016 counterparts. The labeled data points, though, are the ones I want to focus on. Based on this model, each of them are expected to see their ERA drop significantly from last season to this one and could help provide value in the latter rounds of drafts.

Jeff Samardzija
2016 ERA: 3.81
2017 Projected ERA: 3.40

The Shark has had a rough career. Since becoming a starter in 2012, he’s only had one season in which he’s beaten last year’s mark of a 3.81 ERA. He’s played for four different teams in those five years, he’s on the wrong side of 30 and his name is at the same spot on the pronunciation scale as Jedd Gyorko’s. But he does have a few things going for him. He’s struck out over 20 percent of the batters he’s faced in all but one year since 2011, and he’s pitching in a park where home runs go to die. His average fastball velocity is holding steady above 94mph and it was only two years ago where he had a sub-3 ERA with the estimators to back it up. He’s proven he can put up solid numbers, so the predicted improvement isn’t unreasonable. He had a 3.66 FIP in the second half of 2016 that exactly matched his ERA, a substantial drop from his first half numbers. The biggest contributors were his strikeout rate, which rose from 18.9 to 21.9 percent, and his HR/9, which dropped from 1.15 to 0.94. There’s no reason to think the rates are unsustainable either — his HR/FB dropped to a near-league average (in a normal year) 10.8 percent, and his strikeout rate improved almost directly with an increase in his O-Swing%:

Samardzija was able to get batters to swing at pitches out of the zone more frequently as the season went on, and consequently was able to produce more strikeouts. Steamer projects him for a 3.66 ERA, which isn’t all that far off this model’s prediction. If he can bring his strikeout rate back to what it used to be, and AT&T Park does its job, Samardzija could provide some sneaky value in 2017.

Ivan Nova
2016 ERA: 4.17
2017 Projected ERA: 3.72

Moving to the NL seemingly agreed with the former Yankees second-round pick. After posting an unsightly 4.90 ERA in 21 games (only 15 starts) in pinstripes, he turned his season around in Pittsburgh with a 3.06 ERA and 2.62 FIP in his final 11. Switching leagues undoubtedly helped, but there are more reasons behind his improvement. For one thing, he increased his strikeout rate while decreasing his walk rate — just doing those two would be reason to expect a lower ERA. Perhaps more significant, though, is that he halved his HR/9. Much of this is due to a change in scenery — his HR/FB dropped from 21.3 percent before his trade to just 7.8 percent afterward. Of course, he can’t be expected to repeat his performance. He walked just three batters in 64 2/3 innings, good for a 1.1 percent walk rate and a 17.33 K/BB. While Nova is probably better than Phil Hughes, it’s unlikely that even he can replicate that kind of walk rate. Look for Nova to improve on his ERA from last year, but don’t expect him to be as good as his second half. He’ll fall somewhere in the middle, but even that will be more than useful.

Wily Peralta
2016 ERA: 4.86
2017 Projected ERA: 4.35

Don’t look now (unless you promise to come back), but Peralta had a 2.92 ERA in the second half of 2016. Part of this was admittedly due to an inflated 81.7 percent strand rate, but even accounting for that, he managed a 3.75 FIP and 3.59 xFIP during that stretch. His success can be due largely in part to his increase in strikeout percentage, which jumped from 13.6 percent to 20.8 percent. It’s difficult to determine the exact reason behind this, but one explanation might be his increase in velocity. At the start of the year, his fastball was only averaging under 95 mph, a continuation of his 2015 trend and a disgrace to fireballers everywhere. By August, he was closing in on 97 mph, and presumably striking out batters as a consequence. Here’s his velocity by month since 2014, via BrooksBaseball:

Not only did Peralta see an increase in his strikeout rate, but his walk rate improved as well from 8.7 percent to 6.5 percent, which is the lowest to reasonably expect given his career numbers. His WHIP dropped from 1.88 to 1.15, his HR/9 from 1.64 to 1.02 and his wOBA against from .421 to .295 — seemingly everything improved except his age, but I’ll give him a pass on that account. The secret behind his success? His ability to limit hard-hit balls and induce soft contact. Take a look at the trends for each type of contact rate:

In case that doesn’t do it for you, here’s his Statcast exit velocity broken down by game date, via Baseball Savant (with a linear regression line added for those last few skeptics who aren’t convinced):


Peralta’s not an ace, but he has the potential to help out teams this season. Monitor his velocity during spring training, and buy him for a discount on draft day.

Clay Buchholz
2016 ERA: 4.78
2017 Projected ERA: 4.02

Of all pitchers who threw at least 50 innings in each half of the season, Buchholz improved his FIP the most — his first-half FIP was 6.02, so he gave himself quite an advantage, but he still brought it down to 3.74 following the Midsummer Classic. He’s already proven himself to be a capable pitcher, with four sub-3.50 ERA seasons in his past seven seasons, and now he goes to Philadelphia, where pitchers go to be reborn (see: Hellickson, Jeremy). Also, he moves from the AL East to its NL counterpart. Besides going up against a pitcher instead of a designated hitter, he will be facing the likes of the Braves and Marlins instead of the Blue Jays and Yankees.

Despite the difficulty of his former division, Buchholz still managed to improve as the 2016 season wore on. He marginally increased his strikeout and walk rates, doubling his K-BB% to a still-mediocre 9.3 percent in the second half of the season. While that’s not exactly comforting, it’s worth noting that his walk rate in the first half the season was higher than anything he’s put up since 2008, so it’s not likely to approach that number anytime soon. Furthermore, he was able to bring down his bloated HR/FB rate, despite the league’s general struggle to do so. In the first half of the season, 15.9 percent of Buchholz’s fly balls resulted in home runs, which would have been higher than any single season in his career. In the second half that number improved to 5.1 percent, which was much more reasonable given his average rate of 6.5 percent over the previous three seasons. Steamer projects him for a 4.07 ERA, but it’s not difficult to envision a scenario where he does better than even that.

With all that being said, not all of the pitchers on this list are going to live up to their projections. No model is perfect, and none of these guys have exactly had exemplary careers. But they all showed significant improvement over the course of last year, and that’s a strong indication for what to expect from them in 2017.