Another Look at Mauer’s Power

Again looking at leaderboards, I noticed that only one of the top 10 hitters by batting average has fewer than 10 home runs; that would be your 2009 American League MVP, Joe Mauer.

Mauer is hitting .324 through 127 games and, barring a disaster to end the season, will finish as a 5-WAR player.  It’s a far cry from his 8-win 2009 season (which no one could reasonably expect him to repeat), and just looking at his lines across the two seasons, it’s not hard to see where the difference is.

Read the rest of this entry »


A Better MLB Playoff System

Since 1994, when Major League Baseball converted to three divisions and expanded its playoff system to four teams per league, the format has been a mixed blessing.  Good teams that didn’t win their division would qualify for the playoffs as a wild card team, and the expansion of two more playoff teams per league meant that more fans could root for their teams to make the playoffs.  Gone were the days of a make-or-break pennant race, as the second-best team in the division didn’t make the playoffs. 

While I am not arguing that this system is better than what was done in the past, our current format is not the most optimal way MLB should have the playoffs structured, and I propose an alternative.  First, I would like to review what I feel is wrong with the current system:

Problem #1: Teams are not incentivized to get the top seed.

The reward of having the league’s best record means little if a team knows they’re making the playoffs.  For example, let’s say it is the last day of the regular season and a team is either tied for the division lead or for the best record, but if it loses the last game it will be the wild card team or the second seed.  What incentive is there to win that game?  At the risk of overusing them, a manager is unlikely to use an ace on two- or three-days rest, or his closer for two or three innings.  In other words, there is no urgency for teams as to where they’re seeded in the playoffs. Regardless of who wins the division, both teams should make the playoffs.

Problem #2:  Home field advantage is not much of an advantage.

Since 1995, home playoff teams have won two-thirds of NFL games, 65% of NBA games, but only 54.6% of MLB games.  (Incidentally, home teams won 54.1% of regular season MLB games during the same period.)

Problem #3:  Wild card teams have performed just as well – if not better than – the division winners in the playoffs.

Nine of the thirty pennant winners have been wild card teams.  Given that wild card teams do not have home field advantage in either the division series or the LCS, this shows that the current system does not put wild card teams at much of a disadvantage.

Simply put, the current “balanced” playoff system was easy to implement  and simple on the schedule makers – the higher seed gets home field advantage over the lower seed.  While there has been talk of different ways of unbalancing this, not much has changed, although there is talk of changing the Division Series to a best-of-seven series.

My proposal, which improves upon the issues mentioned above and makes for more exciting pennant races and playoff games, is as follows:

I propose that MLB adds a second wild card team to both leagues, and that both wild card teams in each league play a one-game play-in game the day after the regular season ends to determine the fourth seed.  The top-seeded team in each league – based on the regular season record – would then play the four seed in a best-of-five Division Series.

This change eliminates the problems listed above.  Teams would now have an incentive both to 1) try for the top seed and 2) avoid being a wild card.  Top seeds are given a big advantage in this scenario because they will face the winner of the playoff game that will most likely have used their ace in either that game or a pivotal last game of the season, and wouldn’t be able to use him again until possibly Game 3 of a five-game Division Series.  This comes at a great detriment to the wild card teams and more than makes up for the “advantage” of home field for the top seed.

Taking this year’s AL East race as an example, if both the Yankees and Rays are tied going into the last game of the season knowing they’re both making the playoffs, there is little incentive for either manager to push their players that last day.  But if either team knew that the second-place team would, say, face Boston in a one-game playoff, then this would greatly change managerial decisions that last day of the season.

Given that both proposed one-game playoffs would only last one day, scheduling the playoffs will not make it much of a burden on the other playoff games.  As we saw with the Chicago-Detroit game last year, one-game playoffs are exciting to watch and would be a great opening to the MLB playoff season.


The Leadoff Walk

We’ve all heard a broadcaster comment on the impending doom of a leadoff walk and yet they fail to seem to apply the same sort of fateful outcome for a single. I thought it would be interesting to find the outcomes of each of the ways a player can leadoff an inning by getting on first base and see if it affects whether or not the runner goes on to score. I took the retrosheet data sine 1952 (but not including this year) that I have as a MySQL database and created a quick python script to determine these results. I took it further and examined if the breakdown were any different in late game situations, as I’m always hearing “You never want to walk the leadoff batter but especially late in close ball games”. I was also curious if even in general more solitary runs get manufactured once a leadoff runner gets on base in late game situations.

Total times batter lead off an inning by getting to first: 508312
Total times runner scored: 192150

So a leadoff batter who starts on first base scores 37.80% percent of the time, here is the breakdown via the means they get aboard

Any inning

Single      325455 Scored 122662   37.69%
Walk        150570 Scored  57189   37.98%
HBP          11865 Scored   4600   38.77%
Error        19260 Scored   7270   37.74%
Strikeout     1007 Scored    375   37.24%
Catcher's Int. 155 Scored     54   34.84%
Totals      508312 Scored 192150   37.80%

So it appears as though it’s not much of a statistically significant difference between the walk and the single. The HBP numbers seems to be a bit of an outlier, I’m wondering if that is just sample size or if such an outcome rattles the pitcher to the point of that much more runs being produced.

Lets now examine the breakdown based upon the stage of the game.

6th inning or earlier

Single      217421 Scored  83243 38.29%
Walk        100587 Scored  38798 38.57%
HBP           7879 Scored   3070 38.96%
Error        12778 Scored   4880 38.19%
Strikeout      645 Scored    244 37.83%
Catcher's Int. 107 Scored     36 33.64%
Totals      339417 Scored 130271 38.38%

7th inning or later

Single     108034 Scored 39419 36.49%
Walk        49983 Scored 18391 36.79%
HBP          3986 Scored  1530 38.38%
Error        6482 Scored  2390 36.97%
Strikeout     362 Scored   131 36.19%
Catcher's Int. 48 Scored    18 37.50%
Totals     168895 Scored 61879 36.64%

Interesting how 1.74% more leadoff runners reaching first score in the earlier innings.  Is this a comment on the failure of manufacturing runs or pitching being different in the later stages of the game?  Perhaps a deeper look based upon “close game situations” is in order for that.


Can Pat White Play Baseball?

The Kansas City Royals last week flashed back to the past by signing NFL quarterback Pat White in a move that conjures up memories of the old Kansas City Royals Baseball Academy which I wrote about here in June.

Judging by the internet commentary on this move, it is being seen as comical or a case of pure desperation by either White or the Royals and is being dismissed entirely by baseball and football fans alike.

For those unfamiliar with White, he was a star college quarterback for West Virginia for several years and noted for his unique combination of running and passing ability. White is rather short for an NFL quarterback and lacked the great arm strength desired by NFL scouts. Many thought he would be best suited as a wide receiver and might be converted by a team like fellow college quarterbacks Antwaan Randle-El  or Patrick Crayton and many others have done successfully.

The Miami Dolphins thought differently and drafted him in the second round with plans to use him in their much-hyped “Wildcat” formation which was wildly successful in its first year of operation.

A year later, the Dolphins infatuation with White has apparently waned and he was released as an NFL camp casuality in his second year. This posed a problem for White because a team that might want him as a WR cannot place him on their practice squad because he has too much NFL service time and no teams are interested in making him a backup quarterback right at the beginning of the season.

White has long been pursued by major league baseball teams, being drafted by the California Angels in the fourth round out of high school in 2004. He was subsequently drafted again by the Angels,  Reds and Yankees in the last part of the draft.

As a high schooler, White projected as a Carl Crawford-type outfielder with the speed to play center field. He hit .487 his senior year of high school and might have been taken in the second round if not for his quarterbacking abilities.

Now, the question is can Pat White play baseball at age 24? Most seem to think not. It’s a question of Nature or Nurture. And most seem to come down on the Nurture side of things.

Are baseball players born or developed? Has Pat White’s window of opportunity closed because he hasn’t swung the bat for so long or is his brain arranged in such a way that he was born to hit a baseball?

There is reason to think it is possible that White could be playing in the major leagues in a few years. Consider the case of Ron LeFlore.

LeFlore did not grow up playing baseball or any sport for that matter. He was incarcerated at age 19 and began playing sports because he noticed that the prison athletes received extra free time from the guards to play their sports. He began by playing basketball and then was invited to play softball, ultimately graduating to the prison baseball team where he began playing at age 23.

In a community service visit to the prison, Tigers manager Billy Martin was cornered by the inmates who urged him to give LeFlore a tryout. Martin promised he would. Upon his release from prison, LeFlore unexpectedly took Martin up on that promise. Martin didn’t really remember LeFlore but made good on his pledge and ultimately signed him to a contract over the objections of Tigers ownership after a prodigious batting practice display. It was real-life shock and awe.

And how much did LeFlore’s late start set him back? He was signed at age 25 and hit .273 in a 73 at-bat stint in Class A. The next year, he hit .339 in 423 at-bats in Class A and was promoted to AAA and the Tigers shortly afterward. Just one year after signing and three years after beginning playing the sport, LeFlore hit .260 in the major leagues and eventually became a .300 hitter and stole as many as 97 bases in a season.

Or consider Rick Ankiel who went back to hitting at age 25 when he could no longer throw a strike and eventually hit 25 home runs for the St. Louis Cardinals at age 28.

Or consider the way Deion Sanders and Bo Jackson tore through the minor leagues despite never really being all that serous about baseball.

There is also Josh Hamilton. You know him, right?

As talented as White may be, my guess is that he does not have the baseball talents of a Ron LeFlore. I suspect that White is not committed enough to baseball and will bolt for his first chance in the NFL or professional football. I will not be at all surprised if White manages to hit Class A pitching rather well, though.

I’ll put White’s odds at being a major league baseball player at 1 in 15. However, if White was truly destined to be a All-Star major-league baseball player and was another Carl Crawford, he’ll be playing center field for the Royals in three years. No joke.

It’s been done before — more than once.


2010 Pitchf/x Summit Recap

A few weeks ago, Sportvision hosted the 3rd Annual Pitchf/x Summit.  Sportvision is the company behind the Pitchf/x system and has initiated Fieldf/x, which I’ll get into in a minute.  The goal of the summit was to share some of the research being done in baseball analysis, while also serving to explain the possibilities that exist with the new system.  Without further ado, here were the presentations:

Using Velocity Components to Evaluate Pitch Effectiveness (Matt Lentzner/Mike Fast): The purpose of this study was to change the reference point by which Pitchf/x data are measured.  Often, fastballs show more movement than breaking balls, but without the proper frame of reference, it means nothing.  Mike and Matt were able to demonstrate how to determine the horizontal and vertical velocities with respect to the batter’s eye and make the Pitchf/x data more meaningful.

Pitchf/x Application in Player Development and Evaluation (Dr. Glenn “Butch” Schoenhals): Dr. Schoenhals has a Pitchf/x system set up at his instructional school, which allows pupils (including some major leaguers) to see the their pitches broken down immediately and make adjustments.  In conjunction with three cameras set up around the pitcher, the Pitchf/x data provide benefit to both pitchers and instructors in learning/teaching how to pitch.

Okajima’s Mystery Pitch (Matt Lentzner): Hideki Okajima throws a pitch roughly 20% of the time that had previously been classified as a curveball, more specifically a “rainbow curveball.”  Actually, it didn’t really fit any of the known pitch types.  Using his research on pitch types and arm slots (“The Pitching Peanut”), we see that this pitch has almost no break, is faster than a curveball but slower than a slider, and falls at the exact center of the peanut.  His explanation: Okajima is the Boston pitcher who is actually throwing the gyroball, not his more famous teammate Daisuke Matsuzaka.

Leaving the No-Spin Zone (Alan Nathan): Dr. Nathan showed his experiments that relate the spin of the baseball just before and just after it is hit. The result? The two are almost totally independent of each other! I couldn’t believe that, but Dr. Nathan made a lot of sense.  This was a high-grade physics lesson, crashed into about 20 minutes.  He explained why balls tend to curve toward the foul lines; he showed that the bat actually “grips” the ball for a few nanoseconds or so before the ball explodes off the bat, which contrasts the earlier model of the ball “rolling” off the bat.  Really, really cool.

Fieldf/x System Overview (Vidya Elangovan): And the main event began.  Fieldf/x is a new tracking system that utilizes cameras attached to the light standards in baseball stadiums (for now, just AT&T Park) to track the movement of every person on the field 15 TIMES A SECOND.  As soon as I heard that, my mind started going crazy and I don’t think I paid attention for about 5 minutes.  The only issue at the time is that the system does not include the ball (but it will).  All ball events currently have to be added by someone watching the video.  The following presentations showed some of the things you can actually do with the data, and it’s fairly obvious that these data, particularly when connected to batted ball data through the Hitf/x database, are about to revolutionize how baseball players are evaluated.

Infield Defense with Fieldf/x (John Walsh): Actually the first presentation, thanks to being in Italy, (tough life), but it really would have been more helpful after the overview.  Either way, a lot of cool stuff.  First thing he said was that in tracking the different players, he noticed that an average centerfielder runs 8 miles per game, which stunned me and kept my attention.  Thanks to these new data, we can also see the effects of shifts and also what players away from the ball are doing while teammates are attempting to make plays.  Other questions John poses: can we see infielders cheating in a certain direction as the pitchers throws the ball? Do infielders lean in a certain direction before the pitch? Based on his initial investigations, he saw that third basemen step toward the line as the pitch is delivered and shortstops step directly at home plate.  Weird, but potentially important, and just a peak into what can be obtained.

From Raw Data to Analytical Database (Peter Jensen): As a baseball nerd and a programming dork, this was really cool.  Peter Jensen took the 400,000 lines of code that results from each game and wrote a macro to display what actually happened in the game in an Excel worksheet.  The simulation relates the position of each player as well as an approximation of where the ball is throughout the play.  His solution with regards to the reorganization of the data was very impressive for a first run, and it is absolutely vital to make the data useful for analysis.

Using Fieldf/x to Assess Fielders’ Routes to Fly Balls (Dave Allen): These next three were absolutely incredible to me (and I’m sure the last three would have joined them had I had the time to stay).  By using the data to reconstruct fielders’ routes to the ball, Allen surmises that the Fieldf/x data can be used to determine the speed of an outfielder as they pursue a ball, the starting points of each fielder at the time of the pitch (and hit), and how efficient each player is in getting to the ball (measuring the distance traveled against the shortest distance to the ball).  To me, this is something that teams can use to help players they already have by addressing alignment issues or noticing what is happening during the different points of pursuit.  Are outfielders getting good reads/jumps on the ball?  Are they running in straight lines or weaving?  Simply put, the data can confirm for us (and also measure exactly and more efficiently) what our eyes (and scouts’ eyes) have seen.

Measuring Base Running with Fieldf/x (Mike Fast): Mike’s presentation examined the different portions of base running and what the data can be used for.  Mike was able to track each base runner’s path around the bases, even what they were doing on pitches that weren’t hit (during which we would typically say “nothing happened”).  Obviously, with all of these data, there’s a lot happening.  Also, by knowing the position of the player at each moment in time, we can track both his speed and acceleration as rounds the bases; very valuable information for measuring “baseball speed.”

Fieldf/x of Probabilities: Converting Time and Distance into Outs (Jeremy Greenhouse): The coolest of the presentations.  As soon as he said the words “probability model,” I was sold.  Jeremy first examined stolen base attempts (in the thirteen games of data released, he only found four) and tried to determine the different component times of the stolen base attempt.  Some things he brought up that were interesting: “Pop” times, or the time it takes a catcher to catch the ball and get it to second base, was between 2.0 and 2.2 seconds for all attempts, which suggests that a lot of stolen bases are taken off pitchers, not catchers.  The ability to get a good lead is now measurable, as well as the jump a runner gets on the pitcher.

Jeremy also developed a model to determine the probability that a player makes a play on a ball hit near him.  The model was based on where the player is, where the ball would come down, and how long it would take the ball to get there.  From there, the player’s probability can change based on his jump, route, speed, and what I called “catching ability,” or the ability to actually make a play on the ball when in the vicinity.  It was shocking to see some of the plays made where players started out with low (less than 10 percent) chances of catching the ball, but by getting a good jump and running (quickly) in a straight line toward the ball, their probability would increase each 1/15 of a second.  He then showed the video of these plays and we were able to see the spectacular catches made by really good outfielders.  This also applies to outfielders who start with a low probability to make the catch, but increase it as they, for example, chase a ball into the gap, close quickly on it, but don’t catch it.  The ability to increase the probability of a catch is very valuable and that knowledge would be immensely valuable to teams.  Lastly, he also showed how bad outfielders can turn outs into hits by reading the ball poorly, getting bad jumps, and being indecisive.  Super cool, and as soon as the presentations are made available online (which hopefully will be soon), I will link to some of them, but especially some of these graphs.

Unfortunately, I missed the following presentations, so I will just show the abstracts presented in the program.

Where Fielders Field: Spatial and Time Considerations (Matt Thomas): Continued application of close-range photogrammetry through high-resolution digital photography to baseball is revealing hitherto unseen patterns of fielding in the game. Matt examines these patterns and where data permit, factors time into this examination. After reviewing general trends he notes specific achievements and then speculates on whether any of this freshly quantified insight tells us what makes for good (and not so good) fielding.

Scoutf/x (Max Marchi): This presentation evaluates players’ tools with Pitchf/x, Hitf/x, and Fieldf/x.

True Defensive Range (TDR): Getting out of the Zone (Greg Rybarczyk): Greg intends to display detailed tracking of the 25 batted balls in the released data that were hit in the air to the outfield. Presented data will include the relative positions of the outfielders and the ball from the time the ball leaves the bat until the time it is retrieved by the fielder. Using the essential elements of this data (fielder starting position, ball hang time and landing point), he outlines the fundamentals of a new outfield defensive metric, called ‘True Defensive Range’ or TDR, which should provide more accurate player defensive ratings with a smaller required sample size than current metrics. Full realization of this metric will require establishment of baseline values using the full data set. TDR for infielders will employ a similar method, but it will not be covered during this presentation.

The Future of Sportvision’s Data Collection (Greg Moore): Greg will talk about several bits of baseball data that Sportvision might collect in the future, and he will discuss how the data can be used in conjunction with Pitchf/x, Hitf/x, and Fieldf/x. Greg will also conclude the 2010 Pitchf/x Summit with closing remarks.

Obviously, there was a lot of cool stuff presented.  As mentioned, only 13 games worth of data were released to the analysts and most of the presentations were about determining what could be done with the data.  But with enough work and research, it will not only change the way teams and analysts evaluate players, but also will give teams another tool with which to teach their players and improve the guys they already have on the roster.  We’ll also know exactly what skills are important in each aspect of the game (base running, fielding, etc.), and as we learn these things we’ll discover other things we want to know.  I’d love to know what you guys think of all this and I’ll try to answer any questions you have about what can and can’t be measured and how we’ll use it in the future.

UPDATE: After I wrote this mess, I discovered this, much cleaner, detailed, mess, written by Baseball Prospectus writer Ben Lindbergh.  I’ll link to it down here because I want you to read what I wrote instead of Ben’s running diary.  Sorry, Ben.

This article was originally published at Knuckleballs, written by Dan Hennessey.


Matt Kemp’s Struggles: Fastballs

Matt Kemp has been struggling this season, and even if you account for the low BABIP (.303) compared to his xBABIP (.335), he is still striking out at a higher rate this season (28.1%) compared to last (22.9%). What has also confounded me is thatm even though he has a higher strikeout rate, he is also setting a career high in walk rate as well (8.1%). Usually, drawing walks and getting struck out are thought of as tradeoffs, opposite ends of the “patience scale.” Last season, Kemp hit .362/.429/.616 against LHP and .278/.329/.453 against RHP, but this season, he is down to .303/.341/.443 against LHP and .240/.306/.445 against RHP. What happened?

There are two questions I’d like to investigate: 1) Is Kemp swinging at more strikes in 2010 compared to 2009 and how? and 2) Is Kemp making less contact in 2010 compared to 2009 and how?

To answer these two questions, I’d like to look at Kemp’s swinging strike percentages (swinging strikes per pitch) and contact percentages (contact made per pitch) against all fastballs (four-seamers, two-seamers, cutters, and splitters). Checking to see any differences between 2009 and 2010 should lend some insight into Kemp’s offensive struggles this season.

I ran several regressions to model surfaces of Kemp’s swinging strike percentages and contact percentages as well as his swing zones. First up, let’s take a look at Matt Kemp’s SwStr% against RHP fastballs:

The red contour lines tell us that Kemp chooses to swing 50% of the time when a ball is thrown within the contour line. This is what I call Kemp’s swing zone, so the red circles refer to this. Further examples and explanations of these swing zones can be seen here. What the swing zones tell us here is that Kemp is swinging less at RHP fastballs in 2010, but is whiffing at a much higher rate as well. He is also missing more RHP fastballs down the middle as compared to before.

Let’s check Kemp’s Contact% against RHP fastballs to see if his swinging strikes are affecting his ability to make contact:

The red contour lines are the same as in the previous two graphs. Clearly, Kemp is making a lot less contact off RHP fastballs, and this tells me that he is putting the ball into play less. The previous two show Kemp swinging and missing more, while these two show Kemp making less contact, particularly on high inside fastballs. Let’s take a look at how Kemp has been doing against LHP fastballs, first at his SwStr%:

Here are his swinging strike plots, Kemp has actually started to swing more on LHP fastballs down and out of the zone (the red contour lines dip in 2010), so his swinging strike rate there is up. But he is also missing a lot more LHP fastballs this year that come down the middle over the plate, ideal pitches for the right-hander to hit out of the park. This is particularly concerning when you consider that Kemp’s wFB/C (runs above average per 100 fastballs) was at 1.64 last season, while that number is down to 0.38 this season. A major part of that drop must have to do with Kemp whiffing on fastballs down the middle that he used to hit.

Finally, let’s look at Kemp’s Contact% against LHP fastballs:

Looking at his contact plots, we see similar colors in where he makes the most contact (making contact 80% of pitches in those areas). But we notice a huge shift in where the epicenter of that hotspot is. Last year, Kemp made contact off a lot of LHP fastballs down the middle of the plate, but this year, the epicenter of that contact hotspot has shifted a full foot up from the direct middle of the zone to the top of the zone. We can infer that Kemp is making less contact off the sweet spot of his bat, and making more high fastball contact that usually result in pop outs. This is problematic and adds further evidence that Kemp is simply missing fastballs down the middle as well as chasing high fastballs.

In general, what I present here is what Dodgers’ fans already know: Kemp is swinging and missing a lot. But I hope that I was able to demonstrate clearly how Kemp has been struggling against fastballs, showing where he is whiffing on them and where he is making less contact.

An article over at Memories Of Kevin Malone convinced me that perhaps Kemp’s whiffing behavior this season (along with swinging less and drawing more walks) could have been caused by Kemp’s change in swinging mechanics. Finally, if you visit my blog at Think Blue Crew, you can read a longer post about Kemp’s offensive struggles against breaking balls as well.

A variation of this article was originally posted at Think Blue Crew, a blog dedicated to data visualization of baseball, basketball, and football statistics. Check it out for more f/x visualizations like this.


Tribe’s Ongoing Draft Difficulties

Successful small-market teams contend by acquiring above-average talent on two levels: (a) development, either through international free agency or the amateur draft, and (b) trades. 

Success in one area, and not the other, can create a small window of opportunity to win, but for a team to compete for multiple seasons it is important to find talent on both avenues.  

Many fans remember John Hart as the architect of those great Cleveland teams in the late 90’s, but few realize how fortunate he was to step into that situation. Much of the groundwork for the Indians renaissance can be traced back to former General Manager Hank Peters.

During his four-year tenure as GM, Peters drafted Manny Ramirez, Charles Nagy, Jim Thome, Brian Giles, David Bell, Chad Ogea, and Paul Byrd.  He also did well when he acquired Sandy Alomar Jr. and Carlos Baerga in a trade with San Diego for Joe Carter

The foundation was set for Hart to succeed.

Hart, as he should, receives a lot of the credit for those teams.  He was one of the first general managers to sign young players, not quite at their peak, to long term contracts. He acquired a borderline future Hall of Famer in Kenny Lofton for Willie Blair and Eddie Taubensee – both would become more journeyman than established player.

John Hart did a lot of things right but what he could not do was mimic Peters’ draft success. 

Hart’s tenure as General Manager lasted ten years, 1992 – 2001, and in that time he managed to draft and sign only three impact players – CC Sabathia, Sean Casey, and Richie Sexson. Both Casey and Sexson were eventually traded for spare parts.

Using WAR (Wins Above Replacement player) as a statistical guideline the top ten players drafted by Hart are:

Name Year Drafted Round Career WAR
C.C. Sabathia 1998 1 38
Sean Casey 1995 2 15.8
Richie Sexson 1993 24 14.4
Russell Branyan 1994 7 8.5
Steve Kline 1993 8 7.9
Luke Scott 2001 9 7.3
David Riske 1996 56 6.9
Ryan Church 2000 14 6.9
Paul Shuey 1992 1 6.8
Jon Nunnally 1992 3 5.3

While all those players carved out careers of varying success, only Sabathia ranks in the top 1000 players in career WAR.  Hart’s predecessor and current GM, Mark Shapiro, was not as lucky and stepped into a situation much more difficult.

 Hart’s final two drafts epitomized the team’s long standing draft failures and would eventually signal the beginning of Cleveland’s first rebuilding process in nearly a decade. 

In 2000 and 2001, the Indians owned 3 of the top 55 picks and 4 of the top 43 picks, respectively.  The only player chosen to mount any type of tangible career was Toronto long reliever/spot starter Brian Tallet.

As with Hart, the foundation for Shapiro was set – but this one signaled the beginning of a rebuilding period and the end of era. 

In November of 2001, Shapiro was promoted to General Manager of a team with high-priced, aging veterans and a farm system wrought with failure.  Shapiro sought to rebuild the organization from the ground up – in four seasons – the Hank Peters way.

He went about stocking the farm system by trading Bartolo Colon for Cliff Lee, Brandon Phillips, and Grady Sizemore.  Shapiro then drafted college phenom Jeremy Guthrie and signed him to a club record $3 million bonus.  He followed up the 2002 draft by selecting Michael Aubrey, a polished college hitter, in the first round the next season.   Shapiro set about reestablishing Cleveland baseball.

On paper, his way parallels that of Peters.  One problem: while he has had success in trading for prospects – Shin-Soo Choo, Asdrubal Cabrera, and Carlos Santana among others – his draft picks have not panned out.  Guthrie bounced between Buffalo and Cleveland, the bullpen and the rotation.  Aubrey spent more time in the chiropractor’s office then on the field and the rest of prospects failed to make a longstanding impact.

A small-market team has to have the ability to replace players as they enter arbitration years or they will never be able to consistently compete – which would explain the two winning seasons the Indians have had since 2002. 

Shapiro and assistant GM Chris Antonetti have changed draft philosophies in recent years and have begun taking players no longer consider “safe” choices, now focusing on “toolsy” players.  In the last three years the team has added much more promising prospects like Alex White, Lonnie Chisenhall, and 2010 second rounder LeVar Washington. 

Much like in 1992 and again in 2002, the front office and the team have both entered into another period of transition.  Chris Antonetti will spearhead another rebuilding effort in hopes of creating something fans have not experienced in so many years – a consistently competitive team.  It, of course, would be easier if he steps in a position much like Hank Peters created and not what John Hart left. 

Hank Peters’ time highlighted how building a successful franchise is built on strong talent development and smart trades.  His time in Cleveland is a perfect example on intermixing strong draft results and dealing higher priced veteran players for prospects.  In four years he was able to set the foundation, along with Hart’s subsequent tweaking, for the Tribe’s revival.


Mark Reynolds’ Whiffs by Pitch Type

Mark Reynolds is perhaps one of the more interesting power hitters heading into his prime this season. He has led the entire league in strikeouts since 2008, holding the all-time record for most strikeouts in a season with 223 K’s last season.

This year, he leads the league once again in strikeouts, as well as perennial leader in swinging strike percentage. He has whiffed on 17.3% of all pitches this season, second place being Ryan Howard at 14.4%. Interestingly, Reynolds does not actually swing at everything a la Jeff Francoeur (60.7% swing percentage) and is barely in the top 50 in percentage of pitches he swings at with 46.8%. This makes it even more amazing that Reynolds leads the league in strikeouts and swinging strike percentage regularly without even taking that many swings. That’s a lot of whiffing going on, and I do suppose that the rare times he does connect the bat to the ball, he hits it hard.

I wanted to know more about Mark Reynolds’ swinging strike percentages to see how he fares against certain pitch types by handedness. Of the five main pitch types, fastballs, sliders, cutters, curveballs, and changeups, Mark Reynolds has seen cutters less than 200 times since his debut, 139 cutters from right-handed pitchers and 41 cutters from left-handed pitchers. He has seen at least 200 pitches for the other pitch types for right-handed pitchers or left-handed pitchers. Ignoring cutters due to small sample size, I will take a look at Reynolds’ swinging strike percentages against four-seam fastballs, sliders, curveballs, and changeups.

Let’s take a look at Mark Reynolds’ swinging strike percentages against four-seam fastballs split by RHP and LHP (1435 pitches from RHP, 468 pitches from LHP):

Four-seam fastballs

Here, it looks like Reynolds falls victim to high fastballs from both right-handers and left-handers. For Reynolds, he whiffs on the outside fastball from LHP stick out as well as the low and inside fastball from RHP.

Here’s a look at Reynolds against sliders (1542 from RHP, 224 from LHP):

Sliders

This is interesting. Reynolds strikes out far more against right-handed pitchers than against left-handed pitchers, but he tends to swing at (and miss) sliders coming from LHP more than he does from RHP. LHP sliders come low and inside while RHP sliders go low and outside, but even LHP sliders coming in from low and outside are swung at and missed by Reynolds.

Curveballs against Reynolds are a whole different story (567 from RHP, 228 from LHP):

Curveballs

Here, Reynolds clearly struggles at connecting on curveballs from right-handed pitchers, some in the strikezone and most low and outside the strikezone. Curveballs from LHP also get Reynolds to whiff sometimes on the inside part of the plate as well as the lower part.

Finally, here’s a look at Reynolds against changeups, which look like his greatest weakness when it comes to missing pitches (430 from RHP, 338 from LHP):

Changeups

This is very telling. The splits against changeups are very different, as Reynolds whiffs on over 50% of changeups from right-handers that are located on the edge of the strikezone at the bottom. This is much different from LHP changeups, where any spot doesn’t look to cross over 30% whiff rate, except the lower righthand corner of the zone. What’s also crazy about this is that when you look at Reynolds against changeups in general, he misses at around 20% of nearly all changeups low outside and nearly all areas within the strikezone as well.

From these plots, there are characteristics of Reynolds’ swinging strikes that are similar to conventional thought and common knowledge, such as chasing high fastballs or low breaking balls. But the key to exploiting Reynolds’ weakness at missing the ball when swinging is definitely throwing timely changeups, especially from right-handed pitchers, while it seems that Reynolds is less prone to whiff against LHP curveballs the most.

This article was originally posted at Think Blue Crew, a blog dedicated to data visualization of baseball, basketball, and football statistics. Check it out for more f/x visualizations like this.


Cesar Izturis’ Inexplicable Continued Employment

As far as value goes, being a +5.8 WAR player is generally considered quite an accomplishment. After all, only three players have accrued that much value this season (Josh Hamilton, Cliff Lee, and Roy Halladay) and the list of those who topped the mark last year is a roll call of stars.

Why, then, is it with a contemptuous sneer that I note that Cesar Izturis is a +5.8 WAR player? Because that’s his career mark. As shocking (or not, if you happen to be an Orioles fan) as this may be, four months of 2010 Josh Hamilton has been more valuable than an entire decade of Cesar Izturis.

And because direct numerical comparisons are always an exhilarating exercise, why don’t we look at the career WAR marks of some other shortstops? Luminaries who have surpassed the “Izturis Line” include Bobby Crosby (in 1,342 fewer plate appearances), Alexei Ramirez (in a whopping 2,637 fewer PA), the significantly-less-than-immortal Damian Jackson (in 1,676 fewer PA), and Pokey Reese (in 1,047 fewer PA). These are but a few members of the list of unspectacular shortstops who were more valuable in fewer plate appearances than Cesar.

In light of this staggering performance, I think the two main points to investigate are why he has been so awful (given his rep as a serviceable option) and what has compelled MLB teams to run him out there on a daily basis since just after George W. Bush took office the first time.

Considering that during every one of my visits to Camden Yards this year I’ve played a mental game of “Is Cesar’s ISO above or below 0.040?” (it currently sits at a tied-for-MLB-worst 0.038), it seems like his prodigious lack of power is a good place to start. Despite posting a very reasonable strikeout rate (9.7%), an OK walk rate (4.9%), and a mediocre but not crippling BABIP (.281) in his career, Izturis boasts a meager .276 lifetime wOBA, a direct result of an equally abysmal .068 career ISO. Never in a full season has he produced an ISO above .100, although he did give the Blue Jays a (comparatively) robust .119 ISO in 46 games in his rookie year in 2001. All of this ineptitude translates to an insane -169.9 batting runs over 4,185 PA spanning the past decade.

But, you argue, he’s not known for his bat. It isn’t fair, hypothetical you argues, to judge him without looking at the superior defense he provides. This segues nicely into my second question, which is why MLB teams continue to pay him to play every day.

As nearly as I can tell, Izturis’ rep among baseball media and traditional fans is that he’s a superb fielder and a decent overall player. After all, you can certainly do worse than a Gold Glove winner at short, right? Unfortunately for anyone who espouses this belief (I’m looking at you, Gary Thorne and Jim Palmer), it just isn’t true.

No matter how much you want to complain about UZR or DRS as defensive metrics, I’m willing to believe that over the course of 8,300+ career innings they paint a fairly accurate picture of Izturis’ true talent level defensively. UZR has him at +6.7 per 150 games at SS (+45.6 runs total among all positions) while DRS has him at 48 career runs saved, so the metrics agree. At that +6-8 runs per season level, Cesar has been above-average but certainly not elite with the glove.

Is this a case of MLB teams overvaluing defensive contributions at the expense of offense? After all, even the two-year, $5 million deal the Orioles gave him prior to the ’09 season has been more than the Venezuelan has been worth (+0.8 WAR during his time with the O’s). Perhaps teams let their scouting departments and Izturis’ hardware (’04 Gold Glove winner) cloud their judgment, or perhaps there just aren’t any other near-replacement-level shortstops out there. Although I’m skeptical about that last possibility.

And yet, for a guy who has hit 15 career home runs (or as many as Jose Bautista has hit since Independence Day), topped the +0.2 WAR mark just thrice in his career, and averaged a paltry +0.4 WAR since 2004, Izturis is certainly doing pretty well for himself. After all, he gets to start every day for a major league team.

Nathan Biemiller is a junior at Franklin and Marshall College who writes regularly for nothing but his college newspaper. If you would like to offer him a place to write consistently (gratis!) or if you just have questions or comments, you can e-mail him at nbiemill@fandm.edu.


xBABIP Experiment: Mark Kotsay

I am tired of Mark Kotsay. I am tired of his automatic 4-3 ground outs. I am tired of his lazy fly balls to left center. I am tired of his .190 batting average with runners in scoring position. I am tired of his .688 OPS. But most of all, I am tired of people in the White Sox organization defending Mark Kotsay. From Ozzie Guillen to Hawk Harrelson and Chris Rongey, the excuses are coming from every corner of the organization. And as an objective White Sox fan, the constant excuses are getting tiring. Luck or no luck, Mark Kotsay is a bad baseball player, that much is for certain.

Kotsay does nothing well and he contributes nothing on the field to this White Sox team, as shown by Mark Kotsay’s -0.6 WAR, good for the fourth worst in all of Major League Baseball amongst players who have at least 280 plate appearances. This shouldn’t come as a surprise, as of this moment, Mark Kotsay is hitting .228. Yet you have Ozzie Guillen saying things like, “Personally, the numbers out there for Kotsay [are not what] he deserves.” Followed by…“You can ask his teammates, you can ask [hitting coach] Greg Walker. He should have better numbers than what he has.”

You can ask any average White Sox fan or anybody in the White Sox organization and they will say that Kotsay has been unlucky or he “deserves” better. However, just how much better? Fortunately for us, the great people in the sabermetric community have come up with something that tries to battle this thing called luck. I think everybody knows of BABIP by now, but there is something better, something more contextual: xBABIP (Expected Batting Average on Balls In Play) . The concept is simple, take the mean batting average of line drives, ground balls, and fly balls that are not home runs, then create a BABIP based on those averages.

So what if Kotsay wasn’t lucky or unlucky? What if this was a perfect world where the average always happens? For the record, Mark Kotsay is a good player for this little experiment since his career offensive numbers are about as average as you can get. Currently, Mark Kotsay’s xBABIP is .269. This is more or less based on his line drive rate of 15.9%. His actual BABIP is .239. So as you see, he has been pretty unfortunate as that’s a 30 point disparity. Now let’s take this a step further, let’s say that .269 xBABIP is his actual BABIP. Mark Kotsay has hit 222 balls into the field of play (this does not include home runs), if he gets hits 26.9% of the time on those 222 balls, he would have 60 hits. Add his 7 home runs to those 60 hits and you have 67 hits in 258 at bats, which comes out to a .259 Batting Average. What about his On Base Percentage? Taking those 67 hits while adding his 30 walks divided by his 286 plate appearances, we would get an OBP of .336. So far so good right? Looks like Mark Kotsay would be a decent ballplayer if it wasn’t for those “hang wiffums” right?

Hold on just a second here, we can also apply this to his Slugging Percentage. We can play the rate game, which is a dangerous game to play, but we’ll do it anyways. Of the 53  hits Mark Kotsay has put in play, 38 have been singles, 13 have been doubles, and 2 have been triples. So from this, we can see that 71% of Kotsay’s non-home run hits have been singles, 25% have been doubles, and 4% have been triples. As I said before, this is a dangerous game to play, almost a fallacy, but since Kotsay does have an appropriate sample size here, it might be safer than usual. So taking these new numbers to his 60 expected hits, his new hit figures are 43 singles, 14 doubles, and 2 triples. This would result in a Slugging Percentage of .414. By adding Kotsay’s expected OBP and SLG together we come up with a .750 Expected On Base Plus Slugging or OPS, which is just about average.

Alright, so how do these new expected rates affect Mark Kotsay’s value? If we calculated an expected wOBA from these newly calculated values, Mark Kotsay would have a .329 Expected wOBA,  just about average. We can then calculate this into a run value to produce a new expected WAR. In this case, Kotsay would have produced -0.99 batting runs (without ballpark adjustment) in comparison to the average replacement player, much better than his previous rate of -6.2. So in this case, Kotsay’s WAR goes from -0.6 to -.08. A half win difference can go a long way at times.

So what does this tell us? Well first off, it says that Kotsay is a very average hitter in a luck-isolated world and average hitters should not be DHing 1/3 of the games for a team that already has issues scoring runs.  It also tells us is that Mark Kotsay has no place on this current White Sox team. He is a replacement level player who is only capable of DHing and playing 1B and that’s even if he hit like the “deserved” to hit. With Mark Teahen coming back and Brent Lillibridge already on the team, this team could be incredibly versatile. Isn’t that what Ozzie Guillen wanted? Isn’t that why Ozzie said no to Jim Thome, who is clubbing the ball for the rival Twins and is also a great clubhouse guy? This love affair with Mark Kotsay has gone too far. He is in fact costing this team on the offensive side of the ball. I would have no problem if Kotsay stays on this team as a pinch hitter and starts maybe once a week; he’s apparently a good guy in the clubhouse (as is his wife, I imagine). But the fact that this replacement level player has played 3/4 of this team’s games is disturbing. With the way that this situation has been tended to, you’d think the White Sox’ new slogan would be something along the lines of “White Sox Baseball: Here to Make Friends, Not to Win”.