Visualizing Major League Baseball During the Aughts

2010 marks the end of the “aught” decade for Major League Baseball.  I thought I would take the opportunity to analyze the last 10 years by visualizing team data.  I used Tableau Public to create the visualization and pulled team data from ESPN.com (on-field statistics) and USA Today (team payroll).

The data is visualized through three dashboards.  The first visualizes the relationship between run differential (RunDiff) and OPS differential (OPSDiff) as well as the cost per win for teams.  The second visualization is in table form and can be sorted and filtered along a number of dimensions.  The final visualization looks at expected wins and actual wins.

Read the rest of this entry »


Why I Can’t Ignore Stats

If some of you have been active in following your Hall of Fame voters, you probably read this post on Jon Heyman discussing his ballot. He spent the majority of this piece stating why he didn’t vote for Bert Blyleven, and then he explained why he voted for Jack Morris instead. I promise this is not intended to be a “Vote Blyleven, not Morris!” post, because I’m more interested in something else. Heyman claims that Morris had a bigger impact in his games than Blyleven. Well then, what happens if I never experienced this impact?

Read the rest of this entry »


A Guaranteed HOF Snub

On January 5, the Hall of Fame class of 2011 will be announced. It appears that Bert Blyleven will finally get the call after 14 years on the ballot. Roberto Alomar is likely to receive the necessary votes as well. There will be a long list of deserving candidates left out this year. After the announcement, there will be no shortage of analysis of the snubbery. But we know for a fact that, even before the votes are tallied, one deserving candidate will not be inducted.

Read the rest of this entry »


A Pitch F/X Look at Cliff Lee

Lee has a tremendous variety of movement in his pitches. He has three pitches that tail away from righties (fourseam, twoseam, changeup) and a nasty curveball with a ton of movement. For most pitchers this would be plenty; but Cliff Lee is not like most pitchers. He also packs a cutter with as much horizontal movement as some sliders.

We can see this with the following graph, which is from the catcher’s perspective (same with all following graphs):

CLeepitchmovementandlocation

CU=curveball, FC=cutter, FF=fourseam, FT=twoseam, CH=changeup. The black box represents the strikezone and has the average pitch locations for each pitch.

Looking at a pitcher’s entire repertoire like this is useful, but it can be more interesting to look at pitches individually when it comes to pitchers like Lee.

FFdensRHw_strikezone

FFdensLH_w_strikezone




Against righties his location is pretty varied with the fourseam. He mainly locates the pitch middle-away, but often goes up and in too. Against lefties, he consistently pounds the outer half.

Pitch Type Count Selection% Swing% Swing-Miss% HR% GB% LD% FB%
vs·RHB
FF 352 13.8 50.9 12.3 0.9 28.9 25.0 46.1
vs LHB
FF 305 36.4 47.2 13.2 0.7 45.6 12.3 42.1

FTdensRHw_strikezone FTdensLHw_strikezone


Against righties he primarily throws the twoseam pitch up and away, which explains why he has a high flyball rate on a pitch typically associated with groundballs. Against lefties the pitch is pretty much thrown low and over the middle of the plate.

Pitch Type Count Selection% Swing% Swing-Miss% HR% GB% LD% FB%
vs·RHB
FT 1174 46.2 48.2 14.5 0.5 31.0 21.0 48.1
vs·LHB
FT 241 28.8 46.1 11.7 0.0 59.6 27.7 12.8

FCdensRHw_strike

FCdensLHw_strike


Against righties the pitch is a real weapon; the cutter results in many whiffs and a solid amount of groundballs. Against lefties the pitch isn’t as remarkable, but still solid. His location against lefties with the cutter is very similar to his location with his fourseamer against lefties.

Pitch Type Count Selection% Swing% Swing-Miss% HR% GB% LD% FB%
vs·RHB
FC 510 20.1 54.9 20.4 0.6 47.0 23.0 30.0
vs·LHB
FC 185 22.1 49.7 17.4 1.1 41.9 18.6 39.5

CUdensRHw_strike

CUdensLHw_strike


His location against righties and lefties is pretty much the same, though he does backdoor the pitch occasionally to righties. He pretty much only throws his curve late in counts for strikeouts.

Pitch Type Count Selection% Swing% Swing-Miss% HR% GB% LD% FB%
vs·RHB
CU 170 6.7 44.1 37.3 0.0 76.0 12.0 12.0
vs·LHB
CU 49 5.8 36.7 38.9 0.0 20.0 20.0 60.0

CLeedensCH

Pitch Type Count Selection% Swing% Swing-Miss% HR% GB% LD% FB%
CH 293 11.5 58.7 29.7 0.3 42.5 17.5 40.0

Only one graph here because he only threw 20 changeups to lefties the entire year, so I’m just going to ignore those. According to Fangraphs pitch run values, his changeup was his most effective pitch this year. And you can see why; he was great and locating the pitch down and away.

*all data and tables are from Joe Lefkowitz’ site.

*This article was originally posted on www.pendingpinstripes.net


Graphical wOBA by Count

I am a big fan of graphs and baseball. Fangraphs made me excited because putting complex data into reasonably easy to understand graphs helps open up sabermetrics to more fans. I’m a big fan of statistical analysis, but after a while, a table full of numbers just starts running together and stops making sense. That’s what makes graphs such an effective tool.

I’ve dabbled in graphs myself. When people were creating the WAR graphs to compare hall of famers, I made a sample graph showing cumulative WAR by age on Tom Tango’s Book Blog:

(click for a larger image)

Read the rest of this entry »


Another Season Gone By Without Realignment

For the first time ever in the divisional era, 1 division has managed to run the table against the other two. Barring the Royals taking 3 of 4 from Tampa over the weekend, there won’t be a single team in the American League who will have put together a winning record against the AL East.

Since 2007, only 8 teams in the American league have been able to put together a winning record against the AL East, (LAA 3 times, OAK twice, TEX, DET and SEA once). This season, as things currently stand, the Central and West divisions have combined to go 149 – 198 (.429%) winning percentage against the AL East, a record comparable to the Cleveland Indians or the Washington Nationals.

What’s worse, is that the AL East will end again with the team that has either the worst, or second worst record in the American League (assuming Buck Showalter doesn’t phone it in this weekend), but looking deeper into their divisional performance, how bad exactly is the AL East’s punching bag?

(Record, Winning Percentage, % Diff from total)
2010 Orioles: Non-AL East Record (39-47*, .453, +.054), 4 games at home v. Detroit left
2009 Orioles: Non-AL East Record (40-50, .444, +.049)
2008 Orioles: Non-AL East Record (46-43, .516, +.095)

AVERAGE % Diff = +.066, or 10.7 Wins

And the 4th place Jays?

2010 Blue Jays: Non-AL East Record (43-43*, .500, -.019), 4 games in Minnesota left
2009 Blue Jays: Non-AL East Record (49-41, .544, +.081)
2008 Blue Jays: Non-AL East Record (49-41, .544, +.013)

AVERAGE % Diff = +.025, or 4 Wins

This means, on average, even the worst that the AL East has to offer, averages almost 5% points higher against non-divisional foes, or roughly 7 wins better across a 162 game season. How much are those 7 wins worth? If you’re Tampa, obviously zero since the only way to fill the ballpark there is to give away 20,000 tickets, but to a team like Seattle or Toronto (as highlighted above), the 2009 Jays managed 75 wins at 23,162 fans a game, whereas the 2008 version of 86 wins averaged 29,626. Obviously its a very rudimentary look at attendance and I’m ignoring plenty of factors, but the fact is and has always been that outside of Tampa Bay, people will inevitably jump on the bandwagons and go to see winning teams win ballgames. The point I’d like to be making here is the financial impact of regional divisional slotting, and if you take the leap with me, obviously the analysis will have a more profound impact.

For comparison purposes, lets look at the AL CENTRAL 4th and 5th place finishers

2010 Kansas City: Non-AL Central Record (36-50*, .418, +.007), 4 games at home v. Tampa left
2009 Kansas City: Non-AL Central Record (33-57, .367, -.034)
2008 Detroit: Non-AL Central Record (47-43*, .522, +.065), 13-5 in interleague games

2010 Cleveland: Non-AL Central Record (33-55, .389, -.039)
2009 Cleveland: Non-AL Central Record (33-55, .389, -.012)
2008 Kansas City: Non-AL Central Record (44-46*, .489, +.025). 13-5 in interleague games

AVERAGE % Diff = .002, or +.4 Wins

The 2008 figures are both skewed in the positive directions due to AL Central success during interleague play, getting to play the even weaker NL West in the majority of their IL games, and zero games against that division’s winner (84-78 Dodgers), so discounting those games, the result is:

AVERAGE % Diff = -.015 or -2.7 Wins

Does the AL East really cost a team 7-10 Wins a season like it has the Orioles? Probably not, there are plenty of other factors that go into a teams eventual W-L, but traditional Strength of Schedule metrics that value the winning % of your opponents, or Runs Scored/Runs Allowed Differentials often don’t capture the simple facts that your record and team stats will be positively influenced by facing the Royals 18 times a season. Who knows, maybe the 2010 Blue Jays, who lead the majors in slugging and home runs, could’ve competed this season in another division where its young pitching staff wasn’t exposed to the three highest scoring teams in the league (NY Boston Tampa) for 54 games? Under this current alignment, we can never be too sure.

On the positive, at least the divisions regionally make sense in their current form, unlike football. I’m sorry, putting Miami in the AFC EAST, Baltimore in the AFC NORTH, and Indianapolis is in the AFC SOUTH makes about as much sense as the BCS national championship formula.


Gallardo Must Throw More Strikes

Of 132 starting pitchers that have thrown at least 100 innings this season, Brewers’ right-hander Yovani Gallardo has the third lowest O-Swing% in the majors at 24.8%.  That means opposing hitters swing at 24.8% of his pitches that are outside the strike zone.

Even Nick Blackburn and Brian Bannister get more swings at pitches outside the zone.

Common sense suggests the better a pitcher’s repertoire, the more hitters will chase bad pitches.  Therefore, we would expect Yovani Gallardo to have below-average stuff and a lower strikeout rate because he gets fewer hitters to chase bad pitches.

That is, we would expect that to be the case if we merely looked at Gallardo’s O-Swing% in a vacuum.

As it turns out, Gallardo gets very few hitters to swing at pitches outside the zone because hitters rarely swing in general with Gallardo on the mound.  His 40.8% swing rate is the fourth lowest in the majors (min. 100 innings).  Not only that, but hitters also do not make contact often with his pitches when they do swing – as evidenced by his 79.6% contact percentage and 8.3% swinging-strike percentage.

We can infer from these numbers that hitters flat-out do not see Gallardo’s pitches very well from the mound.  Hitters do not swing often – and when they do, they do not make much contact.

In short, Gallardo has well above-average stuff.  His 92-94 MPH fastball is a plus-pitch, and his spike curve can be deadly when he is not spiking it before it reaches the plate.  Moreover, Gallardo has developed a slider, which is arguably becoming his best pitch.

So, how can Yovani Gallardo transform himself from a top-tier #2 pitcher to a full-fledged ace?

The answer appears to be simple.  Throw more strikes.

Of the starting pitchers that have thrown at least 100 innings in 2010, Yovani Gallardo throws the 7th fewest balls in the strike zone. (Livan Hernandez is predictably number one).  That high-percentage of balls outside the strike zone would be acceptable, but as we have established earlier, Gallardo does not induce many swings outside the zone.

That obviously leads to higher walk rates, higher pitch counts, and lower innings totals.  Those are all things that must change for Yovani to be a true ace in Major League Baseball.

Many of you are likely thinking: “Of course Gallardo should throw more strikes. That is an obvious statement.”

Not necessarily.

Some pitchers, such as Livan Hernandez and Jamie Moyer, live on throwing balls outside the strike zone.  They do not have good enough “stuff” to live by throwing strikes.  They have contact percentages north of 92% on balls inside the zone, so they bait hitters and get them to swing at poor pitches.  Both of them have literally made a career avoiding the strike zone

Gallardo is not that type of pitcher.  It seems opposing hitters have decided their best chance to reach base is to actually not swing at all, merely hoping the upcoming pitch is a ball – which it is 57.4% of the time.  That accounts for the high pitch counts, the low walk totals, and the low inning totals.

The only worry about Gallardo throwing more strikes is that more balls would then be put in play, which may not be a positive outcome with Milwaukee’s below-average defense.  It could effectively be argued that Gallardo is better suited to stick to walks and strikeouts – though the Brewers’ front office should work to solidify the defense this winter.

Throwing more strikes would lower the pitch counts, lower the walk rates, and increase the number of innings pitched without sacrificing production on the mound.  Opposing hitters do not make much contact whether or not Gallardo throws the ball in the strike zone, so he may as well cut the walk rate and work in the strike zone much more often.

His development of a slider should aid that mission.  That spike curveball cannot be thrown consistently for strikes, but his new slider (sometimes cutter) can be thrown for a strike on any count.

Perhaps that is the missing piece that can help Yovani Gallardo transform himself into a bona fide ace for the Brewers.  Or perhaps the right-hander simply needs to make a conscious effort to not nibble.

Whatever the case may be, everyone certainly knows the Brewers organization could certainly use an ace.  Milwaukee’s success in 2011 may hinge on whether or not Gallardo is able to take the next step in his development – which seems to be consistently throwing more strikes in every start.


Wainwright Throws Fewer Fastballs, Increases Effectiveness

Adam Wainwright’s strikeout rates keep increasing. In over 400 innings in AA and AAA during his age 21-23 seasons, his strikeout rate was 7.8 per nine innings. When he was elevated to the majors in 2006 as a relief pitcher his strikeout rate took an expected jump to 8.64 K/9. At the time he was throwing his curveball 25.9% of his pitches. A return to starting the following year led to a decrease in both his curveball use (18.6% in 2007 and 17.9% in 2008) and his strikeout rate (6.06 K/9 in 2007 and 6.20 K/9 in 2008).

In 2009, Wainwright made a change in his pitch selection, reverting back to the curveball percentages from his bullpen tenure. The increase in curveball use (24.0% in 2009) increased his strikeouts per nine to 8.19 and turned him from an above-average pitcher (3.90 and 3.78 FIP in 2007 and 2008, respectively) to a Cy Young contender (3.11 FIP). The increased use of the curveball in 2009 also increased its effectiveness, doubling to 2.71 wCB/C. The effectiveness on his slider tripled. Unfortunately, his fastball decreased in effectiveness, going from essentially average to -.75 wFB/C.

In 2010, Wainwright has taken his curveball use to another level, increasing to 28.5% of his pitches and his strikeout rate to 8.26 K/9 and lowering his FIP to 2.86. He has not sacrificed control, lowering his walk rate to 2.21 BB/9. His curveball and slider, which may be more of a cutter, have been slightly less effective, but still very useful pitches. The significant change has occurred in the effectiveness of his fastball. Wainwright has decreased the number of fastballs thrown to a career low 46.5% of pitches. With this decrease has come a drastic increase in the effectiveness of the fastball without changing the velocity, moving to 1.00 wFB/C from last year’s total of -0.75 wFB/C.

Also of note, increasing his strikeouts has not affected his efficiency, with 15.7 P/IP in 2007, 14.8 P/IP in 2008, 15.5 P/IP in 2009 and a career low 14.6 P/IP in 2010.

Wainwright has remained effective this season throwing the third highest percentage of curveballs of any pitcher. (Only Wandy Rodriguez and Gio Gonzalez have thrown a greater percentage of curveballs this year.) When you have the curve he has, you can’t blame him. The consequence, whether inteneded or not, is a sea change in the effectiveness of his fastball.  Another Cy Young-caliber season at a bargain price for the Cardinals.


Carsten Groundball Sabathia

Earlier in the year I observed that CC was getting significantly more groundballs than earlier in his career. At this time we can see that he has maintained this new approach throughout the year (via fangraphs):

Read the rest of this entry »


Making Cent$ of Home Field Advantage

Over the last couple of weeks there has been a lot of debate about the value of home field advantage in baseball. The discussion was crystallized most recently when the Yankees rested key bullpen members in their first place showdown against the Rays, but it has really be going on since the advent of the Wild Card.

Yesterday, we wondered about the value of finishing first from an accomplishment perspective, but ultimately, that is a very intangible way of looking at the question. At SI.com, Joe Seehan looked at home field advantage from a competitive standpoint and came to the conclusion that it really wasn’t an advantage at all. According to Sheehan’s research, the number one seed has advanced to the World Series in only eight of 24 chances since 1998, when the current playoff format was established. What’s more, over that span, the home team has only gone 45-39 in all post season series, according to Sheehan. In other words, there really isn’t a home field advantage in baseball during the postseason.

There is at least one more vantage point from which to consider this question, and it could very well be the most important: economics.

In the post season, gate revenue (i.e., attendance) is divided between the players and hosting team using the following format:

  • Players: 60% of gate receipts from first three games of LDS and first four games of LCS and World Series; no contribution from other games.
  • Home team: 40%* of gate receipts from first three games of LDS and first four games of LCS and World Series; 100% of gate receipts from all other games.

*A small percentage (approximately 1.5%) of LDS gate receipts goes to the umpires, while 15% of LCS and World Series gate receipts go to MLB.

On the face of it, there seems to be an economic advantage to having home field. But, is it real, and if so, how significant is it?

Read the rest of this entry »