Dave Cameron recently had an interesting discussion of the greatness of Rickey Henderson here at FanGraphs. The post left a lot of angles to uncover, some of which require manipulation of data beyond my present capabilities. With that said, one of the astonishing things was that Rickey was caught stealing 42 times one year, nearly negating 130 successful steals (at least from a conventional sabermetrics standpoint).
So what became interesting to me was what might happen if you were to look at who the most effective stealers were of all time. A truly deep look, complete with changing WPA calculations for each situation, would be impossible. (Down 2 in the 8th trying to steal 3rd with 2 out would be a disaster if failed, while getting caught stealing 2nd with 2 outs up 10 runs would be trivial, except for the beaning that might follow next game). And in some sense, each team is different. It might be less advantageous to steal with Barry Bonds at the plate, instead of a line drive hitter such as Barry Larkin, so the calculation isn’t perfect.
But I thought I would take a stab at it. We take the breakeven rate in Moneyball of 70% steals required just to break even. Each unsuccessful steal is, of course, a needless out. Each successful steal is therefore the equivalent of 3/7ths an out avoided (if we keep the sanctity of the 70% breakeven rate).
A total of 7 successful steals is as good as 3 avoided outs, but 3 unsuccessful steals is still worth 3 needless outs. So someone that steals 70 bases and is caught 30 times is no more valuable from a steals standpoint than someone that attempts 0 steals. He may add value on the basepaths in other ways, but we are strictly looking at steals here.
With that said, I took a look at the past 50 years (starting with 1959), and took every season where someone had more than 50 steals. As for why I chose 1959 — well, beyond being a round number, it should be noted that before 1912 data on times caught stealing was unavailable, and even through the 20’s the data was highly unreliable. It just so happened that the 30’s and 40’s and 50’s was not a running era, so we can say the modern running game began 50 years ago.
So who is the most effective stealer of all time by season?
It wasn’t Rickey.
Vince Coleman, in his second year, had 107 steals and was only caught 14 times. He could have been cuaght 30 extra times and still been ahead of the game. What is amazing, though, is in spite of that massive edge, he was probably worth something in the arena of 5.9 extra runs by my (conservative) calculation.
I figured that, over 26.5 outs (close to the true average, since, while home teams sometimes only need 24 outs, there is the longtail risk of 12+ inning games where each teams need 36+ outs), that if you score 4.9 runs per game, each out saved is worth just under .2 runs. (If you score 5 runs and need 25 outs to do it, each out would be exactly .2 runs. I realize that, due to leverage reasons, you could safely increase this worth, it’s just not something I felt comfortable calculating with limited resources). Then taking a slightly modified runs-wins formula (just about using 10 runs = 1 win), I can say that Vince Coleman’s steals are worth .6 wins conservatively. Alternate calculations factoring in leverage could double this number, so the best stealing base season of all times, was only worth .6-1.2 wins.
What were the top 5 seasons after Vince overall?
Well, glad you asked! I can send the whole spreadsheet to all who ask, but I thought I’d post here first.
- Vince Coleman 1986: 107 Steals, 14 CS
- Maury Wills 1962: 104 Steals, 13 CS
- Rickey Henderson 1983: 108 Steals, 19 CS
- Rickey Henderson 1988: 93 Steals, 13 CS
- Vince Coleman 1987: 109 Steals, 22 CS
- Tim Raines 1983: 90 Steals, 14 CS
Who were the worst offenders on the basepaths among prolific modern stealers?
Steve Sax in 1983 got caught 30 times in 86 attempts. That’s good for -1.1 runs. Bill North and Alex Sanchez rounded out the leadfooted three.
Overall, though, prolific stealers were generally net positive. Only 8 out of 213 seasons on this list had negative expected value.
The first conclusion I draw is that prolific basestealers, on the whole, are a net positive, even if it is a small positive. Obviously, there exist on the individual level suboptimal situations where runners still take off, but the large majority of prolific stealing seasons had high success rates. A highly negative rate of steals, of course, will be curtailed by coaches in due time before they could make this list.
Now, one more point. What is a true breakeven point? In the Moneyball Era, with AL teams in 2001 averaging 4.9 runs a game, 70% might be accurate. But in a dead ball era, a caught stealing might well have been more okay, because Albert Pujols wasn’t around to hit you home from first.
It then dawned on me that you could sort of model (crudely but still) a good breakeven model for steals based on runs scored per game.
4.9 runs per game = 70% breakeven
3.6 runs per game = 60% breakeven rate
and so on…
The values sort of fly apart at the extremes, but if your team is scoring 8 runs a game, it sort of would only make sense to steal if you had Usain Bolt standing at first, since, odds are the next guy might well get on base as well.
So in the spreadsheet I am sharing, you can change the average runs per game scored. If you move the average to 4.4 runs per game, Vince Coleman in 86 is still the best season of all time, but some other seasons creep up in value. (The breakeven at 4.4 runs per game with my crude calculator is 66%.)
Anyway, for anyone who wants the spreadsheet, post in the comments with a way of getting it to you — or, if FanGraphs is willing to let me upload it, I would be happy to do so.
Contextualized, Vince Coleman’s 1985, when only 4.07 runs per game were scored, makes his efforts extra impressive. It is a shame that he wasn’t great with the bat, and something of a buffoon to boot, because once he got on the bases, he legitimately did alter the outcome of games.