Modeling One-Run and Extra-Inning Games

When the 2021 regular season concluded, there was the following exchange in the “Hey Bill” section of the Bill James Online baseball community, with Bill’s response starting at “Answered:”

Hey Bill! 

 Is it possible to calculate an expected number of 1-run games for a team in a season? The reason I ask is that the Mets played in 66 1-run games this year, 40.7% of their games. That seems like a whopping big number . . . but is it? 

Thanks 

Kevin

Asked by: kgh

Answered: 10/4/2021

It’s a very large number, but I wouldn’t know how to calculate an expected number. I don’t even know what the variables would be. I suppose one-run games are slightly more common among teams which are near .500, and obviously they would be significantly more common in a low-run environment than in a high-run environment.

Inspired by this interaction, I built a dataset to answer those questions and a few more that popped up along the way. Let’s start with the easiest one:

The 2021 Mets played 66 one-run games, or 40.7% of their contests. Is that a whopping big number?

Yes, that is a big number, but not “whopping” big.

The Mets did play 66 one-run games, with 13 of those in extra-innings and 53 in “regulation.” They played 18 total extra-inning games. This gave them a total of 71 games that were decided by one run or in extra-innings. Several teams listed below played more one-run games than the Mets did in 2021.

Baseline

The historic average of all games that are one-run or go into extra-innings is 32.05%, or 52 per 162-game season.

Dataset

I took the game logs for every team/season from 1900 to 2020 and put them in run differential categories. I called extra-inning games category 0. Double-digit run differentials went into category 10. The remaining games were all put in the slots corresponding to the run differentials. For this study, we’re only concerned with the games in slots 0 and 1. What I didn’t do is categorize the extra-inning games in a slot corresponding to the final outcome. In this case, we’re trying to project the number of one-run games, so I thought it would be more instructive to look at game states at the “end of regulation.” Once a game goes to extra-innings, especially with the zombie runners introduced in 2020, it might end up with a differential of more than one run. It seems to me the conditions for producing an extra-inning game are the same as for one-run games.

What is the record for most one-run games?

The 1971 Astros and the 1993 Royals both played 58 one-run games (not including extra-inning games). That is also the record for percentage of one-run games.

1.	Astros 1971, 35.8% (58 games, 25-33 record)
1.	Royals 1993, 35.8% (58 G, 30-28)
3.	Reds 1946, 35.7% (55 G, 22-33)
4.	Dodgers 1968, 35.2% (57 G, 27-30)
4.	Phillies 1968, 35.2% (57 G, 31-26)
4.	Pirates 1918, 35.2% (44 G, 21-23)

Which teams won the most one-run games?

1.	White Sox 1972, 35-13, .729 WPct
2.	Reds 1940, 34-11, .756 (won World Series)
3.	Cleveland 1917, 33-17, .660
3.	Yankees 1968, 33-22, .600
5.	Amazing Mets 1969, 32-17, .653 (won World Series)
5.	Astros 1979, 32-24, .571

The year of the pitcher, 1968, is well-represented here and elsewhere in this study, showing the strong influence of a low run environment on tight games.

Which teams lost the most one-run games?

1.	Yankees 1966, 13-35, .271 WPct
2.	Giants 1946, 20-34, .370
3.	Reds 1907, 10-33, .233
3.	Reds 1946, 22-33, .400
3.	Astros 1971, 25-33, .431

What is the record for extra-inning games played?

These are all extra-inning games, whatever the run differential ends up being.

1.	Red Sox 1943, 19.1% (29 games, 15-14 record)
2.	Browns 1943, 18.4% (28 G, 10-18)
3.	White Sox 1922, 18.2% (28 G, 16-12)
3.	Tigers 1957, 18.2% (28 G, 13-15)
5.	Cleveland 1943, 17.7% (27 G, 13-14)

Which teams won the most extra-inning games?

1.	Cleveland 1949, 18-1, .947 WPct
1.	Pirates 1959, 18-1, .947
1.	Expos 1988, 18-7, .720
4.	Braves 1999, 17-5, .773 (World Series runner-up)
4.	Diamondbacks 2013, 17-8, .680

You probably recognize the ’59 Pirates. Roy Face set the pitcher’s winning percentage record for the Pirates that season, going 18-1. The team only went 60-75 in games that ended in 9 or fewer innings. You might expect regression the following season, but of course, the Pirates were World Champs in 1960, winning more than 70% of their extra-inning games (12-5).

Which teams lost the most extra-inning games?

1.	Browns 1943, 10-18, .357 WPct
2.	Phillies 1924, 7-17, .292
2.	Pirates 1984, 9-17, .346
2.	Reds 1942, 9-17, .346
2.	Dodgers 1967, 10-17, .370

What is the record for most combined one-run and extra-inning games played?

1.	White Sox 1968, 47.5% (77 games, 33-44)
2.	Reds 1946, 47.5% (73 G, 30-43)
3.	Astros 1971, 46.9% (76 G, 33-43)
4.	Reds 1916, 46.4% (71 G, 29-42)
5.	Marlins 2014, 45.1% (73 G, 38-35)

Which teams won the most combined one-run and extra-inning games?

1.	Orioles 1970, 45-19, .703 WPct (won World Series)
2.	Yankees 1943, 44-23, .657 (won World Series)
3.	Reds 1940, 42-19, .689 (won World Series)
3.	Mets 1969, 42-23, .646 (won World Series)
3.	Orioles 1974, 42-23, .646 (</theme>)
3.	Giants 1978, 42-29, .592

Which teams lost the most combined one-run and extra-inning games?

1.	White Sox 1968, 33-44, .429 WPct
2.	Astros 1975, 16-43, .271
2.	Reds 1946, 30-43, .411
2.	Astros 1971, 33-43, .434
5.	Pirates 1986, 19-42, .311

With the lists out of the way, let’s move on to the meatier question.

Is it possible to calculate an expected number of 1-run games for a team in a season?

We can do a fairly decent job, depending on your definition of “fairly decent.” Many of you have better math and analysis chops than me, so you might be able to get closer.

I took all 2,542 teams and put them into groups based on both winning percentage and run environment, as proposed by Bill James. Following Bill’s lead is often a good idea.

Model 1

I put teams in five roughly equal groups according to their winning percentages and labeled them “terrible,” “poor,” “average,” “good,” and “great.” Don’t get hung up on whether or not some of the “great” teams are really “good” or the “poor” teams were actually “terrible.” That’s not the point. They’re just labels, so we can easily name and distinguish the groups. I used an odd number of groups to ensure there was a central segment that we could call “average,” hovering around .500, to test Bill’s assertion that teams around .500 would be expected to play more one-run games.

Here are the winning percentage ranges:

Model 1 Winning Percentages
Group# TeamsMin W% (Ws per 162)Max W% (Ws 162)
Terrible500.235 (38).426 (69)
Poor507.427 (69).480 (78)
Average524.481 (78).526 (85)
Good503.527 (85).571 (92)
Great508.572 (93).763 (124)

I calculated the runs per 9 innings (R9) for every team/season. This is the total runs scored by the team and their opponents. One could use separate run environments for home and road games. I opted not to do so in this case, thinking the benefit was not worth the effort or complication. As you’ll see, the impact of the run environment on one-run games is clear-cut. Then I put teams in five roughly equal groups: “lowest” run environment, “low,” “average,” “high,” and “highest.” I wanted to match the number of groups used for the winning percentage side of the analysis.

Here are the run environment ranges:

Model 1 Run Environments
Group# TeamsMin R9Max R9
Lowest5075.86*7.96
Low5047.978.58
Average5208.599.12
High5069.139.77
Highest5059.7813.91**
*1908 Brooklyn **1930 Phillies

Given those two variables, I arranged all the teams into a two-dimensional matrix (think “magic quadrant” but 5 x 5), like “Terrible/Lowest” and “Great/Highest.” Groups had a minimum of 86 teams (Terrible/Low and Good/Highest) and a maximum of 130 (Average/Average). Then I took the median percentage of extra-inning and one-run games (Xand1) for each group and sorted from most to least.

And how did that work out? Pretty well, but not the way I thought or hoped (probably naively) it would. The sorted list ended up in exact order by R9 — the lowest R9 group had the most Xand1 games, then the low R9 group, the average, the high, then the highest. Within those groupings, the WPct groups varied quite a bit. In the lowest R9 group, the WPct group followed the pattern I expected going in: average, poor, good, terrible, great. The average WPct group wasn’t in the top for any other R9 group and was actually last for the high R9 and highest R9 groups. That made it clear that the primary factor is the run environment. I hoped it would allow some formula like 3 x R9 + 1 x WPct.

To salvage something from that, I simply took the median percentage of games for each group and used that rate for each team to estimate their Xand1 games. Doing that resulted in a net error for the entire dataset of 30 games fewer than actual, with an average gross error of 5 games in either direction.

Of the 2,542 team seasons in the dataset, this model is exactly right on 163 of them (6.4%) and within plus or minus 5 games on 1,587 (62.4%).

Model 2

Seeing that the run environment influence was stronger than the winning percentage, I built a second model using only that. Using the same groupings (lowest to highest R9) as before, I mapped the Xand1 percentages. This yielded a straightish line:

Thinking that more groups might be better, I divided the teams into R9 deciles. This gave the same basic results, ordered from lowest R9 to highest and giving a mostly straight line again:

Straight lines I could deal with. Applying some fundamental algebra, I arrived at the formula:

Expected Xand1 percentage = 0.523 – (.0228 * R9)

Applying this formula to the dataset gave very similar results with a net error for all teams of 15 games fewer than actual and a gross error per team of plus or minus 5. This model hits the exact number of Xand1 games for 173 teams (6.8%) and is within 5 games for 1,563 (61.5%).

Given how close the results are and the simplicity of a single-variable formula, I prefer this second model.

How To Improve the Models

This is an easy one: learn more modeling techniques. I’m sure you could build a linear regression model or something and arrive at better results with less manual work. I’m retiring from my day job next year and hope to spend time improving my skills. Maybe I’ll return to this data as I build my technical skills.

How do our models do at projecting the 2021 Mets?

The 2021 Mets scored 636 runs and allowed 668 in 162 games. They pitched 1,379.1 innings or 4,138 outs. They made 4,173 outs (1,391 IP for opponents). That yields 8.473 R9. Plugging that into our Model 2 formula: 0.523 – (0.0228 x 8.473) = .3295 = 33% of 162 = 53 Xand1 games.

They fit into the Poor/Low category for Model 1. Looking that up in our table, they’d be expected to have 33.9% of their games tied or within 1 run at the end of regulation, which is 55 games.

The Mets actually played 71 such games in 2021. That’s a pretty big error. The original poster, Kevin, was definitely on to something. The Mets did play an unusual number of one-run and extra-inning games. By Model 2, that’s an error of 18 games, tied for 18th most all-time. By Model 2, the error is 16 games, which is tied for 36th most.

Do teams that play a lot of one-run and extra-inning games perform better in those games?

You hear announcers say this sort of thing all the time — “This team plays a lot of close games, so they’re used to the pressure and perform better than their opponents.” Is that true?

In a word, no.

Let’s look at the 99th percentile of teams that played the most Xand1 games — 25 teams that collectively played 44.1% of such games. Those teams went 826-862 in Xand1 games for a .489 winning percentage, versus 1042-1095, or .488, in other games. They won a total of 5 games more in Xand1 games than you would expect, given their record in other games, or 0.2 games per season.

Let’s then expand that to the 90th percentile (plus ties) — 262 seasons. They had a total of 40% of their games in the Xand1 category. They went 8167-8164 in Xand1 games, a .50009 winning percentage, and 12129-12347 in all other games (.4955%). That almost sounds like something. It equates to 76 games over their performance in other games, or 0.29 games per season.

So it seems like there’s nothing there. Teams that play a lot of close games do not appear to gain some sort of experience that allows them to perform better than expected in those games.

Thanks for reading!

More of my work can be found on my blog.





4 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Joe Wilkeymember
2 years ago

Interesting article! I wonder if you did something a little more robust with the run environment variable, your model would be more accurate. You could either do total R/9 (off+def) or separate them out (Runs Scored/9 and Runs Allowed/9), then include the league run environment, either as a separate variable or by indexing each team to the league average (either as a ratio like OPS+ or by using z-scores). I would think a team that is extreme relative to the league in either direction would play significantly fewer one run games. Additionally, lower league run scoring environments would likely yield more one run games, since the floor of zero runs is closer to the league average.

pbarstonmember
2 years ago

Love the simplicity of this, good insight. I’d be curious how team strikeout rates and home run rates affect this stuff.

Congrats on retirement!

David Gerthmember
2 years ago

An alternative to this is to take historical Vegas odds and reverse engineer what the implied probability of a 1 score game is.