Archive for Research

Attacking the First-Pitch Strike

I have a few bugbears when I watch a baseball game. I feel most of these are derived from the fact that I have never been taught how to play the game or faced something being thrown at me faster than 50 mph. A lot of these are behavioral or unwritten rules, but some leak into how the game is played.

The thing that I don’t understand the most is the first pitch called strike. Below are videos of Nolan ArenadoMike Trout, and George Springer all taking first pitch called strikes. Read the rest of this entry »


Juiced Ball Economics

Look, another article about the juiced ball! I know, I know. At this point, the narrative of the juiced ball has been written from many angles. (For anyone reading this who is somehow not familiar with that narrative, Jay Jaffe’s piece from June is replete with graphs, tables, and external links.)

As fans, this narrative is interesting, as it contextualizes the product we see on the field. It feels like there are more home runs — hey, there are more home runs! It’s human nature to wonder why. Baseball writers have tough jobs, and they need something to write about. Many baseball writers have tried to tackle the “why” to satisfy both reader curiosity and their editor’s article quota.

While this narrative is interesting for fans, for teams, it’s of massive importance financially. Teams have payroll budgets, and presumably, some of those teams will have room in those budgets for free agents this offseason. Teams also have players under contract who are eligible for arbitration, on whom the team must place valuations in advance of submitting a figure to arbitrators or deciding to non-tender the player. In short, teams have financial decisions to make, and they rely on all the information at their disposal to make them. Wasted dollars represent opportunity cost more than anything, in a league where advantages are razor thin and random variance plays such a key role in success or failure from year to year. Read the rest of this entry »


How the Astros Can Fix Noah Syndergaard

With Noah Syndergaard trade rumors whirling around the baseball world, it is fair to ponder how a team who would acquire him could improve upon his 2019 results. To be clear, this incarnation of Syndergaard is still an excellent pitcher, but he still leaves us wanting more; Syndergaard has the physical profile and repertoire of a pitcher that teams dream about in their search for their next front-line starter. So how can he improve upon his already great season?

Look no further than his right-handed colleague, Gerrit Cole. When Cole was trade to the Astros, the baseball community expected that the Houston would be able to optimize Cole’s raw stuff to build one of the best pitchers in baseball. We expected the Astros would ditch his mediocre sinker and trade it for more breaking balls, Cole’s bread and butter. Sure enough, that was the case, and Cole has become one of the most dominant pitchers in the league:

Let’s look under the hood at Syndergaard’s 2019 season. Here is a zone plot of all of his pitches this year: Read the rest of this entry »


Justin Verlander and Juiced Baseballs

The road from Opening Day to the All-Star break produced an abundance of entertainment and storylines. Max Scherzer was unhittable, Lance Lynn led AL pitchers in fWAR, and Christian Yelich and Cody Bellinger found themselves in an MVP race. However, no headline drummed up as much controversy as the increase in home runs resulting from juiced baseballs. To provide some context, the home run to fly ball ratio has risen 2.4% since last season. Additionally, Buster Olney provides us with a different but still shocking angle:

Even if you deny the introduction of juiced baseballs, the increase in home runs is undeniable. 

Presumably, if league-wide home run rates inflate, individuals should experience a similar increase. Of course, there is luck and small sample sizes involved, but increases should be relatively constant across the board. However, there has been one extreme outlier who will be the main focus of this work.

Justin Verlander was the ace of a world champion team, has won a Cy Young Award and an MVP Award, and has posted 67.8 career fWAR. Recently, in an interview with ESPN, Verlander expressed his frustration and accused MLB of juicing the baseballs. “I find it really hard to believe that Major League Baseball owns Rawlings and just coincidentally the balls become juiced,” he said. Surprisingly, very few, if any, pitchers have been more affected by the uptick in home runs as much as Verlander, who gave up 26 in the first half. Read the rest of this entry »


Stealing First May Not Be a Crazy Idea

Earlier this year, Major League Baseball reached an agreement with the Atlantic League that allowed MLB to test any new playing rules in Atlantic League games. Five new rules already went into effect in the first half of the season, and a day after the Atlantic League All-Star Game, in addition to testing the electronic strike zone using TrackMan radar, MLB announced four new rules that would go into effect in the second half of the Atlantic League season. One of the four seems to have received stronger criticism on the Internet than others, and that is the idea of stealing first base.

According to this new rule, if a pitch is not caught in flight, the batter has the right to “steal” first base. In essence, this is an extended version of the dropped-third-strike rule that has long been engrained in the baseball rule books. Of course, a dropped third strike has many criteria: it must occur with two strikes, the pitch must be a strike, and unless it occurs with two outs, there cannot be a runner occupying first base. With this new rule, however, batters now have the right to go for first base in any count, regardless the pitch is a ball or strike, and in any base situations. And recently, we witnessed the first steal of first in professional baseball history:

On the second pitch in the bottom of the sixth inning, Alejandro Chacin of the Lancaster Barnstormers threw a wild pitch, which allowed Southern Maryland Blue Crabs outfielder Tony Thomas to steal first base. From the video, we can clearly tell that the players were still adjusting to this new rule, as neither the catcher nor the batter reacted at first. The fact that catcher Anderson De La Rosa took time to react was probably the main reason that allowed Thomas to reach first without a throw. Interestingly, the term “stealing first” might actually be a misnomer, as the play was scored as a fielder’s choice and counted as an 0-for-1, according to Somerset Patriots southpaw Rick Teasley. Read the rest of this entry »


Is the Baseball Actually Juiced?

Home runs are on the rise. We all know this. The number of homers per game is at an all-time high in 2019, and has increased by about 36% just since 2015:

Home Run Rate
Year HR/game
2015 1.01
2016 1.16
2017 1.26
2018 1.15
2019 1.37

What we do not know is exactly why.

Commissioner Manfred recently suggested that the current baseballs have less drag through the air, caused by the more perfect “centering of the pill” (the innermost part of the ball). It has basically become an operational fact that there is something going on with the baseballs. Manfred’s explanation implies that the flight of the baseball is the key difference.

To look at this closer, I considered the distance traveled by balls in the air as a function of the exit velocity and launch angle at contact. If the average distance on similarly struck balls has increased over time, it would suggest that the ball itself is more aerodynamically efficient.

Pitch-by-pitch data for the 2015-2019 seasons was collected from Baseball Savant via the Statcast Search page. Two random forest models were built for each year, one using all fly balls and one using home runs. To account for a possible difference in flight due to the warm air in the summer months, only data through June of each year was used. (At the end of the season, the analysis can be applied to the full data set). In both cases, the distance the ball traveled is the response variable and the exit velocity and launch angle are the explanatory variables. The models are applied to a test data set of various exit velocity/launch angle combinations. Read the rest of this entry »


Ballpark Attendance and Starting Pitchers

When I am thinking about buying a ticket to a baseball game, often my first question is “Who’s pitching?” I have always felt that the most enjoyable type of game is one in which a great starter is on the mound. Is this feeling common among fans or do they buy tickets regardless of the starting pitcher?

To answer this question, I trained random forest models to predict attendance for games based on situational factors (not including the starting pitcher). Then I considered how the quality of starting pitchers relates to whether the models overestimate or underestimate the attendance. If the models consistently underestimate attendance when star pitchers are on the mound, it would suggest more tickets are sold because of the starter.

Data

Information about each game was collected from Retrosheet’s game logs. In accordance with Retrosheet’s terms of use, please note the following statement: “The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at retrosheet.org.” Pitcher performance data was gathered from FanGraphs. In addition, the people.csv data set found here was used to match player ids from Retrosheet to FanGraphs. Read the rest of this entry »


The Arm of Marcell Ozuna and the Outfield Arm Runs Saved Statistic

Sunday night baseball is such a great thing. Yes, I may fall asleep around the sixth or seventh inning, but I tend to fall asleep to baseball, which is nice. It’s such a summer feeling for me to have the window open, the summer breeze blowing in, and talk of baseball in the background. On June 9, the Cubs were aggressive early on the base paths. At one point, with Kyle Schwarber on first base and Kris Bryant at the plate, Bryant hit what would typically be a routine single to left-center. ​Now, with the Cardinals and the Cubs fighting it out for the top spot in the division, you saw an aggressive approach by Schwarber. Did mastermind Joe Maddon have that all planned and ready? Did he tell his team to run on Marcell Ozuna? Well, if so, maybe he (or his team of data scientists and analysts) was evaluating the rARM statistic.

Part 1: The Stat – Throwing Arm Runs Saved

A player’s total Throwing Arm Runs Saved is then the sum of our three halves: flyballs Runs Saved + groundballs Runs Saved + Miscellaneous Kills Runs Saved.

– The Fielding Bible Read the rest of this entry »


Does Switching Leagues Affect Pitching Performance?

When Major League pitchers change leagues, strange things begin to happen.

Jordan Zimmermann, an ace by any measure when he played for the Washington Nationals, became a free agent after the 2015 season before signing with the Detroit Tigers — only to quickly learn that he was not so invincible.

Wei-Yin Chen’s four strong seasons with the Baltimore Orioles gave him free agent swagger — and a hefty asking price — that led him to a spot on the Marlins roster just before the 2016 season. And then he suddenly became very human.

As is often the case, the stories that develop around pitchers as they transition from one league to the next are fed by assumptions, a bit of baseball history, certainly a little bias, and what can only be called the gut instincts of the fan.

Statistics, of course, aren’t infected by ESPN punditry and don’t care what color a uniform is. They are emotionless, sober friends of reason.

The known stats of Zimmermann and Chen — before I got involved — tell you part of the story. A deeper analysis, however, upends the conventional wisdom. Read the rest of this entry »


Fun Numbers Through the First Two Months

SNELL THE GLOVE

Since 2002, when the stat began being recorded, the top three swinging strike percentages for pitchers through June 1st of a season (minimum 60 innings) are…

1. Blake Snell (2019) – 19.1%
2. Max Scherzer (2018) – 17.5%
3. Curt Schilling (2002) – 16.8%

To say that Snell has been in fine form the first two months of the season would be a massive understatement, as he’s in historic form according to this metric. The reigning AL Cy Young winner’s surface numbers may not look as incredible as they did last year, but his talent is still off the charts.

RYU READY TO ROCK?

When facing lefties, there is an offense that ranks first in ISO, second in wOBA, third in wRC+, and possesses the eighth-lowest strikeout rate this season. It’s not the Twins, who have caught the serious attention of the baseball world. It’s not the Astros, who have had quite the reputation of being death to lefties for the past few years. It’s also not the other teams (the Dodgers, Mariners, Braves, Cardinals, etc.) you would normally suspect…

Yep, you didn’t guess it!

It’s the Arizona Diamondbacks. Granted, two months of data may still be leaning on the short side of sample sizes when it comes to team trends. Their BABIP indicates at least some bit of good fortune, and they currently sit with the third-lowest walk rate. However, none of this mattered against Hyun-Jin Ryu in his last outing on June 4. The Dodgers southpaw continued his stellar campaign by firing seven scoreless innings against the D-backs, allowing just three hits to move to 9-1 in 12 starts. Ryu now holds a 1.35 ERA, 2.6 fWAR, and has allowed just five walks to 71 strikeouts in 80 innings.

It’s worth noting that the matchup took place at Chase Field, where they’ve been conveniently keeping the roof open for night games, which in turn mitigates the effects of the humidor system they installed before the 2018 season. That said, Ryu is doing something truly remarkable and seems capable of silencing just about any lineup in MLB. Read the rest of this entry »