(Originally published on May 16, 2022)
This is part of a series of posts on sabermetrics and the mathematics of baseball. You can find more here.
Yesterday, (May 15, 2022) the Cincinnati Reds no hit the Pittsburgh Pirates, but lost the game 1-0. We know that no hitters are quite rare --- only about 1 in 1000 MLB games from 1998 - 2019 were no hitters --- but losing a no hit game has only happened a handful of times in MLB history.
How does this happen? Yesterday, the Pirates shut out the Reds and managed to score their lone run in the bottom of the 8th, thanks to back-to-back-to-back one out walks to load up the bases, followed by an RBI fielder's choice (that was nearly an inning-ending double play). It isn't too difficult to come up with other potential pathways for a team to score without a hit: there could be a hit-by-pitch and a steal of second base, followed by a throwing error on a routine ground ball. Or, we might see a batter reach on a dropped third strike, advanced to second by a balk, to third by a wild pitch, and scored by a sacrifice fly.
But how often does it actually happen that a team goes an entire inning without recording a hit, yet manages to score a run? Using play-by-play data from the 2021 season, obtained free of charge from and copyrighted by Retrosheet, it seems that this is actually quite rare!
Innings | Run(s) scored | No hits recorded | Run(s) scored and no hits recorded |
|
---|---|---|---|---|
Totals | 42790 | 11190 | 19220 | 83 |
% of innings | 100% | 26.2% | 44.9% | 0.19% |
In just over a quarter of innings, the batting team scored one or more runs. But in almost 45% of innings they failed to record a hit. Certainly hitting and scoring runs should be dependent on one another, but just as a sanity check, if they were independent, we might expect to see an inning in which a run scores without a hit about 12% of the time, or nearly twice per game. Instead, it happened in about 1 in every 500 innings!
There are lots of other finer information we could squeeze from this data without too much additional effort. One thing would be to look at each individual team's breakdown of innings in which they scored a run or were held hitless (or both). Also note that there may be extra inning games included, where a runner started the inning on second base! I would be interested to go back and check how often the designated runner came around to score without a hit, and what we get if these cases were excluded.
From the data above, we might make the assumption that in an average MLB (half) inning, the probability of a run being scored is
We can also answer the question: how likely is a no hit team to score a run? Using our 2021 data, and assuming the team is no hit over a full 9 inning game, we can compute the probability of scoring in at least one of those 9 innings by
How close is this to reality? In 25 of the 316 official MLB no hitters since 1876, the no hit team has scored a run. That's more like 8%, which suggests to me that the no hit team may have been more likely to score in the run environment of previous years than in 2021. Indeed, if we restrict our attention to the 72 official no hitters between 1998 and today, only once (Ervin Santana's in 2011) has the no hit team scored a run, or about 1.4% of the time. Yesterday's game doesn't count as an official no hitter, because the Pirates didn't need to bat in the bottom of the 9th.
What about the exact scenario that happened with the Reds and Pirates? How likely is it that any given game has
This was a fun mini-project to help me get the hang of the (very) basics of working with Retrosheet data. I used only the bevent tool, available for download on their site to process the 2021 season data for each of the 30 teams. It was a bit cumbersome to run the same program 30 separate times, then manipulate each file of results in a spreadsheet 30 separate times, then compile it all together to get the league-wide averages. I'm sure I could find a faster and better way to do this, and I'd need to in order to calculate similar stats for previous seasons.
Thinking about how often teams bat in an inning without a hit seems especially timely, given all the talk lately around pace-of-play and keeping fans engaged with the game. While a no hitter is certainly an exciting feat, hits are also exciting; there's a crack of the bat, frantic searching (or camera panning) for the ball, sprinting for extra bases, and outfield throws attempting to deny them. 45% is a sizeable fraction of innings to end without a single hit! I would expect (though I certainly haven't checked) that in previous seasons where the league batting average was higher, a smaller fraction of innings ended with no hits. Whether this made for a more exciting game or not, I'm not sure, but I'd be curious to confirm how this has changed over the years.