Abstract
Abstract
This paper introduces a new metric for player evaluation in Twenty20 cricket. The proposed metric of “expected run differential” measures the proposed additional runs that a player contributes to his team when compared to a standard player. Of course, the definition of a standard player depends on their role and therefore the metric is useful for comparing players that belong to the same positional cohort. We provide methodology to investigate both career performances and current form. Our metrics do not correlate highly with conventional measures such as batting average, strike rate, bowling average, economy rate and the Reliance ICC ratings. Consequently, our analyses of individual players based on results from international competitions provide some insights that differ from widely held beliefs. We supplement our analysis of player evaluation by investigating those players who may be overpaid or underpaid in the Indian Premier League.
Introduction
Player evaluation is the Holy Grail of analytics in professional team sports. Teams are constantly attempting to improve their lineups through player selection, trades and drafts taking into account relevant financial constraints. A salary cap is one financial constraint that is present in many professional sports leagues. If a team spends excessively on one player, then there is less money remaining for his teammates.
In sports of a “continuous” nature (e.g. basketball, hockey, soccer), player evaluation is a challenging problem due to player interactions and the subtleties of “off-the-ball” movements. Nevertheless, a wealth of simple statistics are available for comparing players in these sports. For example, points scored, rebounds, assists and steals are common statistics that provide insight on aspects of player performance in basketball. More complex statistics are also available, and we refer the reader to Oliver (2004) for basketball, Gramacy, Taddy and Jensen (2013) for hockey and McHale, Scarf and Folker (2012) for soccer.
In sports of a “discrete” nature (e.g. baseball) where there are short bursts of activity and players have well-defined and measurable tasks that do not depend greatly on interactions with other players, there is more hope for accurate and comprehensive player evaluation. There has been much written about baseball analytics where Bill James is recognized as a pioneer in the subject area of
Cricket is another sport which may be characterized as a discrete game and it shares many similarities with baseball. Both sports have
There are various formats of cricket where the governing authority for the sport is the International Cricket Council (ICC). This paper is concerned with player evaluation in the version of cricket known as Twenty20 cricket (or T20 cricket). Twenty20 is a recent form of
In Twenty20 cricket, there are two common statistics that are used for the evaluation of batting performance. However, before defining the statistics it is important to remind the reader that there are two ways in which batting ceases during the first innings. Batting is terminated when the batting team has lost 10
Although various authors have attempted to introduce more sophisticated cricket statistics (e.g. Croucher, 2000; Beaudoin & Swartz, 2003; van Staden, 2009), it is fair to say that these approaches have not gained traction. We also mention the Reliance ICC Player Rankings (www.relianceiccrankings.com) which are a compilation of measurements based on a moving average and whose interpretation is not straightforward. Despite the prevalence and the official nature of the rankings, the precise details of the calculations may be proprietary as they do not appear to be available.
In this paper, we propose a method of player evaluation in Twenty20 cricket from the point of view of
In Twenty20 cricket, a team wins a match when the runs scored while batting exceed the runs conceded while bowling. Therefore, it is run differential that is the key measure of team performance. It follows that an individual player can be evaluated by considering his team’s run differential based on his inclusion and exclusion in the lineup. Clearly, run differential cannot be calculated from match results in a meaningful way since conditions change from match to match. For example, in comparing two matches (one with a specified player present and the other when he is absent), other players may also change as well as the opposition. Our approach to player evaluation is based on simulation methodology where matches are replicated. Through simulation, we can obtain long run properties (i.e. expectations) involving run differential. By concentrating on what is really important (i.e. expected run differential), we believe that our approach addresses the essential problem of interest in player evaluation.
In Section 2, we provide an overview of the simulator developed by Davis, Perera and Swartz (2015) which is the backbone of our analysis and is used in the estimation of expected run differential.
In Section 3, we analyze player performance where players are divided into the following broad categories: pure batsmen, bowlers and all-rounders. Our analyses lead to ratings, and the ratings have a clear interpretation. For example, if one player has an expected run differential that is two runs greater than another player, we know exactly what this means. We observe that some of our results are in conflict with the Reliance ICC ratings. In cases like these, it provides opportunities for teams to implement positive changes that are in opposition to commonly held beliefs. This is the “moneyball” aspect of our paper. We extend our analyses further by looking at salary data in the IPL where we indicate the possibility of players being both overpaid or underpaid. We conclude with a short discussion in Section 4.
Overview of simulation methodology
We now provide an overview of the simulator developed by Davis, Perera and Swartz (2015) which we use for the estimation of expected run differential. There are 8 broadly defined outcomes that can occur when a batsman faces a bowled ball. These batting outcomes are listed below:
In the list (1) of possible batting outcomes, we exclude
According to the enumeration of the batting outcomes in (1), Davis, Perera and Swartz (2015) suggested the statistical model:
The estimation of the multinomial parameters in (2) is a high-dimensional and complex problem. The complexity is partly due to the sparsity of the data; there are many match situations (i.e. combinations of overs and wickets) where batsmen do not have batting outcomes. For example, bowlers typically bat near the end of the batting order and do not face situations when zero wickets have been taken.
To facilitate the estimation of the multinomial parameters
Given the estimation of the parameters in (3) (see Davis, Perera and Swartz 2015), an algorithm for simulating first innings runs against an average bowler is available. One simply generates multinomial batting outcomes in (1) according to the laws of cricket. For example, when either 10 wickets are accumulated or the number of overs reaches 20, the first innings is terminated. Davis, Perera and Swartz (2015) also provide modifications for batsmen facing specific bowlers (instead of average bowlers), they account for the home field advantage and they provide adjustments for second innings simulation. In summary, with such a simulator, we are able to replicate matches, and estimate the expected runs scored when Team A (lineup specified) plays against Team B (lineup specified). Davis, Perera and Swartz (2015) demonstrate that the simulator generates realistic Twenty20matches.
Recall that our objective in player evaluation is the development of a metric that measures player contribution in terms of run differential relative to baseline players. We restrict our attention to first innings performances since the second innings involves a target score whereby players alter their standard strategies. Accordingly, we define
Since success in cricket depends on both scoring runs and preventing runs, we introduce analogous measures for bowling. Accordingly, we define
Summarizing, (4) measures the batting contribution of a batting lineup
There are two remaining details required in the evaluation of (4) and (5). We need to define the typical batting lineup
We note that there is considerable flexibility in the proposed approach. Whereas (6) provides the number of runs that player
Pure batsmen do not bowl. It follows that their overall performance is based entirely on batting and the metric of interest (6) for a pure batsman
When assessing pure batsmen, it is important to compare apples with apples. Therefore, in the calculation of (7), we always insert a pure batsman
Table 1 provides the performance metric (7) for the 50 batsmen in our dataset who have faced at least 250 balls. These are primarily well-established batsmen with a long history in Twenty20 cricket. Wicketkeepers in Table 1 are marked with an asterisk; it may be reasonable to assess them separately from the other pure batsmen since wicketkeepers contribute in a meaningful way that goes beyond batting.
Ahmed Shehzad is the best pure batsman with E (
There are no pure batsmen who are much worse than E (
The E (
Surprisingly, the term “bowler” is not well-defined. The intention is that a player designated as a bowler is one who specializes in bowling and is not “good” at batting. We are going to make the term precise and define a bowler as a player who bowls and whose average batting position is 8, 9, 10 or 11. Since a bowler bats late in the lineup, he does not bat often and his expected differential for runs scored
As any cricket fan knows, the taking of wickets is something that distinguishes bowlers and is highly valued. We wish to emphasize that wicket taking is an important component of our metric (6). A bowler
Table 2 provides the performance metric (6) for the 60 bowlers in our dataset who have bowled at least 250 balls. These are primarily well-established bowlers with a long history in Twenty20 cricket. When comparing Table 2 to Table 1, we observe that the bowlers at the top of the list contribute more to their team than do the top batsmen. This may be relevant to the IPL auctions where teams should perhaps spend more money on top bowlers than on top batsmen. We also note that Chris Mpofu has a very poor expected run differential E (
Interestingly, among the top five bowlers according to the October 2014 ICC rankings, only Sachithra Senanayake and Samuel Badree place highly in terms of E (
Table 2 also suggests that there is little difference between fast and spin bowlers in terms of E (
All-rounders
As with bowlers, the term “all-rounder” is not well-defined although it is intended to convey that a player excels at both batting and bowling. We define an all-rounder as a player who bowls and whose average career batting position is 7 or earlier in the lineup. The calculation of (6) involves simulations where the all-rounder of interest is inserted into position 5 of the batting order. For bowling, four overs are uniformly selected from the 20 overs in the innings and these are the overs that are bowled the all-rounder.
Table 3 provides the performance metric (6) for the 25 all-rounders in our dataset who have faced at least 250 balls and who have bowled at least 250 balls. These are primarily well-established all-rounders with a long history in Twenty20 cricket.
Among the all-rounders, there are some players who have expectionally good batting components of their E (
Additional analyses
In Tables 1, 2 and 3, we calculated the expected run differential metric (6) for pure batsmen, bowlers and all-rounders, respectively. It is interesting to see how the new measures for batting (4) and for bowling (5) compare to standard performance measures.
In Table 4, we provide correlations involving the new measures against the traditional batting average, strike rate, bowling average and economy rate. The correlations are stratified over the three classes of players. We observe that all metrics have similar correlations, neither strong nor weak. If we take E (
Up until now, our analyses have focused on career performances. However, in some situations such as team selection, it is current form which is of greater importance. Davis, Perera and Swartz (2015) provide methodology for determining current form. The approach is implemented by providing more weight to recent match performances. To see that the distinction between career performance and current form is meaningful, Table 5 reports the baseline characteristics for AB de Villiers, Mohammad Hafeez and Umar Gul based on both career performance and current form (up to August 2014). AB de Villiers, a pure batsman, has better recent form than his average career performance where he is now scoring roughly one more run per over than his career average. Much of de Villiers improvement may be attributed to added power as he is now scoring 4’s and 6’s with more regularity. On the other hand, Umar Gul, a bowler, is experiencing a decline in performance in recent matches compared to his career values, allowing 1.66 additional runs per over. We observe that the current form of Mohammad Hafeez is in keeping with his average career performance in both batting and bowling.
More generally, it is interesting to investigate how current form compares with career performances across all players. We look at the correlation between E (
With the availability of batting and bowling characteristics representing current form as in Table 5, we carry out further simulations to obtain the expected run differential metric (6) based on current form. It is interesting to compare our metric (6) with the Reliance ICC ratings which also reflect current form. The Reliance ICC ratings are taken from October 5, 2014.
In Fig. 1, we provide a scatterplot of our metric (6) based on current form against the Reliance ICC rating for the 50 pure batsmen in our dataset who have faced at least 250 balls. There is a moderate correlation (
In Fig. 2, we provide a scatterplot of our metric (6) based on current form against the Reliance ICC rating for the 60 bowlers in our dataset who have bowled at least 250 balls. As in Fig. 1, we obtained a moderate correlation (
In Fig. 3, we provide a scatterplot of our metric (6) based on current form against the Reliance ICC rating for the 25 all-rounders in our dataset who have faced at least 250 balls and have bowled at least 250 balls. In this case, the correlation between our metric and the Reliance ICC all-rounder ratings was
Another investigation with “moneyball” in mind concerns salary. We are interested in how the expected run differential measure (which measures true contribution) compares against perceived worth expressed as salary. To make this investigation, we have collected salary data from the 2012–2014 IPL seasons.
Figures 4, 5, and 6 provide scatterplots of most recent IPL salaries against our metric (6) based on current form for the 21 pure batsmen, 26 bowlers, and 18 all-rounders from our dataset who played in the IPL during the period. In each case, there is no detectible correlation between a player’s performance by the E (
For comparison purposes, Fig. 7 provides scatterplots of the most recent IPL salaries against the Reliance ICC ratings. The three plots correspond to batsmen, bowlers and all-rounders. The correlations here seem a little stronger than in Figs. 4, 5 and 6. If we believe that expected run differential E (
We extend our analyses in two further directions. First, we ask whether it is a good idea to use only first innings data for the estimation of batting characteristics
We therefore repeat our analysis of career performance by including second innings data. Perhaps it is the case that second innings conditions average out in terms of cautious and aggressive situations. In Fig. 8, we provide a scatterplot of the E (
Our final analysis compares our expected run differential metric E (
Traditional performance measures in Twenty20 cricket may not be seen as “fair”. For example, it is easier to score runs for an opening batsman than a batsman who bats in position 7. This paper overcomes these types of difficulties and develops performance measures that focus on expected run scoring differential relative to baseline players. Although there is no gold standard for measuring performance statistics, we take it as axiomatic that expected run differential is the correct metric in Twenty20 cricket. The reason is that the rules of the game are such that a team defeats its opponent if they score more runs. With an emphasis on what is really important in winning matches, the metrics introduce a “moneyball” philosophy to Twenty20 cricket. The metrics are also flexible in the sense that baseline players can be modified and subsets of players can be simultaneously evaluated.
We have observed that the magnitude of E (
Whereas our performance analysis takes both batting and bowling into account, there exists the possibility for future refinements. For example, fielding is an important component of cricket and it would be useful to quantify fielding contributions in terms of expected run differential. Also, how can one measure a wicketkeeper’s contribution beyond his batting performances?
Another avenue for future research involves data collection. Currently, we use only Twenty20 international matches in forming player characteristics. Is there a way of combining information that comes from other competitions such as the IPL and the Big Bash?
Acknowledgments
Tim Swartz has been partially supported by grants from the Natural Sciences and Engineering Research Council of Canada. The authors thank the two Editors Philip Maymin and Eugene Shen, and three anonymous reviewers whose comments have helped improve the manuscript.
