Inside Cricket’s Newest Predictive Model
The 2018/19 Big Bash League is nearly upon us, and with it a host of betting options short form cricket offers become available to the public. Cricket betting has become increasingly popular to the sports betting audience since the emergence of the exhilarating Twenty20 Cricket format with exciting leagues in Australia, India, South Africa and the Caribbean, just to name a few.
Stats Insider is the leader for speculative sports analytics content in Australia. The machine learning models, powered by thousands of data points, predict the probability — or the mathematical percentage likelihood — of a sporting result or outcome. Always gamble responsibly.
With sports bettors always trying to find an edge over the sportsbooks — and each other — the Stats Insider Big Bash League projection model has been rebuilt following substantial research, improving on last year’s iterations by combining micro-level data with performance factors for each individual batsman and bowler, as well as looking at in-game Match Situation variables. This is used to simulate the outcome of every delivery of every match.
Based on ball-by-ball, batter vs bowler data, this data gets pretty granular on the specific factors that impact a team’s likelihood of winning a match.
Predictive models have long been used by sports nerds, daily fantasy and long form fantasy players. It is also highly applicable to team recruitment and selection decisions, and on Stats Insider, is also displayed in a way to help the readers identify value against bookmakers.
For all of the purposes above, some say it is one of — if not the single — most advanced cricket models available, here’s why:
Stats Insider’s Big Bash Model
INDIVIDUAL PLAYER PERFORMANCE
The model looks at every ball delivered in the Australian Twenty20 domestic competition over the last four seasons. It takes into account each players’ recent form, as well as performances from overseas Twenty20 competitions, to determine the likelihood of the result of each and every delivery, ie; from being a dot ball (no run scored), to hitting a six (6 runs scored), to taking a wicket (batsman out).
For example: by tracking every ball Australian batsman Chris Lynn has faced in the last four Big Bash seasons, the model can now predict his batting performances — down to individual deliveries — in this season’s competition. For new Big Bash players such as England’s Joe Root, the model looks at his overseas and international Twenty20 performances to determine his potential output in this season’s competition.
MATCH SITUATION
While every ball in the Big Bash is important, not all Twenty20 deliveries are created equal. Factors such as which over the batting team is in, how many wickets they are down, and the required run rate for a team batting second, all play a part is what is likely to happen for any given delivery.
For example, as you might expect, the data shows the probability of a batsman hitting a four is at its highest during the powerplay when the field is up and in close. The probability then drops off as the powerplay ends, before a progressive rise in likelihood as the inning progresses. This is all taken into account by the new model as it forecasts the outcome of a game.
IN-GAME PERFORMANCE
One common commentators’ adage is that a quick wicket or two will build up pressure on the batting team — the suggestion being that a new batsman (or two) would be slower to score than the incumbent runners, as well as having a higher chance of losing their wicket than a batsman who has been in for a significant amount of time.
The data agrees to an extent. The probability of a dot ball is lower for a new batsman, suggesting a wicket will cut the scoring rate, but the probability of a wicket barely changes — in fact, in T20, a batsman who’s been in longer is more likely to get out. This is called the “in-game performance factor”:
LOCATION (GROUNDS)
Part of the charm of the Australian summer is the yearly tour of the nation’s best cricket venues, whether it be the old scoreboard and shorter square boundary of the picturesque Adelaide Oval, or the famous cauldron of the mighty MCG. Each ground has its own unique identity. Stats Insider’s model utilises the ground size and boundary distance to project their influence on the outcome of a delivery.
PREDICTIONS
The proprietary algorithm combines all of that micro-level data to calculate the projected likelihood of every possible outcome of each given delivery at any specific point in that match.
For example; our model is now able to calculate the likelihood of Glenn Maxwell hitting Andrew Tye for six in the 20th over of the match, given the precise state of the match being simulated.
Treating every delivery as a single contest between bat and ball, batsman versus bowler (taking into account the match situation as well as various other factors), the model uses a Monte Carlo approach to simulate every game 10,000 times in order to understand the range of possible outcomes, and how likely each of those outcomes is to occur. If a team wins 4,000 of the simulations, that team is projected to be a 40% chance to win the upcoming game.
Based on the market prices, this may or may not be a good value bet. For example, if both teams are marketed equally at $1.88 head-to-head, this implies the bookmaker projections are 50/50 for either side to win. If the model’s projection has one team at 40% and the other at 60% probability of victory, this shows a clear “edge” to one side of the market.
HOW TO USE THIS: What is ’The Edge?’
This model — or the outcomes from it — will be published on Stats Insider leading up to the Big Bash on 19 December. If you’re keen to have a look, you can play around with the BBL Tournament Simulator.
Find out more about what we do at StatsInsider.com.au.