Bayesian approaches for critical velocity modelling of data from intermittent efforts

Abstract

In sports science, critical power and related critical velocity models have been widely investigated, and are being increasingly applied to field-based team sports. A challenge associated with these models is that laboratory experiments that yield accurate measurements of maximal sustainable velocity are expensive. Alternatively, inexpensive field data (from training and matches) are being used to fit such models. However, the intermittent nature of efforts in field-based sports implies that the dependent variable concerning maximum sustainable velocity is reliably calibrated only for short time durations. This paper develops methods where field data based on short time durations is combined with prior knowledge to fit the three-parameter critical velocity model. This is accomplished in a Bayesian framework for which Markov chain methods are required for model fitting and inference.

Keywords

Critical power exercise physiology global positioning system team sport wearables

Introduction

Critical power and critical velocity models are widely researched models in sports science. Critical power models investigate the relationship between the maximum intensity (power) that an athlete can sustain in a given time duration versus the time duration. Critical velocity models investigate the relationship between the maximum speed that an athlete can sustain in a given time duration versus the time duration. Knowledge of critical power and critical velocity relationships is useful for optimizing athletic training programs and pacing strategies. Critical velocity and critical power models have been studied across various sports including cycling, running, rowing and swimming.¹ Critical power models have also been identified as potential tools in the detection of doping in sport² and have been extended to whole-body exercise.³

The two-parameter critical power model was first introduced by Monod and Scherrer⁴ who referred to it as the CP model. The CP model has since been utilized in the context of critical velocity where it is known as the CV-2 model. The original CP model has been generalized in various directions to improve model fit. For example, Jones and Vanhatalo⁵ consider activities where exercise intensity varies. In these activities, CP models have been introduced that allow for the reconstitution of power during lower intensity periods. A limitation of the CV-2 model (or any CV model with an asymptote) is that it yields an infinite maximal sustainable velocity as the time duration $t \to 0$ . Another limitation of the CV-2 model is that it implies that athletes can maintain a maximal sustainable velocity for infinite periods of time. Our focus will mainly concern the CV-3 model proposed by Morton⁶ which provides a more realistic relationship between maximum sustainable speed and time duration.

In this work, we are explicit about the statistical assumptions that arise in the context of critical velocity models. In particular, we explore the types of data that arise and how this impacts the reliability of calibration. Statistical independence and correlation are investigated, as well as the distributional assumptions associated with measurement error. All of these topics impact the estimation of parameters in critical velocity models. Incorrect model assumptions can lead to biased parameter estimates. Statistical insight has proven valuable to problems arising in sports science and sports analytics.⁷

Careful consideration of the features and limitations of critical velocity data leads to modelling opportunities. A primary contribution of this paper is the development of the CV-3 model in a Bayesian setting. In this case, the Bayesian framework is applied by synthesizing prior knowledge from laboratory settings with ‘inexpensive’ data obtained from training sessions. By inexpensive, we mean that the data are conveniently obtained from athletes during training sessions through the inconspicuous use of wearable devices. Because field-based sports tend to feature intermittent efforts (e.g., repeated sprints interspersed with walking and jogging), it is recognized that maximal speeds when extracted from training data are only reliable for short time durations. Consequently, in our proposed method, only short time duration data are utilized, and the information shortfall is supplemented by prior information gathered from gold-standard laboratory-based testing. Ideally, such data would be obtained from the same group of athletes; however, in the absence of such data, data collected from an independent sample of athletes can be used instead.

Therefore, the main focus of the paper concerns situations where no laboratory measurements are available and critical velocity parameters for an athlete cannot be estimated. Yet, by introducing unintrusive training data, reliable critical velocity parameter estimates may be obtained. Our careful consideration of model assumptions (including the correlations between observations) is taken into account. Another contribution of the paper is the development of a framework to study the adequacy of the ‘maximal moving average’ formula⁸ as an estimator of maximal sustainable velocity. In Appendix A, we describe how this problem can be addressed via simulation studies where the design of the training session impacts the adequacy of the estimation procedure. We confirm the intuition that the maximal moving average formula does not work well for prolonged durations.

In a sports science application, Peng et al.⁹ have proposed Bayesian methods and elicited priors in the context of the impulse-response model. Clarke and Skiba¹⁰ emphasize the pedagogical importance of performance modelling in exercise physiology; discussion is provided in the context of critical power models and the impulse-response model.

In the ‘Data in critical velocity models’ section, we discuss how data are typically collected in the context of fitting critical velocity models. We specifically highlight the pitfalls of measuring maximal sustainable velocities for prolonged durations using wearable devices. In the ‘Model development’ section, the CV-2 and CV-3 models are presented from a statistical perspective. The assumptions related to the models are clearly specified, and the impact and relevance of these assumptions are discussed. In ‘A Bayesian version of the CV-3 model’ section, Bayesian formulations of the CV-3 model are presented. The motivation is that prior knowledge based on laboratory testing can be combined with field data to provide adequate estimates of the CV-3 parameters. Computational and inferential aspects are discussed. The methods are illustrated via an example in the ‘Example’ section. The ‘Discussion’ section provides a short discussion and outlines future research opportunities.

Data in critical velocity models

Critical velocity data are collected in various formats. Gold standard data which are considered the most reliable measurements are obtained in laboratory settings. Such data are naturally time consuming and expensive. And of course, the reliability of such data is dependent on athletes providing full effort.

Laboratory data are collected using various schemes including treadmill tests and shuttle run tests. For example, in a shuttle run test, two physical endpoints are constructed $m$ metres apart, and an athlete is requested to run continuously and alternatively, from one endpoint to the other within $s$ seconds. The athlete is asked to continue the exercise as long as possible, and upon exhaustion, completes $x$ traversals (i.e. from one end to the other). The response variable $y_{t} = m / s$ is the maximum speed (measured in metres per second) that the athlete sustains during the time duration $t = x s$ s. For example, suppose that the endpoints are spaced $m = 35$ m apart. Suppose further that the athlete runs between endpoints in $s = 5.0$ s, and accomplishes this $x = 10$ times prior to collapsing. Then the athlete will have run for a duration of $t = x s = 10 (5.0) = 50$ s, maintaining a maximal sustainable velocity of $y_{t} = y_{50} = m / s = 35 / 5.0 = 7.0$ m/s.

Due to fatigue, it is intuitive that $y_{t}$ ought to be modelled as a decreasing function of $t$ . It is also clear that an athlete cannot be asked to repeat the exercise for different values of $t$ without recovery periods. Therefore, the collection of $y_{t}$ for different values of $t$ may take place over multiple sessions. This contributes to the expense of gold standard data collection. Laboratory testing also distracts from training, and may not be welcomed by athletes and coaches.

A less expensive data collection method involves the use of wearable sensing systems during training and matches. Prominent sensing systems are based on GPS (global positioning system) technology for which there are various commercial providers. For a given athlete, a ‘wearable’ sensing device provides positioning data from which velocities are obtained. Typically, GPS sensors capture data at a sample rate of 10–20 Hz. Let

v_{1}, v_{2}, \dots, v_{N}

(1)

denote the instantaneous velocity data during a training session where the observations are dependent. For illustration, and without loss of generality,

v_{i}

in Equation (1) refers to the instantaneous speed of the athlete in metres per second at second

i

of training, and the training session consists of

N

seconds. For example, in a 1.5-h training session,

N = 5400

. Therefore, sensing data may be considered ‘big data’, and are inexpensive in the sense that athletes simply carry out normal activities. Wearable devices can provide much more information in addition to instantaneous speed measurements. For example, location data, acceleration data and physiological measurements may also be recorded through wearables. A review of various wearable devices including an assessment of measurement accuracy is given by Lutz et al.¹¹ Since measurement errors may be present in the recording of instantaneous speeds, pre-processing is recommended in which a maximum velocity threshold is determined for the raw data in Equation (1). The threshold may be set by looking at the data, identifying outliers and assessing what may be reasonable upper bound for a given athlete. Pre-processing techniques for GPS data are discussed in detail by Abbruzzo, Ferrante and de Cantis.¹²

In the context of critical velocity models, an immediate question concerns the relationship between the field measurements $v_{i}$ in Equation (1) to maximal sustainable velocities. A common approach defines a response variable $y_{t}$ referred to as the maximal moving-average⁸ as

y_{t} = max {\sum_{i = 1}^{t} v_{i} / t, \sum_{i = 2}^{t + 1} v_{i} / t, \dots, \sum_{i = N - t + 1}^{N} v_{i} / t}

(2)

where again for illustration,

t = 1, \dots, N

is the time duration measured in seconds. Each component in Equation (2) is an average speed taken over a period of

t

seconds. What is not entirely clear is why

y_{t}

in Equation (2) is thought to be a good approximation of maximal sustainable speed for time duration

t

. For large values of

t

, one could realistically imagine an athlete slowing down and taking a rest for portions of every component interval; in this case

y_{t}

in Equation (2) underestimates maximal sustainable velocity. On the other hand, for small values of

t

, one might expect that there is some period of time duration

t

where an athlete is running at a nearly constant speed, nearly as fast as possible, and the average over the period reasonably captures maximal sustainable velocity. For example, coaches could design sessions with specific drills that are expected to generate maximal sustainable velocities. This provides the motivation for the proposed methods that follow. Specifically, we will assume that training (field) data

y_{t}

in Equation (2) collected through wearable devices provides reliable information on maximal sustainable speeds for short time durations

t

. Again, we emphasize that, unlike laboratory data, field data are easy to collect.

In Appendix A, we use point processes and simulation studies to explore the adequacy of the maximal moving average Equation (2) as an approximation to maximal sustainable velocity. This appears to be the first investigation of the adequacy of this widely used approximation. As anticipated, the formula is not a good approximation for prolonged durations. An additional message from Appendix A is that the adequacy of the maximal moving average formula deteriorates with activities (matches or training) having frequent rest intervals.

Model development

Since the CV-2 model⁴ provides the foundation for all critical power and critical velocity models, we begin by examining modelling issues with respect to this simple model. We then transition to the CV-3 model⁶ which provides a more realistic description of the relationship between maximum sustainable velocity and time duration. The CV-3 model is examined in ‘A Bayesian version of the CV-3 model’ section, where short-duration field data are supplemented with prior knowledge under the Bayesian framework.

The CV-2 model

The simplest of the critical velocity models is a decreasing rational function referred to as the CV-2 model. Using the notation presented in the ‘Data in critical velocity models’ section, we write the CV-2 model for a particular athlete as

y_{t} = θ_{1} + \frac{θ_{2}}{t} + ϵ_{t} .

(3)

Using conventional statistical notation, the two unknown parameters in Equation (3) are the quantities of interest and are expressed using the Greek symbols

θ_{1} > 0

and

θ_{2} > 0

, respectively. The variable

y_{t}

is the observed response, the time duration

t

is the observed covariate, and the unobserved

ϵ_{t}

term denotes random error. For probabilistic assessment of the model, distributional assumptions concerning

ϵ_{t}

need to be introduced. In the above formulation,

t

is fixed (i.e. not random) and

y_{t}

inherits randomness from the error term. Burnley and Jones¹³ provide a detailed discussion of the physiological aspects of the model (3). Using laboratory data, Patoz et al.¹⁴ consider inferential aspects associated with the CV-2 model. In particular, they investigate the bias associated with various weighted least squares fitting procedures.

In the sports science literature, $θ_{1}$ is typically referred to as critical velocity and is written as $C V$ . In the critical power setting, $θ_{1}$ is referred to as critical power and is written as $C P$ . Mathematically, $θ_{1}$ is the value of the limiting asymptote corresponding to large time durations $t$ . Immediately, we see a theoretical failing in the CV-2 model (3), since it is physiologically impossible for athletes to sustain a threshold of performance for unbounded periods of time. For moderate time durations, the critical velocity model may be both practical and adequate.

In the sports science literature, the parameter $θ_{2}$ is typically written as $D^{'}$ in the critical velocity setting and $W^{'}$ in the critical power setting. In the critical velocity setting, the expected value of $ϵ_{t}$ is zero, i.e. $E (ϵ_{t}) = 0$ , and we can therefore express the parameter $θ_{2}$ as a product of time and speed via $θ_{2} = t (E (y_{t}) - θ_{1})$ . It follows that $θ_{2}$ has an interpretation as the expected distance that can be travelled in excess of the distance travelled at critical velocity for any time duration $t$ . We also note that in the sports science literature, the recognition and the assumptions concerning $ϵ_{t}$ are generally not made explicit. Typically, the $ϵ_{t}$ are assumed to be independent, normally distributed with mean zero and common variance. However, as previously discussed, with data arising from training sessions, there is dependence on the response variable (2). Such assumptions are important as they impact statistical inference involving the parameters $θ_{1}$ and $θ_{2}$ .

An immediate limitation of the fitted CV-2 model ${\hat{y}}_{t} = {\hat{θ}}_{t} + {\hat{θ}}_{2} / t$ is that the estimated instantaneous maximal speed of the athlete $lim_{t \to 0} {\hat{y}}_{t}$ is infinite. Importantly, this limitation is not just a mathematical curiosity corresponding to $t = 0$ . If we take a typical estimate $({\hat{θ}}_{1}, {\hat{θ}}_{2}) =$ (4 m/s, 200 m) obtained from Dicks et al.,¹⁵ we observe that even for time durations as large as $t = 20$ seconds, the estimated sustainable velocity for the athlete would be 14.0 m/s corresponding to an impossibly fast 50.4 km/h.

The CV-3 model

Again, we prefer to use conventional notation for statistical models where the response variable is on the left hand side of the equation and model parameters are denoted using Greek symbols. Doing so and re-parametrizing, Morton’s three-parameter CV-3 model⁶ can be expressed as

y_{t} = θ_{1} + θ_{2} / (t + θ_{3}) + ϵ_{t} .

(4)

The parametrization (4) provides a simplification of what one typically sees in the sports science literature and aids in the interpretation of parameters. For example, the

θ_{3}

parameter provides a leftward shift of the CV-2 model (3) by

θ_{3}

units in the variable

t

. Moreover, as

θ_{3} \to 0

, the CV-3 model reduces to the CV-2 model. In this parametrization,

θ_{1} > 0

maintains the same interpretation as in the CV-2 model and

θ_{2}

determines the curvature. Although the CV-3 model (4) is not linear in the parameters

θ_{1}

θ_{2}

and

θ_{3}

and consequently do not benefit from linear model theory, the parametrization has various appealing properties. First, as logic involving maximal sustainable speeds would suggest, it can be shown that

{\hat{y}}_{t} = {\hat{θ}}_{1} + {\hat{θ}}_{2} / (t + {\hat{θ}}_{3})

is a decreasing function of

t

. Second, the instantaneous maximal speed

{\hat{y}}_{0} = {\hat{θ}}_{1} + {\hat{θ}}_{2} / {\hat{θ}}_{3}

is necessarily finite, and unlike the CV-2 model, the CV-3 model provides sensible maximal sustainable speeds for short time durations

t

. In the parametrization specified in equation (4), we impose the restrictions

θ_{1} > 0

θ_{2} > 0

and

θ_{3} > 0

In Figure 1, we demonstrate the role of the parameters $θ_{1}$ , $θ_{2}$ and $θ_{3}$ in Equation (4) by plotting various CV-3 models. We see that $θ_{1}$ alters the vertical displacement, whereas $θ_{2}$ and $θ_{3}$ impact the curvature.

Figure 1.

Plots of the expected CV-3 model given by $E (y_{t}) = θ_{1} + θ_{2} / (t + θ_{3})$ . The bold line corresponds to realistic settings of the parameters, $θ_{1} = 6.0$ , $θ_{2} = 400.0$ and $θ_{3} = 200.0$ . The thin lines correspond to a change in a single parameter as indicated by the main captions.

A Bayesian version of the CV-3 model

In Bayesian frameworks, classical statistical models are augmented with prior knowledge of parameters through the use of prior distributions. The synthesis of the two components results in a posterior distribution which gives the complete probabilistic description of the parameters.

Initial model

Let $[A ∣ B]$ generically denote the density function corresponding to $A$ given $B$ . Then based on Equation (4), and the introduction of the parameter $σ$ which characterizes variability in the data, the Bayesian paradigm expresses the posterior density of $(θ_{1}, θ_{2}, θ_{3}, σ)$ as

\begin{aligned} [ & θ_{1}, θ_{2}, θ_{3}, σ | y_{1}, \dots, y_{n}] \propto [y_{1}, \dots, y_{n} ∣ θ_{1}, θ_{2}, θ_{3}, σ] \\ [θ_{1}, θ_{2}, θ_{3}, σ] \end{aligned}

(5)

where

y_{1}, \dots, y_{n}

are field data as specified in Equation (2). Note that

n

is the number of maximal velocity observations that have been included in the model. The variable

n

is chosen sufficiently small such that

y_{t}

accurately measures maximal sustainable velocity for time duration

t

t = 1, \dots, n

. The term

[y_{1}, \dots, y_{n} ∣ θ_{1}, θ_{2}, θ_{3}, σ]

is referred to as the likelihood, where the precise form of the likelihood is determined by the distributional assumptions assigned to the

ϵ_{t}

terms in Equation (4). The term

[θ_{1}, θ_{2}, θ_{3}, σ]

specifies the prior distribution concerning the parameters

θ_{1}

θ_{2}

θ_{3}

and

σ

We begin with some strong but standard assumptions involving the initial model. First, we assume that the error terms $ϵ_{1}, \dots, ϵ_{n}$ are independent with $ϵ_{t} \sim Normal [0, σ^{2}]$ . The consequence of the distributional assumption is that the field data $\underline{y} = (y_{1}, \dots, y_{n})^{T}$ are conditionally independent and that $y_{t} \sim Normal [θ_{1} + θ_{2} / (t + θ_{3}), σ^{2}]$ . We further assume that $\underline{θ} = (θ_{1}, θ_{2}, θ_{3})^{T}$ is statistically independent of $σ$ , where we assign Jeffreys reference prior $[σ] \propto 1 / σ$ . We assign the multivariate normal distribution $\underline{θ} \sim {Normal}_{3} [\underline{μ}, Σ]$ where $\underline{μ} = (μ_{1}$ , $μ_{2}, μ_{3})^{T}$ and $Σ$ are specified hyperparameters. Instead, it would also be possible to consider a truncated multivariate normal distribution where the constraints $θ_{1} > 0$ , $θ_{2} > 0$ and $θ_{3} > 0$ are imposed. Under these assumptions, the posterior density (5) reduces to

\begin{aligned} [θ_{1}, θ_{2}, θ_{3}, σ | \underline{y}] & \propto \exp {\sum_{t = 1}^{n} - \frac{1}{2 σ^{2}} (y_{t} - θ_{1} - θ_{2} / (t + θ_{3}))^{2}} \\ \exp {- \frac{1}{2} (\underline{θ} - \underline{μ})^{T} Σ^{- 1} (\underline{θ} - \underline{μ})} σ^{- (n + 1)} . \end{aligned}

(6)

There is a remaining detail that needs to be overcome in the specification of the posterior density (6). That is, we require the specification of the mean vector

\underline{μ}

and the variance-covariance matrix

Σ

, which convey prior opinion concerning the parameter vector

θ

. Imagine that we have estimates

({\hat{θ}}_{11}, {\hat{θ}}_{12}, {\hat{θ}}_{13})

({\hat{θ}}_{21}, {\hat{θ}}_{22}, {\hat{θ}}_{23})

, …,

({\hat{θ}}_{m 1}, {\hat{θ}}_{m 2}, {\hat{θ}}_{m 3})

for

m

athletes whose estimates were obtained from gold standard laboratory testing. Recall again that such estimates are expensive but are thought to be reliable. It would be sensible if the athlete in question resembled this population of athletes. For example, if the athlete was a high-level rower, you would prefer laboratory data involving rowers of the same sex, comparable competitive levels and similar ages. In this case, all that one needs to do is average the estimates to obtain

\underline{μ}

, i.e.

μ_{j} = (1 / m) \sum_{i = 1}^{m} {\hat{θ}}_{i j}

, for

j = 1, 2, 3

. Similarly,

Σ = (σ_{i j})

is specified by calculating sample variances and sample covariances;

σ_{i j} = (1 / m) \sum_{k = 1}^{m} ({\hat{θ}}_{k i} - μ_{i}) ({\hat{θ}}_{k j} - μ_{j})

. Of course, other approaches for the specification of the hyperparameters

\underline{μ}

and

Σ

are possible.

A more complex model

We now weaken some of the assumptions introduced in the initial model.

We first recognize that the observations $y_{1}, \dots, y_{n}$ obtained through training sessions are not likely to be conditionally independent. Referring to Equation (2), it is likely that the interval $v_{k}, v_{k + 1}, \dots, v_{k + t - 1}$ which yields $y_{t}$ overlaps with the interval which yields $y_{t - 1}$ . Consequently, the field data $y_{t}$ and $y_{t + Δ}$ are more likely to be similar when the time difference $Δ$ is small.

This insight may be modelled by introducing greater structure on the likelihood term in Equation (5). Specifically, we model $[y_{1}, \dots, y_{n} ∣ θ_{1}, θ_{2}, θ_{3}]$ as a multivariate ${Normal}_{n} [\underline{λ}, W]$ density where $λ_{t} = θ_{1} + θ_{2} / (t + θ_{3})$ and $W$ is defined as a first order autoregressive covariance matrix whose $(i, j)$ th element is $σ^{2} ρ^{∣ i - j ∣}$ and the correlation between the $i$ th term $y_{i}$ and the $j$ th term $y_{j}$ is $ρ^{∣ i - j ∣}$ . We refer to $ρ \in (0, 1)$ as the correlation parameter. The consequence is that field data $y_{t}$ and $y_{t + Δ}$ are more positively correlated when the time difference $Δ$ is small. The proposed approach has the benefit of ‘smoothing’ the data $y_{1}, \dots, y_{n}$ . This idea has been used in the analysis of substitution times in soccer.¹⁶ The additional parameter $ρ$ is assumed independent of the remaining parameters and we assign $ρ \sim Uniform (0, 1)$ .

The net effect of the additional modelling is that the posterior now takes the form

\begin{aligned} [θ_{1}, θ_{2}, θ_{3}, σ, ρ | \underline{y}] \propto [\underline{y} ∣ θ_{1}, θ_{2}, θ_{3}, σ, ρ] [θ_{1}, θ_{2}, θ_{3}, σ, ρ] \\ \propto [\underline{y} ∣ θ_{1}, θ_{2}, θ_{3}, σ, ρ] [θ_{1}, θ_{2}, θ_{3}] [σ] [ρ] \\ \propto \exp {- \frac{1}{2} (\underline{y} - \underline{λ})^{T} W^{- 1} (\underline{y} - \underline{λ}) - \frac{1}{2} (\underline{θ} - \underline{μ})^{T} Σ^{- 1} (\underline{θ} - \underline{μ})} \\ σ^{- (n + 1)} \end{aligned}

(7)

where we note that

\underline{λ}

is a function of

(θ_{1}, θ_{2}, θ_{3})

and the matrix

W

is a function of

(σ, ρ)

Computation and Inference

Based on the synthesis of data and prior knowledge, the posterior distributions (6) and (7) fully describe the uncertainty in the parameters. However, as written, these densities are complex, are not recognizable as familiar parametric distributions and are not readily interpretable. Therefore, it would be instructive to obtain posterior summaries of the parameters of interest $θ_{1}$ , $θ_{2}$ and $θ_{3}$ . Examples of posterior summaries are posterior means and posterior standard deviations which themselves are not analytically tractable quantities. The remedy is to generate the parameters from the posterior distributions using Markov chain Monte Carlo (MCMC) methods.

Here, MCMC is carried out using JAGS software.¹⁷ A feature of JAGS is that the experimenter does not have to derive the full conditional distributions required to implement Gibbs sampling (a basic version of MCMC). Rather, the experimenter needs only to provide the data, and specify both the sampling and prior distributions. With this information, MCMC variate generation is carried out by JAGS in the background. Upon practical convergence of the MCMC algorithm, the experimenter is provided with posterior samples from the posterior distribution. The samples are then used for inferential purposes. For example, a user may average the samples of a parameter to approximate its posterior mean. It is also possible to create density plots based on the samples to approximate posterior density functions.

Example

It is good to provide an overview of our problem. Imagine an athlete for whom we want estimates of critical velocity parameters, but there is no laboratory data. Perhaps the best that can be done is to simply reason that their parameters are similar to the average values of comparable athletes. Our belief is that we can do better than that. We can take data from unintrusive training sessions and use the reliable short duration measurements to modify the average values. The method for doing this is not arbitrary – it is the well-established Bayesian approach.

The test case that we consider is based on field data arising as in Equation (1) and converted to $y_{t}$ measurements as specified in Equation (2). In this case, the field data corresponds to a single 90 min soccer match by an anonymous forward in the Chinese Super League (CSL) during the 2019 season. The data were acquired via cameras and optical recognition software by the provider TRACAB.

The measurements $y_{t}$ are recorded in Table 1 and correspond to time durations $t = 1, 2, \dots, 15$ s. In soccer matches, players do not typically run full out for 15 consecutive seconds. The sport involves frequent starts, changes of direction, and periods of rest. Therefore, we expect the measurements in Table 1 to be less reliable for larger values of $t$ . Note that $y_{1} = 9.41$ m/s nearly represents maximal speed and corresponds to 33.9 km/h. For reference, the maximum speed attained by Usain Bolt over a 20-m section at the IAAF World Challenge in Zagreb 2011 was 43.7 km/h.¹⁸ We utilize short time durations $y_{1}, \dots, y_{5}$ as our measurements in fitting CV-3 models. Note that the $y_{t}$ are decreasing, although according to the construction in Equation (2), this does not mathematically need to be the case. Therefore, with limited data (i.e. $n = 5$ data points), we are attempting to fit useful 3-parameter models. What makes this possible is the Bayesian construction where we supplement field data with informative prior knowledge. And, we repeat, a main feature of the approach is that the field data are obtained inexpensively.

Table 1.

Values $y_{t}$ from Equation (2) measured in m/s for a forward during a CSL match.

$t$	$y_{t}$	$t$	$y_{t}$	$t$	$y_{t}$
1	9.41	6	7.17	11	5.62
2	8.92	7	6.69	12	5.42
3	8.50	8	6.37	13	5.38
4	8.08	9	6.12	14	5.34
5	7.67	10	5.88	15	5.28

The next step involved in the implementation of the Bayesian analyses requires the specification of the hyperparameters $\underline{μ}$ and $Σ$ which are common to both models (6) and (7). Referring to Morton⁶, we can reparametrize the fitted parameters corresponding to the six long distance runners in Table 8 (middle three columns). Then following the discussion at the end the ‘Initial model’ section, we obtain hyperparameters

(μ_{1}, μ_{2}, μ_{3}) = (5.82, 396.00, 198.21)^{T}

(8)

and

\begin{aligned} Σ = (\begin{matrix} 0.11 & - 22.86 & - 8.85 \\ - 22.86 & 18891.60 & 13208.13 \\ - 8.85 & 13208.13 & 12833.37 \end{matrix}) . \end{aligned}

(9)

From Equation (8), the prior estimate of critical velocity

μ_{1} = 5.82

m/s corresponds to 21.0 km/h. This value likely overestimates the player’s critical velocity and highlights the importance of having good prior information. Using Equation (8), we also obtain the prior estimate

μ_{1} + μ_{2} / μ_{3} = 7.81

m/s of instantaneous velocity which corresponds to 28.1 km/h. Using the relationship between covariances and correlation, we calculate from Equation (9) that there are meaningful prior correlations between the parameters. Specifically, we obtain

Corr (θ_{1}, θ_{2}) = - 0.50

Corr (θ_{1}, θ_{3}) = - 0.24

and

Corr (θ_{2}, θ_{3}) = 0.85

. Recognizing that reliable prior information is important and that the long distance runners studied by Morton⁶ may differ from soccer players, we introduce a conservative procedure and inflate

Σ

in Equation (9) using

k Σ

where

k > 0

is a tuning parameter. An inflated value of

Σ

introduces greater variability in the prior distribution and consequently puts more inferential emphasis on the data. For illustration, we set

k = 1.5

With observations $y_{1}, \dots, y_{5}$ from Table 1 and prior knowledge as expressed in Equations (8) and (9), we fit three models; the classical model $y_{t} \sim Normal [θ_{1} + θ_{2} / (t + θ_{3}), σ^{2}]$ using maximum likelihood estimation, the initial Bayesian model (6) and the enhanced Bayesian model (7). For the latter two models, computation was carried out in JAGS based on 20,000 iterations where practical convergence of the chain was quickly achieved. From the MCMC output, 5000 iterations were discarded as burn-in, and 15,000 iterations were used for inference. The MCMC iterations required less than 1.0 s of computation on a laptop computer. Note that a feature of simulation output is that it facilitates estimation of functionals. For example, the instantaneous velocity $MaxV = θ_{1} + θ_{2} / θ_{3}$ is a functional of interest as it is a function of the parameters $θ_{1}$ , $θ_{2}$ and $θ_{3}$ . The fact that $MaxV$ is relatively complex and does not have an apparent closed-form estimator is unimportant. In an MCMC setting, we can readily estimate the instantaneous velocity $MaxV$ via the generated output of the primary parameters $θ_{1}$ , $θ_{2}$ and $θ_{3}$ .

The estimated parameters are provided in Table 2. From Table 2, we immediately observe that maximum likelihood estimation (MLE) based on only five observations provides unrealistic CV-3 estimates. For example, the estimate of $θ_{1}$ is negative. Therefore, it is apparent that field data on its own with only five data points cannot be used to estimate CV-3 parameters. It seems that auxiliary information is necessary for this application. Accordingly, we supplement the field data with prior knowledge in a Bayesian framework leading to the estimates corresponding to Models (6) and (7). An important observation is the field data impact estimation. Although the parameter estimates in Models (6) and (7) are in keeping with prior knowledge, there are differences between the posterior estimates and the prior means given in Equation (8). We also observe that Models (6) and (7) provide comparable and believable values. For example, instantaneous maximal velocities of 8.39 m/s and 8.27 m/s correspond to 30.2 and 29.8 km/h, respectively. Relative to the magnitude of the data in Table (1), the estimates of $σ$ in Models (6) and (7) appear large. This may be explained by the discrepancy between prior knowledge and the data, a difference that is reduced with more reliable prior information as specified in Equations (8) and (9). When comparing the estimates involving Models (6) and (7), it is difficult to ascertain the truth. However, the correlation $ρ = 0.66$ in Model (7) provides strong evidence that the observations $y_{1}, \dots, y_{5}$ are dependent. This suggests that the enhanced Model (7) is the preferred model. The JAGS code corresponding to Model (7) in Table 2 is provided in Appendix B.

Table 2.

Estimates of CV-3 model parameters (and standard errors) based on maximum likelihood estimation (MLE), the initial Bayesian model (6) and the enhanced Bayesian model (7) when $n = 5$ observations $y_{t}$ are utilized. The parameter $MaxV = θ_{1} + θ_{2} / θ_{3}$ denotes maximum instantaneous velocity.

Parameter	MLE	Model (6)	Model (7)
$θ_{1}$	−3.26	5.92 (000.41)	5.90 (000.38)
$θ_{2}$	320.38	365.28 (126.82)	376.79 (137.45)
$θ_{3}$	24.29	154.84 (068.14)	175.96 (089.32)
MaxV	9.93	8.39 (000.41)	8.27 (0.68)
$σ$	0.01	0.84 (000.42)	0.99 (0.64)
$ρ$			0.66 (0.25)

An important talking point throughout this investigation is that field data do not provide reliable measurements $y_{t}$ for larger time durations $t$ . To check the sensitivity of the estimates in Table 2, we repeat the analyses based on a larger sample size of $n = 15$ observations $y_{1}, \dots, y_{15}$ . Table 3 provides the estimation results with the larger sample size. We now observe that CV-3 estimates are unreasonable in each of MLE, Models (6) and (7) contexts. For example, the critical velocity $θ_{1}$ is too low under MLE, $θ_{2}$ and $θ_{3}$ are inconsistently large in Model (6), and MaxV is impossibly large for Model (7).

Table 3.

Estimates of CV-3 model parameters (and standard errors) based on MLE, the intial Bayesian model (6) and the enhanced Bayesian model (7) when $n = 15$ observations $y_{t}$ are utilized. The parameter $MaxV = θ_{1} + θ_{2} / θ_{3}$ denotes maximum instantaneous velocity.

Parameter	MLE	Model (6)	Model (7)
$θ_{1}$	1.04	5.55 (000.36)	5.98 (00.36)
$θ_{2}$	106.32	500.70 (169.01)	158.25 (62.53)
$θ_{3}$	11.49	316.57 (112.38)	15.94 (04.20)
MaxV	10.30	7.14 (000.30)	15.61 (01.42)
$σ$	0.19	1.48 (000.30)	4.39 (03.43)
$ρ$			1.00 (00.00)

A drawback of the illustration in this section is that long distance runners (upon which the prior is based) most likely have different critical velocity characteristics than the soccer player of interest. In practice, it is important that the prior describes a realistic probabilistic description of the athlete of interest. We have tried to mitigate the shortcomings of our dataset by using a more diffuse prior (i.e. using $k Σ$ instead of $Σ$ with $k = 1.5$ ). In practice, how might a user select $k$ ? As a tuning parameter, one might consider various choices of $k > 0$ and compare various posterior estimates as reported in Table 2. If the parameter estimates are non-sensical and too precise (i.e. small standard errors), this suggests that the prior is dominating the data and that $k$ needs to be larger. However, if $k$ becomes too large, then the inferences are essentially based only on the data $y_{1}, \dots, y_{n}$ , and with small $n$ , it is difficult to reliably fit a three-parameter model. If one was really confident in the prior, then choosing $k < 1$ may be appropriate; this would have the effect of moving posterior estimates closer to prior estimates.

In our Bayesian formulation, the specification of reliable prior information provides a key role, and we have outlined an approach using parameter estimates from other athletes. Of course, prior information may arise from other sources including expert knowledge, and this knowledge may be incorporated into the prior distributions.

Discussion

The main contribution of this paper is the estimation of CV-3 parameters with inexpensive field data. The challenge is that field data are unreliable for larger time durations $t$ . Therefore, only minimal field data are used (based on smaller time durations) where the data are supplemented with prior information.

Moving forward, how might one improve estimation? An obvious strategy involves the improvement of prior knowledge, and this may be accomplished by setting hyperparameters based on athletes who are more similar to the athlete in question. Another avenue is through improved models. We have provided careful investigation of model assumptions including the conditional independence of the data $y_{t}$ . One may consider alternative models that have been proposed in the critical power literature. Another possibility involves the design of training sessions where athletes are required to move at maximal speeds for longer time durations. This would permit the inclusion of data $y_{t}$ for larger values of $t$ .

In the context of field data, we have also investigated the use of the ubiquitous maximal moving average Equation (2) as an approximation to maximal sustainable velocity (see Appendix A). We have observed that the formula may not be ideal for larger time durations. There is also the suggestion that modified training schedules may help improve the approximation.

Footnotes

Acknowledgements

The authors thank four anonymous reviewers whose comments helped improve the paper. The authors thank Daniel Stenz who provided tracking data used in the construction of .

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

Clarke and Swartz have been partially supported by the Natural Sciences and Engineering Research Council of Canada. The work is also partially funded through a Canadian Statistical Sciences Institute Collaborative Research Team (CANSSI CRT).

Ethical approval

Approval for the use of data for research was obtained from the Simon Fraser University Research Ethics Board.

ORCID iDs

Dave C Clarke

Tim B Swartz

Appendix A

In the context of field data, we investigate the adequacy of the widely used maximal moving average Equation (2) given by

y_{t} = max {\sum_{i = 1}^{t} v_{i} / t, \sum_{i = 2}^{t + 1} v_{i} / t, \dots, \sum_{i = N - t + 1}^{N} v_{i} / t}

as an approximation to maximal sustainable velocity for time duration

t

. In Equation (2),

v_{i}

refers to the instantaneous speed of the athlete measured in metres per second at second

i

of training. In a training session of duration

N

seconds, we therefore have measurements

v_{1}, v_{2}, \dots, v_{N}

We begin the investigation under the assumption that the CV-3 model (4) is an accurate description of maximal sustainable velocity. Therefore, we define the expected maximal sustainable velocity for time duration $t$ as (10)

E (y_{t}) = θ_{1} + θ_{2} / (t + θ_{3}) .

Our approach will compare realizations of Equation (2) via simulation experiments with the benchmark expected maximal sustainable velocities (10). If there is agreement, it is suggestive that the maximal moving average Equation (2) is a good proxy for maximal sustainable velocity.

Our realizations from Equation (2) arise from a behavioural model based on a point process intended to resemble a training session. In this behavioural model, we assume that athletes exert maximal effort (i.e. velocity) for tasks – training sessions may be designed accordingly. Further, we assume that athletes approach each task so that nearly constant effort is applied throughout the task. In this framework, we have $m$ start-stop intervals for the tasks where $m$ is odd, $s_{0} = 1$ , $s_{m + 1} = N$ and the times $s_{1} < s_{2} < \dots < s_{m}$ are generated according to a point process. This leads to active task periods

(s_{0}, s_{1}), (s_{2}, s_{3}), (s_{4}, s_{5}), \dots, (s_{m - 1}, s_{m})

and rest periods

(s_{1}, s_{2}), (s_{3}, s_{4}), (s_{5}, s_{6}), \dots, (s_{m}, s_{m + 1}) .

Suppose now that time

i \in (s_{j}, s_{j + 1})

. According to the CV-3 sampling model (4) with independence of

y_{1}, \dots, y_{n}

, and following the behavioural assumptions, we randomly generate the instantaneous speed

v_{i}

according to (11)

\begin{aligned} v_{i} \sim {\begin{matrix} 0 & j odd (rest period) \\ Normal (θ_{1} + θ_{2} / (t_{i} + θ_{3}), σ^{2}) & j even \end{matrix} \end{aligned}

where

t_{i} = s_{j + 1} - s_{j}

The generated $v_{i}$ ’s in (11) then permit the calculation of the maximal moving average $y_{t}$ given by Equation (2). However, we are interested in the general properties of the maximal moving average formula. Therefore, we repeat the simulation procedure $M$ times. Specifically,

(1) generate $s_{1}, s_{2}, \dots s_{m}$ $\to$ generate $v_{1}, \dots, v_{N}$ $\to$ calculate $y_{t}^{(1)}$

(2) generate $s_{1}, s_{2}, \dots s_{m}$ $\to$ generate $v_{1}, \dots, v_{N}$ $\to$ calculate $y_{t}^{(2)}$

$⋮$

(M) generate $s_{1}, s_{2}, \dots s_{m}$ $\to$ generate $v_{1}, \dots, v_{N}$ $\to$ calculate $y_{t}^{(M)}$

The iterative procedure allows us to calculate the average (12)

{\bar{y}}_{t} = \frac{1}{M} \sum_{k = 1}^{M} y_{t}^{(k)}

for

t = 1, 2, \dots, N

The simulation procedure attempts to set realistic values. For the CV-3 model, we set $θ_{1} = 6.0$ , $θ_{2} = 400.0$ and $θ_{3} = 200.0$ (see equation (8)). We also consider two settings of the CV-3 parameter $σ = 0.1$ and $σ = 0.5$ . Note that $σ$ takes into account variability in measurements due to the athlete and also variability in the measurement device. We let $N = 5400$ which corresponds to a 1.5 h training session and we let $m = 181$ and $m = 21$ which leads to training sessions with 90 and 10 tasks, respectively.

The point process is the component of the simulation for which different settings may be chosen. We illustrate with a very simple point process. In particular, we generate $m$ independent points according to the discrete uniform distribution on $(1, N)$ . We then reorder the points such that $s_{1} < s_{2} < \dots < s_{m}$ . Although one particular set of points $(s_{1}, s_{2}, \dots, s_{m})$ may lead to unrealistic start-stop times, the average pattern over $M = 100$ iterations is intended to be reasonable. Of course, one may consider generating alternative point processes according to particular training sessions. For example, one may introduce constraints in the point-process such that there is a minimum rest period between tasks. Part of our intention in this exploratory analysis is the development of a framework for which the maximal moving average formula may be investigated.

Again, we are interested in the comparison of the simulated ${\bar{y}}_{t}$ values in Equation (12) according to the behavioural model and the expected sustainable velocity values $E (y_{t})$ given by Equation (10). Figure 2 provides plots of $E (y_{t}) - {\bar{y}}_{t}$ versus $t$ under the four settings $(tasks, σ) = (90, 0.5)$ , $(90, 0.1)$ , $(10, 0.5)$ and $(10, 0.1)$ . The simulations required approximately 13 h of computation on a laptop computer. When the differences are tightly scattered about zero, this is evidence that the maximal moving average Equation (2) provides accurate estimation of maximal sustainable velocity. When $σ$ is large (i.e. $σ = 0.5$ ), we observe that ${\bar{y}}_{t}$ tends to overestimate the intended estimand $E (y_{t})$ by roughly 1.0 m/s when $t$ is small. This may occur when we have a measuring device that is inaccurate. The best approximations occur when there are few tasks (i.e. 10) and $σ$ is small (i.e. $σ = 0.1$ ). In this case (the bottom right hand plot), the maximal moving average seems to be good for reasonably wide intervals of $t$ , say $1 < t < 500$ . As anticipated, it is clear in all four cases that the approximation is inadequate for large values of $t$ and can be biased by roughly 2.0 m/s. The explanation is that athletes tend to have periods of rest that reduce the ${\bar{y}}_{t}$ values. Consequently, increased numbers of periods of rest tend to negatively affect the approximation. A message here is that training sessions ought to be designed to elicit good approximations.

Finally, the reader may notice that whereas the simulation study with $(task, σ) = (10, 0.1)$ provides reasonable approximations for the interval $t \in (1, 500)$ s, our example in Table 1 used duration intervals $t = 1, \dots, 5$ s. The reason for the large discrepancy is that the proposed behavioural model in Appendix A was based on a training session with the prescribed point process. In the example, data was collected from an actual match where players rarely sprint continuously for extended periods.

Appendix B

Below we provide the JAGS code used in fitting the preferred model (7) associated with the data and output described in Table 2. The simpler models described in this paper are easily deduced from the code corresponding to model (7).

library(R2jags) # download JAGS at https://sourceforge.net/projects/mcmc-jags/files/library(MASS)

############ enter the data in Table 1 ############

time <- 1:5

yt <- c(9.41, 8.92,8.50, 8.08,7.67)

############ produce model (7) estimates given in Table 2 ############

# define the parameters and the data:

n = length(yt)

data = list("yt" =yt,"t"=time, "n" = n) # identify data

# model definition:

cat(paste("model{

yt[1:n] ~ dmnorm(mu[1:n],s[,])

for (i in 1:n) {

for (j in 1:n){

s1[i,j] <- (1/tau)*(pow(pp,abs(i-j)))

}

mu[i] <- theta[1]*t[i]/(t[i]+theta[3])

+ theta[2]/(t[i]+theta[3]) + theta[1]*theta[3]/(t[i]+theta[3])

}

s[1:n,1:n] <- inverse(s1[,])

theta[1:3] ~ dmnorm(thetamu[1:3],thetasigma1[,])

k = 1.5 # tuning paramter in \DIFadd{the `Example'}\DIFdel{S} \DIFadd{s}ection

# prior means and covariances from equations (8), (9) in \DIFadd{the `Example'}\DIFdel{S} \DIFadd{s}ection

thetamu[1] <- 5.815

thetamu[2] <- 396

thetamu[3] <- 198.212

thetasigma[1,1] <- 0.11163

thetasigma[1,2] <- -22.862

thetasigma[1,3] <- -8.846309

thetasigma[2,1] <- -22.862

thetasigma[2,2] <- 18891.6

thetasigma[2,3] <- 13208.13

thetasigma[3,1] <- -8.846309

thetasigma[3,2] <- 13208.13

thetasigma[3,3] <- 12833.37

thetasigma1[1:3,1:3] <- inverse(k*thetasigma[,]) # JAGS uses the inverse of the variance

pp ~ dunif(0,1)

tau ~ dgamma(1.0E-3, 1.0E-3) # JAGS doesn't allow improper priors

sigma <- 1/sqrt(tau)

Smax = theta[1]+theta[2]/theta[3]

}",sep = ""), file="CV3Model2.txt")

# random initial values:

inits <- function () {list(theta = mvrnorm(1,c(5.815,396,198.212),

matrix(c(0.11163,-22.862,-8.846309,-22.862,18891.6,13208.13,-8.846309,13208.13,12833.37),

nrow = 3)), pp = runif(1, min = 0, max = 1),tu = 1)}

# generate sample from posterior:

jags.m <- jags.model(file = "CV3Model2.txt",data = data, inits=inits, n.chains=3,n.adapt = 1000)

post_sample = coda.samples(jags.m,

c("theta","sigma","Smax","pp"), # which parameters to be monitored

n.iter = 20000)

summary(window(post_sample, start=16001,end = 20000))

\end{verbatim}

References

Poole

Burnley

Vanhatalo

Rossiter

Jones

. Critical Power: An Important Fatigue Threshold in Exercise Physiology. Med Sci Sports Exerc 2016; 48(11): 2320–2334.

Puchowicz

Mizelman

Yogev

Koehle

Townsend

Clarke

. The Critical Power Model As a Potential Tool for Anti-doping. Front Physiol 2018; 9: Article 643. DOI: 10.3389/fphys.2018.00643.

Moritani

Nagata

Devries

Muro

. Critical Power As a Measure of Physical Work Capacity and Anaerobic Threshold. Ergonomics 1981; 24: 339–350.

Monod

Scherrer

. The Work Capacity of a Synergic Muscular Group. Ergonomics 1965; 8: 329–338.

Jones

Vanhatalo

. The ‘critical Power’ Concept: Applications to Sports Performance with a Focus on Intermittent High-intensity Exercise. Sports Med 2017; 47(Supplement 1): 65–78.

Morton

. A 3-parameter Critical Power Model. Ergonomics 1996; 39: 611–619.

Albert

Glickman

Swartz

Koning

, Editors. Handbook of Statistical Methods and Analyses in Sports. Boca Raton: Chapman & Hall/CRC Handbooks of Modern Statistical Methods, 2017.

Lord

Blazevich

Abbiss

Drinkwater

Ma’ayah

. Comparing Maximal Mean and Critical Speed and Metabolic Powers in Elite and Sub-elite Soccer. Int J Sports Med 2020; 41(4): 219–226.

Peng

Brodie

Swartz

Clarke

. Bayesian inference for the impulse-response model of athletic training and performance. 2021.

10.

Clarke

Skiba

. Rationale and Resources for Teaching the Mathematical Modeling of Athletic Training and Performance. Adv Physiol Educ 2013; 37(2): 134–152.

11.

Lutz

Memmert

Raabe

Dornberger

Donath

. Wearables for Integrative Performance and Tactic Analyses: Opportunities, Challenges, and Future Directions. Int J Environ Res Public Health 2020; 17(1): 59.

12.

Abbruzzo

Ferrante

De Cantis

. A Pre-processing and Network Analysis of GPS Tracking Data. Spatial Econom Analy 2019; 16(2): 217–240.

13.

Burnley

Jones

. Power-duration Relationship: Physiology, Fatigue and the Limits of Human Performance. Eur J Sports Sci 2018; 18(1): 1–12.

14.

Patoz

Spicher

Pedrani

Malatesta

Borrani

. Critical Speed Estimated by Statistically Appropriate Fitting Procedures. Eur J Appl Physiol 2021; 121(7): 2027–2038. DOI: 10.1007/s00421-021-04675-8.

15.

Dicks

Joe

Hackney

Pettitt

. Validity of Critical Velocity Concept for Weighted Sprinting Performance. Int J Exerc Sci 2018; 11: 900–909.

16.

Silva

Swartz

. Analysis of Substitution Times in Soccer. J Quant Anal Sports 2016; 12(3): 113–122.

17.

Plummer

. JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. Proceedings of the 3rd International Workshop on Distributed Statistical Computing, Vienna, Austria, 2003.

18.

Coh

Hebert-Losier

Stuhec

Babic

. Kinematics of Usain Bolt’s Maximal Sprint Velocity. Kinesiology 2018; 50: 100–101.