Abstract
Biomechanical and orthopaedic studies frequently encounter complex datasets that encompass both circular and linear variables. In most cases (i) the circular and linear variables are considered in isolation with dependency between variables neglected and (ii) the cyclicity of the circular variables is disregarded resulting in erroneous decision making. Given the inherent characteristics of circular variables, it is imperative to adopt methods that integrate directional statistics to achieve precise modelling. This paper is motivated by the modelling of biomechanical data, that is, the fracture displacements, that is used as a measure in external fixator comparisons. We focus on a dataset, based on an Ilizarov ring fixator, comprising of six variables. A modelling framework applicable to the six-dimensional joint distribution of circular-linear data based on vine copulas is proposed. The pair-copula decomposition concept of vine copulas represents the dependence structure as a combination of circular-linear, circular-circular and linear-linear pairs modelled by their respective copulas. This framework allows us to assess the dependencies in the joint distribution as well as account for the cyclicity of the circular variables. Thus, a new approach for accurate modelling of mechanical behaviour for Ilizarov ring fixators and other data of this nature is imparted.
Keywords
Introduction
External fixators are medical instruments used to immobilise fractures or heavy damage to the bone structure. Healing of a fracture is influenced by the amount of strain at the fracture site. 1 The strain is related to the amount of motion of the fracture under loading. The motion of the fracture relies on the combination of rings, bars, pins and wires. Different configurations of external fixators can be compared by measuring the displacement of the fracture. Fracture healing is inevitably influenced by the complex interplay of biology and biomechanics, that is, inter-fragmentary motion and inter-fragmentary biomechanics. The stiffness of a construct of an external fixator is determined by the configuration of the hardware. The construct’s configuration may lead to different inter-fragmentary motion and it is, therefore, important to obtain the most appropriate construct for optimal healing. Iobst et al. 2 provided an overview of the various circular external fixators based on current choices available within the field, accompanied by a comprehensive comparative assessment of seven prevalent hexapod circular external fixator systems. Pertinent attributes pertaining to the hardware, software components, and educational potentials of each system are meticulously delineated. Noteworthy is the systematic refinement of information procured from system manufacturers, subject to rigorous and unbiased editorial scrutiny to align with the structural requirements of this review, devoid of any extraneous subjective commentary or recommendations. The authors of the review contend that the development of varied systems has had a positive impact on the advancement of this technical domain.
In the literature various studies can be found which compare configurations of constructs to determine strain at a fracture site and thus the most appropriate configuration for optimal healing.3–7 To the best of the authors’ knowledge, all studies in this area follow a similar statistical procedure: the assumption of independence among the variables and all variables are considered linear variables. There are two main shortfalls of this approach: (i) disregard of the dependence structure between the variables which may impair the accuracy of the model and (ii) the cyclicity of the rotational variables is neglected and mistreated as a linear variable which may produce misleading results. In numerous studies, the analysis of variance (ANOVA) test is frequently employed to facilitate comparisons among various fixator frames, as supported by research conducted by authors such as Corona et al. 6 and Watts et al., 7 among others. These investigations are grounded in the analysis of clinical data stemming from patient cohorts. In contrast, this study diverges from this approach by utilising a virtual model to generate simulated data, which emulates the behaviour of the fixator frame under scrutiny.
In the biomechanical domain, studies including the use of directional statistical techniques have been limited to the univariate and bivariate setting.8–11 The primary objective of the study by Pataky et al. 10 was to conduct a comparative analysis between directional analysis and uni- as well as multivariate Cardan analysis with regard to representative joint kinematic data collected during gait. In the work authored by Telschow et al., 11 they addressed the issue of gait reproducibility by examining rotations of the tibia and femur at the knee joint. This investigation takes into account both the spatial influence of marker placement and the temporal variability introduced by different walking speeds in the experimental setup. The study conducted by Rivest et al. 9 centres their attention on estimating the orientations of the two rotation axes at the ankle joint. While Rivest et al. 8 developed a score statistic to evaluate the adequacy of the fixed-axis model. Furthermore, they proposed techniques to address the auto-correlation of errors among adjacent data points. These studies mentioned above represent the most immediate applications of directional statistics in the biomechanical field, indicating the existence of a significant research gap that needs to be addressed. The biomechanical domain is rich with angular data, however, the use of directional statistics techniques is overlooked.
The remainder of the article is organised as follows: Section 2 provides some background on the statistical techniques required to build our proposed model defined in Section 4. The motivation for the proposed model emanates from a specific data application which is discussed in Section 3. Section 5 contains the results and discussion of the data application. Section 6 concludes with some final remarks.
Background
There are a large number of applications which require the analysis of data not realised in the Euclidean space, but rather on some manifold (circles, spheres, hyperspheres, cylinders, torus etc.). This type of data arises in many fields such as life sciences, image analysis, astronomy, meteorology and earth sciences. Circular data is employed in numerous applications, as shown by Ley and Verdebout and 12 SenGupta and Arnold. 13 Directional statistics constitutes a specialised branch within statistical analysis that focuses on angular observations. An inherent challenge in handling directional data lies in the non-linearity of the sample space, typically represented as a circle or circular manifold. This unique feature has garnered increased attention in the past two decades, primarily driven by the increase of data and the corresponding demand for tailored statistical methodologies, as highlighted by Ley and Verdebout. 14 Existing directional models have been summarised in the works of Ley and Verdebout, 14 Mardia and Jupp, 15 and Pewsey and García-Portugués. 16 A fundamental query arises when considering the necessity of directional statistics: ‘Is there a need for specialised techniques due to the curvature of the sample space? Why are standard linear statistical methods insufficient?’ Mardia and Jupp 15 and Mardia 17 provided a simple example by considering the metric of the arithmetic mean to illustrate why it is necessary to account for the curvature of the sample space and how assuming linear techniques leads to inaccurate results. However, in many studies, the cyclicity of a circular variable is often neglected and mistreated as a linear variable.18–21 The biomechanical domain is one example where angular data is prevalent, however, the use of directional statistics techniques is overlooked.
Modelling of circular-linear (C-L) data is limited to the bivariate C-L joint distribution. Various bivariate C-L distributions have been proposed and studied, specifically focusing on meteorology and climatology studies. A well-known method to obtain a C-L distribution is via the conditional modelling approach. Mardia and Sutton 22 combined the von Mises and normal distribution to obtain a joint model with conditional independence. Thereafter, many other models have been proposed, however, this approach restricts the choice of the marginal distribution and may neglect the dependence structure between the variables. This concern becomes even more complicated when extending a C-L distribution to the multivariate setting. Even with the assumption of independence among the circular and linear variables, extending a C-L distribution past the bivariate setting is difficult due to the normalising constant being intractable in most cases as well as the restricted circumstances for the normalising constant to be approximated. Thus, resulting in additional complexity with regards to efficient estimation methods. An attempt to extend the model proposed by Mardia and Sutton 22 to the multivariate setting is given by Luengo-Sanchez et al. 23 In these studies, dependencies between linear variables were considered however, every circular variable was considered independent of all the other variables.
One of the most commonly used C-L distributions is the angular-linear model proposed by Johnson and Wehrly, 24 known as the JW model. This model was constructed using copulas. Copulas are used to link the marginal distributions of variables to their multivariate distributions. 25 Using copulas to build multivariate distributions is a flexible and convenient approach. In the linear domain, a substantial amount of literature can be found on multivariate copulas.25,26 However, these copulas cannot directly be applied to our problem as we need to consider the directional variables and account for their cyclicity. In the directional domain, there are limited studies on multivariate copulas. Bivariate copulas have been studied for circular-circular (C-C) variables27,28 and C-L variables, recently a trivariate circular copula was proposed. 29 The JW model being the most extensively considered and used. The JW model allows the distribution of the C-L variable pair to be written as a function of the marginals and the JW copula function. Since copula functions in the directional domain are limited to the bivariate case an alternative model needs to be considered to obtain our joint six-dimensional (6D) model.
A possible solution for the shortfall of multivariate C-L copulas is vine copulas. The origin of vine copulas stems from the hierarchical copula-based structure, specifically the pair copula construction proposed by Joe
30
and later investigated by Bedford and Cooke.31,32 There has been a growing interest in vine copulas due to their flexibility and intuitive decomposition. Vine copulas allow the construction of multivariate joint probability distributions by decomposing the multivariate function into simple building blocks comprising of bivariate copulas (pair copulas) based on the conditional probabilities.
33
Hence, the need for a multivariate C-L copula function is avoided. A vine is usually divided into D-vine (drawable vine) and C-vine (canonical vine) structures where the latter is more suitable for situations where a variable is key in controlling the dependency. A
By considering vine copulas with our linear-linear (L-L) variable pairs, C-C variable pairs and C-L variable pairs we can construct a multivariate joint probability model while also taking into account the cyclicity of the directional variables. For the trivariate case of a circular-linear-linear joint model this approach has proven to be useful in meteorology and oceanography.18,34
With the flexibility offered by vine copulas comes an increase in complexity in larger dimensions. This results in the computational effort required to estimate all the parameters growing exponentially with dimension. Given that our dataset is 6D we consider a truncated vine copula. Truncated vine copulas have been proposed by Kurowicka
35
and Brenchmann et al.
36
Truncated vine copulas are helpful as they can be constructed by using only pair-wise copulas and a lower number of conditional pair-wise copulas. This method is applied by considering a vine copula structure where all pair-wise copulas with conditioning set equal to or larger than
To summarise in this article, we propose a modelling framework applicable to the 6D joint distribution. Our proposed model makes use of directional statistics to account for the cyclicity of the rotational variables and is constructed based on truncated vine copulas. The pair copula decomposition concept of vine copulas represents the dependence structure as a combination of L-L, C-C and C-L pairs modelled by their respective copulas. This allows us to assess the dependencies in the joint distribution. An advantage of using vine copulas is the flexibility to build multivariate distributions via bivariate copulas that model the dependence between pairs of random variables. The truncation of the vine copula assists with the computation effort required to estimate our model.
Data and motivation
Let

External fixator on left. Fracture ends at the reference position in middle. Fracture ends at the deflected position on right.
For this application, we consider the fracture displacement data comprising of the translational and rotational variables for two different constructs of an Ilizarov ring fixator which we refer to as configuration 1 and configuration 2. In Table 1, the descriptive statistics for the translational (linear) variables of configuration 1 and configuration 2 are given; specifically, the mean, standard deviation (sd), interquartile range (IQR), skewness and kurtosis.
Descriptive statistics of the translational variables for each configuration.
sd: standard deviation; IQR: interquartile range.
In Table 2, the values of the main circular statistics
15
are given for the rotational variables of configuration 1 and configuration 2. Specifically, the mean resultant length (
Descriptive statistics of the rotational variables for each configuration.
In Figures 2 and 3, the data plots of the translation and rotational variables for configuration 1 and configuration 2 are provided, respectively. From Figures 2 and 3, bimodality and slight skewness are observed in some of the data plots indicating the need for a model that can account for multimodality, for example, the use of finite mixture models for specific variables. Based on these model requirements, a copula approach where the marginal distributions can be specified separately from the dependence structure is a useful technique for building a joint model. From the results in Table 2 and the histograms in Figures 2 and 3, we note that the rotational variables are highly concentrated which raises questions on the assumption of the periodicity of the variables. However, due to the nature of the variables we consider them to be circular in the modelling approach as the concentration might differ for various manufacturers and configurations but the nature and domain of the variable remains unchanged. As pointed out by Mardia, 17 the underlying geometry is the main driver motivating the use of statistics for non-Euclidean variables. Hence, directional statistical techniques are considered for this study.

Histograms of the data of the translational (left) and rotational (right) variables for configuration 1.

Histograms of the data of the translational (left) and rotational (right) variables for configuration 2.
Pearson’s correlation coefficient was considered for the paired translational variables. The bivariate relationship between the translational and rotational variables is measured by the linear-circular correlation coefficient,
15

Correlation plot of the paired translational variables, paired rotational variables and translation-rotational variables for configuration 1.

Correlation plot of the paired translational variables, paired rotational variables and translation-rotational variables for configuration 2.
A significance test was performed to evaluate the necessity of accounting for a dependence structure between the variables. Tables 3 and 4 provide the significant (at a 5% significance level) correlation coefficients for configuration 1 and configuration 2, respectively, where the null hypothesis assumes independency among the respective variables. The results from the correlation tests further emphasise the need for a joint model that accounts for dependencies.
In this section, we define the proposed modelling framework applicable to the 6D joint distribution for modelling the displacement of a fracture site. The six variables, of which three are translational and three are rotational, are taken into account in this framework. The linear and circular probability density functions (PDFs) considered are given as well as the copula functions.
Linear and circular distributions
If
Based on the preliminary analysis of the data, and the histograms of the variables provided in Figures 2 and 3, for the translational variables, the normal (N) distribution and a two component mixture of the normal (MN) distribution were considered to be most appropriate. A two component mixture of the normal distribution was considered to accommodate for the multimodality observed in the data. The expression for a finite mixture model is given as
Paired variables for configuration 1 with correlation coefficients significant at a
Paired variables for configuration 2 with correlation coefficients significant at a
For the rotational variables, we considered various circular distributions and found the wrapped Cauchy (wC) and a two component mixture of the wC (MwC) to be most appropriate. The peakedness of the wC makes it an appealing choice for this study. The PDF of the wC
15
is given by
To build our model, we consider the defined linear and circular distributions as the marginal distributions for our framework based on the analysis of the data (see Figures 2 and 3 and Tables 1 and 2). Various other distributions may also be considered depending on the complexity of a dataset.
The copula approach allows us to consider the marginal distributions separately from the dependence structure between the variables. Consider
In the linear domain, various multivariate copula functions have been defined. 25 Since in directional statistics copula functions are limited to the bivariate case, we consider the use of vine copulas.
For the L-L pair copula, we consider the conventional Farlie-Gumbel-Morgenstern (FGM) defined as follows:
For the C-L pair copula, we consider the most common function proposed by Johnson and Wehrly,
24
the JW copula, defined as
Jones et al. 27 proposed a similar approach to Johnson and Wehrly 24 to obtain a bivariate circular copula. The resulting copula function simplifies to a circular PDF where the argument is defined as in (9).
Based on the concept of vine copulas, our proposed model is built using the L-L, C-L and C-C pair copulas and their respective marginal distributions.

A schematic diagram of the canonical vine structure considered for the proposed model.
In Figure 6, the illustrated canonical vine structure considered is given. The paired variables that form the basis of the vine copula structure are denoted as
It is important to note that different vine structures may lead to different results. As mentioned by Aas et al.,
33
the decomposition should be selected by determining which pair relationships are most important. Based on expert opinion, in this application, we considered a structure where the variable,
For the parameter estimation of (10) the method of maximum likelihood estimation (MLE) is considered. Closed-form expressions for the MLEs cannot be obtained due to the complex functional form. Advanced optimisation algorithms are required to numerically compute the maximum likelihood and thus the MLEs.
In this section, we illustrate the validity of our proposed model by considering the fracture displacement data discussed in Section 3.
Combination of marginal and copula functions considered for each configuration.
Combination of marginal and copula functions considered for each configuration.
N: normal; MN: mixture of the normal; wC: wrapped Cauchy; MwC: mixture of the wrapped Cauchy; FGM: Farlie-Gumbel-Morgenstern; JW: Johnson and Wehrly.
The flexibility offered by vine copulas comes at a computational cost for high dimensions. As a result, we consider a simplified form of the joint PDF given in (10) namely a truncated vine copula. For the purposes of this study, we considered
To compute our joint distribution model we first need to specify the marginal and copula functions for each variable and variable-pair, respectively. Based on preliminary analysis of the data (as illustrated in Section 3) and models defined in Section 4, for the linear variables we consider the N distribution as well as the MN distribution. For the circular variables we consider the wC distribution as well as the MwC distribution. A combination of different marginal functions is considered for each configuration termed cases. Table 5 provides a summary of the different cases evaluated for each configuration. We consider two cases, A and B, for each configuration (
For the finite mixture models, the expectation-maximisation (EM) algorithm may be utilised. Due to the high-dimensional space of the parameter set, advanced optimisation algorithms such as the particle swarm optimisation (PSO) 37 and genetic algorithm (GA) 38 were used to efficiently estimate the parameters of the joint model.
In Table 6, the performance measures, for the different cases specified in Table 5, are provided. For the performance evaluation, two goodness-of-fit metrics are applied to evaluate the models. The Akaike information criterion (AIC) estimates the relative amount of information lost by a given model. The Bayesian information criterion (BIC) is widely used for model selection and is similar to the AIC, however, the BIC is more strict in its penalisation of model complexity. These two metrics are defined as follows:
The number of parameters (
It is important to note that the results of the proposed model cannot be compared with existing methods (defined in the linear domain) as the models are defined on different manifolds. Due to the nature of the rotational variables, implementing an approach that incorporates directional statistics is crucial for accurate modelling.
To illustrate the validity of the proposed model in comparison to the conventional use of an independent model, we consider a likelihood ratio test (LRT). We consider the model under the null hypothesis to be the independent model and case B1 and case B2 (more complex model) to be the model under the alternative hypothesis, respectively. For the independent model, we consider the same distributions as specified for case B1 and case B2 for the six variables (
In this article, we propose a modelling framework applicable to the 6D joint distribution. This model accounts for the cyclicity of the rotational variables by means of directional statistics as well as accounts for a dependence structure between the variables. The framework is constructed based on vine copulas. The pair copula decomposition concept of vine copulas represents the dependence structure as a combination of C-L, C-C and L-L pairs modelled by their respective copulas. This allows us to assess the dependencies in the joint distribution. An advantage of using vine copulas is the flexibility to build multivariate distributions via bivariate copulas that model the dependence between pairs of random variables. For efficient estimation, a truncation of the vine copula was considered. The analysis of this data motivates the need for a dependence structure to be accounted for when modelling this type of data. From the results of the real data application, the advantage of applying the joint dependence model is observed. Based on the LRT, we can conclude that the joint dependent model is a better choice for modelling the fracture displacements and will thus be more informative for evaluations of these devices and the design thereof. The proposed modelling framework can be adjusted for other practical cases depending on the desired dependency structure required and the relationship between the variables. The primary goal of an external fixator is injury rehabilitation. Fracture healing is inevitably influenced by the complex interplay of biology and biomechanics – that is, inter-fragmentary motion and biomechanics. The construct of an external fixator is determined by the configuration of the hardware. The construct’s configuration may lead to different inter-fragmentary motions and it is therefore important to accurately model and understand the motions in play to obtain the most appropriate construct for optimal healing. The modelling framework proposed in this article provides a more accurate view for fracture displacements thus leading to improved evaluations and design of these devices, thus, aligning with the United Nation’s Sustainable Development Goal (SDG) 3 to promote good health and well-being.
Footnotes
Acknowledgements
The authors would like to thank the anonymous reviewers for their insightful comments which led to an improvement in this paper.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was based upon research supported in part by the National Graduate Academy for Mathematical and Statistical Sciences (NGA) of South Africa; National Research Foundation (NRF) of South Africa, Reference: SRUG2204203965, grant no. 120839 and Reference: RA171022270376, grant no. 119109; DSI-NRF Centre of Excellence in Mathematical and Statistical Sciences (CoE-MaSS), South Africa and STATOMET at the Department of Statistics at the University of Pretoria. The research of M. Arashi is based upon research funded by Iran National Science Foundation, grant no. 4015320 as well as the National Research Foundation (NRF) of South Africa, Reference: RA211204653274, grant no. 151035.
