Abstract
The few existing statistical models of breast cancer recurrence and progression to distant metastasis are predominantly based on multi-state modelling. While useful for summarising the risk of recurrence, these provide limited insight into the underlying biological mechanisms and have limited use for understanding the implications of population-level interventions. We develop an alternative, novel, and parsimonious approach for modelling latent tumour growth and spread to local and distant metastasis, based on a natural history model with biologically inspired components. We include marginal sub-models for local and distant breast cancer metastasis, jointly modelled using a copula function. Different formulations (and correlation shapes) are allowed, thus we can incorporate and directly model the correlation between local and distant metastasis flexibly and efficiently. Submodels for the latent cancer growth, the detection process, and screening sensitivity, together with random effects to account for between-patients heterogeneity, are included. Although relying on several parametric assumptions, the joint copula model can be useful for understanding – potentially latent – disease dynamics, obtaining patient-specific, model-based predictions, and studying interventions at a population level, for example, using microsimulation. We illustrate this approach using data from a Swedish population-based case-control study of postmenopausal breast cancer, including examples of useful model-based predictions.
Introduction
The clinical staging of cancer is strongly correlated with prognosis and contributes to the choice of specific treatment regimes. In the United States, for breast cancers diagnosed between 2010 and 2016, the 5-year relative survival rates were 99%, 86% and 28% for localised, regional and distant stages, respectively (figures from www.cancer.org). Similarly, in Sweden, Bower et al. 1 reported relative survival rates of 97–98%, 92–94%, 75–79% and 25–33% for women diagnosed with Stage I to IV breast cancer, based on data between 1992 and 2012 and obtained from the Breast Cancer Data Base Sweden (BCBaSe). Few breast cancers are fortunately diagnosed with distant spread, but for those without distant spread at diagnosis, progression to distant metastatic cancer can occur years after diagnosis of the primary tumour, and unfortunately, once being diagnosed with distant metastases, prognosis is still, these days, poor. 2 Different subtypes of cancer are associated with different patterns of recurrence (with distant metastases). 3 Estrogen receptor (ER) negative cancers are associated with early recurrence, whilst ER-positive cancers are associated with later recurrence, but with more than 5 years sustained risk. 4 A deep understanding of metastatic recurrence is important for improving the prevention and treatment of breast cancer.
There are though few statistical models of breast cancer recurrence or progression to distant metastasis. Most of these are based on multi-state modelling.5,6 Whilst these approaches are very useful for summarising the risk of recurrence, they do not provide insights into the underlying biological mechanisms and have limited use in terms of understanding the implications of interventions (e.g. modifications to screening). In this paper, we describe an alternative approach to modelling (lymph node and distant) metastases, based on a biologically-inspired continuous tumour growth model. The strength of continuous growth models lies in their flexibility and parsimony. Although they rely on strong parametric assumptions they have the potential to be useful for understanding disease dynamics.7–15 Early continuous growth models, applied to breast cancer screening data, combined a tumour growth model with a continuous function of screening sensitivity and accommodated random effects to account for heterogeneity in tumour growth. Isheden et al. described a joint model of tumour size and lymph node spread, and Gasparini and Humphreys developed a model of tumour size and distant metastatic spread.14,15 To the best of our knowledge, no natural history model that jointly accommodates both local and distant metastatic spread alongside latent tumour growth has been proposed in the literature so far. Specifically, the model introduced in this paper is developed for breast cancer incident cases collected in a population of women in which screening is offered.
First, we describe the model for tumour growth in Section 2. We then describe the models for spread to the lymph nodes and distant metastatic spread in Sections 3 and 4, respectively; these models are based on previous work.14,15 We bring the models together using a copula-based approach (Section 5), and then formulate likelihood functions, first in the absence of screening (Section 6), and then for a screened population (Section 7). Possible choices for the copula function are discussed in Section 8. In Section 9, we summarise Monte Carlo simulation studies which check the computing algorithms. Results and additional details of the simulation studies are provided in the Supplementary Material. Model-based predictions that can be obtained after fitting the joint model are introduced and described in Section 10. In Section 11, we analyse data from a Swedish population-based case-control study of postmenopausal breast cancer. We conclude the article with a discussion in Section 12.
Modelling tumour growth
The growth of the primary tumour is assumed to follow an exponential function such that, for a tumour growing according to an inverse growth rate
Individual variation in growth rates is accounted for by assuming that inverse growth rates follow a Gamma distribution with shape
We assume that tumours are detectable with non-zero probability from a given volume
As in previous work,7,9,12,13 we assume the following hazard function for time to symptomatic detection
For an unscreened population, the density for tumour volume at symptomatic detection can, from the above assumptions, be calculated as follows:
The model for spread to the lymph nodes (seeding) is based on a non-homogeneous Poisson Process with intensity function
Given that breast cancer is a heterogeneous disease, we assume that spread to the lymph nodes occurs at different rates for different women. Therefore, we assume that
The probability of having
The model for time to distant metastatic spread is based on the same mathematical formulation as for lymph node spread; it is based on a non-homogeneous Poisson process, as in equation (4), but with parameters
Finally, for a tumour of volume
In Sections 6 and 7, we derive likelihood functions for the joint probability of tumour size, lymph node spread, and time to distant metastases. Because we assume that detection is a function of tumour size (and not lymph node or distant metastatic spread) it is convenient to work with the joint distribution of the number of affected lymph nodes
The joint bivariate density function follows as:
We note also that the copula
In this section, we describe a likelihood function for the joint model introduced in Section 5. We do this first in the absence of screening, as an intermediate step, before developing the algorithm that we will use in our observational data analysis (with screening incorporated). The contribution to the likelihood differs between observed events (patients who are diagnosed with distant metastasis during follow-up), left-censored subjects (with detected metastasis at diagnosis), and right-censored subjects, who are event-free at the end of follow-up.
The likelihood contribution for observed events (
The likelihood contribution for left-censored observations is defined as
Finally, the likelihood contribution for right-censored observations is defined as
We now formulate the likelihood function for a screened population. We assume that, for all incident cases patients, information is available on whether their primary tumour was detected by screening or via symptoms, and that dates of their (negative) screens prior to diagnosis are also known (as is the case for the participants of the study presented in Section 11). Individual contributions to the likelihood will be based on the joint probability of tumour size, number of affected lymph nodes, and time to distant metastasis, conditional on mode of detection and dates of prior negative screens.
We start by defining a model for screening sensitivity
The screening sensitivity is then used, together with the sub-models introduced in the previous section, to formulate the likelihood function. In essence, detection via screening and via symptoms act as competing events. We note that in the derivations below, we rely on stable disease assumptions (described by Isheden and Humphreys 13 ) to formulate the likelihood function, and we use notation that is similar to that of previous work13,15 :
Assuming all of the above, the likelihood contribution for screen-detected (
The term
As in the absence of screening (Section 6), we carried out a simulation study to validate the estimation procedure (and our implementation) in the presence of screening, based on using a Frank copula. This is described in brief in Section 9 and in full in Section A.2 of the Supplemental Material.
In principle, different copula functions
Most important Archimedean copulae, including their bivariate formulation, the domain of the association parameter
, and possible correlation values (in terms of Kendall’s
) that could be represented. Note that the product copula is also known as the independence copula.
Most important Archimedean copulae, including their bivariate formulation, the domain of the association parameter
It would also be possible to use a Gaussian copula, which is constructed from a multivariate normal distribution using the probability integral transform as follows (for the bivariate case):
The choice of the copula function affects the shape of the association structure and the possible correlation values that can be captured and described; for instance, some formulations yield a symmetric association structure (e.g. the Frank and Gaussian copulae), while others do not. Unless prior knowledge about the association structure is available, the choice of the copula structure can be data-driven, for example, using information criteria such as the Akaike information criterion (AIC) and the Bayesian information criteria (BIC) or the observed correlation between the margins.
We validated the estimation procedure by designing and running two simulation studies, one in the absence, and one in the presence, of screening; specifically, the aim of these simulation studies was to assess whether our estimation procedure could retrieve the correct values of the model parameters.
We simulated independent datasets, each with 1500 cancer patients, with outcomes simulated under the full joint model described above, both in the presence and absence of screening. We carried out our simulations assuming a Frank copula for simplicity, but we expect our results to generalise to other copulae functions. More details on the settings of the simulation studies are described in the Supplemental Material.
We ran 500 repetitions for the simulation in the absence of screening and 800 repetitions for the simulation for a screened population after preliminary analyses suggested these numbers of repetitions to be sufficient for constraining Monte Carlo errors to 0.01 or less.
The results of our simulations showed (1) that our estimation procedure could retrieve the correct model parameters with no to negligible bias, both in the absence and presence of screening, (2) that estimating the model standard errors using the inverse of the Hessian matrix at the optimum was satisfactory, and that (3) the small, aforementioned bias did not affect coverage probability which was (overall) close to the optimal value of 0.95. As expected, Monte Carlo errors were smaller than 0.01 (overall) showing that the numbers of repetitions were satisfactory for our aims. Further details and results of our simulations are available in the Supplemental Material available online.
Model-based predictions
After fitting the joint model introduced in Section 5, a variety of useful predictions can be obtained.
First, we can define the probability of having detected distant metastases at diagnosis of the primary tumour given size of the tumour and number of affected lymph nodes:
Second, we can define the probability of having diagnosed or latent/undiagnosed distant metastases given size of the tumour and number of affected lymph nodes at diagnosis of the primary tumour:
Finally, we can define the survival probability at any time
The model introduced in this manuscript was used to analyse data collected from incident cases in a case-control study of postmenopausal breast cancer in Sweden (CAHRES; Cancer And Hormone REplacement Study). 21 Data on the participants (women born and residing in Sweden, aged 50–74, diagnosed with an incident primary invasive breast cancer between 1 October 1993 and 31 March 1995) was linked to data from the Swedish Cancer Registry and the Stockholm-Gotland Breast Cancer Registry. The data on distant metastases has been used before to study the association of mammographic density and the risk of distant spread 22 ; the (marginal) natural history model of lymph node spread was also developed using CAHRES (and other) data.14,23,24 Mammographic images and screening histories were collected from mammography screening units and radiology departments, in an extension of the original case-control study; the collection of this data has also been described previously.25,26 CAHRES is an old study, which for us has several advantages: first, there is an adequate follow-up to study breast cancer spread to distant metastasis. Second, novel treatments for breast cancer were not available at the time, which may favour some of the basic assumptions of our model.
For this application, we only included women with information on the number of affected lymph nodes and time to distant metastasis, alongside data on tumour size, mode of detection, and screening history. This led to a dataset representing 1581 women, of which 1019 (64.4%) were detected through screening and 562 (35.6%) were detected symptomatically. The median tumour diameter at detection was 15 mm, with an inter-quartile interval of 10–22 mm.
Data for this study was collected before the introduction of sentinel lymph node biopsy; 1091 women (69.0%) had no affected lymph nodes at detection, 170 (10.8%) had one affected lymph node, and 91 (5.8%) had two affected lymph nodes; the remaining 229 women (14.4%) had three or more affected lymph nodes. The 99th percentile of the lymph nodes distribution was 17, and the maximum was 42.
Only one woman had detected distant metastasis at the time of diagnosis of the primary tumour; during follow-up, 288 more women (18.2%) were diagnosed with distant metastasis (Figure B1 in the Supplemental Material available online). Median follow-up time, estimated using the inverse Kaplan-Meier method, 27 was 5.50 years (95% CI: 5.41–5.59 years). For the estimation of the joint model described in Section 5, we discretised time to diagnosis of distant metastasis in years; as was done in simulations reported in the Supplemental Material available online, the performance of the estimation procedure was found to be acceptable in this setting.
On average, patients with a large number of affected lymph nodes at diagnosis have a higher risk of distant metastasis, with shorter times to detection of distant metastasis compared to patients without lymph node spread. In our data, the estimated Kendall’s
The fitted maximum likelihood values (and model-based estimates of Kendall’s

Fitted model coefficients under each copula specification, application to Cancer And Hormone REplacement Study (CAHRES) data. Confidence intervals for the model assuming a Clayton copula are omitted, as the fitted Hessian at the optimum was found to be numerically unstable.
Fitted maximum likelihood values, AIC values, and Kendall’s
CAHRES: Cancer And Hormone REplacement Study; AMH: Ali-Mikhail-Haq; AIC: Akaike information criterion.
Fitted parameters (with 95% CI) for the model assuming a Frank copula, application to Cancer And Hormone REplacement Study (CAHRES) data.
For the best fitting model, we calculated some of the model-based predictions that are described in Section 10. Specifically, we focus here on novel quantities that can be obtained by fitting the model introduced in this manuscript; additional quantities of interest can be calculated, see e.g. previous work.13–15 For instance, we estimate from this model a median tumour doubling time of 216 days (95% CI: 209–223), which is comparable to what has been reported elsewhere.15,28 Note that standard errors for all model-based predictions reported in this section (such as median doubling time) were estimated using the numerical delta method as implemented in the predictnl function from the rstpm2 package. 29 Model-based predictions for all copula formulations that were tested are included in the Supplemental Material.
We focus on predictions for the time to distant metastasis, conditional on being free of distant metastasis at the time of diagnosis of the primary tumour. This is relevant since almost all patients have no (detectable) distant metastases at detection of the primary tumour, and is clinically interesting as almost all breast cancer deaths are preceded by a diagnosis of distant metastatic spread. 30 We first used equation (35) to construct a predicted survival curve for each patient in CAHRES, conditional on their covariates (i.e. tumour size, mode of detection, and timing of prior negative screens). We then divided the patients into groups according to the number of affected lymph nodes at diagnosis of the primary tumour. For women with zero, one, and two affected lymph nodes we (separately) marginalised over the observed covariates’ distributions. The resulting curves, for both the model with the Frank copula and the model with the product copula, are depicted (up to 10 years after diagnosis) in Figure 2; 95% point-wise confidence intervals, indicated by shading, are included as well.

Marginal survival curves (in black) for time to diagnosis of distant metastasis for subjects with zero, one, and two affected lymph nodes at diagnosis. Curves are shown for both the model with a Frank copula and the model with a product (independence) copula and averaged over the observed covariates’ distribution in the Cancer And Hormone REplacement Study (CAHRES). Kaplan-Meier curves (in grey) for relevant subgroups of the study population are included for comparison.
Figure 2 also includes Kaplan-Meier curves (depicted in grey), for a reference. Our model-based predictions were reasonably close to the observed data for the model assuming a Frank copula. The other copula formulations showed a poorer fit to the observed data (Figure B2 in the Supplemental Material available online). This illustrates that, although the parameters of the marginal models were estimated to be quite close to each other (Figure 1), the shape of the correlation structure plays an important role in obtaining a good fit to the observed data. As expected, the model assuming independence between the underlying lymph node and distant metastatic spread processes performed less well; the differences in the survival curves across a number of lymph nodes, for the independence copula, are due to a number of affected lymph nodes and time to distant metastases both being dependent on tumour size.
We also estimated the marginal cure fraction, which can be obtained by first calculating one minus equation (34) for every subject in our data, and then taking the average: this yielded an estimated cure probability of 0.697 (95% CI: 0.658–0.736), marginally over the observed distribution of covariates. When averaging, separately, over women with 0, 1 and 2 affected lymph nodes the cure probability estimates were 0.805 (95% CI: 0.772–0.839), 0.553 (95% CI: 0.500–0.605) and 0.479 (95% CI: 0.423–0.535), respectively (Table B2 in the Supplemental Material).
To illustrate the types of insights that a joint model of lymph node and distant metastatic spread can provide, we performed a microsimulation study to demonstrate the potential consequences of early detection. We simulated 10 million tumours from the inverse growth rate distribution estimated in CAHRES (Table 3) and simulated number of affected lymph nodes at 15 mm, along with a time of detection of distant metastasis (relative to the time the tumour reached 15 mm); this was done using the parameter values from Table 3 for the Frank copula model. For women/tumours without detectable distant metastasis at 15 mm, we then calculated the proportion of women that would have a detected distant metastasis within 5 years of diagnosis of the primary tumour (i.e. from the time the tumour was 15 mm), separately for women with zero, one, or two affected lymph nodes at diagnosis and by tertiles of inverse tumour growth rates (defined according to their fitted distribution). These numbers are represented by the right-most dots in each subplot of Figure 3. Then, we examined what the impact would have been if all tumours had been detected one, two, and three years earlier. Metastases that were seeded after the earlier date of diagnosis would (according to our modelling assumptions) have not been seeded, as the primary tumours would have been removed at this earlier date. As a result, fewer tumours would have distant metastases detected in the future. The five-year risks of distant metastasis counting time from the earlier diagnosis are represented as grey dots in the subplots of Figure 3; these five-year risks are, however, of course, not directly comparable to the five-year risks counting time from when the tumour reached 15 mm. To correct for what is essentially lead-time bias (note that lead time is defined as the time between the early diagnosis, e.g., because of screening, and the time that cancer would have been otherwise diagnosed through symptoms31–33 ) we calculated five-year risks from the time the tumour reached 15 mm. These estimates are represented as black triangles in Figure 3; these represent the proportions that should be interpreted (we however keep the naive estimates as a reminder of the importance of incorporating lead time in interpreting early detection in studies of screening). It is clear that early detection substantially improves prognosis only for fast-growing tumours.

Five-year risks of distant metastasis for women with 15 mm wide tumours, calculated at diagnosis and with risks inferred from scenarios representing early diagnosis, by one, two and three years (Time). Probabilities are estimated using a simulation approach. We simulated 10 million tumours based on the joint model parameter estimates described in Table 3. The naive estimates (in grey) are affected by lead-time, while the lead-time corrected estimates (in black) are directly comparable to the risk estimates based on detection at 15 mm. Results are presented for tumours with 0, 1 and 2 lymph nodes and by tertiles of inverse growth rates. Each plot includes annotations with average tumour size (as diameter, in mm) at each detection time considered in this microsimulation study.
The comparison between growth rate groups with time on the x-axis is not entirely straightforward (or fair), since the faster-growing tumours have smaller sizes at any earlier fixed time than the medium and slow-growing tumours. We, therefore, in Figure 4, reformulate the results of our microsimulation example in terms of detection at fixed smaller sizes (thus using tumour diameter for the x-axis), to answer, for example, the question: what would be the risk of distant metastasis if the tumour was detected at a size of 10 mm compared to 15 mm? Specifically, we go back in time to when the detected tumour would have been 10, 5 and 1 mm wide (compared to 15 mm at diagnosis); at each time, we re-calculate the same probabilities that were described above. We come to the same conclusions when we view the results in these terms; together, the two figures provide a more complete picture of the implications of early detection. Note that average sizes at earlier detection times (in Figure 3) and average times to detect smaller tumours (in Figure 4) are approximately the same across the number of affected lymph nodes after conditioning on growth rates (i.e. row-wise). This is expected from the model for the lymph nodes (Section 3) has the property that the number of lymph nodes is conditionally independent of growth rates, given tumour volume.

Five-year risks of distant metastasis for women with 15 mm wide tumours, calculated at diagnosis and with risks inferred from scenarios representing early diagnosis at diameters of 10, 5 and 1 mm (Size). Probabilities are estimated using a simulation approach. We simulated 10 million tumours based on the joint model parameter estimates described in Table 3. The naive estimates (in grey) are affected by lead-time, while the lead-time corrected estimates (in black) are directly comparable to risk estimates based on detection at 15 mm. Results are presented for tumours with 0, 1 and 2 lymph nodes and by tertiles of inverse growth rates. Each plot includes annotations with the average time (in years) it would take to detect tumours at the smaller sizes.
Natural history models, such as the one introduced in this manuscript, can provide useful and statistically efficient tools to study the latent history of breast cancer and metastatic spread. An attractive feature is that they explicitly model different aspects of breast cancer (such as growth, detection) using biologically inspired components and assumptions. The joint, copula-based joint model for lymph nodes and distant metastasis that we have developed deviates slightly from this, as copulas are mathematical constructs that do not necessarily have a biological interpretation. We note that evidence showing that direct seeding of tumour cells from the primary tumour to distant organs is the main route of metastatic dissemination has been presented, 34 but little is known about the dynamic nature of the relationship between lymph node and distant metastasis.
The marginal sub-models that we use through a copula are based on biological concepts,14,15 and their interpretations are retained with our approach. Our approach is based on an assumption that the rates of metastatic spread are proportional to the number and rate of cell divisions – in previous work we have been able to demonstrate, in the context of lymph node spread, that this assumption is more consistent with observational data than an assumption that rate of spread is proportional to tumour volume. 14 The shape of the copula determines a non-linear association between the number of affected lymph nodes and time to distant metastasis, which is consistent with previously reported data. 35 Overall, the marginal models alone were found to fit observed data reasonably well,14,15 therefore we expected the joint model to show a good fit to the data. Results from the applied example in Section 11 aligned with our expectations.
The assumptions for the marginal models have been described in detail before for the lymph nodes model 14 and for the distant metastasis model. 15 We also note that assumptions regarding the shape of the association structure (e.g. the parametric form of the copula function) seemed to have a marginal impact on the estimation of the marginal model coefficients (Figure 1). It is important to be clear that our approach is based on somewhat strong assumptions about the effects of tumour removal. An important assumption is that the seeding of distant metastases fully stops when the primary tumour is diagnosed. As discussed by Gasparini and Humphreys, 15 this will not be fully realistic as treatments administered after a diagnosis of breast cancer (such as surgery followed by chemotherapy and/or radiotherapy) are unlikely to completely stop metastatic seeding. Our modelling assumptions may however be reasonable enough, given the strong evidence supporting the idea that most metastases are seeded early. 36 Another implicit assumption of our model is that metastases do not cross-seed new metastases; cross-seeding may contribute to faster, or increase heterogeneity in, metastatic growth. 15 A natural extension of our model consists of, assuming enough data is available, explicitly modelling the efficacy of treatment, for example, by allowing differential efficacy by molecular subtype, or by allowing a fraction of seeded metastasis to survive treatment.
Gasparini and Humphreys
15
have previously discussed possible extensions to the distant metastasis model; one of these has been dealt with here by generalising and extending the distant metastasis model by including a random effect on the spread parameter
The approach introduced in this manuscript can be defined as cause-specific, as we do not explicitly model competing events; thus, we censor study subjects if and when they experience competing events such as death. Despite this, being based on biologically inspired model assumptions, a benefit of our approach is that it still allows inference on the underlying processes of tumour cell spread irrespectively of the occurrence of competing events. For model-based predictions (e.g. of distant metastases), cumulative incidence functions that take into account the competing event of death could be calculated by formulating a second cause-specific model for death and by using the approach described by, for example, Hinchliffe and Lambert. 39 This would embed our approach within a multi-state modelling framework, that is, allowing us to incorporate competing events such as death (from causes other than breast cancer). In practice, this would mean adding additional states and modelling the relevant transition probabilities. Taking solely a multi-state approach to modelling distant metastases and lymph node spread (where lymph node spread would have to be coarsely discretised) would of course be more straightforward for handling competing risks, but without our natural history modelling component the (standard) multi-state models fail to capture important components such as heterogeneity in growth rates. In future work, we aim to add to our framework a component to explicitly model death due to breast cancer, where distant metastatic spread acts as an intermediate event, and we are currently considering approaches to incorporate death due to other causes.
An additional advantage of our approach is that it can be used to assess how interventions at a population level would impact patient outcomes: we focused on early detection and the risk of distant metastasis, but other interventions can, in principle, be studied, such as personalised mammography screening intervals. Interestingly, our model provided us with a way of estimating the risk of future events while at the same time correcting for lead time. Using our approach we could clearly demonstrate how fast-growing tumours stand to benefit most from early detection.
In conclusion, we have introduced a joint, copula-based model for the latent growth of breast cancer, detection, spread to the lymph nodes, and distant metastatic spread. We have shown that this model was able to capture relevant patterns in data and have demonstrated how, at least under our assumptions, fast-growing tumours stand to gain the most from early detection. Our framework, which can be further extended to better adapt to the complexities of breast cancer progression and treatment, can be fundamentally important for studying the natural history of growth, spread, and prognosis.
Supplemental Material
sj-pdf-1-smm-10.1177_09622802221122410 - Supplemental material for A natural history and copula-based joint model for regional and distant breast cancer metastasis
Supplemental material, sj-pdf-1-smm-10.1177_09622802221122410 for A natural history and copula-based joint model for regional and distant breast cancer metastasis by Alessandro Gasparini and Keith Humphreys in Statistical Methods in Medical Research
Footnotes
Acknowledgements
We are grateful to Gabriel Isheden for a discussion on the different possibilities for modelling the dependence between the metastatic spread processes.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors are supported by research grants from the Swedish Research Council (2020-01302), the Swedish Cancer Society (CAN-2020-0714), and the Swedish e-Science Research Center.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
