Abstract
Psychosocial acceleration theory (PAT) posits that experiencing harsh and unpredictable environments during childhood cues the development of earlier and more frequent reproductive events. However, this developmental association has not been tested in large human populations. Visible minorities and Indigenous people are also subjected to harsher and more unpredictable circumstances than the general population. These circumstances might not be detected in common measures used in PAT literature. In four studies, we tested whether an exploratory analytical approach using measures of harshness and unpredictability from census data of a high-income country (Canada) would be relevant predictors of measures of reproduction 15 years later, whether the proportion of Indigenous people and of visible minorities are relevant predictors, and tested methodological and statistical assumptions and limitations of this approach. Following PAT assumptions, we hypothesize that higher rates of harshness and unpredictability will be associated with higher rates of reproduction. Results were mixed in offering support or working against PAT claims. A higher percentage of children in low-income households was predictive of a higher percentage of single-parent households in Canadian census divisions. However, measures indicative of lower access to resources and unpredictable parental availability were negatively predictive of reproduction outcomes. A higher percentage of Indigenous people was also predictive of a larger family size of single parents. Findings can help inform public policies around early pregnancy and family planning.
Keywords
Introduction
Life History Theory
Life history theory proposes that the optimal allocation of resources to different functions varies among species and across diverse environmental contexts (Del Giudice et al., 2015; Stearns, 1992). Resources such as energy and time are limited, and an organism has to allocate these resources across many functions. An organism needs energy to maintain its body functioning or to grow its body, depending on its time of development. It also needs to seek food and mates; build or acquire resources (e.g., find or defend territory, a nest or mound, tools); heal from wounds or fend off pathogens; reproduce; and, for many species, invest some resources in offspring.
Depending on the environment and niche, different patterns of investments are more adaptive (Ellis et al., 2009). For example, for a species under considerable predation, investing in acquiring resources is unlikely to pay off. It may be more adaptive to invest in reaching sexual maturity earlier and reproducing faster, reducing the chances of being preyed upon before passing on its genes. Differences in the optimal level of investment for each body function at each time of development lead to the development of different life history strategies (LHS). Life history theory had its origins in biology (LHT-B), and it started by describing differences between species (Del Giudice, 2009; Stearns, 1992), but it has also been used to describe differences within species (Albaladejo-Robles et al., 2023; Stearns et al., 2008; Stone et al., 2023), including humans (Del Giudice & Belsky, 2010; Dinh et al., 2022), which started the related field of life history theory in evolutionary psychology (LHT-P; Nettle & Frankenhuis, 2020).
A considerable amount of research in LHT-P has been focused on assessing how environments high in harshness—understood as extrinsic mortality—and unpredictability—understood as random variation of harshness—can influence humans’ LHS (Ellis et al., 2009; Simpson et al., 2012; Webster et al., 2014; Xu et al., 2018). The different LHS have been assumed to lie on a continuum from slow to fast. People on the faster end of this LHS continuum would favor investments in earlier and frequent reproduction, whereas people on the slower end would delay investments in reproduction to favor investments in growth. A faster LHS would be more adaptive in harsh and unpredictable environments because an individual would be less certain of their future chances of reproduction.
Density levels are also hypothesized to have an effect on LHS (Stearns, 1992). Low-density environments would favor a faster LHS because those adopting this strategy would have higher chances of occupying the area and benefit from the available resources. On the other hand, higher density environments would favor a slower LHS because offspring would have to compete with others to acquire resources (Del Giudice et al., 2015; Ellis et al., 2009; Sear, 2020). The effect of density on LHS was also hypothesized to be dependent on the levels of harshness and unpredictability. For example, low-density would particularly increase the fitness of those adopting a faster LHS in relatively stable and resourceful environments. LHT-P research, however, has mostly ignored the effect of density on LHS (Stearns & Rodrigues, 2020; Volk, 2025).
Within LHT-P, Belsky et al. (1991) have proposed the psychosocial acceleration theory (PAT). PAT posits that cues of harshness and unpredictability early in childhood can trigger a series of developmental milestones, including earlier puberty (Webster et al., 2014; Xu et al., 2018) and earlier and more frequent reproductive events (Dinh et al., 2022; Wilson & Daly, 1997). Parental absence—usually the father—has been argued to be the best predictor of this developmental shift (Ellis et al., 2003; Hartman et al., 2018; Webster et al., 2014), and the first 7 years of life would be the critical time of development to shape one's LHS (Ellis et al., 2003; Simpson et al., 2012; Webster et al., 2014; Xu et al., 2018).
LHT-P has been under criticism. One such criticism is that it has considerably departed from LHT-B, which uses more formal modeling and more specific predictions (Nettle & Frankenhuis, 2020). The fast–slow continuum of LHS has been abandoned in LHT-B. Research in the field of biology has focused on how specific environments would favor different life history traits and not a suite of traits as it has been studied in psychology (Nettle & Frankenhuis, 2020).
The existence of LHS—understood as a suite of correlated behaviors—and the utility of LHS to explain behaviors (Sear, 2020; Stearns & Rodrigues, 2020) or the validity of PAT assumptions has been questioned (Volk, 2023, 2025). In a meta-analysis, Webster et al. (2014) noted studies with bigger sample sizes found a small association between father absence and menarche. Some studies have also failed to find results that support LHT-P and PAT assumptions (e.g., Nolin & Ziker, 2016; Richardson et al., 2020, 2024; Wells et al., 2019).
The definitions or measures of harshness and unpredictability have also been criticized. For example, unpredictability lacks a clear statistical definition (Young et al., 2020), such as whether it means a sudden change point, high variance of harshness, or high autocorrelation with harshness. Would only unpredictability that increases harshness lead to the development of faster LHS, or would unpredictability that decreases the mean level of harshness also lead to a faster LHS? The definition of harshness is the level of extrinsic mortality in the population, but the connection of this definition and the usual measures of harshness used in LHT-P and PAT literature (e.g., socioeconomic status) are still not clear, especially in modern environments or high-income countries (Stearns & Rodrigues, 2020; Volk, 2023, 2025). Many have argued that LHT-P and PAT should go into considerable revision, and in doing so, researchers should aim to harmonize the theories in psychology with those in evolutionary biology (Nettle & Frankenhuis, 2020; Sear, 2020; Stearns & Rodrigues, 2020; Volk, 2025).
In several studies, PAT research has used socioeconomic indicators as a measure of resource access and therefore a measure of harshness (Copping & Campbell, 2015; Hartman et al., 2018; Simpson et al., 2012) and measures of parental transitions (e.g., household configuration change and employment change) and geographical moves as measures of unpredictability (Belsky et al., 2012; Young et al., 2020). Earlier or faster reproduction has been measured as time of menarche (because menarch is a direct and usually memorable puberty marker,Xu et al., 2018) and as first time having sex or first time having children (Ellis et al., 2003; Webster et al., 2014).
Considering the developmental aspect of PAT (i.e., exposure to certain environments in the first 7 years of life shaping a series of reproductive milestones), a longitudinal design would be ideal to test such a hypothesis. The measures that are often used in PAT (i.e., socioeconomic status, employment, geographic moves, household configuration, and fertility) are similar to measures usually present in censuses and other governmental reports (Statistics Canada, 2024). Censuses and other governmental reports are also often publicly available, which makes them an invaluable asset for research (Johnston, 2017; Trzesniewski et al., 2011). Leveraging these data is particularly true for research exploring PAT claims (Copping, 2017) because the periodicity of this governmental data fits well with PAT developmental description. Studies that use large sample sizes or large populations, and more importantly, studies that offer cross-cultural findings are also especially valuable for evolutionary psychology because they are supportive that such findings are reflective of an adaptation or of an evolved mechanism instead of some other phenomenon (Buss, 2024).
The use of census data, however, has not been well explored in PAT research. Most PAT research has either used convenience samples (Sear, 2020) or longitudinal surveys from a few samples (Kelly et al., 2017; Magnus et al., 2018; Young et al., 2020). To the best of our knowledge, research testing LHT-P hypotheses and using census or entire population data has only been conducted in England and Wales (Copping et al., 2013; Copping & Campbell, 2015).
Indigenous People and Visible Minorities in Canada
First, we would like to acknowledge that the use of
Indigenous people are more likely to be subject to harsh circumstances compared to non-Indigenous people in Canada (Honouring the Truth, Reconciling for the Future, 2015). Colonial history, confinement of its culture and ways of living to “reservations,” and structural inequities (Neu & Graham, 2006; Romaniuk, 2008) all cause Indigenous people to experience harsher environments, on average. According to LHT-P rationale, these circumstances may be part of the cause for Indigenous people to be younger (Statistics Canada, 2023), faster growing (Statistics Canada, 2017), and to have disproportionately high rates of teenage pregnancy (Reading & Wien, 2009) than non-Indigenous people.
Visible minorities are also socially disadvantaged in Canada.
Current Study
This study uses an exploratory analytical approach to assess five research questions. Our first research question asks whether indicators of harshness and unpredictability, as informed by PAT, present in the census successfully predict reproduction indicators 15 years later. This research question is intended to test if the general claim of PAT (i.e., early harshness and unpredictability shapes future reproduction) is also present in geographical data (i.e., describing entire populations).
The second research question is intended to inform us about the likelihood that population mobility across time is affecting the results. We are interested in knowing whether a model built using either a smaller or larger geography level will better predict reproduction 15 years later. Dissemination areas are the smallest geography level reported by the Canadian census, and it is an area that comprehends an average of 400 to 700 people, whereas census divisions are deemed the most stable geography level (Dictionary, Census of Population, 2021, 2023) that is usually composed of neighboring municipalities. Please see the description of these geography levels in the Method section.
The third and fourth research questions test the developmental aspect of PAT. The third research question asks which model will better predict reproduction: a model using data in the correct timeline (i.e., harshness and unpredictability predicting reproduction 15 years later) or a model using data in an inverted timeline (i.e., harshness and unpredictability predicting reproduction 15 years prior). This research question is intended to test if results are likely to be describing a developmental and longitudinal phenomenon or just some statistical association across time. The fourth research question tests whether the model will perform better in geographies with higher proportions of children. This is also intended to test the developmental assumption of PAT but taking into consideration the critical period (first 7 years of life) of exposure to harshness and unpredictability to shape future reproduction.
Our final research question asks whether the proportions of Indigenous people and of visible minorities will be relevant predictors of reproduction. Indigenous people and visible minorities are exposed to particularly harsh and unpredictable circumstances that may not be usually captured in the measures commonly used in PAT research. We ask then if the proportions of these two populations will be relevant predictors of reproduction even when indicators of harshness and unpredictability are included in the model.
Following PAT rationale, we hypothesize that the model will be successful at explaining a good amount of variance of reproduction indicators in Canada. However, we are not sure whether the model built with smaller geography or larger geography data will perform better. Data from smaller geographies will yield higher statistical power and more variance because of their smaller convergence to mean values. Larger geographies will be more stable and less susceptible to noise due to migration.
We hypothesize that harsher and more unpredictable measures will be predictive of higher reproduction rates 15 years later, but that such an association will not hold in a model with an inverted timeline. We finally hypothesize that the model using data from geographies with a higher proportion of children and with a higher proportion of Indigenous people and visible minorities will predict reproduction better than the model using data from geographies with a smaller proportion of these groups.
The studies in this manuscript will use geographic-level data (e.g., the rates or averages of people in a municipality that fall under a certain criterion) to assess the common claims and assumptions of PAT, a theory that aims to explain individual-level phenomena. The findings in this manuscript must be, therefore, interpreted with caution. For example, the discussion of the findings here relies on the assumption that most of the population of a given geography remained in that same geography across time. This assumption is not necessarily met. We discuss and assess the likelihood that this assumption is violated in the article.
Another issue is that associations that we find at the geographic level may not be present at the individual level. Confounding variables or associations that exist at the geographic level but that are actually describing two different populations limit the inferences that can be drawn from our findings. Thus, our focus will be on the prediction and detection of patterns, rather than on causal or individual-level inferences.
Methods
Data Selection and Transformation
We accessed census data through the Canadian Census Analyser (Statistics Canada, 2024) available through the university library. Data were extracted from 52,973 dissemination areas (DA) and 288 census divisions (CD) in the 2006 Census and 57,936 DA and 293 CD in the 2021 Census. A dissemination area is a “small, relatively stable geographic unit composed of one or more adjacent dissemination blocks with an average population of 400 to 700 persons” (Dictionary, Census of Population, 2021, 2023, p. 86), and they cover all Canadian territory, whereas CD are larger geographies composed of groups “of neighbouring municipalities joined together […]” and “are the most stable administrative geographic areas” (Dictionary, Census of Population, 2021, 2023, p. 68) next to provinces or territories. After merging the 2006 and the 2021 data frames, 48,867 DA and 286 CD were left. This loss in the number of cases is due to geographical redefinition or recoding in this period.
We extracted 120 variables that we considered of any relevance to the research question from the 2006 Census and 235 variables from the 2021 Census. These variables comprised information about age and sex, family and dwelling characteristics, income, immigration, labor, education, and Indigenous and visible minorities. These variables were assessed and thematically grouped by the first author to create the factors used in this analysis. The variables were selected based on their relevance to the research questions. The first iteration of the model used 40 variables. Table 1 describes the variables and the factors used in the model, and
Variables Fed Into the First Models in Study 1.
CD=census division; DA= dissemination area; LHT= Life history theory.
We assessed NAs next with a 5% cutoff established (i.e., if more than 5% of the values were NAs, the variable would be excluded), but no variables met such cutoff. Most of the variables were right skewed (i.e., with distribution close to 0). Aiming for a distribution closer to a normal distribution, all variables were square-root and log transformed and visually assessed with boxplots. The first author visually inspected the boxplots for each variable (original, square root-, and log-transformed) and selected the data transformation that resulted in a distribution closer to a normal distribution. This decision was only made when variables had a notable difference in distribution (i.e., the median was closer to the center of the quartiles, whiskers of relatively equal lengths, and fewer outliers). Whenever differences were not notable, the order of preference was (1) variable with no transformation; (2) variable with square root transformation; and (3) variable with log transformation (Supplemental Table S1).
In the transformation process, we identified that four variables had more than 75% of zeros (median male lone parent income, percentage of male lone parent income coming from other sources, prevalence of low income, and people speaking French and a nonofficial language). These variables were removed from the model because they could distort the relationship between variables. Finally, we defined outliers in this data frame as cases with a z-score above the absolute value of 3 on any variable. Outlier cases were also removed. The data frame used for the model had 38 variables and 39,481 cases in the DA sample and 240 cases in the CD sample (see Table 1).
Partial Least Squares Structural Equation Modeling
Partial least squares structural equation modeling (PLS-SEM) is an exploratory and predictive analytical approach focused on explaining the variance in the dependent variables (Hair et al., 2021). It does so by combining a measurement model (factor analysis) and a structural model (path analysis), and it relies on several statistics for the evaluation of the model's quality (Hair et al., 2022). PLS-SEM is a nonparametric analysis, and it is robust with formative factors (Hair et al., 2021). Formative factors are a group of items that are understood to be forming the factor. PLS-SEM also allows for single-item “factors,” which is a good advantage when working with secondary data. These make PLS-SEM well suited for this study.
Large samples generate lower
Model
All the models in this article had predictors set as formative factors and outcomes set as reflective factors. The biggest change between these two is whether the items that are making that factor are understood to form or to reflect their factor. The items are understood to describe the factors in a formative factor (e.g., one of our predictors,
Following Hair et al.’s (2019, 2021) guidelines, we assessed the reflective factors loadings (>.7), indicators reliability (loading2 > .05), internal consistency (α, ρC, ρA > .7) and reliability (AVE > .5), and discriminant validity (HTMT < .9 and Fornell-Larcker criterion, in which the constructs correlations should be lower than the square root of the AVE). Formative factors were assessed with collinearity (VIF < 5), and weights and loadings for significance and relevance of indicators. Convergent validity analysis was not possible because there would not be an alternative measure, nor would it be possible to resample participants who responded to the census. Finally, we assessed collinearity (VIF < 5), relevance and significance of paths (bootstrapped

Decision-making process in the PLS-SEM analysis.
Study 1: Are Dissemination Areas or Census Divisions the Best Geographical Level for Analysis?
In the first study, we aimed to build one model using DA and to build a second one using CD data. Our goal was to offer insight into the hypothesis of population mobility. We hypothesized that measures akin to the common measures of harshness and unpredictability used in research would predict measures of reproduction present in both datasets. We established an adjusted
Method
The method for this study followed the steps described in the methods section above. The data were transformed for both DA and CD because they were organized in two different data frames. See Table 1 for a list and description of the variables and factors used in the first iteration of the model.
Results
Figure 2 illustrates the models built with DA and Figure 3 illustrates the model built with CD data. Rectangles represent the variables extracted from the census, and hexagons represent the factors the variables were loaded into. The arrows represent either the paths between factors or between variables and factors. Arrows pointing from variables to the factors represent formative factors, and arrows pointing from the factors to variables represent reflective factors. In the case of single-item factors, the arrows convey no meaning other than indicating that the factor is composed of that single variable. An arrow's width represents the path's strength, and dashed lines indicate a negative association. All the variables present in the models on both DA and CD samples were above the criteria of model quality. In the reflective model assessment, loadings were

Proportion of young children, rates of visible minorities in the population, and socioeconomic factors predict reproduction in dissemination areas.
Formative Latent Variables Assessment of Dissemination Areas Model in Study 1.
Both models were not collinear and had statistically significant (
Structural Model Assessment in Study 1.
A few distinctions are worth noting. On the CD sample, only the variables of young children aged 0 to 4 years (
On the CD sample, the effect sizes of the variables predicting
Discussion
The model on CD was the most parsimonious model (i.e., using the smallest number of predictive variables) and was able to predict more variables with both higher explanatory power and effect sizes. This suggests that using CD is the most reliable geographic level to predict
It is worth noting that the variable
Naive linear models had fewer errors than the PLS-SEM in predicting the outcome variables. This has been a consistent observation in the multiple model iterations in this and in past studies, and it probably indicates that the factors we are using to describe the measures are not in fact functioning as factors. This would be an expected or common result when dealing with secondary data because the measures were not designed to be grouped as factors. On the other hand, both the naive linear model and the PLS model incurred considerably low errors. The RMSE varied between 2.06% and 15.4% of the mean value and between 41.9% and 71.9% of the standard deviation of the variables in
A smaller issue regards the discriminant validity between the variables (HTMT). Indeed, the measure used in
Study 2: Is It a Developmental Phenomenon or Just Statistical Artifacts?
In this study we were interested in testing the hypothesis that the results found were a mere correlational stability of our variables across time. Whether the findings of study 1 likely describe a developmental association or merely statistical artifacts. To achieve that, we reversed the years of predictor and outcome variables. We set the variables of harshness and unpredictability in 2021 to predict reproduction variables in 2006. Since our primary hypothesis is that harsh and unpredictable environments would predict more frequent reproduction in Canadian geographies, we expected that the hypothesis of mere correlational stability would not be supported. Therefore, we hypothesized that the longitudinal model in study 1 will have a better performance and a higher explanatory power than the model with the reversed timeline.
Method
We used the CD data frame because it had higher predictive power in study 1. The model in this study started with the first iteration of the CD model in the previous study. However, the predictor variables were selected from the 2021 Census, and the outcome variables were selected from the 2006 Census. Similar to the previous study, we established a threshold of a difference of
Results
Only two single-item predictors and two outcome variables remained in the reversed timeline model.

Proportion of young children, rates of indigeneity in the population, and socioeconomic factors predict reproduction in census divisions.

Later harshness and unpredictability are poor predictors of previous measures of early reproduction.
The reflective model assessment of
Figure 4 reports the structural model in study 2. The model assessment indicated no collinearity between the predictors of
Both out-of-sample metrics of predictive values were lower on the linear model than on the partial least squares model. RMSE LM:
One predictor that was excluded from the model is worth noting. The percentage of children aged 0 to 4 years was removed because it was highly collinear with other predictors (VIF = 21.0). This finding was dissimilar to the models in study 1. Several of the predictors were removed because they were not relevant (
Discussion
The reversed model had only two manifest variables predicting two factors:
The outcome
Study 3: Is There a Sensitive Period to Experience Harsh and Unpredictable Environments?
Here we are interested in further testing the developmental hypothesis that children who experience harshness and unpredictability are likely to have reproduced 15 years later instead of the alternative hypotheses that the model in study 1 is merely describing a geographical correlation or a statistical artifact. PAT literature points to the first 5 or 7 years of life as the most sensitive period in which exposure to harsh and unpredictable environments can shape reproductive patterns (Ellis et al., 2003; Simpson et al., 2012; Webster et al., 2014; Xu et al., 2018). We hypothesized that models using quantile subsamples of Canadian CD with the highest percentage of children will have better performance at predicting reproductive patterns than CD with a lower percentage of children.
Method
We subsampled the data into quantiles: (1) highest percentage of children aged 0 to 4 years; (2) lowest percentage of children aged 0 to 4 years; (3) highest percentage of children aged 5 to 9 years; (4) lowest percentage of children aged 5 to 9 years. The inclusion of such year gaps was intended to cover the 0- to 7-year range in which children would be most sensitive to their environment to adjust their LHS. This choice was also motivated by the fact that fertility has been globally declining (Roser, 2014) and the average age of the parents at the birth of the child has been increasing in Canada (Provencher & Galbraith, 2024). An older year range for children would, therefore, allow for a more probable association to adults at reproductive age 15 years later.
The CD sample was a subsample of tertiles of the data, given that it is composed of only 240 cases, while quartiles were used in the DA sample because it is a much bigger data frame. The final subsamples were composed of 80 cases for the CD tertiles and 9871 for the DA quartiles. We repeated the models reached in study 1 with the CD and DA subsamples. For brevity and simplicity, only the results observed with the CD sample—the model with better performance in study 1—will be reported in this manuscript. The full report can be found in
Results
Table 4 reports the collinearity values, coefficients, effect sizes, and explanatory power of CD tertiles with the highest and lowest percentage of children aged 0 to 4 years and Table 5 reports the same metrics with the highest and lowest percentage of children aged 5 to 9 years. Considering our established criteria of a
Structural Model Assessment of the Tertiles of Children Aged 0 to 4 Years in Study 3.
Structural Model Assessment of the Tertiles of Children Aged 5 to 9 Years in Study 3.
Interestingly, the percentage of Indigenous people was a statistically significant predictor and had a higher and positive coefficient in the tertile with a higher percentage of children in comparison to the tertile with the lowest percentage. This may indicate an interaction between these variables, but testing this hypothesis was beyond the scope of this study. Counterintuitively,
Discussion
Overall, PAT claims were supported when the model was predicting
Study 4: Do Indigenous People and Visible Minorities Face Different Circumstances?
Next, we tested whether the percentage of
Method
The method in this study followed the same procedure as the methods in study 3. However, in this study the subsamples using CD data were selected by the tertiles with the highest and lowest percentage of visible minorities, and the subsamples using DA data were selected by the quartiles with the highest and lowest percentage of Indigenous people. This choice aimed at avoiding subsampling a dataset using a variable that was already a significant predictor in that model. In other words, since
Results
Table 6 reports the structural model measurements of CD tertiles. The models performed mostly similarly based on our criteria for statistical significance (
Structural Model Assessment of the Tertiles of Visible Minorities in Study 4.
Discussion
Contrary to our prediction, selecting geographies with the highest and lowest percentages of Indigenous people and visible minorities did not affect the models’ performances. The percentage of
It is interesting and counterintuitive, though, that
Another explanation could relate to migration. DA are small geographic areas and susceptible to high migration. Even though CD are the most stable geographic unit (Dictionary, Census of Population, 2021, 2023), they are still susceptible to migration in a time span of 15 years. Canada has also observed high immigration from other countries in recent years (Statistics Canada, 2022), so it is likely that a considerable proportion of people answering the census in 2021 were not even in the country in 2006.
A final explanation could be related to the skewness of the data. As with many variables used in this manuscript, the percentages of
General Discussion
The results point to the viability of using census data, a longitudinal design, and an exploratory multivariate analytical approach to test common PAT assumptions. The models built with data from both geographic divisions could explain remarkable variance of reproduction patterns. Namely, the CD, which is the bigger and more stable geographic division (Dictionary, Census of Population, 2021, 2023), was able to explain 81% of the variance of indicators of larger family sizes (named here
These point to both geographic divisions being predictive of reproductive patterns in the Canadian population. The DA geographic division allows for more granular, although noisier, prediction, and the CD division allows for greater population and more precise prediction. For the aims in this article, the CD was the best-performing one and is the focus of this study. In general, these findings support that the use of such data and methodologies can effectively project future reproduction trends among Canadians, which can be a useful tool for policy development and various governmental initiatives.
The effect sizes that were found are high in comparison to what is typically reported in psychology studies. Studies in psychology are usually able to explain around 40% of variance (i.e.,
These results could be due to (1) the statistical power that census data offer; (2) the relatively exploratory analytical approach that we used; and (3) the PLS-SEM algorithm to calculate such effects. The census is the best description of a population. Even though we used percentages or mean values—which inherently reduces some of the variance—of Canadian CD or DA, we still analyzed data collected from millions of people. This statistical power could partly explain the remarkably high effects that have been found.
The first motivator to select variables was the variables present on the Canadian census (Statistics Canada, 2024) that were similar to usual measures of harshness, unpredictability, and reproduction in PAT research. After that, variables were removed from the model or regrouped in different factors for statistical reasons. This relatively exploratory analytical approach could also result in inflated findings. Finally, PLS-SEM algorithm has been criticized for how it calculates statistical significance and for inflating some of its reported values (Rönkkö et al., 2015). This was part of the reason why we adopted criteria that were stricter than what is conventional in research in psychology (e.g.,
The findings also mostly support the longitudinal and developmental aspects of PAT (Ellis et al., 2003; Simpson et al., 2012; Webster et al., 2014; Xu et al., 2018). In study 1, both models had the percentage of children (aged 0–4 in the CD sample and 0–9 in the DA sample) as the highest predictor of reproductive patterns 15 years later. The percentage of children under 6 years old in low-income families was also a significant predictor of the family size of one-parent families in the CD model. When we reversed the timeline between predictor and outcomes, a lot of the variables emerged as nonsignificant predictors, consistent with the idea that this is a developmental phenomenon. Several of the variables were not significant or not relevant in the reversed timeline model, which could mean that their prediction of reproduction was noisier (i.e., resulting in greater error). Reversing the timeline did not result in a prediction as accurate as the correct timeline, which, again, supports the argument of a developmental phenomenon instead of the hypothesis of geographical association.
Subsampling quantiles of the percentage of children did not result in consistently greater performance of the quantiles with a higher percentage of children. The explanatory power was greater in predicting
The PAT claim that environmental harshness and unpredictability (Ellis et al., 2009; Stearns, 1992) shape reproductive patterns was not strongly supported. The significant indicators in the CD sample—
There are, of course, explanations alternative to PAT that can help explain this phenomenon. Fertility has been dropping, and the time of first pregnancy is also being delayed worldwide (Roser, 2014), including in Canada (Provencher & Galbraith, 2024). These are thought to be the result of women having more access to education, healthcare, and employment (Behrman & Gonalons-Pons, 2020; Olowolafe et al., 2025) and of the reduction in child mortality (Roser, 2014). While these may be reflective of a more standardized, stable, and less harsh environment, there are other explanations, such as lack of institutional support for effective reproductive plan and control (South & Crowder, 2010; Wodtke, 2013), local cultures and values (Wilson, 1987; Wodtke, 2013), and local contagion (i.e., observing others having children or having children at a younger age could cue others in the community to also have children; South & Crowder, 2010).
Even within the scope of LHT-P, there are alternative explanations not considered in these analyses. The influence of genes, for instance, was not tested. Gene-environment interactions are the basis for evolutionary research (Stearns, 1992), and genetics should be playing a role in the phenomenon explored in this study. Genes influence the time of puberty (Del Giudice et al., 2015), and the same genes that select for or shape certain environments can also influence reproductive behavior (Belsky et al., 1991; Volk, 2025). In any case, these alternative explanations seem to agree that fertility is higher in places with harsher, resource-lacking conditions. Therefore, the results pointing to geographies with these harsher environments experiencing less
The percentage of
The long history of colonization, structural inequities (Goghari & Kassan, 2022) and the confinement of Indigenous people in “reservations” (Neu & Graham, 2006; Romaniuk, 2008) can create specific harsh and unpredictable environments that are not usually measured in PAT literature. Indigenous people and visible minorities suffer more marginalization and discrimination (Prather et al., 2016), experience or live in communities with higher levels of violence and crime (Griskevicius et al., 2011; Williams et al., 2022; M. Wilson & Daly, 1997), experience cultural differences from non-Indigenous or Caucasian ethnicities (Trovato & Burch, 1980), or experience differences in access to institutional support such as health care, education, or child care (Roser, 2014; Wilson, 1987; Wodtke, 2013). These are factors historically neglected by Western society that could be influencing the reproductive behavior of both Indigenous people and visible minorities in Canada. Future studies could explore how LHT-P can interact with the environments of these populations.
Caveats
There are many considerations and limitations to this study. The first is that this study uses geography-level data; therefore, conclusions about individuals are remarkably limited. Considering PAT and other sources, we draw some conclusions about individual circumstances and behaviors, but it is possible that associations at a population level do not exist at the level of the individual. For example, Canada has been experiencing a considerable amount of immigration (Statistics Canada, 2022), which is commonly associated with employment or education. Immigrants in Canada also have more children than their Canadian-born counterparts (Bélanger et al., 2006). Therefore, it may be likely that the geographies accepting more immigrants are geographies that tend to have lower unemployment rates and more children, but these would not be the same individuals or the same households.
The conclusions drawn here rely on an assumption that most of the population of a given geography will remain on the same geography 15 years later. This is not a guarantee, especially in a country with a high level of immigration (Statistics Canada, 2022) or particularly in smaller geographic units such as DA. Research conducted with a cohort of more than 800,000 people from the Canadian Community Health Survey identified that 54% of them moved within the past 10 years. However, around 37% of them are likely to have moved within the same CD (Mah et al., 2025), resulting in around 34% of the population moving to a different CD in a 10-year period. Arguing that the results from this survey apply to the entire country population and to the findings in this manuscript is a considerable extrapolation, though.
Another limitation of this study is its exploratory analytical approach. The choice of variables in this study was based on a largely used theory (PAT) and some of the tests and comparisons conducted were aimed at assessing the likelihood of statistical artifacts or alternative explanations. However, we started the first iterations of the models with 40 variables and selected or removed variables for statistical reasons. In addition, the analyses were all conducted with the same data from two time points (2006 and 2021). Future research should aim at confirming these findings to allow for more generalizable, reliable or even causal conclusions.
Finally, as mentioned earlier, LHT-P and PAT have been going on through serious criticisms (Nettle & Frankenhuis, 2020; Sear, 2020; Volk, 2025). These criticisms argue for the need of reconsideration of core aspects of the theories and toward a re-approximation to its origin in biology (Frankenhuis & Nettle, 2020, 2020; Stearns & Rodrigues, 2020; Volk, 2025). This significantly limits the inferences and conclusions that can be drawn from this study, but it also highlights its importance. More research using data descriptive of entire populations and aiming at specific outcomes can help LHT-P and PAT to refine its assumptions and generate more accurate and formal predictions.
Conclusion
Using Canadian census data, we showed that indicators of harshness and of the proportion of children and visible minorities can predict frequent reproduction and family sizes 15 years later. We also showed that using data from CD results in a more accurate model than data from DA. The proportion Indigenous people and Visible minorities in Canada are significant and relevant predictors of such reproductive outcomes, highlighting the importance of understanding their structural and cultural characteristics to better understand the ecological and social factors shaping reproductive strategies and outcomes. These findings help inform future research that can make use of more confirmatory approaches to confirm these results (Hair et al., 2022) and to make use of more formal modeling (Frankenhuis & Nettle, 2020; Nettle & Frankenhuis, 2020). These results and future results can also inform an array of public policies, especially those aiming at dealing with early pregnancy and family planning of Indigenous peoples and visible minorities.
Supplemental Material
sj-docx-1-evp-10.1177_14747049261432881 - Supplemental material for Childhood Demographics and Socioeconomic Conditions Predict Reproduction 15 Years Later
Supplemental material, sj-docx-1-evp-10.1177_14747049261432881 for Childhood Demographics and Socioeconomic Conditions Predict Reproduction 15 Years Later by Vinícius Betzel Koehler and M.D. Rutherford in Evolutionary Psychology
Supplemental Material
sj-zip-2-evp-10.1177_14747049261432881 - Supplemental material for Childhood Demographics and Socioeconomic Conditions Predict Reproduction 15 Years Later
Supplemental material, sj-zip-2-evp-10.1177_14747049261432881 for Childhood Demographics and Socioeconomic Conditions Predict Reproduction 15 Years Later by Vinícius Betzel Koehler and M.D. Rutherford in Evolutionary Psychology
Footnotes
Ethical Consideration
This study is exempt from Ethical approval from ethics review board because it only utilizes publicly available and anonymous secondary data.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Natural Sciences and Engineering Research Council of Canada (grant number RGPIN-2020-06761).
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Supplemental Material
Supplemental material for this article is available online.
