Abstract
Psychosocial acceleration theory (PAT) posits that harsh and unpredictable ecologies during childhood can cue humans into developing earlier and more frequent reproduction. This study tested whether variables measuring harsh and unpredictable circumstances in 4,135 Brazilian municipalities and in 2,763 US counties would predict reproductive behavior 10 to 14 years later. Data was extracted from the Brazilian Census and American Community Survey samples. A secondary analysis explored whether the percentage of visible minorities (Black and Indigenous population) would also be a predictor or mediator of the same outcomes. Partial least squares structural equation modeling and multivariate linear regression were used in the analysis of Brazil and US data, respectively. Municipalities with higher rates of lack of resources, with young mothers both married or separated, and with large families with many residents per room were predictive of higher rates of teenage and young adult mothers and of young children in Brazil. Harshness predicted the percentage of young children in US counties, but the direction of this association was mixed. Some findings were contrary to PAT predictions. Divorce rates were negative predictors of reproduction in both countries. Education and employment indicators were not significant predictors of reproduction in Brazil. Higher rates of perceived minorities were not a relevant predictor in Brazil, and they were a negative predictor of the percentage of children in the United States. Findings suggest that harsh ecologies and the proportion of children in the population impact patterns of reproduction a decade later.
Introduction
Life history theory in psychology (LHT-P; Frankenhuis & Nettle, 2020; Nettle & Frankenhuis, 2020; Sear, 2020) describes factors that inform the allocation of resources between reproduction versus body growth and maintenance (Ellis et al., 2009; Stearns, 1992; Xu et al., 2018). LHT-P originated from life history theory in evolutionary biology (LHT-E), which started describing variation between species (Del Giudice, 2009). However, the theory also accounts for within-species variation (Albaladejo-Robles et al., 2023; Malone et al., 2022; Stone et al., 2023), including humans (Dinh et al., 2022; Richardson et al., 2020; Stearns et al., 2008).
Depending on their pattern of investments in reproduction or body growth and maintenance, species or individuals can be classified as lying on a continuum from fast to slow life history strategies (LHSs; Copping & Campbell, 2015; Sear, 2020; Stearns & Rodrigues, 2020). Species or individuals on the fast LHS end favor investments in reproduction earlier in life and focus on offspring quantity, whereas those on the slow LHS end would prioritize body growth and maintenance and offspring “quality” (Ellis et al., 2009). There are critics about this continuum both in LHT-E and LHT-P, which we discuss below, but this description keeps being used in LHT-P research (e.g., Chang et al., 2019; Wang et al., 2022; Zhu & Chang, 2020).
Among LHT-P, a great deal of attention has been given to psychosocial acceleration theory (PAT), which posits that early environmental adversity can alter the onset of puberty (most studies measure time of menarche) and other developmental milestones such as sexual debut and the age of having children (Belsky et al., 1991; Ellis et al., 2003). In PAT research, harsh and unpredictable ecologies have been associated with fast LHSs (Copping & Campbell, 2015; Griskevicius et al., 2011; Webster et al., 2014). If resources are scarce or unpredictable or if there are high levels of predation, reproducing fast and creating numerous offspring may be advantageous because it takes the opportunity for passing on genes and spreading them in the environment while conditions allow. In PAT, harshness is defined as the rates of extrinsic morbidity and mortality in the environment, whereas unpredictability is defined as stochastic variation of such rates (Ellis et al., 2009).
In practice, PAT research has been using socioeconomic measures as a proxy for harsh conditions, whereas unpredictability has been typically measured with parental absence or parental transitions (i.e., changes in the presence of parental figures due to separation, divorce, or remarriage), parental employment change, and geographical moves (Belsky et al., 2012; Ellis et al., 2003; Hartman et al., 2018; Xu et al., 2018). Faster and earlier reproduction is typically measured as time of menarche (Webster et al., 2014), time of sexual debut, and age at first birth or age at marriage (Copping & Campbell, 2015; Ellis et al., 2003; Xu et al., 2018). Among the usual unpredictability measures, parental absence seems to be the best predictor of earlier reproduction, and the first 5 or 7 years of life seem to be a critical period for this cue (Ellis et al., 2003; Simpson et al., 2012; Webster et al., 2014; Xu et al., 2018).
Recently, the assumptions and even the findings of LHT-P and of PAT have been criticized. These critics include those questioning the existence of a single fast-to-slow LHS continuum among humans (Nettle & Frankenhuis, 2020) and its utility in explaining variation in adopted traits (Stearns & Rodrigues, 2020). In fact, the notion of coherent LHSs encompassing broad suites of traits has been challenged as conceptually inconsistent with LHT-E (Sear, 2020; Stearns & Rodrigues, 2020). Although LHT-E focuses on life history traits (e.g., growth, reproduction) and on the trade-offs that shape their timing and covariation, LHT-P tends to focus on the psychological and behavioral strategies by which these traits arise. This continuum has been mainly abandoned in LHT-E (Nettle & Frankenhuis, 2020), where researchers have been using formal mathematical models to make predictions that are typically focused on a limited number of traits (Frankenhuis & Nettle, 2020; Nettle & Frankenhuis, 2020). These differences have resulted in the suggestion that LHT-P should replace the notion of single “strategies” lying on a continuum by the analyses of specific trade-offs and of the environmental cues that influence such trade-offs.
Other critics hypothesize that mortality, morbidity, and resource scarcity may no longer be present in modernized environments as much as they would be present in the environment of evolutionary adaptedness (Belsky et al., 1991; Buss, 2024). As a result, these lower levels of harshness and unpredictability would not be sufficient to cue humans into a faster LHS adoption (Nolin & Ziker, 2016; Volk, 2023, 2025). This is even more likely to be the case in developed countries, where most of the studies in the field are conducted (Sear, 2020; Volk, 2023, 2025; Webster et al., 2014; Xu et al., 2018). Indeed, studies have failed to find the expected results derived from LHT-P and PAT (e.g., Nolin & Ziker, 2016; Richardson et al., 2020, 2024; Wells et al., 2019).
There are hypotheses to explain this phenomenon that are related to PAT but use a different mechanism of explanation. One of these alternative hypotheses is that genes could mediate or confound the association between harsh and unpredictable environments and earlier reproduction (Del Giudice et al., 2015; Volk, 2025). Shared genetic factors between parents and offspring could also result in a cyclical component. Low parental investment would result in harsher and more unpredictable environments early in childhood, which could be associated—due to genes, environment, or both—with faster and earlier reproduction and again lower investment in offspring. Other hypotheses focus on cultural differences or differences in access to institutional support (Wilson, 1987; Wodtke, 2013). For example, places with reduced access to high-quality education or health care could result in riskier reproductive behavior or more unplanned reproduction. These hypotheses, however, would not explain the studies that did not find the association expected in PAT, as they mostly argue that environments with scarcer resources would be associated with earlier and more frequent reproduction.
A longitudinal design would be ideal to test hypotheses between environmental conditions and a life trajectory of prioritizing investments in either reproduction or growth and maintenance, particularly considering the developmental aspect of a critical time of exposure (0–7 years of age) to the environmental predictors and the later appearance of the outcomes. When considering the measures that are typically used (e.g., socioeconomic status, family structure, and marital relationships), census and other governmental surveys would make a valuable data source for this field (Copping, 2017). Most countries conduct censuses or other surveys periodically and measure variables similar to the variables of interest in this field. Moreover, utilizing such data sources would allow for cross-country comparisons, including non-Western and developing populations, which is more representative of the majority of the human population (Henrich et al., 2010). Cross-cultural work within human behavioral ecology and evolutionary anthropology has examined life history traits across diverse populations, including non-industrialized societies (e.g., Martin et al., 2020; also see Sear, 2020). However, the use of census data to test PAT hypotheses has not been well explored. To the best of our knowledge, LHT-P research utilizing the census or surveys of entire populations has only been done in England and Wales (Copping, 2017; Copping & Campbell, 2015). Most studies predicting human LHSs are derived from surveys with convenience samples (Sear, 2020), with either adult (e.g., Wang et al., 2022) or adolescent participants (e.g., Lordelo et al., 2011). Many of the longitudinal studies exploring LHT-P questions rely on the same samples, like the Minnesota Study of Risk and Adaptation and the Study of Early Childcare and Youth Development (Young et al., 2020), the Avon Longitudinal Study of Parents and Children (e.g., Magnus et al., 2018), or the UK Millennium Cohort Study (e.g., Kelly et al., 2017). These studies are important because they address this longitudinal characteristic of LHT-P hypotheses. The samples are large and representative of their focal population. Repeated reports on the same samples, however, reduce the independent verification of the results and can also increase the risk of type I errors.
There are four aims in the studies in this article. The first is to test if an exploratory analytical method using both PAT as a theoretical frame of work and using governmental surveys could be useful in statistically predicting reproductive patterns in an entire country's population. To do this we will investigate whether variables extracted from the Brazilian Census that are akin to the constructs of harshness and unpredictability usually present in PAT literature can predict earlier reproduction 10 years later. Following PAT literature, we hypothesize that socioeconomic stressors—typically used as a proxy for harshness—and that family configuration stressors—typically used as a proxy for parental absence—will be statistically significant predictors of earlier reproduction.
This article will be using demographic variables descriptive of people residing in a given municipality but will make interpretations based on an evolutionary psychology theory, a field in which research is usually conducted with individual-level participant data. It is worth mentioning that the data used here is observational and aggregate across geographical regions. Thus, our approach focuses on prediction and detection of patterns across geographies and cross-culturally rather than causal inference. This will limit the validity or the scope of inferences we can make from the studies in the article. On the other hand, testing this theory using a whole country's population as the “sample size” is important considering how valuable cross-cultural or universal trait findings are for evolutionary psychology (Buss, 2024).
The second aim of this article is to identify which variables, among the ones selected, will predict earlier reproduction. Among potential indicators of harshness and unpredictability present in the Brazilian Census, which ones will an exploratory analysis show to be the most statistically significant and relevant predictors of our outcome variables? We have no specific hypothesis about which variables will be selected, but we aim to discuss potential explanations for variables being or not being significant predictors of early reproduction.
The third aim of this article is to test if the findings in a different country's population (the United States) and using a different data source (the American Community Survey (ACS)) would be similar to the findings observed with the Brazilian Census. We will use a more confirmatory analytical approach in this analysis. Considering the evolutionary background of PAT, we hypothesize that, in general, the results using US data will be in the same direction as they were with Brazilian data, although we expect to find some differences due to socioeconomic and cultural differences between the two countries.
The last aim of the study is to test whether a higher percentage of visible minority groups will be predictive of earlier reproduction in Brazilian municipalities and US counties. Both countries have a long history of Black and Indigenous communities facing discrimination in education, employment, health access, the justice system, and many other settings (Aliverti et al., 2023; Bleich et al., 2019; Couto & Brenck, 2024; Serchen et al., 2022; Silva et al., 2024), and the United States observes a somewhat more recent history of racism against and discrimination against Hispanics or Latinos (Canizales & Vallejo, 2021). Following PAT rationale, it is reasonable to assume that these conditions of discrimination and racism faced by visible minority groups configure as higher levels of harshness and unpredictability in life, which could alter the LHS of these populations. For example, visible minorities could experience poor conditions regarding employment, education, housing, access to health care, or more exposure to violence. Some of these measures are usually present in the census (e.g., employment or education), while others are not (e.g., access to health care or exposure to violence). Thus, a secondary question is whether the percentage of these ethnicities (i.e., understood here as an indicator of people living in particularly harsh and unpredictable ecologies) will be either predictors of earlier reproduction or mediating variables in our primary prediction. We hypothesize that municipalities and counties with higher percentages of visible minority groups will have higher percentages of reproduction indicators.
Young Parenthood in Brazil
Methods
Data Selection and Transformation
We accessed publicly available census data from 5,507 municipalities from the 2000 Census and 5,565 municipalities from the 2010 Census from Instituto Brasileiro de Geogragia e Estatística (IBGE; Brazilian Institute of Geography and Statistics). We selected a subset of variables that are publicly available online in the
All variables are expressed as the percentage of the population in a given category (e.g., people with income of one minimum wage or less, people unemployed or precariously employed, children aged 0–4 years). This subset was selected based on the variables available in the census that best map onto the variables frequently measured as proxies of harshness, unpredictability, and reproductive timing in studies in PAT. Thus, we aimed to select variables that seemed indicative of socioeconomic status (harshness), parental change (unpredictability), and earlier or frequent reproduction (reproductive timing). In many cases, as is common with secondary data research (Andersen et al., 2011), the ideal variable was not available, and we had to choose proxies for that variable. The measure of menarche was a clear case of a frequent measure in PAT literature that we could not include because this is not present in census data. Variables that did not resemble frequent measures in PAT literature were ignored, even if they could be relevant to the question in other disciplines (e.g., living in a rented or owned dwelling with or without a mortgage).
We then merged 40 variables from the 2000 Census with 10 variables from the 2010 Census and used them in an initial model. These variables were initially grouped into the following predictors:
Variables in the Brazilian Model and Inferred Concepts.
Variable abbreviation if retained in the final model.
Log transformation applied to the variables.
Square root transformation applied to the variables.
Included in first iteration of the model but planned to be used in model comparisons.
There are two groups of variables that are not common measures of harshness and unpredictability that are worth discussing here. These variables were used as both predictors and outcomes, although with the time difference, and composed the factors
We also used usual statistical treatments related to missing data, outliers, and data transformation. A cutoff criterion for missing values of 5% was established: any variable with 5% or more of missing values would be removed from the model. Four of the outcome variables were removed for this reason (women aged 10–14 with children; aged 10–14 living with spouse or partner; aged 10–14 married but not living together; and aged 20–24 married but not living together). None of the predictors were above the cut-off criterion. Municipalities that had any variables more than three
Lastly, the square root and log transformations were applied to all variables. For each measure, we then selected the one (i.e., original, square root-, or log-transformed) whose distribution was closest to normality based on visual inspection of histograms and boxplots. Log transformation of two
Partial Least Squares Structural Equation Modeling
Structural equation modeling is a second-generation statistical technique, sitting between analysis of variance or multiple regressions and machine learning (Hair et al., 2022). Structural equation modeling combines factor analysis and path analysis (i.e., a series of multiple regression analyses) to examine relationships between observed and latent variables (Hair et al., 2022; Kline, 2016). It has the advantage of describing complex relationships between several variables in a single model. It has been utilized in social sciences, business, and psychology to evaluate hypothesized causal relationships and make predictions of an outcome variable that is usually a construct that cannot be directly measured (Hair et al., 2022; Kline, 2016; Maruyama, 1997).
Covariance-based structural equation modeling (CB-SEM) is the most widely used, and it is primarily a confirmatory approach (Hair et al., 2022; Kline, 2016). Partial least squares structural equation modeling (PLS-SEM), on the other hand, is primarily an exploratory approach (Hair et al., 2022). Although CB-SEM relies on fit indices by comparing the covariance matrix implied in the model with the covariance matrix found in the data (Kline, 2016), PLS-SEM relies on several statistics for the evaluation of its measurement (factor analysis) and its structural (path analysis) model (Hair et al., 2022).
PLS-SEM is a nonparametric analysis, and it is more robust than CB-SEM with formative latent variables (i.e., where items are understood to be forming the latent variable instead of the latent variable being the assumed common cause for the items), which is true for all of our predictors. It also allows for single-item measures to be included in the structural model. PLS-SEM performs similarly to CB-SEM in a wide range of cases, especially with larger sample sizes or for simple mediation models (Hair et al., 2022; Willaby et al., 2015). All of the above contribute to PLS-SEM, as opposed to CB-SEM, being a better analytical approach for this study.
However, we acknowledge that PLS-SEM has been criticized as being merely a weighting system in its measurement model; for lacking theoretical bases for model fit, which CB-SEM has, therefore not allowing for overidentification tests; and for issues with its significance tests (Rönkkö et al., 2015). We are mindful of these limitations and of the exploratory nature of this research when interpreting our findings. All analyses were conducted in R using the seminr package (Ray et al., 2018).
Considering the large sample size, it would be very likely that we would find significant

Decision-making process in the PLS-SEM analysis.
Models
Predictors were set as formative latent variables (i.e., a group of items that sufficiently describe a factor), whereas outcomes were set as reflective latent variables (i.e., when the group of items are understood to be caused by the factor). We argue that our predictors are not reflective latent variables because constructs such as people living on minimum wage or less and lacking access to public services are caused by the greater latent construct of
Two model comparisons were conducted. One tested whether the percentage of visible minorities in Brazilian municipalities would be a statistically significant and relevant predictor of early reproduction or a mediator between harsh conditions and early reproduction. The other model used the selected model. The selected model, however, was tested with the subset of top and bottom quartiles of the sample based on the percentage of children 0–4 years old and 5–9 years old. In the first model comparison, we hypothesized that ethnicity would be a statistically significant predictor and/or mediator. In the second model comparison, we hypothesized that the paths and explanatory power of the model with top quartiles’ subset would be higher than the paths and explanatory power of the bottom quartiles’ subset.
The final iteration of each model was bootstrapped (nboot = 10,000). The raw and wrangled data, all materials, and model outputs used in this study are openly available on OSF at https://osf.io/akqxn/overview?view_only=9639feeb25614309a2fb8eddcba2fc4. See SM03 R files and SM04 model iterations output in the Supplemental materials for the full analytical report and results. See SM04 model iterations output in the Supplemental materials for the full analytical report and results of the different models tested. This study was not preregistered.
Results
The selected model used three variables

Model predicting early reproduction in Brazil.
Measurement Model
Reflective Measurement Model
Formative Measurement Model

Model predicting early reproduction in Brazil excluding lack of resources.
Assessment of Brazilian Formative Latent Variables.
Structural Model
The structural model was also assessed for collinearity (VIF < 5), path statistical significance (
Assessment of Brazilian Structural Model.
Predictive power was assessed with
Model Comparisons
To test whether ethnicity was a predictive factor of early reproduction, we collected data on the percentage of Black, Indigenous, and South Asian people in municipalities. The percentage of Indigenous and South Asian people was mostly absolute zeros in almost half of the municipalities (>2,000 cases), and these measures were removed. The percentage of Black people was then included as a single item both as a predictor and as a mediator in our most restricted model between
The second planned comparison was to divide the sample between the quartiles with the highest and lowest percentage of children aged 0–4 and of children aged 5–9. There were minor issues with some indices in some of the subsamples (i.e.,
Discussion
The first aim of this study was to test if an exploratory analytical approach using LHT-P as the theoretical approach to data selection and interpretation and using the census could successfully predict reproductive patterns in an entire country's population. To do this we tested whether census measures associated with familial socioeconomic stressors, young marriage, and living in a poorly resourced area could predict early reproduction as reported in the census 10 years later. The explanatory power for
The second aim was to identify which of the indicators of harshness and unpredictability would predict faster or earlier reproduction. Both the percentage of children aged 0 to 9 years in the population and the percentage of women 15–24 years old with children in the population of Brazilian municipalities can be substantially predicted by the percentage of (1) families with low resources (i.e., no electrical power or garbage collection service and living with one minimum wage or less); (2) women 15–24 years old with children and teenagers with a spouse, and people divorced or separated; and (3) large families with more than one person per bedroom in the previous census. Regardless of these variables being grouped into factors, these were the most successful predictors of earlier reproduction in Brazilian municipalities.
Notably, these effects were measured across a 10-year gap, with predictors measured in 2000 and outcomes measured in 2010. This time delay was chosen because of the assumed developmental phenomenon between harsh and unpredictable environments and later reproductive timing (Ellis et al., 2003; Simpson et al., 2012; Szepsenwol et al., 2019). Based on PAT, our hypothesis was that children experiencing harsh and unpredictable conditions in the first years of life are likely to adopt a “faster” LHS, experience puberty earlier, and start reproducing earlier. Our findings, in addition to findings from previous research, support the assumption that there is a critical period of development (0–7 years old) in which exposure to harsher and unpredictable environments is likely to cue individuals into reproducing more frequently or earlier. Other findings in the literature also support this assumption by showing that sudden or current change in harsh life conditions does not impact reproductive behavior (Richardson et al., 2020) or is associated with reproductive decline (Nolin & Ziker, 2016).
To further support the hypothesis that the observed association might be a developmental phenomenon and not due to some spatial association or some statistical artifact, we repeated the analyses using the quartiles with the highest and lowest percentages of children aged 0–4 and the quartiles with the highest and lowest percentages of children aged 5–9. The paths from predictors to outcome remained similar across the four subsamples (ranging from 0.35 to 0.48) and comparable to the main model, indicating a stability in the association between these variables. Crucially, the municipalities with the most children had greater explanatory power than those with fewer children, which is consistent with the idea that there is a developmental phenomenon.
Because we are using the census data, our analyses used municipal populations, not just a sample of the population. Therefore, conclusions can and should be drawn regarding nonpredictive variables, especially when there is a strong prediction that these variables will be associated with
Caveats
Proper interpretation of these findings should be mindful that this research has used aggregated municipal data to assess hypotheses regarding individuals' developmental processes. Finding patterns present in entire populations is a valuable contribution to PAT. However, aggregate municipal data is far from being ideal to assess these hypotheses, and these limitations are discussed in the General Discussion section.
Ten years is not an ideal time frame to test this developmental prediction. Children 0–7 years old experiencing harsh or unpredictable environments would be 10–17 years old by the time we assessed early reproduction. Regarding the outcomes, the parents of children 5–9 years old could not have been in the critical period of development of 0–7 years a decade earlier, especially for children aged 7–9 years. To offer some insight into this issue, we ran a post hoc model removing the variable
Some parameters in the model of the study required theory-informed interpretation. One of our discriminant validity measures (HTMT) was not attained for the three predictors. We decided to retain it because the other measure (Fornell–Larcker) passed the criteria and because the issue with a high HTMT is the risk of it being above 1 in the population (Hair et al., 2022). Bootstrapped measures did not reach 1; therefore, it is very unlikely that the HTMT is 1 in the population, which suggests that the variables attained discriminant validity.
Second,
For the comparison of ethnicity, however, we removed the variable
Assessing the predictive value of the model is a recommended step of PLS-SEM (Hair et al., 2019, 2022). In this step, the prediction of each of the directly observed variables (i.e., the four variables indicating
In addition, the predictive value assessment is meant to be indicative of the capacity of the model to predict out-of-sample outcomes (Hair et al., 2022; Shmueli et al., 2016). However, because this study is using census data, there is no alternative sample where this model can be replicated. We are not estimating what would be the association between predictors and outcomes in a population; we are describing what they are. This is another reason why we did not rely much on statistical significance when choosing the optimal model. Moreover, previous research has not reported the predictive value of PLS-SEM models (Kazár, 2014) or has tested the predictive value of PLS-SEM models with different approaches (Shmueli et al., 2016; see Riou et al., 2016 for an example).
Municipalities likely have higher correlations between variables when those municipalities are nearer geographically. It is possible that the predictive value has been affected by this spatial relationship. In addition, characteristics specific to a region, community, or the differences between urban and rural communities could confound the interaction between the variables studied, increasing error in the model prediction. Future studies using spatial cross-validation (Pohjankukka et al., 2017) or stratified cross-validation (Diamantidis et al., 2000) approaches could address this issue.
A final limitation of this study is its exploratory nature (Szollosi & Donkin, 2021). We used PAT to inform variables’ selection and, to the extent possible, the time frame of a phenomenon. Therefore, this study is descriptive and not ideal for hypothesis testing. Future studies could attempt to use more confirmatory approaches, CB-SEM for example. When supported by theoretical background, CB-SEM allows for hypothesized causal inferences (Kline, 2016; Maruyama, 1997). Indeed, there has been a claim for more formal and more refined models in LHT in psychology (Nettle & Frankenhuis, 2020; Stearns & Rodrigues, 2020). The second study in this article attempted to use CB-SEM with a different data set (ACS). Future studies could test whether this pattern is representative of a more longitudinal/developmental phenomenon or a mere geographical correlation by using same-year (i.e., predictors and outcomes extracted from the same census year) or reverse (i.e., predictors extracted from the most recent census and outcomes extracted from the past) models. These comparisons, however, are beyond the scope of this article.
Frequent Reproduction in the United States
In this study, we used a confirmatory method of analysis to determine if variables similar to the significant predictors of reproduction patterns in Brazilian municipalities would be significant predictors of reproductive indicators in a new data set. We chose a different country for this test (United States) and a different year range. We hypothesized that most of the variables would be significant predictors of reproduction.
Methods
Data Selection and Transformation
We utilized the variables that were relevant and significant predictors or outcomes in the first study (Figure 2) to explore potential variables for this study. Publicly available data was collected from the ACS: 5-Year Estimates aggregate tables (U.S. Census Bureau, 2020) using the Census Bureau of Statistics API. Predictors were extracted from the ACS 5-year aggregate table in 2009 (i.e., aggregate estimates from 2005 to 2009) and outcomes in 2023 (i.e., aggregate estimates from 2019 to 2023). This represented a time difference of around 14 years between predictors and outcomes. The ACS produces its estimates by collecting data from around 3.5 million addresses in the United States and Puerto Rico every year. These are reported as estimates of geographies with at least 65,000 people (e.g., census tracts or counties). The 5-year aggregate combines data from 5 consecutive years to produce estimates of geographies with fewer than 65,000 people (U.S. Census Bureau, 2020).
US counties were decided to be the geographic level of data collection. Counties or equivalents are the primary divisions of American states, and they usually include multiple cities or towns (U.S. Census Bureau, 2022). The choice of counties instead of smaller geographies is intended to reduce the likelihood of a high percentage of the population moving during the 14-year time span under which data was collected.
Data from ACS 2009 included 3,221 counties, and data from ACS 2023 included 3,222 counties of the US mainland, Alaska, Hawaii, and Puerto Rico. Table 4 reports the 25 variables that were collected for this study and how they compare to the variables that were used in the model in the first study. Similar to the study utilizing the Brazilian Census, variables were square-root and log-transformed, and the boxplots of these variables were used to choose the transformation (or non-transformed variable) that most closely resembled a normal distribution. Outlier (
Variables in the Model Using Brazilian Census Compared to Variables Using US American Community Survey.
Log-transformed variables.
Square root-transformed variables.
Manually calculated the sum of the variables to come up with a single variable representative of both groups.
Merging the data frames (2009 and 2023) resulted in 3,209 counties. The reduction from the original data frames (e.g., 3,221 in 2009) is usually due to the division or redefinition of a county, which would result in a different code for this new geographical division. Outlier removal resulted in a data frame of 2,763 counties, which represents a reduction of around 15% of the data frames that were collected or merged. No variable had an NA count higher than 5%; therefore, no variables were removed due to this process. The final data frame consisted of 19 predictor and 6 outcome variables and 2,763 cases. All scripts and a full report of data curation and transformation are available in the Supplemental materials.
Results
CB-SEM analysis was planned for this data. CB-SEM is an alternative to PLS-SEM that encompasses a more confirmatory approach (Hair et al., 2022; Kline, 2016). The measurement model is tested in confirmatory factor analysis, and in a subsequent step the structural model is tested with the paths between factors.
The confirmatory factor analysis step, however, did not reach a solution in different attempts to load the variables into factors. These attempts were composed of different variable organizations (as long as there was theoretical support in LHT-P) and of using variables that ACS reports separated by sex, both separately and added. The solutions consistently presented one variable with high loading (e.g., λ > 1) while the others had low loadings (e.g., λ < 0.3). See the Supplemental materials for a full report of these steps. Since we are using simple models (i.e., without mediation or multiple latent outcome variables), removing the factors rendered our CB-SEM analysis as only a series of multivariate linear regressions. Therefore, the analytical approach of US data was switched to multivariate linear regressions.
Different multivariate linear regressions used the outcome and predictor variables described in Table 2. Visual inspection of diagnostic plots indicated that the residuals were approximately normally distributed, with some deviations in the tails and with slight heteroskedasticity. Given that this is demographic data with non-normal distribution and the arguably large sample size, we utilized heteroskedasticity-consistent degrees of freedom correction (HC1) in order to inflate residuals and make the analysis more reliable (Long & Ervin, 2000; MacKinnon & White, 1985). We assessed collinearity with VIF > 3, but no predictor was removed because none was above this criterion. Nonsignificant variables were removed from the model until only statistically significant variables were left, a process similar to stepwise linear regression analysis (Harrell, 2015).
Tables 4 to 7 report the models for each of the outcome variables, and Table 8 reports the explanatory power of these four models. The models predicted a substantial proportion of variance of
Variables Predicting Percent of 0- to 4-Years Olds in US Counties.
Variables Predicting Percent of 5- to 9-Years Olds in US Counties.
Variables Predicting Percent of 15- to 19 Years of Age Who Had Given Birth.
Multivariate Linear Regression Model of Women Who Had Given Birth in the Past 12 Months.
Explanatory Power of the Four Models.
Percent of Age Group—0 to 4 Years of Age by Low, Medium, and High Tertile of Predictors.
Percent of Age Group—5 to 9 Years of Age by Low, Medium, and High Tertile of Predictors.
Percent of Women 15 to 19 Years of Age Who Had Birth by Low, Medium, and High Tertile of Predictors.
Percent of Women Who Had Birth in the Past 12 Months by Low, Medium, and High Tertile of Predictors.
Discussion
This study aimed to assess whether an analysis with US data would have similar findings to the Brazilian Census analysis, but this time with a confirmatory analysis. There were a few similarities. Linear regressions fitted were capable of explaining a substantial proportion of the variance of the percentage of the population that were children aged 0–4 and 5–9 years in US counties using variables that were also predictive in Brazil and that are akin to variables usually understood to be measures of harshness and unpredictability in PAT (Belsky et al., 2012; Ellis et al., 2003; Hartman et al., 2018; Xu et al., 2018). A higher percentage of children in a given area can be understood as more frequent reproduction. We assume that this higher percentage of children is indicative of one of two things: (1) a higher percentage of women are having children in these counties; or (2) a subset of women in these counties are having multiple children.
Even though the outcome variables of young motherhood (
Some considerations can help understand this poor performance. First, there is a global trend toward the decline and delay in fertility (Roser, 2014; Volk, 2023). Access to education and health care, especially among women, are understood to be the lead causes of this decline and delay. Observing the frequency tables, we can see that the non-transformed percentage of
Second, the notably small percentages, particularly in the first case, mean that outlier cases potentially resulting from confounding variables would exert a disproportionately large influence on the variance of these variables. This noise would likely have a reduced impact if these outcome variables encompassed a greater proportion of the population. Lastly, the considerable right-skewness observed (i.e., values clustering near zero) increases the likelihood of the model being impacted by the violation of distributional assumptions required for linear regression analyses. Future studies could repeat this analysis with a subset of the population in which there are higher percentages of young or recent parents or make use of a more robust analysis.
The model predicting the percentage of
Fewer resources in US counties in 2009—indicated by not having a complete kitchen, having income below poverty level, and having more than one occupant per room in the household—were positively associated with
While the percentage of women who gave birth in the past 12 months was positively associated with the percentage of children both aged 0–4 and aged 5–9 years, the percentage of divorced people was a negative and statistically significant predictor of the percentage of children in these age groups. The former may support the claim that the phenomenon being observed is of more frequent reproduction in harsher environment, while the latter is contrary to what has been observed in the literature. One of the explanations of LHT-P is that experiencing parental transitions during childhood would favor faster and more frequent reproductive strategies because it would alter the perception that the parental figure (usually the father) would remain in the relationship and invest in children (Belsky et al., 2012; Volk, 2023). Our data shows that having fewer divorced people in 2009 predicts a higher percentage of children aged 0–9 years.
Differently from the study using the Brazilian Census, a high percentage of Blacks in 2009 was a statistically significant predictor of the percentage of children aged 0–9 years in 2023, and the percentage of Hispanic people was a significant predictor of the percentage of children aged 0–4 years. These associations were all negative, meaning that once you remove the effect of socioeconomic predictors, the ethnicity itself is actually a predictor in the contrary direction of the common perception that these ethnicities have larger families (Pew Research Center, 2015).
This study suffers from similar limitations as the study with Brazil data. Considering that we are describing an observation made with aggregate values of large geographical areas (US counties), one should be very careful when considering how our results apply to individual behaviors. There is the possibility that the associations found at the county level would not be found at the individual level, that confounding variables are present, and that, if observed or measured, they would result in a different interpretation of the phenomenon at the individual level.
Data in both studies were transformed; therefore,
Linear regressions—used in the analysis of US data—assume normality of residuals. Transforming the data improved the non-normality and heteroskedasticity, and we also used HC1 correction to obtain robust standard errors. Finally, our sample consisted of more than 2,700 cases, and
General Discussion
Our first study determined if predictions derived from LHT-P could account for measures of reproductive behaviors contained in theBrazilian Census data. Studies with a similar approach have been conducted before in the United Kingdom (Copping et al., 2013; Copping & Campbell, 2015), but to our knowledge, this study is novel in a few characteristics. First, the analysis of the Brazilian Census was the first study to utilize population data from a developing country to assess PAT hypotheses. Second, our analyses used a longitudinal approach—using census data from one year to predict data collected a decade or more later—to assess the developmental prediction proposed by PAT (Ellis et al., 2003; Simpson et al., 2012; Webster et al., 2014). Third, this project is the first to compare the results obtained from two countries with very large populations (more than 200 million people). Fourth, the study is also the first to assess whether ethnicity—used as a proxy for harsh and unpredictable circumstances along with other measures common in PAT literature—would be predictive of reproduction.
These two studies were designed to answer several research questions. The first was whether an exploratory approach to a structural equation modeling analysis would converge and perform well using secondary data. This question is interesting because secondary data tend to be suboptimal for variable selection (Andersen et al., 2011; Johnston, 2017; Jones, 2010) and are unlikely to achieve model fit (Hair et al., 2022) because data collection was not designed for that purpose. This analysis is even more challenging because we used variables measured in one year to predict outcomes a decade later.
PLS-SEM analyses of the Brazilian data worked reasonably well. Some variables converged into satisfactory factors (e.g.,
In the study with US data, however, the variables did not converge into factors in the CB-SEM analysis. As discussed above, this nonconvergence might be caused by the nature of the data—secondary data, not intended to be grouped as factors in its conception—or by the nature of the analysis—confirmatory and assuming reflective factors. We believe that PLS-SEM is suitable for analyzing this type of data and encourage future studies to use it to analyze data from different countries.
The second aim of the study was to assess which predictors, among the ones that were akin to those frequently used in LHT-P, would predict variables that index early reproduction. In contrast to studies that identified unpredictability (Belsky et al., 2012; McLaughlin et al., 2021; Simpson et al., 2012), especially parental transitions (Hartman et al., 2018), as the strongest predictor of faster LHSs, our study found that
Most studies in the field have been conducted in developed countries. Such populations may have both lower levels of harshness and less variance in the measures of harshness frequently used in LHT-P (i.e., socioeconomic status). The variance in harshness in our data set may explain why harshness was the best predictor of early reproduction in our study but not in studies relying on data from developed countries.
The variance of the measures of unpredictability used in this study could be obscured by other variables. For example, the same variance or even the same regions in which there is a high level of unemployment (
The third aim was to assess whether an analysis of survey data from a developed country would yield similar findings to the ones obtained from an analysis of census data from a developing country. Results from Brazil mainly indicated that areas that were associated with harsher conditions reported—10 years later—as having more and younger children. In addition, the percentage of mothers aged 15–24 years and of people 15–19 years old living with a partner were also predictive of a higher percentage of children and of younger mothers. We argue that this is a developmental phenomenon: it is likely that many children living in those municipalities in 2000 grew up to become young mothers in 2010. The results from the United States were comparable. For models predicting the percentage of children aged 0–4 and children 5–9 year indicators of harshness (e.g., as lacking a complete kitchen, low income, more rooms per dwelling, more than one occupant per room), and a higher percentage of women who had births in past 12 months and the percentage of children aged 0–9 years old were predictive of the percentage of children aged 0–4 years old. In this study, predictors and outcomes were separated by roughly 14 years. This, once again, may support the developmental hypothesis that many children living in counties with higher levels of harshness in 2009 grew up to become parents in 2021.
In both studies, the percentage of divorced people was a negative predictor of the percentage of children. This result was surprising because “parental transitions” is a common measure associated with a faster LHS in LHT-P (Belsky et al., 2012; Ellis et al., 2009; Hartman et al., 2018). Parental transitions have been assumed to be a signal to children raised in these environments that relationships do not last, so one should expect lower partner investment in raising children, but whether this is really a cue has been questioned recently (Volk, 2023). This result is contradictory to PAT assumptions. It can be interpreted, though, in light of the recent shifts of marriage and divorce statistics (Herre et al., 2020; Kennedy & Ruggles, 2014). Marriage has been in a slight decline and there is an increase in conjugal relationships and parenting without marriage, which in turn lowers divorce rates. Marriage and divorce are also happening at later age. These could decouple the measure of divorce rates from the parental transition it was intended to measure. In addition, divorce in modern environments and divorce in our environment of evolutionary adaptedness can result in considerably different outcomes for children (Ellis et al., 2009; Volk 2023). In the environment of evolutionary adaptedness, the absence of a parent early in life is much more likely to decrease chances of survival, which is the original measure of harshness in PAT research. All of these can impact the predictive power of divorce rates.
Another comparison between the two countries that are of interest is the direction of the prediction. Results from Brazilian Census indicated that lack of access to recycling service and to electrical power was associated with a higher percentage of children and of early parenting. Lacking plumbing in the United States, however, was a negative predictor of the percentage of children 14 years later. This may be due to its reduced prevalence or more closely related to geographical conditions (e.g., rural environments) instead of an indication of a harsher environment.
Harshness indicators in the ACS were not significant predictors of percentage of children aged 5–9 years. The variables that were positively associated with the percentage of children were the percentage of children between 2005 and 2009, of women who had births in the previous year, and the median number of rooms. The percentage of children and of women who had births in the previous year being significant predictors may be due to the similarity in the variables. Geographies with a higher number of children in a given year are more likely to have a higher number of children 14 years later. Another possibility is that the age range and the time difference in our data were not aligned. The middle point of our predictors using US data was 2007, and of the outcomes it was 2021. The outcome variable being children who were 5–9 years old in 2021 means that they were born between 2012 to 2016 and means that the children who were 0–4 years old in 2007 would still not be in reproductive age then. Therefore, the time span does not allow for the phenomenon described by LHT-P to be found with this outcome variable.
With all these considerations, we suggest that results found with US data were somewhat similar to the ones found with the Brazilian Census. The United States and Brazil are two of the biggest countries in land mass and both countries have population counting in the hundreds of millions. Finding similar results between the two countries is a valuable contribution to LHT-P literature. Moreover, findings that are common among different environments and that are found among a large sample size are invaluable to evolutionary psychology because of its claims of adaptations that have been selected in our evolutionary history (Buss, 2024).
Finally, we aimed at assessing whether the percentage of visible minorities, which historically have faced and still face harsher circumstances, would be a statistically significant and relevant predictor of our reproduction measures. Our results point out they were not. Having a high percentage of Black people in Brazilian municipalities had a negligible effect in predicting early reproduction and the Black or Hispanic and Latino percentages had a statistically significant negative effect in predicting higher reproduction rates in US counties. Different fields of research have long established the association between Black ethnicity in the United States and early parenthood (Wilson, 1987; Wodtke, 2013) and both Hispanic or Latino and Black women have higher fertility rates than white women in the United States (Pew Research Center, 2015). We consider that the relationship between these previous findings and the findings in our studies help in demonstrating the different circumstances and the history of racism and discrimination that visible minorities face (Bleich et al., 2019; Canizales & Vallejo, 2021; Couto & Brenck, 2024). Lower or lack of resources effects and not a general effect of ethnicity explains reproductive behavior in our model, and in the case of United States, Black and Hispanics or Latinos are actually having fewer children when the effects of such socioeconomic conditions are removed.
In sum, the usual PAT hypothesis that harshness is associated with early or more frequent reproduction was supported—at a population level—in a developing country and partially supported in a developed country. The prediction of unpredictability indicators was inconsistent in both countries, and the percentage of visible minorities were either a nonrelevant predictor or a statistically significant predictor of reproduction, but in the opposite direction as commonly observed in literature. Future research could aim to confirm such results with different data sets.
Caveats
This research assessed hypotheses derived from LHT-P, specifically PAT. These theories are mostly used to explain the developmental trajectory and behaviors of individuals. The first study, however, used Brazilian Census data at the municipal level and the second used data at the county level. All the data utilized were the percentages of people in these geographies. Therefore, any conclusion or inference about individuals should be done with careful consideration. The first consideration is that this study relies on the assumption that most of the children from one municipality will grow up and remain in the same municipality 10 years later. If this were not true, we would be just measuring geographical correlations or some statistical artifact. The 2010 Census in Brazil reports that 62.8% of Brazilians were born in the municipality they live in (Sistema IBGE de Recuperação Automática—SIDRA, 2024) and the National Household Sample Survey indicated that in 2011 59.9% of Brazilians lived in the municipality they were born in (Goularti, 2016). The age range of most mobility is between 25 and 29 years. In comparison to the most mobile age range, the population between 15 and 19 years old was 74% as likely to move in the 2000 Census and 69% as likely to move in the 2010 Census (Nascimento et al., 2016). In addition, if the national average is around 60% of the population, it is reasonable to assume that this percentage will be negatively associated with age until the age range of most mobility. In other words, the younger the person (until 25 years of age), the more likely that she is in the municipality in which she was born. This will only not be true if one is expecting a considerable migration back to municipality of birth at a later age. We can then expect that among people 24 and younger, which is the population relevant to our outcome variable, 59.9% or more will be residing in the municipality they were born in. It is worth noting that the factor
Another caveat in interpreting our findings is that the associations found at the municipal level may not be found at the individual level. It is possible that people experiencing any of our predictors (e.g., Lack of resources, Youth: Married with children, and Family size) are not the same people experiencing another predictor. It is also possible that the people between predictors and outcomes were different. In addition, there was no manipulation or random assignment of participants (characteristics of experimental research), which then increased the chances of confounding variables. Many variables that are not available in the census (e.g., mortality, disease, use of contraceptive or family planning) could be equal or better predictors of our outcome variables and can be covarying with any of our predictors. Explaining these findings then would require measurement of these other variables or another theoretical framework, which were beyond the scope of this study.
Other theoretical frameworks that could help explain the associations found in this study are, for example, genetic mediation or confounding, or cultural and institutional factors. Genetic variation may have a role in the different LHSs developed by humans (Buss, 2016; Del Giudice, 2009). It could be that genes select for or shape environmental conditions and both of these would impact LHS factors such as time of puberty (Volk, 2025).
Cultural and institutional factors could help explain the association between poverty and early parenting (Wilson, 1987; Wodtke, 2013). For example, parents who experience longer commute to work or who have to work longer hours to provide necessary income may not have as much time as parents from more affluent conditions to support or invigilate their children in their sex and reproductive decisions. In addition, poorer areas are frequently the ones that usually lack more institutions to aid parents in such tasks. Early or more frequent reproduction could be more common and more accepted in some cultures (Wilson, 1987; Wodtke, 2013). In places where educational and employment opportunities are scarce, becoming a parent may also offer a sense of identity and an “adult status” among the community (Hanna, 2001). All of these alternative explanations could interact with PAT claims to explain the association between poor communities and early and frequent parenting.
LHT-P and PAT have been subjected to many criticisms (Nettle & Frankenhuis, 2020; Sear, 2020; Stearns & Rodrigues, 2020; Volk, 2023, 2025). Critics of LHT-P and of PAT propose a considerable revision of its claims and theoretical bases. The goal of this paper was to use census data that is similar to the most common or mostly tested measures in PAT research and see if the patterns proposed by PAT would be found in this population data. This can serve as basis to theorizing about what findings are consistent in human populations and to generate more specific and formal predictions.
In sum, our findings are consistent with the idea that growing up in an unpredictable and harsh environment predicts the development of a “faster” LHS using data from the census of a developing country. A model utilizing variables measuring lack of resources, larger families, parental transition, and young parents substantially predicted a variable measuring early reproduction in both the overall sample and in the geographic areas with the highest percentages of children. However, inferences from these findings should be careful because we did not assess individual-level data.
Conclusion
Using Brazilian Census data, we showed that harshness early in childhood in a given municipality predicts early reproduction in that municipality, consistent with the prediction of PAT. In the United States, harshness early in childhood in a particular county predicts higher reproduction rate in the same county, a finding that is similar to the findings in Brazil. However, in the United States the direction of some of the predictors was negative, which is contradictory to PAT. Ethnicity predicted reproduction in the United States but not in Brazil, suggesting that our data set cannot completely account for the effect of environmental circumstances associated with ethnicity on reproduction.
Supplemental Material
sj-zip-1-evp-10.1177_14747049261432896 - Supplemental material for Harshness Predicts Reproduction in Brazilian Municipalities and US Counties: A Life History Theory Approach
Supplemental material, sj-zip-1-evp-10.1177_14747049261432896 for Harshness Predicts Reproduction in Brazilian Municipalities and US Counties: A Life History Theory Approach by Vinícius Betzel Koehler and M. D. Rutherford in Evolutionary Psychology
Footnotes
Ethical Approval and Informed Consent Statements
This study is exempt from ethical approval from ethics review board because it only utilized publicly available and anonymous secondary data.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This project was funded by the Natural Sciences and Engineering Research Council of Canada( grant number RGPIN-2020-06761). The funding source had no involvement in the study design; in the collection, analysis, and interpretation of data; or in the writing of the report.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Supplemental Material
Supplemental material for this article is available online.
