Abstract
Researchers investigating the relationship between age and life satisfaction have produced conflicting answers, via disputes over whether to include individual-level control variables in regression models. Most scholars believe there is a ‘U-shaped’ relationship, with life satisfaction falling towards middle age and subsequently rising. This position emerges mainly in research that uses control variables (for example, for income and marital status). This approach is incorrect. Regression models should control only ‘confounding’ variables; that is, variables that are causally prior to the dependent variable
Introduction
What is the relationship between age and life satisfaction? That question has been notoriously difficult to answer. Most researchers (e.g. Blanchflower and Oswald, 2004, 2008; Graham and Ruiz Pozuelo, 2017) perceive a ‘U-shape’: life satisfaction is said to decline towards middle age and then rise with older age. Others (e.g. Diener et al., 1999) assert that the relationship is basically flat, with no particular trend. The U-shape idea receives regular reinforcement in research on other topics: whatever their research question, well-being researchers almost always build regression models that include a quadratic function for age (via use of raw age and age-squared as well), anticipating a negative coefficient for the former and a positive coefficient for the latter – and empirical results almost always verify this expectation. The U-shape idea is even the topic of a book addressed to non-academic readers (Rauch, 2018), suggesting (in the subtitle) that ‘life gets better after midlife’, with primary grounding in the work of Blanchflower and Oswald.
The discrepancy in findings pivots mainly on the question of whether to gauge the association via regression models that control for ‘other determinants’ of life satisfaction – or, via consideration of the relation between age and life satisfaction that emerges when one controls more narrowly for ‘confounders’. The debate has resisted resolution in part because the essence of the ‘control only for confounders’ position has not been absorbed. Effective interpretation of results gained from analyses that control more broadly for ‘other determinants’ has been undermined by imprecision (explored below). These points have been correctly described in previous research (e.g. Easterlin, 2006; Glenn, 2009; Hellevik, 2017).
However, some researchers have rejected or neglected the core points in later work (especially Blanchflower and Oswald, 2009, 2019). 1 Laaksonen (2018) takes an agnostic view about whether a broad set of controls is required and presents results from both approaches. Beja (2018: 1820) includes controls for socioeconomic profile as a matter ‘of course’; one can see the consequence of this decision in the way the age and age-squared coefficients get larger (approximately doubling) in models where such controls are added (Models 7 and 8 in his Table 1), deepening the impression of a U-shape. Another recent article (Frijters and Beatton, 2014) describes the relationship between age and happiness as a ‘mystery’; the situation likely remains mysterious in part because there is insufficient exploration of what it means to include controls in this context.
Univariate information.
In brief, the reason to avoid individual-level controls such as income, marital status and education is that these aspects of people’s lives cannot determine one’s age. In the first instance, the reason to include control variables in a regression model is to mitigate against the possibility of bias – confounding bias, in particular (e.g. Shahar and Shahar, 2013). Confounding bias emerges from the omission of variables that determine the dependent variable and
This article offers researchers greater clarity on these points by reinforcing important distinctions: between confounding and intervening variables, and between total effects, direct effects and indirect effects. To gauge ‘the effect’ of age, one starts with a regression model that includes confounders (if any) but excludes intervening variables. Having thus identified a
Many researchers investigating life satisfaction (as well as happiness) do not write about the important distinction between confounding variables and intervening variables. One therefore sees many regression models (in this field and many others) that are not properly specified, in the sense that they do not give results that identify ‘the effect’ (i.e. the total effect) of particular variables, because they include intervening variables. There is an obvious need for greater clarity in this regard, in particular to avoid what is sometimes called ‘overcontrol bias’ (e.g. Elwert and Winship, 2014).
The article proceeds in the next section with a discussion of the logic of control variables in regression models. This discussion comes in lieu of a (further) review of previous findings about age and life satisfaction. The dispute is effectively summarized above,
2
and the issue in contention is methodological, not substantive. (That is, the differences in findings are not rooted in disagreement about
Confounding vs. Intervening Variables
Commonly used statistics/econometrics textbooks teach students to deploy control variables without making an important distinction – between confounders and intervening variables (e.g. Hair et al., 2013; Wooldridge, 2015). Regression is conventionally treated as a fundamentally mathematical exercise, with ‘prediction’ as the core goal. While there is often a nod to ‘theory’ as a basis for selecting additional independent variables, the main criterion is mathematical: does adding a variable improve ‘model fit’? It is then unsurprising to encounter confusion and misspecification in empirical work. This tendency is likely reinforced by an implicit belief that models are ‘better’ (or perhaps at least more impressive) when they contain more control variables.
Two simple examples illustrate why a different approach is needed for gaining insight into how the social world works. An example sometimes used for teaching involves height and maths ability. Even without a regression model, one could see via a scatterplot that there is a positive relationship: taller people are better at maths. An empirical description along these lines would be entirely correct; a (bivariate) regression model would only provide additional detail, for example a coefficient for the slope. As odd as it might seem, height is a useful ‘predictor’ of maths ability – a point that shows the limitations of findings expressed via the language of prediction.
As the saying goes: correlation is not causation. We can reasonably ask: are taller people better at maths
But some potential controls are not
In general, then: regression models designed to gauge the effect of a particular variable should include confounding variables and exclude intervening variables (e.g. Berk, 1983; Gangl, 2010; Pearl, 2018; Smith, 1990). The purpose of control variables is to minimize bias in one’s estimates – but this purpose is achieved only if the controls we add are confounders; that is, variables that are causally prior to the independent variable whose impact we seek to identify. Adding control variables does not have a single/unitary impact on the results given by our models. To understand their impact, we must consider this distinction between confounders and intervening variables.
A necessary caveat is that for some variables the situation is less clear than a binary opposition between confounding and intervening variables might imply. Some variables might belong to both categories. In situations of that sort, one must make a judgement as to which type of effect is likely to dominate (i.e. the strength of the two effects), given the substantive question one seeks to answer. If the intervening path dominates the confounding path, one likely loses more than one gains by including the variable as a control; including it is likely to exacerbate bias rather than to reduce it. The matter should be discussed in substantive terms, perhaps as a limitation of one’s analysis.
Reconsidering the Relationship between Age and Life Satisfaction
The points in the preceding section offer increased clarity on what sort of analytical approach to use for investigating the relationship between age and life satisfaction. An analysis that mostly excludes individual-level controls is required. Such an analysis is not biased via omission of other variables – because most other variables cannot be confounders in this context. Other variables help determine life satisfaction, but they do not affect age. Again, they cannot do so; certain variables might affect the age composition of a population, but they cannot determine the age of any particular individual.
Researchers who produce results relying on regression models that contain individual-level controls sometimes describe the age coefficient(s) as the ‘net’ or ‘pure’ effect of age (Blanchflower and Oswald, 2019). Interpretations of that sort are formally correct – but they can be understood properly only via precise articulation regarding what the ‘net effect’ is net
So, if income, for example, is to be included as a control, we then need to understand how age would affect income, and how the resulting changes in income would affect life satisfaction. Assume a positive impact of income on life satisfaction, and assume as well that income rises towards middle age (or perhaps towards retirement age) and then declines after retirement (a pattern that reasonably describes the experience of many) – in other words, an inverse-U. If we control for income in a model investigating age, the results for the age coefficient will be altered by control of the path on which age (indirectly) increases life satisfaction towards middle age via its (age’s) earlier positive effect on income. We also mitigate the way age would (indirectly) decrease life satisfaction after retirement via its (age’s) later negative effect on income. If age in general had no particular impact on life satisfaction (i.e. if it was basically flat as per Diener et al.’s (1999) early position), controlling for income might foster the
The idea of a net/direct effect is sensible on its own terms; we might well be interested in some notion of a ‘pure’ impact of age, distinct from its impact via income (or indeed other circumstances). That impact might be articulated with reference to the way age itself can make people feel: being old might make one feel depressed and useless (especially in a society valuing youth) – or, in contrast, it might lead to a sense of peace rooted in ‘wisdom’ or acquiescence (notions that are not directly connected to income or other aspects of one’s circumstances). But a sensible interpretation along these lines requires explicit articulation – at a minimum, so that readers (and especially journalists) are not led to believe that they are being presented with the
The situation is quite complicated in any scenario where more than one intervening control variable is included. When regression models contain multiple controls, we must think of coefficients for particular variables in ceteris paribus terms – the effect of one variable holding
If a model is specified to control only for confounders pertaining to the core independent variable of interest, then considerations of this sort do not arise; the model will give the total effect for that variable. If the goal is to investigate ‘the effect’ (e.g. of age), then a properly specified model is one that includes controls for variables understood to be confounders (and excludes variables understood to be intervening). For age, there are
Data and Analytical Approach
With these considerations in mind, we turn to an empirical investigation of how (or indeed whether) age affects life satisfaction. Data are taken from Waves 1 through 6 (survey periods: 1981/1984 to 2010/2014) of the WVS (Inglehart et al., 2014). Countries that participated only once are not suitable for effective treatment of cohort and so are dropped from the analysis; we then have 69 countries representing a wide variety of (country-level) situations.
3
National samples are constructed to be representative of the population; interview mode is face-to-face at the respondent’s home. Total sample size is 304,131; the per-round average across the countries included here is 1505. The sample as analysed below is
Life satisfaction is given in answer to the question, ‘All things considered, how satisfied are you with your life as a whole these days?’ Respondents choose a number from 1 to 10, where 1 is ‘completely dissatisfied’ and 10 is ‘completely satisfied’ (with the intervening numbers unlabelled). Age is derived from a question about the respondent’s year of birth; respondents are also asked to confirm their age.
The relevant controls in this context are cohort, period and country, each of which can influence population age composition as well as life satisfaction. The effect of country on age composition is evident via the significant international variation in life expectancy. To correctly perceive the general relationship between age and life satisfaction, one must adjust for the age-composition effect of mortality: if there are contextual factors that increase early mortality and also tend to reduce life satisfaction, the absence (from the sample) of older respondents affected by those patterns will artificially raise the coefficient of an age variable describing the experience of older people. The same possibility pertains to cohort and period effects. One can also include sex for this purpose: it too can affect the age composition of a population and might help determine life satisfaction. Crucially, it is ‘safe’ as a control variable because it cannot be an intervening variable (age cannot determine one’s sex).
Following Yang (2008), cohort is constructed in five-year spans working backwards from the year of birth of the youngest respondent in the Wave 6 sample. The use of five-year spans for the cohort variable allows analysts to ‘break’ the linear dependency that otherwise impairs age–period–cohort analysis (without making arbitrary assumptions about time trends). The variable for period is simply the year in which the respondent participated in the survey.
The analysis presents results from cross-classified random-effects models (CCREM), developed by Yang (2008) and Yang and Land (2008) for investigation of age effects using repeated cross-sectional datasets (as with the WVS here). ‘Random effects’ address the clustering associated with the (Level 2) country, cohort and period variables, using a multi-level modelling framework; this approach is preferable to entering these variables as fixed effects in a way that ignores the clustering. The higher-level variables are not fully nested within one another; the ‘cross-classification’ angle allows in particular for the possibility that people in the same birth cohort might be interviewed in different periods (and vice versa). Within this framework, age and sex are entered as fixed effects at Level 1; in this respect the model yields coefficients for those variables that are equivalent to the results given by ordinary least-squares models. 4
The conventional approach for this question is to construct models containing a quadratic function for age; that is, age-squared as well as age. But this approach produces coefficients that are difficult to interpret.
5
If both terms are statistically significant, one is led to conclude that there is indeed a U-shape, but it is difficult to determine
Some researchers (e.g. Cheng et al., 2017; Jivraj et al., 2014; Kassenboehmer and Haisken-DeNew, 2012) turn to panel data, believing that a longitudinal analysis is superior to inferring change from cross-sectional comparison of older to younger people. If there were panel data that covered people’s entire adult lives, it would indeed be very useful for the purpose of understanding how life satisfaction changes across the life course. But existing panels are much shorter; to understand what will happen to younger people as they become much older, it is still necessary to extrapolate from the ‘current’ experience of much older people (and vice versa), in a manner equivalent to cross-sectional analysis (cf. Frijters and Beatton, 2014). Another reason to use panel data is to enable application of fixed-effects techniques that control for unmeasured time-invariant characteristics. If there were grounds for concern about the existence of unmeasured characteristics that amounted to potential confounders in this context, then one could make a good case that a fixed-effects analysis is superior. But we return to a core premise of this article: other individual-level factors (even the unmeasured ones)
A third potential advantage of longitudinal methods is that they enable consideration of ‘reverse causation’: perhaps people live longer when they are more satisfied with their lives, and if we do not account for the way this process leads to lower representation of older less-satisfied people in the sample, a cross-sectional analysis gives a biased result for the effect of age on life satisfaction (the main relationship under consideration) via sample composition. Importantly, the direction of that bias would be upwards especially for older people: if the sample contained people whose death had been caused in part by poor life satisfaction, the result describing the overall effect of ageing would be lower than it appears in a conventional cross-sectional analysis based on living respondents. We then face a trade-off: a cross-sectional approach might give results that are biased in the way we have just described, but on the other hand we can consider a much wider range of countries, so that our impressions are not determined solely by countries where panel data are available. Given this trade-off, a cross-sectional analysis is worth pursuing, and the possibility of bias can be considered a potential limitation (and will be discussed further in connection with results below).
Table 1 offers univariate information on the variables included in the analysis.
Results
Table 2 first reports results from separate models for older and younger respondents. The left portion of the table (Model 1) indicates that life satisfaction indeed declines across the younger age range – at an average rate of 0.0082 points per year on the 10-point scale. This is not a greatly ‘significant’ number in a substantive sense, though it is perhaps a genuine decrease. (After 10 years, a younger person’s life satisfaction will have declined by almost one-tenth of a point on the 10-point scale, on average.)
Cross-classified random-effects models of life satisfaction.
But Model 2 fails to provide support for the notion that life satisfaction
This latter finding is a significant blow against the idea of ‘U-shape’. It is robust to the choice of cut-off; for no value do we get a result suggesting any more substantial rise in life satisfaction after middle age (the additional values explored were 50, 55, 60 and 65; results available on request). Without some substantial increase after middle age, there can be no U-shape.
The results from Model 3 of Table 2 reinforce this conclusion: here as well there is no indication of an upturn in post-middle-age life satisfaction. In a conventional model containing a quadratic function for age, the coefficient of the age-squared term is negative (as is the coefficient for raw age), though also tiny at −0.00002 and not statistically significant.
Figure 1 offers a visual representation of the conclusions we derive from the linear analyses here (Models 1 and 2): the lines indicate a mostly ‘flat’ trend in life satisfaction especially among older respondents. A visual representation of the results from Model 3 would consist of a flat line, at least in the sense that the age coefficients from that model are not statistically significant.

Age and life satisfaction.
The WVS sample across the six rounds contains a wide range of countries, and we should expect to find diversity in findings across that range (Deaton, 2008; Steptoe et al., 2015). Table 3 replicates the analysis described above (with cut-off at 45)
8
for each of the 69 countries (a random-effect term for country is now excluded). The table (containing only the coefficients for age) indeed reveals a diverse set of patterns. For some countries we see results that support the conclusion of U-shape: life satisfaction generally falls and then rises in the Czech Republic, Mexico and Turkey. That conclusion uses a threshold of
Age and life satisfaction by country and age group (CCREM regressions, controlling for sex, cohort and period).
What remains is to consider the possibility that some potential individual-level control variables might in this context be confounding as well as intervening. The possibility of a confounding effect exists only at the level of sample composition; individual-level variables cannot determine how old a person is, they can only affect sample composition by affecting the likelihood of early mortality. An obvious candidate is health: to state the obvious, people in poor health (especially older people in poor health) are at greater risk of dying. To mitigate against bias emerging from that possibility, one could consider including health as a control. But the costs of doing this surely outweigh any benefits. Any confounding bias arising from the omission of health is likely to be dwarfed by the bias induced by including it, given the way health intervenes in the relationship between age and life satisfaction. Age generally leads to poorer health; poorer health leads to lower life satisfaction. If we control for health, we will get an artificially raised coefficient for the effect of age. That coefficient is now a direct effect, net of the (negative) indirect effect of age that travels via its effect on health. If we interpreted it as the total effect, we would be presenting a figure that we have good reason to think is substantially biased. In contrast, the confounding bias associated with omitting health is by definition marginal; it does not pertain to the experience of people still alive (who are thereby available to participate in surveys). Including it is likely to exacerbate bias, not reduce it.
Empirical results verify intuitions about the consequences of including health. In models equivalent to those in Table 1 (adding a five-category variable for subjective health: very poor; poor; fair; good; very good), the age coefficient for those under 45 is now less negative (–0.0020 instead of −0.0082), and for those over 44 the age coefficient is more positive (0.0109 instead of 0.0025). Adding health has raised the age coefficients – but it has not given us a better understanding of the way age affects life satisfaction, expressed as a total effect (so, a table is not included here). Instead, it has obscured the way age often leads to worse health, which then depresses life satisfaction; the age coefficients are artificially high as a consequence of excluding that indirect effect.
One could consider other variables (e.g. education, income, partner) in similar terms. In each case, the fact that any confounding effect would operate only via sample composition means that inclusion would predictably induce bias via intervening effects.
We note again the potential implication of bias in cross-sectional results, emerging from the possibility of reverse causation: people with lower life satisfaction might experience poorer health (Steptoe et al., 2015), which might lead to early mortality. We can do more than simply accept a generic limitation for our cross-sectional results in connection with this possibility (‘the results might be biased’): we can consider the likely direction of bias. As argued above, the bias is likely positive – so, unbiased results for the impact of age on life satisfaction are likely to be lower than what is reported above, especially for older people. If low life satisfaction increases the odds of mortality, we will lack data on some people who have low life satisfaction especially in old age. The absence of such people from our sample means the coefficient describing the age impact on older people is higher than it would otherwise be. The main substantive result of this article is that there is no substantial increase in life satisfaction after middle age (and thus no U-shape) – and the possibility of reverse causation does not undermine that conclusion (and in fact would tend to reinforce it).
Conclusion and Implications
In a key contribution to this debate, Blanchflower and Oswald (2008) ask (in their title): ‘is well-being U-shaped over the life cycle?’ At least for the countries analysed here, that question has a straightforward answer: no. That answer applies to these countries taken as a whole and individually to 66 of the 69. One might be tempted to draw on the statistically significant age coefficient for the regression model pertaining to older respondents. But a responsible reporting of that finding should contain the following caveat: if there is a U-shape, the actual change in life satisfaction associated with age is very small, with only a negligible upturn in later life.
This answer is rooted in the assertion that the right answer comes from models without individual-level controls apart from sex (so, including controls mainly for the higher-level variables of country, cohort and period). Blanchflower and Oswald (2019) offer a rebuttal that might seem appealing: neither approach is incorrect – each gives different information. 10 That position is appealing only on the surface; it holds up in regard to the ‘control for other determinants’ position only if the results from that approach are interpreted correctly – that is, as the direct effect of age when the indirect effects of age have been controlled away. Again, that interpretation is correct, but results are usually not articulated with that sort of precision. 11
Should researchers continue the common practice of including age as a control variable in models intended for other purposes? Yes. Even if there is no general U-shape, age affects people’s life satisfaction in a large number of countries. It also affects other aspects of their situations (e.g. income, health, marital status, etc.). It is therefore a plausible confounder in most situations – and it cannot be an intervening variable.
Whether it makes sense to include an
What is genuinely needed is a more detailed and substantive justification for selection of control variables in one’s model. Many researchers provide virtually no justification at all: one commonly sees (certainly in research on happiness/life satisfaction) a notion of a ‘standard set’ of controls. At a minimum, researchers need to use the core distinction emphasized here, between confounding and intervening variables. Heeding this distinction could be consequential especially for research exploring the effects of characteristics that for most people are very unlikely to change (e.g. sex and ‘race’), and for questions pertaining to the impact of experiences earlier in life, where control variables might create indirect effects on paths leading to the dependent variable.
Even if there is no apparent upturn in life satisfaction after middle age, it is perhaps remarkable that life satisfaction does not generally decline as people become old. Of course, in some countries there
Footnotes
Acknowledgements
I am very grateful for feedback on earlier drafts from Martijn Hendriks, Patrick White and Jing Shen. Early versions of the article were presented to the Mannheim Centre for European Social Research (March 2019) and to the Wittgenstein Centre Conference 2019 on ‘Demographic Aspects of Human Wellbeing’.
Funding
The author received no financial support for the research, authorship and/or publication of this article.
