Abstract
In a recent publication, we applied a novel model to address phenotypic heterogeneity in genetic research on alcohol misuse by stratifying individuals based on their patterns of alcohol use behaviours and comorbid psychopathology. In this Commentary, we provide further descriptions of the subtypes of alcohol misuse that emerged from the empirical mixture modelling approach and present new results comparing these groups on sociodemographic characteristics and additional alcohol use outcomes. We take a broad perspective to discuss how these results fit with existing typologies of alcohol misuse and how the results inform future genetic research. Our findings add further evidence that conceptualisations of a binary distinction between ‘internalising’ (relief-seeking) versus ‘externalising’ (reward-seeking) subtypes does not fully capture the complexity of alcohol misuse. However, accounting for individual differences in these dimensions is a promising means to reduce heterogeneity and thereby improve power for gene discovery and, eventually, personalised medicine applications. We argue that more detailed, person-specific assessment of alcohol misuse measures, particularly with attention to longitudinal trajectories, is needed to further advance this important line of research.
Overview
Our recent study 1 represents one of the few existing attempts to incorporate the well-known phenotypic heterogeneity of alcohol misuse 2 into large-scale genetic association (GWAS) research. Using a mixture modelling approach, we empirically identified four subgroups of adults based on their distinct patterns of alcohol use behaviours and comorbid psychopathology. These included a group with low levels of all psychopathology and alcohol misuse (‘low risk’), a group with anxiety and depression symptoms but little current alcohol use (‘internalising’) and two groups with similarly high levels of alcohol use but of which only one had high rates of alcohol use disorder (AUD) diagnoses and other psychopathology (‘heavy use, low problems’ vs. ‘broad risk’ groups). Using GWAS and bioinformatics tools, we identified numerous genetic differences between the groups. This multidimensional approach generated advances in gene discovery for alcohol misuse over previous studies using simpler measures such as AUD diagnoses or overall alcohol consumption levels. In this Commentary, we will expand upon the relevance of the modelling approach for uncovering the etiology of alcohol misuse and provide further results to support an understanding of the similarities and differences between groups.
Methodology
As previously described, 1 data were drawn from N = 410,414 individuals from the UK Biobank, a population-based cohort of adults in the UK with a broad array of assessments including surveys, biological samples and linked medical records. Each individual was assigned to one of four latent classes (low risk, n = 105,142; internalising, n = 125,318, heavy use, n = 94,731 and broad risk, n = 85,770) based on their patterns of self-reported symptoms/behaviours and recorded diagnoses related to internalising, externalising and alcohol use disorders. Mixture modelling was carried out in Mplus 3 to identify the best fitting model and match individuals to a latent class based on their highest posterior probability of class membership. In the current study, we make use of the previously derived latent classes to further characterise the similarities and differences between these groups. We compared the classes on 10 selected sociodemographic characteristics and alcohol-related diagnoses that were not included in the latent class definitions. Exploratory comparisons of the means/proportions between latent classes were conducted using ANOVA tests for continuous variables and χ2 tests for dichotomous variables, with a two-tailed Bonferroni-corrected significance threshold (p < .05/10 = .005). Furthermore, we repeated the previous mixture modelling approach 1 (using the same methods and input variables) within two subsets of younger (age ≤ 55 at intake; n = 237107) and older (age > 55, n = 173854) participants with the UK Biobank cohort in order to qualitatively compare the latent class patterns for indications of developmental trends. Results from these additional analyses are presented in context of the discussion of the original results, below.
Internalising/externalising typology of alcohol misuse
Alcohol misuse frequently co-occurs with an array of different psychiatric disorders. 4 This study was built upon a substantial body of theory and previous research pointing to an etiological distinction between ‘internalising’ drinkers, whose motivation stems from seeking relief from negative emotions, and ‘externalising’ drinkers, whose reward-seeking patterns of drinking are driven by behavioural undercontrol.5–9 Within this typology, internalising alcohol problems are thought to be more environmentally driven and externalising problems more heritable, making genetic research especially relevant. There is also evidence that environmental influences are responsible for the overlap between AUD and internalising spectrum disorders, while genetic influences account for the overlap between AUD and externalising spectrum disorders. 10 We aimed to leverage this knowledge to improve power for gene discovery within these distinct subtypes.
In this new study, 1 we did find evidence of an internalising class, but not one linked to alcohol problems. Rather, the internalising class had the highest proportion of lifelong abstainers and former drinkers and the lowest levels of current alcohol consumption. This would seem to suggest that a predisposition to internalising psychopathology (in the absence of other psychiatric disorders) may be protective against alcohol problems. Additionally, a specific externalising class did not emerge – although we should note that the externalising measures available in the UK Biobank sample were limited. Instead, we saw that alcohol problems were most prominent in the ‘broad’ class whose members exhibited both internalising and externalising psychopathology. The ‘heavy’ class additionally had high levels of alcohol use but remained relatively unburdened by other mental health problems. Although this class structure was not in line with the expected internalising/externalising typology, it is consistent with some studies that have found the groups to be more muddled or found that the co-occurrence of internalising and externalising symptoms is associated with alcohol misuse rather than wholly separate subtypes.11,12
In conjunction with the broader literature, our findings suggest that the internalising/externalising typology, while indexing important dimensions of individual variation, is not sufficiently nuanced to reflect the underlying etiology of alcohol misuse. In particular, this typology fails to capture longitudinal changes and (neuro)adaptations relevant to alcohol misuse. It is plausible that the latent class structure found in our study reflects the older age of the UK Biobank sample. An externalising subtype may have been evident had we looked earlier in life, before these individuals may have developed comorbid internalising psychopathology as a consequence of chronic heavy alcohol use. This would be consistent with the implications of the predominant model of the neurobiology of addiction, 13 which suggests that withdrawal-driven relief-seeking (internalising) processes become salient after chronic alcohol consumption, even if the initial motivations were reward-seeking (externalising). Indeed, the internalising/externalising distinction is more clearcut in some younger samples, 7 indicating that the inconsistency in support for this typology may be due to developmental changes in motivations for drinking across the lifespan. 14 It may be the case that the observed ‘protective’ effect of internalising psychopathology is also an artefact of the sample age. If an internalising pathway to alcohol misuse is indeed more environmentally influenced, 10 individuals in the internalising class might be more responsive to environmental moderators of alcohol use behaviours. In particular, this group could exhibit low current levels of alcohol use due to a sensitivity to negative alcohol-related consequences experienced earlier in life (perhaps more easily experiencing guilt or social pressures), spurring them to quit or reduce drinking. Such patterns cannot be accounted for with data that records only a snapshot of behaviour at one point in the lifespan.
To examine these possible alternative explanations, we carried out two exploratory analyses expanding on our previously published results. First, to investigate whether age differences may have biased the observed latent class structure, we stratified the UK Biobank data by age (older/younger than 55) and repeated the mixture modelling analysis. We found that the pattern of results was very consistent across both younger and older individuals (Figure 1). Second, we investigated whether the internalising group exhibited more characteristics of ‘responsiveness’ to negative consequences of their alcohol use by carrying out exploratory comparisons of the latent classes on specific alcohol-related diagnostic codes from lifetime medical records. ANOVA and chi-square tests (Table 1) showed that the internalising class had the lowest levels of recorded alcohol problems such as abuse or alcoholic liver disease, although they had the highest levels of a recorded ‘personal history of alcoholism’. This may indicate that high-internalising individuals are quicker to perceive their drinking as problematic and reduce alcohol use before physiological consequences manifest.

Mixture model results in (a) younger (age ≤ 55; n = 237,107) and (b) older (age > 55; n = 173,854) individuals from the UK Biobank. Note: Probabilities and standardised mean differences are presented in separate panels. MDD: major depressive disorder; PD: panic disorder; GAD: generalised anxiety disorder; SP: specific phobia; TUD: tobacco use disorder; SUD: substance use disorder; AUD: alcohol use disorder.
Comparison of selected medical, environmental and demographic characteristics between classes.
Field code from which the measures were derived, as listed in the UK Biobank showcase (https://biobank.ctsu.ox.ac.uk/showcase/). Two-tailed Bonferroni-corrected significance threshold = .005.
Our exploratory analyses, however, are limited by the age range (middle to older adulthood) and level of assessment detail available in UK Biobank. Formal comparisons using designs such as latent transition analysis in longitudinal samples are needed to explore how the internalising/externalising typology may manifest across salient developmental milestones starting in adolescence or younger adulthood. To fully answer these questions, a greater focus on trajectories of health and behaviour across the lifespan is essential, especially in genetic research of complex phenotypes.
Genetic comparisons of subgroups
The characteristic that separates the study by Thijssen et al. 1 from previous typologies of alcohol misuse is the comparison of groups on their underlying genetic differences. Using GWAS, we identified 96 regions of the genome at which allele frequencies differed significantly between latent classes, including many which were not found in previous studies of simpler, unidimensional measures of alcohol misuse. This study therefore highlighted the value of addressing phenotypic heterogeneity in genetic research. The strongest genetic signal distinguished the two low alcohol use classes (low risk; internalising) from the two high alcohol use classes (heavy; broad), most strongly broad from internalising. These genetic influences, however, did not map directly onto risk pathways earlier in life. Using a college student sample in which we previously observed distinct internalising and externalising subtypes of drinkers, 7 we found that genetic variants associated with the broad latent class corresponded to the internalising subtype in young adulthood, but neither the broad nor heavy class showed a genetic overlap with the young adult externalising subtype. This finding may simply reflect the lack of externalising measures in the UK Biobank, but may again highlight the need to consider important etiological changes across the lifespan.
An additional finding of interest was the lack of any robust genetic effects distinguishing the heavy from the broad class. These are perhaps the most relevant classes to compare, as they differ primarily on clinically significant symptoms/disorders while both having high levels of alcohol consumption. All pairwise class comparisons had similar statistical power (80,000–110,000 individuals in each of the four groups), yet only for the comparison of the heavy and broad classes were no genome-wide significant loci found. This suggests that environmental exposures, in the context of a similar genetic predisposition to alcohol misuse, may determine the manifestation of clinically relevant psychopathology. Exploratory comparisons of the latent classes on environmental measures such as trauma exposure, socioeconomic indicators and social support (Table 1) showed no dramatic differences in these measures between the heavy and broad groups which might explain their divergence in alcohol-related problems (although all comparisons are statistically significant given the large sample size). Much work remains to understand the reasons – both genetic and environmental – for individual differences in the propensity to develop alcohol misuse.
Conclusion
Overall, our recent study adds to the growing body of literature highlighting the complexity of the etiology and manifestation of alcohol misuse. Multidimensional deep phenotyping can be an innovative means to uncover some of the otherwise hidden genetic influences, although unravelling the biological mechanisms behind those influences is no trivial task. Our findings at both the phenotypic and genetic level point to the critical need to incorporate longitudinal data and developmental perspectives to push forward research of the genetic etiology of alcohol misuse.
Footnotes
Acknowledgements
The research has been conducted using the UK Biobank Resource (application no. 16406). We would like to thank many UK Biobank participants and staff.
Author contribution
J.E.S. contributed to conceptualization, data curation, formal analysis, writing – original draft and funding acquisition. A.B.T. contributed to data curation, formal analysis and writing – review and editing.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethical approval statement
This project involved secondary analysis of data from the UK Biobank. The National Research Ethics Service Committee North West–Haydock approved this initiative (reference 11/NW/0382) and data were accessed under application #16406.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the grants to J.E.S. from the Amsterdam Neuroscience Alliance Project and the Netherlands Organization for Scientific Research (NWO VENI VI.201G-064).
Author biographies
Jeanne E Savage is a research associate in the Department of Complex Trait Genetics at Vrije Universiteit Amsterdam. She received a BA in Psychology from the University of Missouri and a PhD in Psychiatric, Behavioral, and Statistical Genetics from Virginia Commonwealth University. She researches the interplay between genes, environment, and development on mental health, with a focus on substance use disorders.
Anaïs B Thijssen is a PhD student in the Department of Psychiatry at the Amsterdam UMC. She received her master's in Psychology at Utrecht University and her master's in Neuroscience at Vrije Universiteit Amsterdam. Her research focuses on the genetics of psychiatric disorders, with the aim of better understanding the similarities and differences both within and between psychiatric disorders.
