Abstract
The UPPS-P Impulsive Behavior Model and the various psychometric instruments developed and validated based on this model are well established in clinical and research settings. However, evidence regarding the psychometric validity, reliability, and equivalence across multiple countries of residence, languages, or gender identities, including gender-diverse individuals, is lacking to date. Using data from the International Sex Survey (N = 82,243), confirmatory factor analyses and measurement invariance analyses were performed on the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale to examine whether (a) psychometric validity and reliability and (b) psychometric equivalence hold across 34 country-of-residence-related, 22 language-related, and three gender-identity-related groups. The results of the present study extend the latter psychometric instrument’s well-established relevance to 26 countries, 13 languages, and three gender identities. Most notably, psychometric validity and reliability were evidenced across nine novel translations included in the present study (i.e., Croatian, English, German, Hebrew, Korean, Macedonian, Polish, Portuguese—Portugal, and Spanish—Latin American) and psychometric equivalence was evidenced across all three gender identities included in the present study (i.e., women, men, and gender-diverse individuals).
Keywords
Introduction
Impulsivity is a psychological construct included in most prominent personality models (Whiteside & Lynam, 2001) and is one of the most frequently encountered diagnostic criteria in nosography manuals (American Psychiatric Association, 2022; World Health Organization, 2019). Consistently, impulsivity is transdiagnostically implicated in the etiology of numerous psychopathological and neurological disorders (Berg et al., 2015; Evenden, 1999; Rochat et al., 2018). Among the most dominant impulsivity models, the UPPS-P Impulsive Behavior Model (Cyders et al., 2007; Whiteside & Lynam, 2001) conceptualizes impulsivity as a multidimensional construct encompassing five distinct facets, namely, (a) lack of premeditation (lack of reflection on the potential consequences of actions preceding their emission), (b) positive urgency (emission of sudden actions in intense positive emotional contexts), (c) sensation seeking (tendency to appreciate and seek excitement and to be open to new experiences), (d) negative urgency (emission of sudden actions in intense negative emotional contexts), and (e) lack of perseverance (difficulty focusing on the completion of demanding or monotonous tasks).
Initially, the UPPS-P Impulsive Behavior Model was developed based on four different impulsivity-related facets in the Revised NEO Personality Inventory (i.e., impulsiveness, excitement seeking, self-discipline, and deliberation) (Costa & McCrae, 2008) and 17 classic scales or subscales measuring impulsivity (Whiteside & Lynam, 2001). By federating 21 coexisting conceptualizations of impulsivity and thereby correcting the jingle (i.e., distinct constructs designated by one same label) and jangle (i.e., distinct labels designating one same construct) fallacies that characterized the research field of impulsivity, the UPPS-P Impulsive Behavior Model received great interest and exerted a significant impact on subsequent impulsivity research. Several psychometric instruments were developed and validated based on the UPPS-P Impulsive Behavior Model, such as the original 59-item UPPS-P Impulsive Behavior Scale (Cyders et al., 2007; Whiteside & Lynam, 2001), the 20-item short French version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012), and the 20-item short English version of the UPPS-P Impulsive Behavior Scale (Cyders et al., 2014). Given their well-established psychometric properties and relevance for various problematic behaviors and mental disorders, UPPS-P Impulsive Behavior Scales were adapted to numerous languages (Bteich et al., 2017; Cándido et al., 2012; d’Orta et al., 2015; Wang et al., 2020; Zsila et al., 2020) and populations such as children (Geurten et al., 2021), adolescents (d’Acremont & Van der Linden, 2005), patients in psychiatric emergency settings (Dugré et al., 2019), patients with substance use disorders (Calzada et al., 2017; Kempeneers et al., 2023; Sánchez-Domínguez et al., 2022), and patients with neurological disorders (Rochat et al., 2008, 2010).
Recently, several studies examined and fully established the measurement invariance of UPPS-P Impulsive Behavior Scales across different groups, such as age-related (Argyriou et al., 2020; Donati et al., 2021; Wang et al., 2020), ethnicity-related (Liu et al., 2023; Stevens et al., 2018; Watts et al., 2020), gender-identity-related (Donati et al., 2021; Gialdi et al., 2021; Watts et al., 2020), and sex-related (Argyriou et al., 2020; Cyders, 2013) groups. Measurement invariance assesses whether the assumption of equivalence of a psychological construct—as measured by a corresponding psychometric instrument—holds across certain defined groups, which is a prerequisite for suggesting that the said psychological construct has comparable meaning for these groups (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). However, despite the extensiveness of UPPS-P-related research, no study published to date has examined the measurement invariance of UPPS-P Impulsive Behavior Scales across country-of-residence-related or language-related groups, and although three studies published to date have examined the measurement invariance of UPPS-P Impulsive Behavior Scales across gender-identity-related groups (Donati et al., 2021; Gialdi et al., 2021; Watts et al., 2020), none included gender-diverse individuals. Therefore, evidence is lacking to date regarding whether the impulsive behavior dimensions—as assessed by UPPS-P Impulsive Behavior Scales—have comparable meaning for individuals across the latter groups and whether one can engage in cross-group comparison analyses of the composite factor scores of UPPS-P Impulsive Behavior Scales. Moreover, in the interest of including country-of-residence-related, language-related, and gender-identity-related groups for which evidence is lacking to date, such evidence is critically warranted to support the integration of the assessment of impulsive behavior dimensions in clinical and research settings with respect to underrepresented and underserved groups.
In the present study, to address the abovementioned gaps, we probed the preestablished five-factor structure and the measurement invariance of the 20-item short version of the UPPS-P Impulsive Behavior Scale developed and validated by Billieux et al. (2012) across 34 country-of-residence-related, 22 language-related, and three gender-identity-related groups, with the overarching aims to examine whether (a) psychometric validity and reliability and (b) psychometric equivalence hold across different groups.
Method
Participants and Procedure
The total sample comprised participants recruited in the context of the International Sex Survey (Bőthe et al., 2021), a collaborative study conducted across 42 countries 1 , all of which received ethical clearance directly from local ethics committees or indirectly from the principal investigators’ institution’s local ethics committee (e.g., the Institutional Review Board of the Eötvös Loránd University, Budapest, Hungary). Detailed ethical information is available from the Open Science Framework (https://osf.io/e93kf). Participation consisted of completing online sociodemographic information questions and self-administered psychometric instruments, one of which, the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012), was investigated in the present study.
The total sample included 82,243 participants from the general population of legal age residing across 42 countries and speaking 26 different languages. The age of the participants ranged between 18 and 99 (M = 32.391, SD = 12.524) years, with 56.995% identifying as women, 39.577% as men, and 3.384% as gender-diverse individuals. Detailed sociodemographic information is available from the Open Science Framework (https://osf.io/cj658).
Materials
The 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) is a self-administered psychometric instrument. This instrument assesses the applicability of 20 statements related to the five different dimensions of the UPPS-P Impulsive Behavior Model, namely, (a) lack of premeditation (e.g., Item 13: “I usually make up my mind through careful reasoning.”), (b) positive urgency (e.g., reverse-scored Item 15*: “I tend to act without thinking when I am really excited.”), (c) sensation seeking (e.g., reverse-scored Item 9*: “I quite enjoy taking risks.”), (d) negative urgency (e.g., reverse-scored Item 12*: “I often make matters worse because I act without thinking when I am upset.”), and (e) lack of perseverance (e.g., Item 8: “I finish what I start.”) (Cyders et al., 2007; Whiteside & Lynam, 2001). Each of the five instrument’s dimensions includes four items that are scored (or reverse-scored) on a 4-point Likert-type scale (from 1 = strongly agree to 4 = strongly disagree) and that provide composite factor scores likewise ranging from 1 (i.e., the lowest level of endorsement of the corresponding impulsive behavior dimension) to 4 (i.e., the highest level of endorsement of the corresponding impulsive behavior dimension).
In the context of the International Sex Survey, the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) was first adapted from French—its original language—to English based on prevalidated English items from the original 59-item UPPS-P Impulsive Behavior Scale (Cyders et al., 2007; Whiteside & Lynam, 2001), then adapted from English to all target languages (for which no corresponding validated translation was available) following a preestablished translation protocol (Beaton et al., 2000). Detailed information regarding the materials in all 26 languages included in the International Sex Survey is available from the Open Science Framework (https://osf.io/b5tdw).
Data Analytic Plan
Data analysis was performed following a preregistered data analytic plan available from the Open Science Framework (https://doi.org/10.17605/OSF.IO/DK78R). All analyses were performed using R version 4.3.0 (R Core Team, 2023). Detailed information regarding the analyses and the code is available from the Open Science Framework (https://doi.org/10.17605/OSF.IO/UVPC2; https://doi.org/10.17605/OSF.IO/AS8R5). The corresponding data are not available from the Open Science Framework as the International Sex Survey involves sensitive data.
Item-level and construct-level missingness due to partial response rate (1.160% of the participants responded to between one and all but one item) on the 20-item short version of the UPPS-P Impulsive Behavior Scale was handled through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014) using the R package mice version 3.16.0 (van Buuren et al., 2023).
Confirmatory factor analyses and measurement invariance analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale were performed within the framework of structural equation modeling analysis using the R packages lavaan version 0.6-15 (Rosseel et al., 2023) and semTools version 0.5-6 (Jorgensen et al., 2022).
Confirmatory factor analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale were performed with respect to its preestablished five-factor structure on the total sample (N = 82,243) and on all country-of-residence-related, language-related, and gender-identity-related groups that presented sufficient subsample size (n≥460) according to Monte Carlo simulation analyses (Type I error probability α = 0.050; Type II error probability β = 0.800) (Muthén & Muthén, 2002) conducted in the context of the preregistered data analytic plan. To fit the structural equation models, weighted least squares mean-and-variance-adjusted robust estimation methods were employed (Finney & di Stefano, 2013). To assess the quality of the structural equation models’ adjustment to the data, three conventional model fit indices were employed: the comparative fit index (CFI), the Tucker–Lewis index (TLI), and the root mean square error of approximation (RMSEA) along with its corresponding 90% confidence interval (Kline & Little, 2023). Following the preregistered data analytic plan, good fit was determined by a CFI ≥ 0.950, a TLI ≥ 0.950, and an RMSEA ≤ 0.050, while acceptable fit was determined by a CFI ≥ 0.900, a TLI ≥ 0.900, and an RMSEA ≤ 0.080 (Browne & Cudeck, 1992; F. Chen et al., 2008; Kenny et al., 2015; Marsh et al., 2005; Schermelleh-Engel et al., 2003).
Measurement invariance analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale were performed with respect to its preestablished five-factor structure on all country-of-residence-related, language-related, and gender-identity-related groups for which the abovementioned procedure yielded confirmatory factor analysis models with acceptable fit. Four measurement invariance hypotheses were sequentially and hierarchically examined by incrementally imposing cross-group equality constraints on the initial unconstrained models’ parameters (i.e., the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale) (Kline & Little, 2023). In the first examined measurement invariance hypothesis, item thresholds’ invariance, cross-group equality constraints were imposed on the model-implied non-null τ unstandardized estimates 2 (Wu & Estabrook, 2016). Item thresholds refer to the boundaries between adjacent categories in ordered observed variables by relating the latter boundaries to points on a continuous latent normal distribution (Kline & Little, 2023). Item thresholds’ invariance implies that the item thresholds with respect to the 20-item short version of the UPPS-P Impulsive Behavior Scale are equivalent across groups and, therefore, that individuals across different groups interpret the 4-point Likert-type scales of the impulsive behavior items similarly (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). In the second examined measurement invariance hypothesis, factor loadings’ invariance, cross-group equality constraints were imposed on the model-implied non-null λ unstandardized estimates (Wu & Estabrook, 2016). Factor loadings refer to the magnitudes of associations between latent variables and observed variables (Kline & Little, 2023). Factor loadings’ invariance implies that the factor loadings with respect to the 20-item short version of the UPPS-P Impulsive Behavior Scale are equivalent across groups and, therefore, that the magnitudes of associations between impulsive behavior dimensions and their corresponding items are similar for individuals across different groups (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). In the third examined measurement invariance hypothesis, item intercepts’ invariance, cross-group equality constraints were imposed on the model-implied non-null ν unstandardized estimates (Wu & Estabrook, 2016). Item intercepts refer to the observed variables’ means considering that all latent variables equal zero (Kline & Little, 2023). Item intercepts’ invariance implies that the item intercepts with respect to the 20-item short version of the UPPS-P Impulsive Behavior Scale are equivalent across groups and, therefore, that individuals across different groups who present similar levels of endorsement of the impulsive behavior dimensions also present similar levels of endorsement of their corresponding items (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). Of note, accepting all three aforementioned measurement invariance hypotheses would imply that the impulsive behavior dimensions—as assessed by the 20-item short version of the UPPS-P Impulsive Behavior Scale—have comparable meaning for individuals across different groups and that one can engage in cross-group comparison analyses of the composite factor scores of the latter psychometric instrument (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). In the fourth and last examined measurement invariance hypothesis, item residuals’ invariance, cross-group equality constraints were imposed on the model-implied non-null θ unstandardized estimates (Wu & Estabrook, 2016). Item residuals refer to the observed variables’ sum of unique and error variances (Kline & Little, 2023). Item intercepts’ invariance implies that the item residuals with respect to the 20-item short version of the UPPS-P Impulsive Behavior Scale are equivalent across groups and, therefore, that the measurement error between impulsive behavior dimensions and their corresponding items are similar for individuals across different groups (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). Following the preregistered data analytic plan, acceptable measurement invariance between sequential structural equation models’ fit was determined by a ΔCFI≥–0.010, a ΔTLI≥–0.010, and a ΔRMSEA≤ 0.015 (F. F. Chen, 2007; Cheung & Rensvold, 2002). If the latter decision rules were not met, cross-group equality constraints to the corresponding sequential structural equation models’ model-implied non-null unstandardized estimates were released based on univariate chi-square tests’ statistics (i.e., “modification indices”) until partial measurement invariance (Milfont & Fischer, 2010) was supported by the data (likewise determined by a ΔCFI≥–0.010, a ΔTLI≥–0.010, and a ΔRMSEA≤ 0.015).
Cross-group comparison analyses of the composite factor scores of the 20-item short version of the UPPS-P Impulsive Behavior Scale were performed with respect to its preestablished five-factor structure on all country-of-residence-related, language-related, and gender-identity-related groups for which measurement invariance analyses supported item intercepts’ invariance (i.e., “strong invariance”). Cross-group comparison analyses were performed using two-sided Kruskal–Wallis rank sum tests, and pairwise cross-group comparison analyses were performed using one-sided Wilcoxon rank sum tests. Cross-group comparison analysis results were interpreted in light of their probability values and effect sizes: negligible effect size was determined by an r < 0.100 or an η2 < 0.010, small effect size was determined by an r≥0.100 or an η2≥0.010, moderate effect size was determined by an r≥0.250 or an η2≥0.0625, and large effect size was determined by an r≥0.500 or an η2≥0.250 (Cohen, 1992, 2013).
Results
Factor-level and item-level descriptive analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale derived from analyses performed on the total sample (N = 82,243) yielded the descriptive values shown in Table 1.
Factor-Level and Item-Level Descriptive Analyses of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on the Total Sample (N = 82,243).
Note. M = variable’s mean; SD = variable’s standard deviation; γ1 = variable’s skew index; γ2 = variable’s kurtosis index; α = variable’s (confirmatory-factor-analysis-model-implied) Cronbach’s alpha internal consistency value; ω = variable’s (confirmatory-factor-analysis-model-implied) McDonald’s omega internal consistency value. Single asterisks indicate reverse-scored items. All reported values were obtained after handling multivariate missing data through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014).
Confirmatory factor analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale performed with respect to its preestablished five-factor structure on the total sample (N = 82,243) yielded the model-implied non-null λ and φ standardized estimates shown in Figure 1. The model-implied fit indices showed an acceptable to good quality of adjustment to the total sample, N = 82,243, χ2(160) = 52,867.560, p < 0.001, CFI = 0.957, TLI = 0.949, RMSEA [90% CI] = 0.063 [0.063, 0.064]. The model-implied Cronbach’s alpha and McDonald’s omega internal consistency values are shown in Table 1.

Confirmatory Factor Analysis Model of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on the Total Sample (N = 82,243).
Confirmatory factor analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale performed with respect to its preestablished five-factor structure on all 34 country-of-residence-related, 22 language-related, and three gender-identity-related groups that presented sufficient subsample size according to the Monte Carlo simulation analyses conducted in the context of the preregistered data analytic plan (n≥460) yielded the model-implied fit indices, Cronbach’s alpha, and McDonald’s omega internal consistency values shown in Table 2. Of these, 26 country-of-residence-related, 13 language-related, and three gender-identity-related groups presented confirmatory factor analysis models with sufficient quality of adjustment to the data (see Table 2).
Confirmatory Factor Analysis Models of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on the Country-of-Residence-Related, Language-Related, and Gender-Identity-Related Groups.
Note. N = number of groups; n = variable’s subsample size; χ2 = model’s chi-square; df = model’s chi-square’s degrees of freedom; p = model’s chi-square’s probability value; CFI = model’s comparative fit index; TLI = model’s Tucker–Lewis fit index; RMSEA [90% CI] = model’s root mean square error of approximation along with its corresponding 90% confidence interval; α = group’s (confirmatory-factor-analysis-model-implied) Cronbach’s alpha internal consistency values range; ω = group’s (confirmatory-factor-analysis-model-implied) McDonald’s omega internal consistency values range. Boldfaced groups indicate acceptable confirmatory factor analysis models supported by the data. Single asterisks indicate groups that presented insufficient subsample size (n < 460) according to Monte Carlo simulation analyses (Type I error probability α = 0.050; Type II error probability β = 0.800) conducted in the context of the preregistered data analytic plan (https://doi.org/10.17605/OSF.IO/DK78R). Double asterisks indicate groups that presented confirmatory factor analysis models with nonacceptable fit determined by a CFI < 0.900, a TLI < 0.900, or an RMSEA > 0.080 following the preregistered data analytic plan (https://doi.org/10.17605/OSF.IO/DK78R). Triple asterisks indicate country-of-residence-related groups that were not included as collaborating countries and country-of-residence-related groups that were included as collaborating countries at the time of publication of the International Sex Survey’s study protocol (Bőthe et al., 2021) but that did not receive timely ethical clearance from local ethics committees (i.e., Egypt, Iran, Pakistan, and Romania). All reported values were obtained after handling multivariate missing data through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014).
Measurement invariance analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale performed with respect to its preestablished five-factor structure on all 26 country-of-residence-related, 13 language-related, and three gender-identity-related groups that presented sufficient subsample size according to the Monte Carlo simulation analyses conducted in the context of the preregistered data analytic plan (n≥460) and that presented confirmatory factor analysis models with sufficient quality of adjustment to the data yielded the model-implied fit indices shown in Table 3. For country-of-residence-related and language-related groups, factor loadings’ invariance (i.e., “weak invariance”) was supported by the data (see Table 3). Several cross-group equality constraints to the initial unconstrained models’ model-implied non-null ν and θ unstandardized estimates ought to be released for item intercepts’ partial invariance (i.e., “partial strong invariance”) and item residuals’ partial invariance (i.e., “partial strict invariance”) to be supported by the data (see Table 3). For gender-identity-related groups, item residuals’ invariance (i.e., “strict invariance”) was supported by the data (see Table 3).
Measurement Invariance Analysis Models of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on the Country-of-Residence-Related, Language-Related, and Gender-Identity-Related Groups.
Note. N = number of groups; EC = cross-group equality constraints to the initial unconstrained models’ parameters (i.e., the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale) (Billieux et al., 2012); NA = no cross-group equality constraints to the initial unconstrained models’ model-implied non-null unstandardized estimates; τ = cross-group equality constraints to the initial unconstrained models’ model-implied non-null τ unstandardized estimates (i.e., item thresholds’ invariance); λ = cross-group equality constraints to the initial unconstrained models’ model-implied non-null λ unstandardized estimates (i.e., factor loadings’ invariance or “weak invariance”); ν = cross-group equality constraints to the initial unconstrained models’ model-implied non-null ν unstandardized estimates (i.e., item intercepts’ invariance or “strong invariance”); θ = cross-group equality constraints to the initial unconstrained models’ model-implied non-null θ unstandardized estimates (i.e., item residuals’ invariance or “strict invariance”); χ2 = model’s chi-square; df = model’s chi-square’s degrees of freedom; p = model’s chi-square’s probability value; CFI = model’s comparative fit index; TLI = model’s Tucker–Lewis fit index; RMSEA [90% CI] = model’s root mean square error of approximation along with its corresponding 90% confidence interval. Boldfaced cross-group equality constraints to the initial unconstrained models’ parameters indicate acceptable measurement invariance analysis models supported by the data. Single asterisks indicate that for Item 3* (sensation seeking), Item 14* (sensation seeking), Item 20* (positive urgency), and Item 10* (positive urgency), cross-group equality constraints to the initial unconstrained model’s model-implied non-null ν unstandardized estimates were released (i.e., partial item intercepts’ invariance or “partial strong invariance”). Double asterisks indicate that, for Item 3* (sensation seeking), Item 14* (sensation seeking), Item 20* (positive urgency), Item 10* (positive urgency), and Item 5 (lack of perseverance), cross-group equality constraints to the initial unconstrained model’s model-implied non-null θ unstandardized estimates were released (i.e., partial item intercepts’ invariance or “partial strong invariance”). Triple asterisks indicate that, for Item 5 (lack of perseverance), cross-group equality constraints to the initial unconstrained model’s model-implied non-null θ unstandardized estimates were released (i.e., partial item residuals’ invariance or “partial strict invariance”). All reported values were obtained after handling multivariate missing data through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014).
Cross-group comparison analyses of the composite factor scores of the 20-item short version of the UPPS-P Impulsive Behavior Scale were performed with respect to its preestablished five-factor structure on groups for which item intercepts’ invariance (i.e., “strong invariance”) was supported by the data (i.e., all three gender-identity-related groups). Factor-level and descriptive analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale derived from analyses performed on all three gender-identity-related groups yielded the descriptive values shown in Table 4, two-sided Kruskal–Wallis rank sum tests yielded the results shown in Table 5, and one-sided Wilcoxon rank sum tests yielded the results shown in Table 6. All five two-sided Kruskal–Wallis rank sum tests were significant (p < 0.001); of these, four presented negligible effect sizes (η2∈ [0.004, 0.008]), whereas one presented a small effect size (η2 = 0.017) suggesting that the composite factor scores of negative urgency were not equal between women, men, and gender-diverse individuals (see Table 5). All 15 one-sided Wilcoxon rank sum tests were significant (p < 0.001; p < 0.010); of these, 13 presented negligible effect sizes (r∈ [0.011, 0.097]), whereas one presented a small effect size (r = 0.132) suggesting that the composite factor scores of negative urgency were higher for women than for men, and one presented a small effect size (r = 0.135) suggesting that the composite factor scores of lack of perseverance were lower for men than for gender-diverse individuals (see Table 6).
Factor-Level Descriptive Analyses of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on all 3/3 Gender-Identity-Related Groups.
Note. N = number of groups; n = variable’s subsample size; M = variable’s mean; SD = variable’s standard deviation; γ1 = variable’s skew index; γ2 = variable’s kurtosis index. All reported values were obtained after handling multivariate missing data through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014).
Cross-Group Comparison Analyses of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on all 3/3 Gender-Identity-Related Groups.
Note. N = number of groups; H0 = null hypothesis; H1 = alternative hypothesis; H = two-sided Kruskal–Wallis rank sum test’s statistic; df = two-sided Kruskal–Wallis rank sum test’s degrees of freedom; p = two-sided Kruskal–Wallis rank sum test’s probability value; η2 = two-sided Kruskal–Wallis rank sum test’s effect size. All reported values were obtained after handling multivariate missing data through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014).
Pairwise Cross-Group Comparison Analyses of the 20-Item Short Version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) Derived From Analyses Performed on all 3/3 Gender-Identity-Related Groups.
Note. N = number of groups; H0 = null hypothesis; H1 = alternative hypothesis; W = one-sided Wilcoxon rank sum test’s statistic; df = one-sided Wilcoxon rank sum test’s degrees of freedom; p = one-sided Wilcoxon rank sum test’s probability value; r = one-sided Wilcoxon rank sum test’s effect size. All reported values were obtained after handling multivariate missing data through multiple imputations (i.e., five iterations of five imputations of predictive mean matching) (Newman, 2014).
Discussion
The UPPS-P Impulsive Behavior Model (Cyders et al., 2007; Whiteside & Lynam, 2001) and the various psychometric instruments developed and validated based on this model are well-established in clinical and research settings. However, evidence regarding the psychometric validity, reliability, and equivalence across multiple countries of residence, languages, or gender identities, including gender-diverse individuals, had been lacking to date. In the present study, we addressed the aforementioned gaps by probing the preestablished five-factor structure and the measurement invariance of the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) across 34 country-of-residence-related, 22 language-related, and three gender-identity-related groups, with the overarching aims to examine whether (a) psychometric validity and reliability and (b) psychometric equivalence held across different groups.
First, our confirmatory factor analysis results showed that the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale yielded an acceptable to good quality of adjustment to the data across the total sample, 26 country-of-residence-related groups, 13 language-related groups, and three gender-identity-related groups. In line with and in addition to prior UPPS-P-related literature evaluating the psychometric properties of UPPS-P Impulsive Behavior Scales, our confirmatory factor analysis results suggest that the psychological construct of impulsivity—as assessed by the 20-item short version of the UPPS-P Impulsive Behavior Scale—is valid and reliable across an extended number of countries, languages, and gender identities.
With respect to the translations that were adapted from the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012), validated in prior UPPS-P-related literature, and employed in our study, our confirmatory factor analysis results regarding the French (Billieux et al., 2012), Hungarian (Zsila et al., 2020), Italian (d’Orta et al., 2015), and Spanish–Spain (Cándido et al., 2012) validated translations supported and reinforced their preestablished psychometric validity and reliability. The sole exception concerns the validated Mandarin–Simplified (Wang et al., 2020) translation. Notably, it was previously highlighted that an item included in the dimension of lack of perseverance (i.e., Item 11: “Once I start a project, I almost always finish it.”) might not tap its corresponding psychological construct due to psycholinguistic factors, which are likely to account for this nuance with respect to the Mandarin–Simplified validated translation (Wang et al., 2020).
With respect to the novel translations that were adapted from the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) following a preestablished translation protocol (Beaton et al., 2000) and employed in our study, our confirmatory factor analysis results regarding the Croatian, English, German, Hebrew, Korean, Macedonian, Polish, Portuguese–Portugal, and Spanish–Latin American novel translations supported their psychometric validity and reliability. In light of these promising results, we, therefore, invite researchers to engage in further psychometric validation of the aforementioned novel translations of the 20-item short version of the UPPS-P Impulsive Behavior Scale available from the Open Science Framework (https://osf.io/b5tdw).
Second, our measurement invariance analysis results showed that, whereas measurement invariance of the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale partially held across country-of-residence-related and language-related groups, it fully held across gender-identity-related groups.
With respect to country-of-residence-related and language-related groups, our measurement invariance analysis results showed that (a) the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale, (b) the 4-point Likert-type scales of its impulsive behavior items, and (c) the magnitudes of associations between its impulsive behavior dimensions and their corresponding items were fully invariant for individuals across such groups, therefore fully establishing “weak invariance” (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). Our results also showed that (d) the levels of endorsement of its impulsive behavior items (considering similar levels of endorsement of their corresponding dimensions) and (e) the measurement error between impulsive behavior dimensions and their corresponding items were partially invariant for individuals across such groups, therefore partially establishing “strict invariance” (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). Item intercepts’ partial invariance (i.e., “partial strong invariance”) analysis results provided insight into which impulsive behavior items reflected differential item functioning (DIF). The DIF denotes instances where individuals across different groups who present similar levels of endorsement of a dimension present dissimilar probabilities of responding similarly to some or all of the said dimension’s corresponding items (Kline & Little, 2023). With respect to country-of-residence-related and language-related groups, our results highlighted five instances of DIF: two relating to the dimension of positive urgency (i.e., Item 10*: “When overjoyed, I feel like I can’t stop myself from going overboard.” and Item 20*: “When I am very happy, I feel like it is OK to give in to cravings or overindulge.”), two relating to the dimension of sensation seeking (i.e., Item 3*: “I sometimes like doing things that are a bit frightening.” and Item 14*: “I generally seek new and exciting experiences and activities.”), and one relating to the dimension of lack of perseverance (i.e., Item 5: “I generally like to see things through to the end.”). Discussing the interpretation of partial measurement invariance results is a critical notion that ought to be addressed by researchers performing measurement invariance analyses. If erroneous, suggesting that a psychological construct has comparable meaning across certain groups may yield potentially significant implications, such as biased and invalid cross-group comparisons (Jeong & Lee, 2019). In light of our measurement invariance results across country-of-residence-related and language-related groups, as (a) cross-group equality constraints to the initial unconstrained models’ parameters ought to be released for a fourth of all items for “partial strict invariance” over “weak invariance” to be supported by the data, and as (b) there is no prior UPPS-P-related literature to support theoretical justification regarding the comparability of the meaning of the impulsive behavior dimensions for individuals across such groups, we recommend adopting a conservative approach and not interpreting that the psychological construct of impulsivity—as assessed by the 20-item short version of the UPPS-P Impulsive Behavior Scale—has comparable meaning across country-of-residence-related and language-related groups. In line with our aforementioned recommendation, we, therefore, did not engage in cross-group comparison analyses of the composite factor scores of the 20-item short version of the UPPS-P Impulsive Behavior Scale with respect to the country-of-residence-related and language-related groups.
Considering the intersections between country-of-residence-related and language-related groups, it is tenable that the latter might contribute to a certain degree of convergence in our confirmatory factor analysis and measurement invariance analysis results across such groups. With respect to the psychometric validity and reliability of the 20-item short version of the UPPS-P Impulsive Behavior Scale, our confirmatory factor analysis results across country-of-residence-related groups examined in the present study reflect those of their respective majority-language-related group. For instance, the proportion of the majority-language-related group French by the country-of-residence-related group Switzerland equaled 83.042%, and the psychometric validity and reliability of the 20-item short version of the UPPS-P Impulsive Behavior Scale were supported for both latter groups. For another instance, the proportion of the majority-language-related group Czech by the country-of-residence-related group Czech Republic equaled 96.037%, and the psychometric validity and reliability of the 20-item short version of the UPPS-P Impulsive Behavior Scale were not supported for both latter groups. Although the proportion of the majority-language-related group Dutch by the country-of-residence-related group Belgium equaled 73.137%, this constitutes the sole exception among all country-of-residence-related groups examined in the present study, as the psychometric validity and reliability of the 20-item short version of the UPPS-P Impulsive Behavior Scale were supported for the latter country-of-residence-related group but not for the latter language-related group. In this perspective, our confirmatory factor analysis and measurement invariance analysis results with respect to country-of-residence-related groups are liable to be partially explained by their intersections with language-related groups and, to a lesser extent, vice versa. Detailed information regarding the intersections of country-of-residence-related and language-related groups is available from the Open Science Framework (https://doi.org/10.17605/OSF.IO/TDEJW).
With respect to gender-identity-related groups, our measurement invariance analysis results showed that (a) the preestablished five-factor structure of the 20-item short version of the UPPS-P Impulsive Behavior Scale, (b) the 4-point Likert-type scales of its impulsive behavior items, (c) the magnitudes of associations between its impulsive behavior dimensions and their corresponding items, (d) the levels of endorsement of its impulsive behavior items (considering similar levels of endorsement of their corresponding dimensions) and (e) the measurement error between impulsive behavior dimensions and their corresponding items were fully invariant for individuals across such groups. Therefore, the impulsive behavior dimensions—as assessed by the 20-item short version of the UPPS-P Impulsive Behavior Scale—have comparable meaning for individuals across the aforementioned groups (Jeong & Lee, 2019; Milfont & Fischer, 2010; Putnick & Bornstein, 2016). Our results extend those of the three studies published to date which examined and established the measurement invariance of UPPS-P Impulsive Behavior Scales across gender-identity-related groups (Donati et al., 2021; Gialdi et al., 2021; Watts et al., 2020) to gender-diverse individuals. In addition, our cross-group comparison analysis results suggested that with respect to the impulsive behavior dimensions, differences between women, men, and gender-diverse individuals were mostly negligible or small.
Our study contains limitations that ought to be acknowledged. First, the psychometric properties of the 20-item short version of the UPPS-P Impulsive Behavior Scale were evaluated by gathering construct validity and internal consistency reliability evidence. In contrast, other types of evidence (e.g., convergent validity, test–retest reliability) were not gathered. However, we do not consider this a critical limitation, as the psychometric properties of the latter psychometric instrument are well established. Second, measurement invariance analyses of the 20-item short version of the UPPS-P Impulsive Behavior Scale were performed within the framework of multiple-group confirmatory factor analysis (MGCFA), which imposes strict cross-group equality constraints and, by extension, constitutes a conservative approach to psychometric equivalence. Measurement invariance analyses can be performed within other frameworks, such as the alignment method, which does not impose strict cross-group equality constraints and, by extension, constitutes a liberal approach to psychometric equivalence (Asparouhov & Muthén, 2014). In this perspective, the framework in which measurement invariance analyses were performed might have contributed to their corresponding results. However, the alignment approach is particularly recommended when examining large numbers of groups or parameters as the computational complexity of the analysis might exceed the available space or time resources of one’s computer (Asparouhov & Muthén, 2014), and as we did not encounter the latter issue, we believe that the MGCFA constituted a suitable framework for performing measurement invariance analyses in the present study. Third, the psychological construct of impulsivity was assessed by the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012) rather than its parent form—the original 59-item UPPS-P Impulsive Behavior Scale (Cyders et al., 2007; Whiteside & Lynam, 2001)—despite the loss of content validity inherent to short forms (Smith et al., 2000). However, given that—in the context of the International Sex Survey (Bőthe et al., 2021)—participation consisted of completing online sociodemographic information questions and self-administered psychometric instruments totaling a maximum of 338 items, we believe that the loss of validity is justifiable in view of the considerable time saved. Last, detailed general limitations regarding the International Sex Survey are available from the Open Science Framework (https://osf.io/6kscb).
Taken together, the results of the present study substantiate the considerable corpus of research demonstrating the psychometric validity and reliability of the 20-item short version of the UPPS-P Impulsive Behavior Scale (Billieux et al., 2012), extending the latter psychometric instrument’s well-established relevance for measuring the psychological construct of impulsivity to a total of 26 countries, 13 languages, and three gender identities. Most notably, psychometric validity and reliability were evidenced across nine novel translations included in the present study (i.e., Croatian, English, German, Hebrew, Korean, Macedonian, Polish, Portuguese–Portugal, and Spanish–Latin American) and psychometric equivalence was evidenced across all three gender identities included in the present study (i.e., women, men, and gender-diverse individuals).
Footnotes
Acknowledgements
The authors would like to thank Abu Bakkar Siddique, Anastasia Lucic, Anne-Marie Menard, Clara Marincowitz, Club Sexu, Critica, the Digital Ethics Center (Skaitmenines Etikos Centras), Día a Día, Ed Carty, El Siglo, Jakia Akter, Jayma Jannat Juma, Kamrun Nahar Momo, Kevin Zavaleta, Laraine Murray, L’Avenir de l’Artois, La Estrella de Panamá, La Voix du Nord, Le Parisien, the Lithuanian National Radio and Television (Lietuvos nacionalinis radijas ir televizija), Mahfuzul Islam, Marjia Khan Trisha, Md. Rabiul Islam, Md. Shahariar Emon, Miriam Goodridge, Most. Mariam Jamila, Nahida Bintee Mostofa, Nargees Akter, Niamh Connolly, Natasha Zippan, Rafael Goyoneche, Raiyaan Tabassum Imita, Raquel Savage, Ricardo Mendoza, Saima Fariha, SOS Orienta and Colegio de Psicólogos del Perú, Stephanie Kewley, Sumaiya Hassan, Susanne Bründl, Tamim Ikram, Telex.hu, Trisha Mallick, Tushar Ahmed Emon, Wéo, and Yasmin Benoit for their help with respect to participant recruitment and data collection.
Author Contributions
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: The authors declare no conflict of interest with respect to the content of this manuscript.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Loïs Fournier was supported by the Swiss National Science Foundation (SNSF) under a "Doc.CH" Doctoral Fellowship [Grant ID: P000PS_211887]. The SNSF had no role in the study design, collection, analysis, or interpretation of the data, writing the manuscript, or submitting the manuscript for publication. Zsolt Demetrovics was supported by the Hungarian National Research, Development, and Innovation Office (Grant number: KKP126835). Mónika Koós was supported by the ÚNKP-22-3 New National Excellence Program of the Ministry for Culture and Innovation from the source of the National Research, Development, and Innovation Fund. Shane W. Kraus was supported by the Kindbridge Research Institute. Léna Nagy was supported by the ÚNKP-22-3 New National Excellence Program of the Ministry for Culture and Innovation from the source of the National Research, Development, and Innovation Fund. Sophie Bergeron was supported by a Tier 1 Canada Research Chair. Lijun Chen was supported by the National Social Science Foundation of China (Grant No. 19BSH117). Rita I. Csako was supported by the Auckland University of Technology 2021 Faculty Research Development Fund. Hironobu Fujiwara was supported by a Grant-in-Aid for Transformative Research Areas (A) (Japan Society for The Promotion of Science, JP21H05173), a Grant-in-Aid for Scientific Research (B) (Japan Society for The Promotion of Science, 21H02849), a Grant-in-Aid for Scientific Research (C) (Japan Society for The Promotion of Science, 23K07013), and the Smoking Research Foundation. Roman Gabrhelík was supported by Charles University’s institutional support program Cooperatio - Health Sciences. Mateusz Gola was supported by the National Science Centre of Poland (Grant No. 2021/40/Q/HS6/00219). Joshua B. Grubbs was supported by grants from the International Center for Responsible Gaming and the Kindbridge Research Institute. Karol Lewczuk was supported by a Sonatina grant awarded by the National Science Centre of Poland (Grant No. 2020/36/C/HS6/00005). Chung-Ying Lin was supported by the WUN Research Development Fund (RDF) 2021, the Higher Education Sprout Project, and the Ministry of Education at the Headquarters of University Advancement at the National Cheng Kung University (NCKU). Christine Lochner was supported by the WUN Research Development Fund (RDF) 2021. Katerina Lukavská was supported by Charles University’s institutional support program Cooperatio - Health Sciences. Gábor Orosz was supported by an ANR grant of the Chaires de Professeur Junior of Artois University and by funding from the Dialogue Stratégique et de Gestion (Phases 1 and 2). Gabriel C. Quintero Garzola was supported by the Sistema Nacional de Investigación (SNI #073-2022) from Secretaria Nacional de Ciencia, Tecnología e Innovación (Republic of Panama). Kévin Rigaud was supported by funding from the Dialogue Stratégique et de Gestion (Phase 2). The Sungkyunkwan University Research Team was supported by the Brain Korea 21 (BK21) program of the National Research Foundation of Korea.
