Abstract
Background:
Over the past decade, wide-spreading streaming platforms have given rise to new ways of consuming TV series, but also to the potential for harmful excessive use and the concept of binge-watching addiction. This highlights a clear need for developing a deeper understanding of this emerging behavioral phenomenon that threatens human physical and mental health. Our present research endeavor intended to investigate the psychometric characteristics of a brief measure of binge-watching addiction designed on the basis of components that theoretically reflect the 6 facets operationalizing the concept the behavioral addiction, i.e. the Binge-Watching Addiction Scale (BiWAS).
Method:
We carried out a multisite research study among 4092 participants from 5 Arab countries (i.e., Palestine, Jordan, Oman, Egypt, and Lebanon) during the period September-October 2025.
Results:
Factorial Structure Analysis showed that the single-factor model that was postulated was an acceptable fit, except for the RMSEA value. Internal consistency of the instrument’s global score was satisfactory, with a Cronbach’s alpha of .78. In addition, the BiWAS featured adequate configural, metric, and scalar invariance across the 2 sexes and the 5 country groups. Furthermore, BiWAS scores showed significant positive correlations with higher smartphone addiction, increased psychological distress, more severe insomnia, and greater fatigue levels.
Conclusion:
Results indicated that the BiWAS is a psychometrically valid and reliable scale in both sexes and all countries involved in this study. It is hoped that the new scale will open up possibilities for quantitatively based investigations of binge-watching thus contributing to deepen our knowledge of this prominent phenomenon and enhance understanding of the mechanisms involved in increased vulnerability to becoming addicted. Further studies are warranted to offer additional information and a more comprehensive understanding of the scale’s performance across different populations, cultures and societies across the globe.
Introduction
Over the past decade, there has been a growing development of multiple on-demand streaming platforms such as Apple TV, Amazon Prime, Disney+, Hulu, Crunchyroll, and Netflix.
1
Due to the large availability of streaming services and digital video, a new pattern of TV viewing behavior has emerged, especially among young audience. Unlike traditional television where the watcher has to wait several days for the release of the next episode, the entire season of a TV series is made available at once by these technologies. A watcher can choose from diverse and extensive content and view as many episodes of a television show as desired. Watching TV series has become easily accessible anywhere including at home, workplace, or while traveling.
2
and via multiple devices.
3
Following this new trend, a new phenomenon has increasingly gained prominence and popularity and has rapidly become an integral part of millions of people’s daily routine worldwide,
4
which is called binge-watching. Binge-watching consists of the practice of watching multiple episodes (2-6) of a TV series sequentially in a single sitting.5,6 However, various other definitions have been used by researchers to describe the problem, such as viewing 3 to 4 episodes more than half an hour lengthy, or viewing 1 to 3 episodes in 1 sitting.
7
The prolonged involvement in this new normative way of watching TV series has led to problematic behavioral pattern
Despite the growing research on binge-watching, the increased awareness of its occurrence and potential deleterious impact, there is to date a lack of consensus on how to operationally define and measure it. 12 In the present research, binge-watching is approached as an addictive behavior because of a set of attributes reported in literature that accurately reflect and define behavioral addictions. Indeed, content of streaming media makes it particularly difficult and dissatisfying to stop watching at the end of 1 episode, which leads watchers to the desire to watch “just one more” and to feelings of “being drawn-in by a show.”13,14 This often results in viewing more episodes than initially planned or anticipated, and subsequent distressing feelings of guilt. 15 Therefore, people who engage in unhealthy patterns of binge-watching tend to lose time control and start to neglect their social, family, and school/work responsibilities, which may lead to psychosocial functioning disruption. 16 In addition, a key factor that motivates people to binge watch is relaxation and pleasure-seeking, 3 but also distraction and escapism from everyday distressing situations 17 ; these motives are commonly seen in other addictive behaviors such as online gaming addiction. 18 Researchers argued that binge-watching triggers the brain’s reward system and increases dopamine levels, as it allows watchers to meet their needs of instant gratification and immediate desire satisfaction. 11 Hence, the dopamine rush induced by binge-watching makes the individual engage in this behavior more and more. Furthermore, there is evidence that binge-watching episodes of TV series has a range of negative physical, psychological, and social consequences, 19 which resemble consequences seen in other technology addictions like Internet, smartphone, social networking sites, and gaming addictions. 20 All that said, it is worth noting that the concept of binge-watching behavior as an addiction is still evolving. It is not yet recognized by international classifications21,22 and is still in need for empirical and theoretical development. Based on these considerations, researchers called for further investigations of the addictive nature of binge-watching to clarify and better understand how a simple, rewarding daily habit can turn into an addictive, harmful behavior over time.5,23
Measurement of Binge-Watching
The assessment of the extent of engagement in TV series watching has varied widely across studies, which is mainly due to large differences in the operationalization of the binge-watching construct. 12 The majority of previous studies used quantitative estimates, such as the average duration of 1 viewing session,7,24 the average number of episodes watched per day or per viewing session,25,26 the frequency of binge-watching over the last week or month,27,28 the pace of watching a particular series,29,30 the general tendency to binge-watching, 31 or self-perceived binge-watching duration. 32 Other studies used self-developed, non-psychometrically-validated measures of binge-watching-related tendency, 33 excessiveness, 18 intention, 34 or cognitive/behavioral involvement. 35
Of note, there have been a few attempts to develop and validate psychometric tests to assess binge-watching. For instance, Orosz et al 36 created the problematic series watching scale based on the 6 component model of Griffiths. 37 The developers chose to call the measure “Problematic Series Watching” and not qualify the behavior as an addiction because there has been no evidence in 2016 regarding its health-related negative outcomes. In 2017, the Series Watching Engagement Scale (SWES) was designed to enable the differentiation between “highly engaged” and “problematic” viewers. 38 The scale is composed of 15 items and 5 factors assessing engagement (overuse, social interaction, identification, persistence, and self-development), a construct that is defined as a potential precursor for addiction. 38 However, the SWES has been critized because it involves dimensions that do not assess the behavior of binge watching per se, such as associated booster (“identification”) or motivational features (“social interaction”). 39 In 2019, the Binge-Watching Engagement and Symptoms Questionnaire (BWESQ) was developed, which comprises 40 items across 7 domains: Loss of control, Engagement, Dependency, Desire/Savoring, Positive Emotions, Binge-watching, and Pleasure preservation. 39 The BWESQ was defined by the developers as assessing a wide range of symptoms related to one’s involvement in TV series, including facets that reflect both positive expectancies (such as pleasure) and pathological high binge-watching engagement (such as loss of control and dependency). The authors argued that approach to conceptualization and measurement of binge watching was adopted to reflect the motive of binge watching as a “leisure habit,” and avoid overpathologization such a recreational behavior. However, such an approach poses the risk of overlooking the documented harmful effects addictive binge-watching may have on a person’s health.
Watching TV series can be regarded as an expanding technology addiction that mirrors other well-known addictions that apply to other technology-related online activities such as gaming, social networking sites, online shopping, online pornography, or smartphone use. Because these potential behavioural addictions are still relatively new for researchers and practitioners, there can be resistance to consider the term “addiction” as applying to these entities (e.g., de Alarcón et al 40 ). The reluctance to admit the obvious addictive potentials of technology-related activities was analogized to the era when alcohol addiction was regarded as a “personality problem.” 41 Hence, while most binge-watchers enjoy watching TV series as a leisure habit and are healthy, minimizing or disregarding the addictive risk potential of binge watching will not help unhealthy binge-watcher addicts in need for care. In this line of reasoning, Forte et al 42 developed the Binge-Watching Addiction Questionnaire (BWAQ) consisting of 20 items and 4-factor dimensions (i.e., “Dependency,” “Craving,” “Avoidance,” and “Anticipation”), which demonstrated good psychometric properties among Italian-speaking individuals. However, because it is relatively lengthy, the BWAQ can have limited utility in resource- and time-pressured real-life contexts. Overall, the plurality of definitions of the binge-watching construct and the ambiguity surrounding its conceptualization pose a major challenge to an adequate understanding of this behavior. In addition, the heterogeneity and lack of robustness of assessment of binge-watching impedes accuracy and consistency across existing research and hampers the replication and comparisons of findings of empirical studies. 12
Rationale and Purpose of This Study
In today’s technological landscape, traditional TV is experiencing a steady viewership decline while binge-watching is becoming an increasingly prominent behavior in Arab societies. 43 A study conducted among Egyptian Youth showed that 93.2% reported binge-watching (defined as viewing more than 2 episodes in 1-sitting), among them 49% reported that they either always or often binge-watch. 44 A cross-cultural study on binge-watching showed that 18.7% of Arabic-speaking participants from Egypt reported watching more than 6 episodes in 1 session, which is much higher than that reported by participants from other countries (ranging from 6.1% among Spanish-speakers to 15.4% among Chinese-speakers), and 24.6% reported a frequency of watching of once to several times a day. 45 One important barrier to advancing knowledge in this field is the lack of valid, reliable, yet convenient-to-use instruments to measure the binge-watching construct among Arabic-speakers. Echoing earlier research that has introduced brief measurement tools to the field of technology addiction, such as the Bergen Social Media Addiction Scale (BSMAS 46 ), the Bergen Facebook Addiction Scale (BFAS 47 ), and the TikTok Addiction Test (TAT, 48 a short Binge-watching addiction instrument may be particularly useful to capture core features of the construct while preserving sound psychometric properties.
Therefore, the main goal of this study was to create and validate a brief scale measuring the binge-watching construct based on the 6 key components of addiction via 1 item each. The newly developed measure is labelled Binge-Watching Addiction Scale (BiWAS). A multinational sample of Arabic-speakers is used to increase the cross-cultural generalizability of the reported findings. Our study sought to test whether the BiWAS was reliable and valid. Another aim of this work was to test and establish measurement invariance of the new scale across sex and countries. It is proposed to test the following hypotheses: (1) results from factor analysis will indicate a 1-factor structure of the BiWAS representing a unique dimension of binge-watching addiction with adequate internal consistency (Cronbach’s alpha >.70); (2) factor loadings connecting the items to the single dimension will be invariant across country and sex, indicating measurement equivalence; and (3) the BiWAS will positively correlate with relevant measures including smartphone addiction, insomnia, fatigue, and psychological distress.
Methods
Procedure and Sample
We carried out a multisite research study in 5 Arab countries (i.e., Palestine, Jordan, Oman, Egypt, and Lebanon) during the period September to October 2025. Our inclusion criteria encompassed adults from the general population, who were aged at least 18 years, who were able to read and understand Arabic, who had access to the internet, and who have been TV series viewers for at least the past 12 months. Eligible individuals who agreed voluntarily to take part in the study were administered an online survey anonymously and under conditions of confidentiality. The questionnaire was designed in Arabic using Google Forms application, and distributed to potential participants using a link shared via multiple social networking sites. Convenience and snowball sampling approaches have been adopted to collect data. Specific information about the research purpose and processes was provided to participants in the first section of the questionnaire, which was also designed to gather informed consent before allowing access to the rest of sections. Participants were also informed that they can withdraw from the study at any time without penalty and without giving reasons. The research protocol of this study was reviewed and approved by the Scientific Research Ethics Committee of Isra University, Jordan (Reference: SREC/25/09/158, Date: 15/9/2025). A total of 299 individuals ticked “No” to the informed consent statement and were asked to submit the form without going any further into the questionnaire. There were 4092 valid responses obtained and included in the analysis (Palestine: N = 520; Jordan: N = 552; Oman: N = 398; Egypt: N = 1892; and Lebanon: N = 730).
Minimum Sample Size
According to the suggestion of 20 times the number of BiWAS items, we estimated a minimum sample size of 120 participants based on the recommendation of per scale’s variables. 49
Measures
The Binge-Watching Addiction Scale (BiWAS)
The BiWAS has been designed in the Arabic language. The same approach and criteria used to develop several established measurements of behavioral addictions (such as sex addiction 50 ) were applied to develop the new scale. The items were constructed based on core criteria fitting behavioral addiction frameworks that have been established as relevant for content validity.21,37,51,52 The specific wording of items and response options were formulated on the basis of those used in scales assessing other technology addictions (such as social media, 46 TikTok addiction, 48 and conversational artificial intelligence 53 ), and were modified for the BiWAS to use only binge-watching-related terminology. For example, the modifications involve using the words “TV series” instead of the words “Social media” 46 “TikTok,” 48 “conversational artificial intelligence,” 53 or “sex.” 50
Similar to the above-mentioned measures, the BiWAS operationalizes binge-watching addiction according to the 6 components that define behavioral addictions – more particularly Gambling Disorder – following the Diagnostic and Statistical Manual of Mental Disorders (DSM) criteria. 21 The components consist of: (a) tolerance, reflecting important increase over time in TV series viewing to attain acceptable levels of satisfaction; (b) salience, designating excessive attention and preoccupation with TV series viewing; (c) mood change, reflecting improvements in mood as a result TV series viewing, (d) withdrawal, defined as unpleasant sensations (emotional or physical) that a person can experience when TV series viewing is abruptly interrupted, (e) relapse/loss of control, reflecting failure in attempts to reduce TV series viewing, uninstall, or block the app, and (f) conflict, indicating negative impacts of excessive TV series viewing on interpersonal relationships. For each criterion, 1 item was created that can be scored on a 5-point scale as follows: very rarely (1), rarely (2), sometimes (3), often (4), and very often (5). The Likert-type scale was chosen to allow for clearer evaluation of risk degrees and facilitate capturing of at-risk users. The time frame was set at the past 12-month period preceding the completion of the scale. After developing the scale, a pilot study of the Arabic version using a sample of 30 general population adults for clearer comprehensibility and cultural suitability. After pilot testing, no changes were deemed necessary.
The Smartphone Addiction Scale-Short Version (SAS-SV)
The SAS-SV consists of a self-administered instrument composed of 10 items measuring levels of smartphone addiction. 54 Each item can be rated on a 6-point from 1 (Strongly disagree) to 6 (Strongly agree), with total scores ranging from 6 to 60. A higher total score indicated greater severity of smartphone addiction. The version of the SAS-SV validated in Arabic was used, 55 which showed a Cronbach’s alpha of .93 in this study.
The Insomnia Severity Index (ISI)
The ISI is a self-rating scale consisting of 7 items that evaluate the severity of insomnia symptoms across the following domains: severity of sleep onset, sleep maintenance, early morning awakening problems, interference of sleep difficulties with daytime functioning, sleep dissatisfaction, distress caused by the sleep difficulties, and noticeability of sleep problems by others. 56 Items are scored on a 5-point Likert scale. Total scores range between 0 and 28, with greater total scores reflecting more severe insomnia. The version of the ISI validated in Arabic was used, 57 which showed a Cronbach’s alpha of .79 in this study.
The 11-Item Chalder Fatigue Scale (CFS)
The CFS is a self-report measure consisting of 11 items that assess physical and mental fatigue. 58 Response options are as follows: “much worse than usual,” “worse than usual,” “no more than usual,” and “better than usual.” The version of the CFS validated in Arabic was used, 59 which showed a Cronbach’s alpha of .90 in the present sample.
The Patient Health Questionnaire (PHQ-4)
The PHQ-4 is a short self-completed scale composed of 4 items measuring depression (2 items) and anxiety (2 items) over the past 15 days. 60 Respondents can answer each item on a scale from 0 (not at all) to 3 (almost every day), leading to a total score that ranges from 0 to 12. The version of the PHQ-4 validated in Arabic was used in our study, 61 which showed a Cronbach’s alpha of .85.
Sociodemographic Data
Participants were requested to provide information about their sex, age, country, educational level, occupation, and marital status. Financial burden level was assessed using the Single-Item Financial Stress Scale (SIFiS), 62 which consists of the following: “How stressed do you feel about your personal finances in general?”. The item can be answered on a 10-point scale ranging from 1 (“No Stress at All”) to 10 (“Overwhelming Stress”).
Data Analysis
We conducted the Exploratory Factor Analysis (EFA) and calculated reliability indices using FACTOR 12.04.01, 63 followed by Confirmatory Factor Analysis (CFA) with R software. To evaluate the internal structure of the scale, the full sample was randomly divided into 2 groups; the EFA was performed on the first group, representing one-third of the total sample (1349 participants), while the CFA was run on the second group (2743 participants). Prior to the EFA, data adequacy was assessed with the Kaiser-Meyer-Olkin (KMO) measure and Bartlett’s test of sphericity. Additionally, item-level suitability was examined through the Measure of Sampling Adequacy (MSA) 64 and the Anti-Image Correlation. 65 Items with MSA values below 0.50 were excluded, as such values indicate poor adequacy. 64 The Expected Residual correlation direct Change (EREC) index was further employed to evaluate residual associations between item pairs after accounting for common factors, where values should approximate zero. Strongly correlated item pairs (doublets) were identified and items repeatedly appearing in different doublets were removed. 66 Since the variables were ordinal and several items presented skewness and kurtosis beyond |1|, the EFA was based on a polychoric correlation matrix given the ordinal nature of the variables. 67 The method of estimation was Unweighted Least Squares (ULS) according the recommendations of the current literature. 68 Factor retention was determined through the Optimal Implementation of Parallel Analysis.69,70
The choice of FACTOR for EFA was motivated by its superior handling or ordinal Likert-type data. FACTOR allows the computation of polychoric correlation matrices and supports robust extraction methods and optimal factor retention procedures such as parallel analysis. In contrast, AMOS does not implement polychoric correlations for EFA and relies primarily on Pearson correlations, which may underestimate factor loadings when applied to ordinal data. Second, FACTOR provides reliability estimates that are directly aligned with latent variable modeling, including McDonald’s omega, Cronbach’s alpha, and factor score determinacy indices, which are more appropriate than classical Cronbach’s alpha in the context of ordinal constructs. CFA was subsequently conducted in R using a structural equation modeling framework, which allows flexible model specification, and the computation of standard fit indices, and average variance extracted. This separation of analyses follows established psychometric conventions. The use of different softwares does not compromise comparability of interpretability of the results.
We conducted a confirmatory factor analysis in R software (lavaan package). We used the Weighted Least Squares Mean and Variance Adjusted (WLSMV) estimator, the most appropriate for ordinal data. Model fit was assessed with several indices, including Standardized Root Mean Squared Residual (SRMR), root mean square error of approximation (RMSEA), Tucker-Lewis Index (TLI), and comparative fit index (CFI). Adequate model fit was considered for SRMR values ≤0.05, RMSEA ≤0.08, and CFI and TLI ≥0.90. 71 Multivariable normality was not verified as shown by the skewness (=2094.12; P < .001) and kurtosis (59.46; P < .001) values.
Prior to EFA/CFA, data were screened for careless responding using straight-lining, within-person response variance, and long-string response patter analyses; sensitivity analyses excluding flagged cases yielded comparable structures. Furthermore, multi-group CFA was applied on the second subsample to test measurement invariance across sex and countries. 72 Configural, metric, and scalar invariance were evaluated, with ΔCFI ≤0.010 and ΔRMSEA ≤0.015 or ΔSRMR ≤0.010 serving as evidence of invariance.73,74 Group differences in binge watching scores were examined with the Mann-Whitney U and Kruskal-Wallis tests.
Internal consistency was estimated using Cronbach’s α coefficient and McDonald’s ω, while validity evidence was explored through Spearman correlations between the new scale and related constructs.
Results
Participants’ characteristics are depicted in Table 1. The most frequent streaming platforms used were Chahed (35%) and Netflix (29%). The most common device used for binge-watching was the smartphone (71%), followed by the television (55%), tablet (38%), streaming player (6%), and game console (6%).
Sample Characteristics (n = 4092).
A total of 387 (9.5%) participants showed floor effect, whereas 6 (0.1%) showed ceiling effect. The 2 groups did not differ by sex (χ2 (1) = 0.93, P = .336), by country (χ2 (4) = 0.89, p = 0.927), or by age (t(4087) = −1.04, P = .301) (Supplemental Table 1).
Factorial Validity
Item relevance was first examined using the MSA index, which showed values above 0.50 for all items, indicating that each contributed to the same underlying construct. The EREC analysis did not identify any doublets. The analysis confirmed the adequacy of the data for EFA, as reflected by the KMO value of 0.787) and a significant Bartlett’s test (P ≤ .001). The results supported a unidimensional factor structure that explained 53.63% of the variance, with satisfactory indices: GFI = .972 (>.95) and UniCo = .970 (>.95). Parallel analysis further corroborated the 1-factor solution.
The unidimensional model was then validated through CFA on the second subsample. Initial model fit was acceptable (SRMR = 0.098, RMSEA = 0.173 [90% CI 0.163-0.184], CFI = 0.957, and TLI = 0.928). Internal consistency of the binge-watching score was good (α = .78/ω = .78) [95% CI 0.77; 0.79 for both]. The description of the items can be found in Table 2.
English Items of the Binge-watching Addiction Scale (BiWAS), Their Frequency, and Standardized Estimates of Factor Loadings from the Exploratory (EFA) and Confirmatory Factor Analysis (CFA).
Sex Invariance (Subsample 2)
The invariance across sexes and countries was established at all levels (Table 3). Higher binge-watching was found in females (Median = 8; IQR = 7) compared to males (Median = 6; IQR = 7), Mann-Whitney U = 1 266 959; Z = −6.99; P < .001.
Measurement Invariance of the Binge-watching Addiction Scale (BiWAS) Using the Total Sample.
Abbreviations: CFI, comparative fit index; RMSEA, root mean square error of approximation; SRMR, standardized root mean square residual.
No significant difference was found between countries: Egypt (Median = 7; IQR = 7), Oman (Median = 8; IQR = 7), Palestine (Median = 7; IQR = 7), Jordan (Median = 7; IQR = 6), and Lebanon (Median = 7; IQR = 6), Kruskal-Wallis H = 8.67; P = .070.
Concurrent Validity (Total Sample)
Higher binge watching was significantly associated with higher smartphone addiction, increased psychological distress, more severe insomnia, and greater fatigue (Table 4).
Correlation Matrix Between Continuous Variables.
Numbers reflect Spearman correlation coefficients (rho).
P < .001.
Discussion
Over the past decade, wide-spreading streaming platforms have given rise to new ways of consuming TV series, but also to the potential for harmful excessive use and the concept of binge-watching addiction. 11 This highlights a clear need for developing a deeper understanding of this emerging behavioral phenomenon that threatens human physical and mental health. Our present research endeavor intended to investigate the psychometric characteristics of a brief measure of binge-watching addiction designed on the basis of components that theoretically reflect the 6 facets operationalizing the concept the behavioral addiction. 21 The BiWAS was found to be unidimensional, invariant across sex and country, reliable, and valid in Arabic-speaking people residing in 5 Arab, Middle-Eastern nations. These findings are of substantial practical importance.
Both CFA and EFA were used to uncover and confirm the underlying structure of the BiWAS in our multi-country sample of community adult Arabic-speakers. Factorial Structure Analysis showed that the single-factor model that was postulated was an acceptable fit. In addition, internal consistency of the instrument’s global score was satisfactory, with a Cronbach’s alpha of .78, supporting that the 6 items analyzed had a good level of homogeneity among them. This suggests that the BiWAS may be structured on 1 dimension and that a global score might be a good indicator of binge-watching addiction. The BiWAS adequately and meaningfully measures the binge-watching addiction construct it intends to measure. Similar result was reported in previous brief, 6-item technology addiction measures that consistently produced a unidimensional construct, with the single dimension comprising the 6 components of behavioral addiction, namely withdrawal, relapse, tolerance, salience, mood modification, and conflict.46,48,53 This study designed the BiWAS to be brief enough to be practical and applicable for use among the target population for assessing binge-watching addiction. Nowadays technology consumers may tend to prefer activities that are less demanding and time-consuming, especially as increased screen time was shown to be associated with more distractibility, more inability to finish tasks, heightened impatience and waiting difficulty for rewards. 75
Regarding the fit indices, the CFI and TLI values were within acceptable ranges, whereas the SRMR was borderline but within the upper acceptable limit (<0.10), indicating that the standardized residuals did not diverge substantially from the observed data. However, the RMSEA value was substantially above the conventional cutoff suggesting poor absolute fit. Given the very small degrees of freedom (df = 9) and the short scale, the RMSEA is expected to be inflated. It is well documented that RMSEA overestimates misfit in models with low df, even when the model is correctly specified. 76 Therefore, greater weight should be given to incremental indices (CFI/TLI) and residual-based indices (SRMR) in this context. Considering these limitations, and given the theoretical justification for unidimensionality and the overall acceptable performance of CFI, TLI, and SRMR, the model can be considered to show adequate fit. It is noteworthy that the loading factor values of item 4 in both EFA and CFA was low (=0.45) but exceeded the cut-off value of 0.40 recommended by previous authors. 77 Its retention was justified by its theoretical relevance to loss-of-control processes in binge-watching addiction and by meeting established psychometric threshold for acceptable item performance, and was consequently kept in the analysis. Item 4, i.e. “Tried to cut down on the number of episodes you watch in 1 sitting, or uninstall the TV series app without success?”, reflects the relapse/loss of control component of behavioral addiction, which represents 1 of the 6 components model of addiction outlined earlier in the paper and largely supported by theory.21,37,51,52
A series of multi-group CFA were conducted to verify and establish measurement invariance across sex and country groups. In particular, 3 levels of measurement invariance were tested: (1) configural (i.e., items load onto the same factor across groups, (2) metric (i.e., item factor loadings are equal across groups), and (3) scalar (i.e., item intercepts are equal across groups). Findings indicated that the scale replicated adequate adjustments for a unifactorial model across the 2 sexes and the 5 country groups based on fit indices. In other terms, the BiWAS items are interpreted in a conceptually similar manner by male and female TV series binge-watchers, and whichever the country is. Invariance is considered a prerequisite to research and an important step to ensure that the interpretation of cross-group differences is not misleading and that the instrument performs equivalently across different groups or settings. After ensuring measurement invariance and soundness comparisons of BiWAS mean levels, results indicated more severe binge-watching addiction symptoms among females, whereas no significant differences were found based on the country of the participants. These findings build on prior evidence showing that females are more inclined towards binge-watching for longer periods of time 7 and are more likely to exhibit binge-watching addiction symptoms 78 than males.
Another objective of our study was to examine the concurrent validity of the BiWAS. To investigate this psychometric property, we correlated respondents’ BiWAS mean scores with variables that have empirically established relationships with binge-watching, namely smartphone addiction, fatigue, insomnia, and psychological distress. Findings showed that BiWAS scores correlated with these variables in the expected direction, thus supporting its practical usefulness as a construct. Binge-watching scores showed significant positive correlations with smartphone addiction scores. Notably, One’s smartphone is the handiest gadget, and was found to be the most common streaming device used to binge watch TV series in our sample. Consistently, smartphones and laptops were found to be the most frequently used devices to binge-watch among Arab people from the United Arab Emirates. 79 In addition, prior findings indicated that, in both males and females, smartphones are an important motivating factor and a major source behind binge watching. 33 Furthermore, addictive patterns of binge-watching may lead to both physical and psychological negative effects on the individual. 11 Physical impacts include sleep disruption and fatigability, 80 while psychological effects encompass depression and anxiety symptoms. 81
Practical Implications and Future Research Directions
An attempt conceptualization of binge-watching as a behavioral addiction, and the creation of a new instrument that allows its assessment based on core criteria of addiction will generate more consistency and coherence at theoretical and methodological levels in how the construct is measured. Results indicated that the BiWAS is a psychometrically valid and reliable scale in both sexes and all countries involved in this study. As a youthful region with increasing trends of binge-watching, the Middle East provides a fertile environment to study and understand the factors that relate to this behavioral problem. Users of the scale should bear in mind that although the binge-watching behavior can have a profound impact on individuals’ lives, this entity is not formally recognized as an addiction in many diagnostic frameworks and international classifications.21,22 Therefore, the main function of the BiWAS should be for screening rather than diagnostic classification.
It is hoped that the new scale will open up possibilities for quantitatively based investigations of binge-watching thus contributing to deepen our knowledge of this prominent phenomenon and enhance understanding of the mechanisms involved in increased vulnerability to becoming addicted. The brevity of the BiWAS makes it a valuable asset for use among binge-watchers in future clinical and research applications, especially in Arab countries with limited resources where researchers and practitioners often work under time and financial pressure.
Study Limitations
Although this study provides sufficient evidence to indicate the good validity and reliability of the BiWAS based on a large multisite sample, some limitations need to be discussed. The psychometric properties of the scale have only been examined in the adult general population, and their testing should be expanded to other age groups (such as adolescents) and other settings (including clinical populations). The study relied on cross-sectional data, precluding the examination of test-retest reliability. Future research with a longitudinal design still needs to verify the stability of the BiWAS over time to guarantee that differences observed across time in binge-watching scores reflect actual changes in the latent construct measured by the BiWAS. In addition, participants were recruited using a snowball sampling approach, which may have introduced selection bias by overrepresenting individuals who are socially connected, digitally active, or more interested in binge-watching behaviors. As a result, our sample may not be fully representative of the broader adult population, thereby limiting the generalizability of the findings. The cross-sectional design used in this study precludes the assessment of test-retest reliability and temporal stability of the BiWAS. Without longitudinal data, it is not possible to determine whether observed scores remain stable over time. Although data was collected across 5 Arabic-speaking countries, the age and country distributions seem broadly comparable between subsamples, whereas the sex distribution shows a noticeable imbalance, with a higher proportion of males in the CFA sample; therefore, cultural representation remains limited and does not capture the full sociocultural diversity of the Arab world. Finally, to improve the generalizability of the findings, there is a need to replicate the validation study with more culturally diverse samples from various regions within Arabic-speaking countries.
Conclusion
There is a clear need for the field to reach a consensus regarding the conceptualization and assessment of binge-watching addiction. This study is meant to contribute to this area by designing and validating a new measure of problematic behavioral patterns of binge-watching. The BiWAS showed good psychometric qualities, suggesting that it can have widespread clinical and research applicability among adults in Arabic-speaking communities. It is our hope that the BiWAS will help both researchers and practitioners to generate reliable information derived from accurate data that can be used to guide prevention and intervention strategies. Further studies are warranted to offer additional information and a more comprehensive understanding of the scale’s performance across different populations, cultures, and societies across the globe. Longitudinal studies are needed to evaluate the scale’s stability and sensitivity to change and should include more diverse samples from additional Arab countries.
Supplemental Material
sj-docx-1-jpc-10.1177_21501319261422127 – Supplemental material for Construction and Validation of the Binge-Watching Addiction Scale (BiWAS): Validity, Reliability, and Measurement Invariance in a Large Multinational Adult Sample From the Middle East
Supplemental material, sj-docx-1-jpc-10.1177_21501319261422127 for Construction and Validation of the Binge-Watching Addiction Scale (BiWAS): Validity, Reliability, and Measurement Invariance in a Large Multinational Adult Sample From the Middle East by Feten Fekih-Romdhane, Mai Helmy, Hanaa Ahmed Mohamed Shuwiekh, Abdallah Y. Naser, Mirna Fawaz, Kamel Jebreen, Diana Malaeb, Muna Barakat, Esra’ O. Taybeh, Eqbal Radwan, Sahar Obeid and Souheil Hallit in Journal of Primary Care & Community Health
Footnotes
Acknowledgements
Author KJ would like to acknowledge support through the ICTP-Arab Fund Associates Program (2024-2026).
Ethical Considerations
The research protocol of this study was reviewed and approved by the Scientific Research Ethics Committee of Isra University, Jordan (Reference: SREC/25/09/158, Date: 15/9/2025). All methods were performed in accordance with the relevant guidelines and regulations.
Consent to Participate
Written informed consent was obtained from all subjects; the online submission of the soft copy was considered equivalent to receiving a written informed consent.
Author Contributions
FFR and SH designed the study; FFR drafted the manuscript; SH carried out the analysis and interpreted the results; MH, HAMS, AYN, KJ, ER, and MF collected the data; DM, MB, and SO reviewed the paper for intellectual content; all authors reviewed the final manuscript and gave their consent.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
All data generated or analyzed during this study are not publicly available due the restrictions from the ethics committee but are available upon a reasonable request from the corresponding author.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
