Abstract
Affect, behavior, and severity of psychopathological symptoms do not remain static throughout the life of an individual, but rather they change over time. Since the rise of the smartphone, longitudinal data can be obtained at higher frequencies than ever before, providing new opportunities for investigating these person-specific changes in real-time. Since 2019, researchers have started using the exponentially weighted moving average (EWMA) procedure, as a statistically sound method to reach this goal. Real-time, person-specific change detection could allow (a) researchers to adapt assessment intensity and strategy when a change occurs to obtain the most useful data at the most useful time and (b) clinicians to provide care to patients during periods in which this is most needed. The current paper provides a tutorial on how to use the EWMA procedure in psychology, as well as demonstrates its added value in a range of potential applications.
Keywords
It is increasingly recognized that human psychology is highly changeable and dynamic (i.e., constantly in motion): Affect, behavior, environmental factors, and severity of psychopathological symptoms do not remain static throughout the life of an individual, but rather they change over time (Diener & Emmons, 1984; Guastello et al., 2008; Mischel & Shoda, 1995; Reis et al., 1980). Intensive longitudinal data is needed to properly investigate these changes and shed light on the rules governing these changes (Hayes et al., 2007; Molenaar, 2004; Molenaar & Campbell, 2009; Nelson et al., 2017). Ecological Momentary Assessment (EMA) is a popular method to collect such data. This method involves sampling the affect, behavior, and or environmental factors of an individual, typically several times a day for multiple days (Larson & Csikszentmihalyi, 1983; Myin-Germeys et al., 2018, 2009; Shiffman et al., 2008; Stone & Shiffman, 1994). Since the rise of the smartphone, collection of such data has become far more feasible in psychological research, sparking a large increase in both the interest in, as well as the availability of such data. Studies collecting EMA data for 1 or 2 weeks can be used to investigate how dynamics differ between persons or how experiences and behaviors covary within persons (e.g., higher levels of negative affect in stressful situations). However, in recent years it has been shown that it is also feasible to collect EMA data over a longer period of time (Schreuder et al., 2020; Smit et al., 2019; Smit, Snippe, et al., n.d.; Wichers et al., 2016, 2020). Such an extended research period increases the likelihood of substantial within-person changes in affect, behavior, and/or severity of psychopathological symptoms occurring during the research period, and allows researchers to investigate these intra-individual changes (Mehl & Conner, 2012; Nelson et al., 2017; Smit, Snippe, et al., n.d.; Wright & Woods, 2020).
Unfortunately, many of the methods currently used to analyze EMA data are unfit for investigating within-person changes over time. For example, commonly used (partial) correlation analysis, and (vector) autoregressive (VAR) models require the data to be stationary (Lütkepohl, 2005). In an intuitive sense, stationarity means that the statistical properties of the process generating the time series (e.g., means, variances, and serial correlations) remain the same throughout the research period. The assumption of stationarity directly contradicts the goal of studying change over time, and therefore any model assuming stationarity cannot be used to investigate such changes. Of course, methods do exist that do not assume stationarity, and can be used for studying change over time in intensive longitudinal data. However, most methods that do not assume stationarity, such as time-varying (V)AR models (Bringmann et al., 2018), and change-point detection methods (Cabrieto et al., 2018, 2019), can only be applied after the data collection has been completed. This is unfortunate, as it could be useful to be able to detect within-person changes in intensive longitudinal data prospectively, in real-time.
First, being able to detect changes in real-time provides new research opportunities. In most studies, EMA questionnaires are kept short to keep participant burden manageable. When within-person changes in psychological functioning can be detected in real-time, additional questionnaires or qualitative interviews could be added at these highly relevant times. Since this additional information can be gathered when recall bias is still limited, such additional data could provide otherwise unobtainable insights in how and why changes occur. Second, detecting changes in real-time could be highly relevant for clinical applications. Interventions could be started or adapted as soon as a change is detected. In some cases, it may even be possible to detect changes when they are still relatively small and harmless, and start interventions before these small changes grow into more problematic ones (e.g., a depressive episode). Real-time change detection could be a big step toward providing the right intervention, to the right patient, at the right time.
Recently it was shown that it may be possible to detect intra-individual changes in intensive longitudinal psychological data in real-time using statistical process control (SPC) methods (Schat et al., 2021; Smit et al., 2019; Smit & Snippe, n.d.). SPC methods were originally developed to monitor an industrial production process over time and indicate when changes in the process occurred. Several univariate SPC methods exist, including the Shewhart procedure (Shewhart, 1931), cumulative sum procedure (Page, 1954), and the exponentially weighted moving average (EWMA) procedure (Roberts, 1959). In this paper, we will focus on the latter procedure. The EWMA procedure, which is applied to monitor the average level of a variable in real-time seems particularly useful, since it is relatively easy to implement and interpret. Moreover, all statistical process control procedures are based on some (potentially unrealistic) assumptions (e.g., normal distribution, independence of observations), but EWMA tends to be quite robust against violations of these assumptions. Finally, as the EWMA procedure has been applied in many fields and investigated for the better part of a century, its behavior and statistical properties are well understood. Though its behavior on simulated EMA data has been investigated recently (Schat et al., 2021) showing promising results, tutorial applications on different types of empirical intensive longitudinal data are needed to gain insight in the practical possibilities and limitations of this method in psychology research. The current paper aims to do this, by analyzing three different data sets. In addition, we provide a tutorial, R-code and practical recommendations, paving the way for future applications of the EWMA procedure in psychological research.
Statistical Process Control
General Idea
In this section we will introduce SPC using two examples. A first example stays close to the origins of SPC and stems from industry, in that we monitor the industrial process of filling water bottles. The output of the filling machine is tracked, where the scores we observe are the amount of ml in each bottle. Second, we consider an example from psychology, where we monitor the affective fluctuations of an individual as measured through EMA.
SPC procedures are based on the idea that even if a process remains the same, observations of that process will exhibit natural variability (in the statistical sense). For instance, there might be a small variation in the amount of ml in the water bottles, but overall the machine still functions well. Similarly, a person’s affect is expected to fluctuate over time, for instance due to contextual changes (Kuppens & Verduyn, 2017). If this natural variability of the observations is known, control limits can be set so that the vast majority of new observations of the machine or person fall between these control limits, as long as the process does not change (i.e., remains in-control). However, when the process characteristics do alter, features of the observations also change, which should result in more observations exceeding the control limits. For example, the filling machine can break down causing the produced water bottles to be empty. The monitored scores (i.e., amount of ml) will then no longer fall within in the in-control distribution/range, and the process should be flagged as out-of-control. For our second example, the monitored person may fall into a depression. Among others, this change will be reflected in an increase of negative affect (Clark & Watson, 1991; Watson et al., 1988). To summarize, exceeding the control limits indicates that it is likely that the data generating process has changed. The process is then considered to be out-of-control, and an intervention may be necessary. A mechanic may have to repair our bottle filling machine, whereas a therapist may have to check on our monitored person. On the contrary, if no out-of-control scores occur, there is no evidence that the process has changed, implying that intervening is not needed.
SPC procedures thus require two distinct research phases. In Phase I, the natural variability of the in-control data is captured and used to establish the in-control distribution. The estimated mean ˆµ1 and standard deviation
EWMA Procedure
The EWMA procedure (Roberts, 1959) was proposed to detect mean changes across time. It monitors a real-time running estimate of the average in a control chart rather than the original observations. Specifically, the procedure combines past information with current information and tracks a weighted sum of the original observations, where more recent observations receive higher weights. At each measurement occasion

EWMA Control Charts for Different Sizes and Types of Mean Change.
Given
and
As
Figure 1A shows a situation in which no mean change was introduced in Phase II, and the EWMA scores correctly remain within the control limits. An abrupt change of .25
The EWMA procedure is often compared to other methods, such as the Shewhart and CUSUM procedures, as well as the simple moving average (SMA). Simulation studies by Schat et al. (2021) showed that the EWMA procedure performed considerably better than the Shewhart procedure. The CUSUM procedure, on the other hand, did not consistently perform worse than the EWMA procedure, but it is harder to implement as there is no simple formula to calculate the control limits. The SMA is often considered to be less complex than EWMA. Here, a time window of size
ARL
The expected behavior of SPC procedures is usually expressed in term of the run length, which indicates at which Phase II observation the process goes out-of-control for the first time. In case the process remains in-control, an out-of-control EWMA would be a false positive (i.e., type 1 error). The expected run length until the first false positive is encountered is called the
In practice, we suggest researchers to decide on a suitable

EWMA Control Charts Demonstrating the Impact of the Choice of
The
Notice that lower values of
Assumptions of SPC Procedures
SPC procedures are based on the assumption that observations are independent and normally distributed. This assumption is often violated in psychological research: observations are serially dependent (i.e., autocorrelated) and skewed distributed. Advantage of the EWMA procedure is that it is quite robust against violations of the assumption of normality (Montgomery, 2009). To deal with autocorrelation, Schat et al. (2021) recommend to monitor day averages rather than individual observations, which reduces or even removes the autocorrelation. Modeling and removing autocorrelation before running the EWMA is also an option (Montgomery, 2009; Smit et al., 2019). Although this procedure is more complicated especially in case of potential missingness in the data, it does allow the user to evaluate the EWMA at every individual observation rather than just once per day. An additional consequence of using day averages, is an increase in effect size as more fluctuations are averaged out, increasing the power of SPC procedures in detecting small changes. Therefore, we will monitor day averages in the remainder of this paper.
Next to distributional characteristics of the variable under investigation, potential users should be aware that sufficient Phase I data is needed to obtain accurate estimates of in-control behavior. Due to insufficient Phase I data, the in-control distribution may be either too wide or too narrow (i.e., too large or too small
Demonstrating the EWMA Procedure in Three Different Applications
In this section we present three applications that illustrate different purposes for using the EWMA procedure. In each application, we also vary one EWMA setting (i.e., Phase I length,
In Application 2, we again consider an example using EMA data in which changes in psychopathology occur, however this time without any sudden external triggers. In this application, there may a period before the onset of core symptoms in which an increase of prodromal symptoms can already be detected using the EWMA procedure. This has huge potential, as in such cases it would be possible to start an intervention when symptoms are still relatively harmless and manageable, which in turn may prevent a full-blown episode. Moreover, we investigate potential influences of the
In Application 3, we investigate whether changes can be detected in passively collected data using the EWMA procedure. As it is typically difficult to predict during what period a change is likely to occur, relatively long research periods are needed to capture the change of interest. While recent studies have shown that EMA questionnaire data (such as in Applications 1 and 2) can be collected during a continuous period of several months (Helmich et al., 2020; Schreuder et al., 2020; Smit et al., 2019; Smit, Snippe, et al., n.d.), such a design may not be feasible or ideal in all study populations or for all research questions. In some cases it may be more suitable to use measurements with a lower participant burden than high frequency questionnaires, such as passive measurements of physiology or actigraphy (Kunkels et al., 2021). It is therefore useful to also investigate if such, more passive, measurements also show meaningful changes that can be detected using the EWMA procedure. In Application 3, we also examine potential influences of the expected
Application 1: Detecting Change After it Occurred
Purpose
Application 1 focuses on detecting detrimental changes in a process as soon as possible after they happen in real-time. The timely real-time detection of elevations in psychopathological symptom levels could help start interventions as soon as possible. In Application 1, we test whether increases in feeling down and in experiencing craving to use drugs can be detected after adverse life events that are expected to potentially trigger these symptoms.
Data
We demonstrate the EWMA procedure on the data of a participant who was monitored using a maximum of 4 semi-random EMA observations daily, for a period of 114 days (yielding a total of ~400 EMA observations). This participant was diagnosed with major depressive disorder (MDD), remitted substance abuse (amphetamines), panic disorder with agoraphobia, and borderline personality disorder. During the research period two large external life events happened: The participant’s grandmother passed away on day 45 of the study, and the COVID-19 lockdown started on day 74 of the study. We expect that characteristics of the EMA observations may change as a result of these life events. We investigate changes in the items “to what extent do you feel down at this moment,” and “did you feel like using amphetamines since the previous beep” in particular, as these were the items with the strongest conceptual links to MDD and substance abuse, respectively. For a more complete description of this study, see Dejonckheere et al. (2021).
Results
Figure 3A and B shows EWMA control charts of the day averages of “down” and “craving,” respectively. The following settings were used:

EWMA Control Charts With Varying Phase I Lengths.
This example shows that different events may trigger changes in different symptoms. The passing of the participant’s grandmother may have led to a persistent change in “down,” which may be most relevant in the context of the major depressive disorder; the COVID-19 lockdown seems to have led to an increase in “craving,” which could be relevant in the context of the remitted substance abuse.
Impact of Chart Settings
This application provides an opportunity to gain insight in the relevance of the choice of the length of Phase I. As, to date, no EMA datasets have been gathered with the specific goal to analyze them using the EWMA procedure, the data used in Phase I has not specifically been collected with the aim of using it as in-control data. Therefore, the Phase I period needed to be defined post hoc in this case, with the possibility that a relevant change already occurred during Phase I.
Figure 3C and D shows EWMA control charts of “down” and “craving,” using the same EWMA settings, but setting the Phase I period to 74 days rather than 44 (i.e., all days before the COVID-19 lockdown). For “craving,” no change occurred during Phase I, and the different choice of Phase I period had minimal impact on the results. The change in “down” due to the passing of the participant’s grandmother now falls within the Phase I period. This had two effects on the chart: (1) the participant experienced a higher average level of “down” during Phase I, shifting the control limits upward and (2) the EMA observations of “down” had a higher variance during Phase I, widening the control limits. This meant that a larger upward change in “down” is needed in Phase II before the EWMA procedure marks it as significantly different from Phase I. Although in this specific case, an increase in “down” after both life events could still be detected, the EWMA goes back in-control between the two life events. This creates the impression that “down” returned to its normal level, even though it remains significantly higher than in the period before the participant’s grandmother passed away. When applied in a clinical setting, this may mean that with a 44-day Phase I an intervention would be started from the moment the participant’s grandmother passed away, while using a 74-day Phase I this intervention may have been stopped when the scores went in-control again.
Application 2: Detecting Change Before it Occurs
Purpose
Even before the onset of core symptoms there may be a period in which early changes can already be detected. For the purpose of such early stage detection, focusing on items that are expected to increase during the prodromal phase of a disorder might be useful, rather than items that are the closest proxies for core symptoms. Based on this idea, it has been hypothesized that an increase in the item “I feel restless” may be found before the onset of core depressive symptoms (Smit et al., 2019; Smit & Snippe, n.d.), as symptoms of anxiety often precede depressive episodes (Hetrick et al., 2008; Pede et al., 2017). Notice that items such as “I feel down” may not yet show a clear mean change during the prodromal phase, but rather only increase once the core depressive symptoms start to increase.
In Application 2, EMA data was collected before (Phase I), during, and after (both Phase II) gradual discontinuation of antidepressant medication. We test whether an increase in restlessness can be detected before the start of core depressive symptoms, using the EWMA procedure. The R code to construct the EWMA control charts for Application 2 can be found at [https://osf.io/nf7zk/].
Data
The EWMA procedure was performed on the day averages of the publicly available data described in Wichers et al. (2016). One participant filled out a maximum of 10 EMA questionnaires daily before, during, and after gradual antidepressant discontinuation (tapering), yielding a total of 1,474 EMA observations over a continuous period of 239 days. From days 42 to day 98, double blind tapering of the participant’s antidepressant medication started. It was hypothesized that this change in context may lead to an increase in depressive symptoms, and around day 127 of the experiment, a sudden increase in depressive symptoms indeed occurred. 1 For a more complete description of this study, see Wichers et al. (2016).
Results
Figure 4A and B shows EWMA control charts of “down” and “restlessness,” respectively. The following settings were used:

EWMA Control Charts Based on Varying
Note that the chart itself does not provide information on why “restlessness” starts changing at this early stage. The change may indicate a rise in prodromal symptoms as hypothesized, but could also reflect direct effects from antidepressant tapering on the EMA data. However, it can clearly be seen how relevant the choice of variable can be in the timely detection of changes using the EWMA procedure.
Impact of Chart Settings
This application provides an opportunity to gain insight in the relevance of the choice of the
Figure 4C and D shows the difference between the control chart using the commonly used
Application 3: Passively Collected Data
Purpose
If changes in psychopathology can be detected using passively collected data, this could be an important step toward reducing the participant burden in research using the EWMA procedure. Theoretically, any time series that is hypothesized to change in a meaningful way compared with the Phase I period can be used to construct a control chart. For example, we may hypothesize that physical activity measured using actigraphy reduces when depressive symptoms increase, as physical activity tends to be lower in depressed patients (Burton et al., 2013). Although this link may not be as direct as the link between an individual’s mood and depression, intensive longitudinal data on physical activity has the advantage that it can be collected using accelerometers that require no active attention from participants. In Application 3 we apply the EWMA procedure to actigraphy data, and test if a reduction in physical activity can be detected in real-time before or shortly after an increase in depressive symptoms.
Data
For the actigraphy measurements, participants wore the MotionWatch 8 accelerometer by CamNtech (CamNtech, 2020; Kunkels et al., 2020) on their wrist for a continuous period of 4 months during and shortly after (gradual) antidepressant discontinuation. A pilot case with 177,120 1-minute bins of actigraphy data covering 123 days will be used for the current study. The participant experienced an increase in depressive symptoms around day 68 of the study period. For a more complete description of this study, see Smit et al. (n.d., 2020).
Results
Figure 5A shows EWMA control chart of day averages of actigraphy data. The following settings were used:

EWMA Control Charts of the Actigraphy Data With Varying
Impact of Chart Settings
This application provides an opportunity to gain insight in the relevance of the choice of the
Summary of the EWMA Settings
Table 1 provides an overview of the EWMA settings discussed in the three applications (i.e., Phase I length,
Overview of the EWMA Settings.
Note. EWMA = exponentially weighted moving average.
Discussion
The three applications in the current paper demonstrate the potential usefulness of the EWMA procedure in psychological research, and demonstrate that it is feasible to apply it on a range of relevant time series data. Furthermore, this study shows that it is possible to construct person specific control charts with individualized control limits, that allow us to monitor single individuals without the need to obtain a sample of similar participants. This means the EWMA procedure allows the user to personalize variables and parameters for each person individually in a relatively simple way. In addition, the EWMA procedure can be used to analyze streaming data in real-time. This combination makes this method ideal for N = 1 research, and has high potential for clinical applications.
The results were in line with the idea that (a) different environmental factors can impact observed variables in different ways and (b) different variables may start to change at different stages in the development of psychopathology. In Application 1, feelings of sadness seemed to be triggered by the passing away of the participant’s grandmother and the COVID-19 lockdown, while “craving” only seemed to be strongly affected by the latter. In Application 2, the participant showed a large increase in “restlessness” during the prodromal stage of depression, while a large increase in “down” was found after the participant had already experienced a depressive relapse. This underlines the importance of variable selection in the EWMA procedure. As the EWMA procedure only requires the data of a single participant, the user has the freedom to select the variables that are expected to be most relevant for the individual under investigation. In Application 3, actigraphy data was used for the EWMA procedure, demonstrating the potential range of data types in which this method could be applied. This provides a lot of potential for personalizing the EWMA procedure, though this personalization needs further investigation.
In each of the three applications in this paper, we varied one EWMA setting (i.e., Phase I period,
Next to the chart settings, the statistical properties of the selected variables can also influence the performance of the EWMA procedure in terms of type I error and power to detect changes. Specifically, data are assumed to be independent over time and normally distributed. With autocorrelated data, the control limits are suboptimal, influencing both the type I error and power (e.g., Alwan & Roberts, 1988; Harris & Ross, 1991). However, a practical way to deal with autocorrelation in the context of EMA research is to monitor day averages rather than individual observations, as this reduces or even removes the autocorrelation (Schat et al., 2021). The EWMA procedure is known to be quite robust against violations of the normality assumption, meaning that the EWMA procedure can be applied to monitor variables that are skewed distributed (Schat et al., 2021; also see “craving” in Application 1). When items refer to more extreme behaviors (e.g., self-harm) or experiences (e.g., suspiciousness), observations may not vary at all during Phase I (i.e., floor effect items). For such items, control charts cannot be obtained using standard software. However, the principle of the control chart still holds and can be used in practice by manually setting the control limits at for instance 0, implying that any indication of self-harm will be flagged as an out-of-control score.
For other data characteristics, more research is needed to establish their impact on the EWMA procedure. First, it is unclear how missing data patterns (e.g., missing not at random) impact performance. For example, compliance has been shown to depend on the time of day (Rintala et al., 2019). Second, as holds for other time series methods (Vogelsmeier et al., 2019, 2021), SPC procedures implicitly assume measurement invariance across time, implying that participants always interpret the momentary questions in the same way as well as consistently use the answering scales. Given that SPC requires assessing participants across long stretches of time, finding ways of reducing or compensating for missing measurement invariance may improve the performance of the EWMA procedure. Third, ESM data may contain trends, such as diurnal patterns or specific context effects. Such trends violate the underlying EWMA assumption that all Phase I data are sampled from one and the same distribution. One way to deal with this is to detrend the data before applying EWMA, for instance by means of a smoothing procedure (Adolf et al., 2022; Cleveland et al., 1993) or by fitting a tailored time series model (for an overview, see Ariens et al., 2020). An alternative is to use the moving centerline EWMA (Mastrangelo & Brown, 2000). Fourth, though the EWMA procedure is aimed at detecting changes in the mean level, other changes (e.g., variance) can also affect the probability with which the control limits are exceeded. For instance, in Application 2, an alternative explanation for the two out-of-control periods around days 92 and 126 could be that antidepressant discontinuation lead to an increase in the variance of “down.” This is in line with the hypothesis that instability increases prior to transitions in depressive symptoms (Smit, Helmich, et al., n.d.; Wichers et al., 2016, 2020). If during Phase II the variance increases compared to Phase I, the process would tend to show more out-of-control periods.
SPC methods such as the EWMA procedure can be applied in real-time in the sense that the analysis can incorporate each new observation as soon as it becomes available. However, successful real-time implementation of the EWMA procedure comes with additional requirements. First, the collected data needs to be available for analysis shortly after it is obtained, and the data needs to be analyzed directly after becoming available. Although this is not necessarily very challenging as (a) several apps (e.g., PETRA and m-Path; Bos et al., 2022; Mestdagh et al., 2022) already upload data in real-time, and (b) there are many examples of analyzing regularly incoming data using the EWMA procedure (see Montgomery, 2009 for an overview of historical applications), researchers still need to keep this in mind when aiming to base an intervention on the EWMA procedure. Finally, changes can only be detected after they have occurred, and no form of analysis can change this. Whether this is soon enough to be useful strongly depends on the application. While in some cases it may be valuable to react as soon as possible after a patient has relapsed into substance abuse or depression, this would no longer allow us to prevent these highly detrimental changes. Preventive action is only possible if a variable can be found that already changes before the detrimental change occurs. For example, a patient may show increased craving for drugs before actually remitting into substance abuse (see Application 1), or start showing signs of restlessness before relapsing to depression (see Application 2).
It is important to note that in none of the applications above, the data was gathered specifically to be analyzed using the EWMA procedure. This means that the Phase I periods were defined post hoc, while real-time applications would require the user to define the Phase I period by collecting data on a predefined number of days before entering Phase II. Ideally, the Phase I data should be representative of how Phase II data is expected to behave when no change occurs in the participant, and should contain enough observations to reliably estimate control limits (see Schat et al., 2021 for guidelines for choosing an appropriate number of days). If Phase I contains data that is abnormal for the participant, this would impact the calculation of the control limits and therefore the performance of the chart. For example, as demonstrated in Application 1, if a change already occurs during Phase I, this can have a substantial effect on the width of the control limits. Also, life events may lead to abnormal variation in Phase I, which may not be expected to repeat in Phase II. It is important to note that most abnormalities in Phase I will represent additional variance on top of the natural variance we aim to capture, causing the control limits to be too wide and the EWMA procedure being on the conservative side. Thus, the main risk of a suboptimal Phase I period will be missing changes in Phase II, and improving the Phase I data will mainly help increase the power for detecting small changes in Phase II. As Application 1 and Schat et al. (2021) both demonstrate the importance of the Phase I period, future studies aiming to use the EWMA procedure should plan on collecting Phase I data. In addition, researchers may consider evaluating Phase I data to uncover and potentially control for abnormal sources of variance before beginning Phase II monitoring. Though the few existing empirical studies applying the EWMA procedure on EMA data seem to suggest that important changes are often large enough to be detected, even without having a strongly controlled Phase I period at hand (Smit et al., 2019; Smit & Snippe, n.d.). Information on how to collect Phase I data and evaluate its quality is provided elsewhere (Montgomery, 2009), but future studies are still needed to refine these procedures for application in psychology and test their usefulness.
Important property of the EWMA procedure is that it is a general purpose method, making it applicable in many research fields. This is an advantage from a statistical perspective, in that the framework is thoroughly tested and validated, as well as relatively straightforward to implement in a wide range of datasets. Whereas we focused on psychopathology, statistical process control can also be generalized to other fields in psychology, to study for instance personality development, cognitive development (gains or losses), or sudden gains in therapy. As evidenced by applications to daily COVID-19 data (Perla et al., 2021) or by applications to weekly or monthly hospital data (Thor et al., 2007), the frequency of the observations (e.g., weekly, monthly) does not play an important role in such generalizations, as long as the total number of in-control observations is high enough to obtain reliable control limits. However, this general purpose character is a disadvantage when looking for mechanistic insight in the onset and further development of psychopathology. Indeed, in contrast to network methods (Borsboom & Cramer, 2013) or computational models of affective dynamics (Loossens et al., 2020), SPC does not provide a causal theory about the etiology of psychopathology, such as vicious direct relations between symptoms. Also, though the EWMA procedure can be used to detect both sudden and gradual changes (see Figure 1), it does not provide information on whether the detected change occurred suddenly or gradually, and only provides an upper bound for the timing of the change. However, the simple interpretation of control charts may open new avenues of research regarding how and why changes occur. Specifically, both quantitative and qualitative measures could be intensified in out-of-control periods, to increase the information on how and why changes occurred.
Although the current paper showed the EWMA procedure in a range of N = 1 studies, applying this method in a sample of multiple participants that are all followed for an extended period using intensive longitudinal data could also be useful. This kind of research can be used to gain insight in how to personalize variables and parameters effectively, and investigate how the EWMA procedure will function when structurally applied in a specific population. Recently, Smit and Snippe (n.d.) performed such a study, where the N = 1 study in Application 2 was extended to a sample of 41 individuals. A pilot (Smit et al., 2019) was used to plan the study, and choose appropriate variables and settings for the EWMA procedure. The advantage of such a design is that it combines the personalized control limits for detecting within-person change, with the possibility to provide important between-persons summary statistics such as the sensitivity and specificity of the method, and the average timing of the first out-of-control EWMA score. Although a substantial investment of time and resources is required to obtain the data necessary, such studies do provide important information on the reliability, and overall usefulness of the EWMA procedure in psychological research and clinical practice.
Although SPC provides a practical statistical way of detecting significant changes in time series data, future research is needed to investigate the effectiveness of SPC-based interventions. Depending on the application and the intervention costs, benefits, and risks for both researchers/clinicians and participants/patients, parameters of the control chart need to be chosen in such a way that an appropriate balance between sensitivity and specificity is achieved. For low-cost interventions like Just-In-Time Adaptive Interventions, one could prefer a lower
In conclusion, the EWMA procedure is a general purpose statistical method that can be used to detect changes (a) in individual patients (i.e., without the need for a sample of multiple participants), allowing the user to personalize which variables are most relevant for each individual and (b) in real-time (i.e., data can be analyzed as soon as it is collected), making the EWMA is a unique new tool for analyzing time series data in psychology, that may be promising for clinical applications. Although some recent studies applying the EWMA procedure in multiple participants seem to confirm this potential usefulness (Smit et al., 2019; Smit & Snippe, n.d.), more research is needed to test the usefulness of this procedure in a wider range of psychological applications. The current study was an important step in this direction, by (a) demonstrating how the EWMA procedure was relatively straightforward to implement in three different psychological time series, and (b) exploring how the results could be used and interpreted in a range of applications.
Footnotes
Acknowledgements
We thank Dejonckheere, E., Houben, M., & Kuppens, P., for recruitment and data collection in Application 1, Wichers, M., & Groot, P. C., for the data collection in Application 2, and the TRANS-ID Tapering team including Wichers, M., Snippe, E., Riese, H., & Kunkels, Y.K., for recruitment, data collection and/or data cleaning in Application 3.
Author Contributions
A.C.S. and E.S. contributed equally to this manuscript and share first authorship.
A.C.S. developed the study concept, and E.S. and E.C. contributed to refining the study concept. All authors contributed to the study design. A.C.S. contributed to the data collection for Application 3. E.S. performed the analyses, and wrote the publicly available R-script used in this manuscript. All authors interpreted the results, drafted parts of the paper, and provided critical revisions. All authors approved the final version of the paper for submission.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The research presented in this article was supported by a research grant from the Research Council of KU Leuven (C14/19/054) awarded to E. Ceulemans.
Methodological Disclosure
“We report how we determined our sample size, all data exclusions, all manipulations, and all measures in the study.”
