Abstract
Despite empirical progress, theoretical understanding of mood instability remains stagnant. A major reason for this stagnation concerns the field’s reliance on narrative theories that cannot integrate disparate quantitative perspectives on mood dynamics. Here, we address the limitations of narrative theorizing by developing a formal theory of mood instability. Our theory is predicated on the computational process of “evaluation”: the process of appraising the value of stimuli, which has long been theorized to be central to mood dynamics. Building on reinforcement-learning models of evaluation, we propose a dynamic framework, which we use to simulate various evaluative situations. Our simulations can generate and thereby formally integrate three well-known types of mood instability: emotional rigidity/inertia, transience/instability, and sensitivity/reactivity. We discuss how this formal perspective could enhance the theory, clinical utility, and measurement of mood instability.
The core of affective experience, whatever its guise, is its evaluative nature. Affect introduces value in a world of factual perceptions and sensations.
Burdening up to 14% of the general population, mood instability (MI; or “emotional instability”) 1 is an important yet often neglected endophenotype (Black et al., 2006; Marwaha, Parsons, Flanagan, & Broome, 2013). Although archetypally synonymous with two psychiatric conditions (i.e., bipolar disorder and borderline personality disorder [BPD]), MI is found, to some extent, in virtually all psychopathologies—with, indeed, as many as eight out of 10 psychiatric inpatients reporting some level of MI (Gilbert et al., 2005; Marwaha et al., 2014). Recently, some progress has been made in understanding MI. Neurobiologically, for instance, projections from the limbic system (amygdala, in particular) to the salience network have been consistently and transdiagnostically implicated in MI (for a review, see Broome, He, et al., 2015). Likewise, psychologically, MI has been associated with various mental-health problems, such as poor functioning (Bowen et al., 2012), lower levels of well-being (Hills & Argyle, 2001), and increased suicidality (Marwaha, Parsons, & Broome, 2013).
Despite empirical progress, however, theoretical understanding of MI remains stagnant. Researchers in the field disagree on even basic questions, such as what the definition of MI is, what causes it, and whether different “types” of it exist (Broome, Saunders, et al., 2015). Consequently, the nomenclature surrounding MI is equivocal. Terms such as “emotional instability,” “mood swings,” “affective volatility” (or “lability”), “emotional dysregulation,” and “emotional impulsiveness” are used interchangeably by researchers and clinicians alike (for reviews, see Broome, Saunders, et al., 2015; Koenigsberg, 2010; Marwaha et al., 2014). Moreover, researchers from different disciplines (from psychiatry to psychology and neuroscience) focus on putatively distinct “aspects” of MI, yielding varied definitions—from “excessive rise of emotion and delayed return to baseline” to “frequent oscillations between affective categories” and “intense emotional reactivity” (Broome, He, et al., 2015; Koenigsberg, 2010). These definitional ambiguities impede the measurement and pragmatic utility of MI in both research and clinical settings (Broome, Saunders, et al., 2015; Zimmerman et al., 2010).
The ambiguities around MI can be traced back to the field’s reliance on natural language as a means of specifying its theoretical aspects. Because of the vagaries of language (Bolton & Hill, 2004; Wittgenstein, 2010), theories specified at the narrative level have been argued to be severely limited (see Fried, 2020; Haslbeck, Bringmann et al., 2021; Robinaugh et al., 2021; Smaldino, 2020; Vallacher et al., 2017). First and foremost, narrative theories are vulnerable to implicit assumptions and hidden contradictions—consider, as an example, the conflation of self-esteem instability and mood instability (Broome, Saunders, et al., 2015). Second, even without such internal contradictions, narrative theories cannot be rigorously examined against empirical reality—because of their generality, such theories cannot be subjected to significant risk of refutation and thus have the propensity of remaining neither fully corroborated nor refuted (Robinaugh et al., 2021). Finally, narrative theories cannot address the growing complexity of quantitative approaches to mood dynamics—for example, despite progress in creating quantitative definitions for several MI types (i.e., “affective inertia” or “stress reactivity”; Dejonckheere et al., 2019), theoretical research has not progressed in parallel to provide a unifying account of how these quantitative taxonomies might be understood theoretically.
Formal theories may address these shortcomings. First and foremost, formal theories can define constructs more precisely by formalizing them with the language of mathematics (see Adams et al., 2016; Huys et al., 2011; Lewandowsky & Farrell, 2011). Second, because of their mathematical specificity, formal theories can generate precise quantitative predictions that can be more explicitly examined against empirical observations (Huys et al., 2016; Zavlis, 2024a). Finally, because of their generative nature, formal theories enable researchers to unify disparate theoretical and quantitative perspectives by showing how they could all be generated from specific computational processes (Friston et al., 2014; Montague et al., 2012). Because of these properties, formal theories could present a fruitful avenue for clarifying the nature of MI.
In this article, we aim to develop a formal theory of MI. Our theory is grounded on Marr’s (1982) approach to computational modeling, which involves formalizing the psychological processes that the human mind/brain employs to solve particular life tasks. This approach has a long history in computational neuroscience (and most recently, computational psychiatry) and can provide a notable adjunct to contemporary quantitative perspectives to MI. For example, although several longitudinal models have already been constructed with the aim of elucidating mood dynamics (by creating well-defined parameters such as “affective inertia,” “instability,” and “reactivity”; Vanhasbroeck et al., 2021), these models do not address how their parameters might themselves be generated by a common computational process. As a consequence, whether these models could be unified under a common generative framework remains, at the time of writing, uncertain.
Our framework, as we outline later, bridges these disparate quantitative approaches by positing a computational process that can generate the various types of MI. In particular, our framework suggests that at least three distinct types of MI (i.e., “emotional rigidity,” or inertia; “emotional transience,” or instability; and “emotional reactivity,” or sensitivity) could be generated and thereby integrated by a common computational process: the process of assigning value to stimuli (i.e., “evaluation”). Before we turn to our model and the explanations it furnishes, some background information on these mood parameters is first necessitated. We provide this background information in the next section, in which we briefly review existing longitudinal approaches to MI.
Longitudinal Approaches to MI
Existing longitudinal approaches have demonstrated that MI can be parsed into at least three basic types: inertia, instability, and reactivity. These types have, in turn, been operationalized as statistical parameters (e.g., variance or regression slopes) in different quantitative models. We briefly review this line of research next.
First, emotional inertia can be interpreted as the tendency of emotions to resist change (Suls et al., 1998). This MI type is typically estimated as the correlation of a given (mood) variable with itself over time (autocorrelation); higher autocorrelations imply a greater tendency of a certain emotion to “carry over” to the next time point (Box et al., 2015). Research on inertia has typically relied on (discrete-time) (vector-)autoregressive (VAR) models (Bringmann et al., 2018; Ernst et al., 2024; Haslbeck, Bringmann et al., 2021), revealing that depression is typified by strong autocorrelations of negative emotions (Koval et al., 2013). More recent network approaches have extended these univariate models in a multivariate context by estimating networks of emotion autocorrelations, revealing that they are inflated in individuals who score high on trait neuroticism (Bringmann et al., 2016). Finally, more sophisticated dynamical models that operate on a continuous timescale, such as the continuous-time VAR model (see Guthier et al., 2020; Oud & Jansen, 2000; Uhlenbeck & Ornstein, 1930), the damped linear oscillator (Boker & Nesselroade, 2002; Chow et al., 2005; Hu et al., 2014), or the model of intraindividual variability in affect (Wirth et al., 2022), have operationalized inertia as a regulation parameter that enables individuals to return to their baseline mood after having deviated from it. Although the estimation of emotional inertia may differ across these models, its interpretation converges, implying that the only fundamental difference between the models may be in the types of data they accommodate (see Vanhasbroeck et al., 2021).
The second well-known MI type is emotional instability, which refers to the fluctuation of emotions over time. Traditionally, emotional instability was approximated with the variance of time series (Larsen & Diener, 1987), revealing inflated mood variability in people with borderline personality (Cowdry et al., 1991; Stein, 1996; Stiglmayr et al., 2001). Despite its straightforwardness, however, this operationalization was eventually argued to be limited because it did not consider the temporal dependency of mood variability: that is, the fact that although emotions may be highly variable, they may still be highly autocorrelated over time, implying mood rigidity (inertia) rather than instability (Larsen, 1987). Accordingly, measures that combine both time variability and time dependency have been introduced, including the mean of all squared differences of affect states (Ebner-Priemer et al., 2009) and the probability of acute change (Jahng et al., 2008). Research using these more sophisticated measures of “instability” has shown that they are more elevated in patients with BPD compared with individuals with depression (P. Santangelo et al., 2014; Trull et al., 2008) and that they are negatively correlated with inertia (e.g., Dejonckheere et al., 2019; Thompson et al., 2012). These patterns imply that the concept of emotional instability can be considered as the inverse of inertia because it reflects the tendency of emotions to rapidly change rather than persist over time.
The final type of MI refers to emotional reactivity: the tendency to experience strong emotions in response to particular life situations. Emotional reactivity is intimately dependent on life events, implying that its measurement differs from the other mood concepts in that it requires a variable denoting one’s “appraisal” (or evaluation) of life events (Silk, 2019; Thompson et al., 2012). In empirical research, affects are regressed on this appraisal of life events, yielding a coefficient that reflects one’s propensity to experience strong affects in response to certain life events within naturalistic (Dejonckheere et al., 2019) or laboratory contexts (Lapate & Heller, 2020). Traditionally, reactivity has been conceptualized in this discrete manner and shown to be elevated in individuals with BPD (Bortolla et al., 2020) and deflated in individuals with major depressive disorder (Bylsma et al., 2008). More recent generative models have extended these approaches by formalizing the link between appraisals and emotions in a probabilistic way (although this link has yet to be examined empirically; see Ryan et al., 2025). Whether reactivity is conceptualized with traditional statistical models or more recent generative models, its interpretation remains the same, reflecting the potency of emotions in response to particular life conditions.
To summarize, existing longitudinal work parses out MI into at least three quantitative subtypes: rigidity (or inertia), instability (or transience), and reactivity (or sensitivity). These quantitative types are immensely helpful quantitatively because they can reveal interesting patterns of MI over time (e.g., that people with borderline personality will be extremely reactive to social stimuli; Sadikaj et al., 2013). However, such approaches are somewhat limited theoretically because they do not elucidate the cognitive processes that humans employ to generate these patterns of MI in the first place.
Cognitive perspectives can address these explanatory shortcomings. Rather than directly modeling longitudinal data, these approaches leverage principles from cognitive sciences to posit a computational process assumed to be used by the human mind (Adams et al., 2016; Huys et al., 2016; Montague et al., 2012). This cognitive process is then used to simulate healthy psychological effects before turning to their maladaptive variants. In that sense, this cognitive approach offers a top-down (rather than bottom-up) perspective to mood dynamics because it starts from underlying cognitive principles and then derives formal predictions that can be more precisely examined against empirical observations.
In this article, we adopt this top-down perspective to build a formal theory of MI. This formal theory is based on the notion of evaluation: the process by which humans assign value to stimuli. This notion of evaluation (aka “appraisal”) is featured in virtually all theories on emotion, including traditional theories on basic emotions (Ekman & Cordaro, 2011), contemporary perspectives on the social nature of emotions (Barrett, 2017), and cognitive approaches on the functional aspects of emotions (Keltner & Gross, 1999; for an integrative review, see Lange et al., 2020). Here, we build on contemporary computational models of evaluation to assess whether these can generate the abovementioned MI types. Before we introduce our formal framework, though, we first review existing cognitive approaches to evaluation and emotion in the next section.
Cognitive Approaches to Evaluation and Emotion
Existing cognitive work has highlighted that the process of evaluation has at least three key properties: It is reference-dependent, predicated on precision (or certainty in evaluation), and encompasses a nonlinear value function. We briefly review these properties in turn before showing how they have been integrated by the model used in this paper.
The first property is based on a wealth of evidence illustrating how mood and emotions are fundamentally dependent on prediction errors, that is, on mismatches between rewards that a person receives (R) versus rewards the person expects (V). For example, research by Rutledge et al. (2014) revealed that self-reported happiness is predicated on the discrepancy between one’s expectation of rewards and one’s actual (received) rewards (for replications, see also Villano et al., 2020; Vinckier et al., 2018; Will et al., 2017). Likewise, seminal work by Eldar and colleagues showcased that the same reward-prediction errors are linked to momentary affective states (Emanuel & Eldar, 2022) and are robustly associated with phasic dopaminergic activity (Eldar et al., 2016; Eldar & Niv, 2015). Together, these patterns suggest that affective valence is based not on life outcomes per se but rather on their discrepancy with a person’s reference (e.g., expectation). A model of evaluative processes, then, should account for such reference effects.
A second important aspect emerging from empirical research concerns the role of (un)certainty: that is, how certain or precise individuals are in their evaluative inferences (an aspect also at the core of some recent reinforcement-learning models; Bellemare et al., 2023). This certainty facet has been well documented by research showing that the more certain one is in one’s evaluation, the stronger one’s affective response, either for better (reality exceeds expectations) or for worse (reality falls short of expectation; Clark et al., 2018; Hesp et al., 2021). Moreover, parameters of (un)certainty are at the core of recent formal models of various psychopathologies which assert that people with psychosis and borderline personality tend to experience stronger emotions because which are overcertain in their evaluations (Henco et al., 2020; Rigoli, 2022a; Zavlis et al., 2025a; Zavlis et al., 2025b). These patterns imply that the concept of evaluative certainty could be leveraged to formalize emotional reactivity, a possibility that we turn to subsequently.
The final property relating to evaluation is the well-known nonlinearity characterizing the value function, an aspect that is present in virtually all models of evaluation. To illustrate, consider well-replicated research that has indicated how income gains beyond a certain degree (i.e., ≈$75,000) yield progressively smaller gains in well-being (Kahneman & Deaton, 2010). Such research highlights that objective rewards (i.e., income) might map on subjective values (i.e., affect) in a logistic-like manner, implying that after a certain point, objective wealth accumulates without proportional affective gains. A wealth of evidence has showcased several such nonlinearities in evaluative inferences (see Barberis, 2013; Bruch & Feinberg, 2017), implying that formal models of evaluation need to account for the nonlinear relation between “objective” rewards and the “subjective” (aka “affective”) values experienced in response to those rewards.
To summarize, cognitive research on evaluative inference has highlighted at least three central properties that should be captured by a model of this process: the reference point, (un)certainty, and nonlinear nature of the value function. A recent computational model has attempted to synthetize these various properties in a unifying account (see Rigoli, 2019). This “logistic” account of evaluation has also been effectively applied to the field of psychopathology, illustrating how maladaptive evaluative processes could lead to various mental-health problems, such as depression, addiction, and personality problems (see Rigoli et al., 2021). Here, we build on this model to develop a formal framework on MI. Before we do so, however, we first outline the logistic model in its current form.
Static Evaluation
The logistic model of evaluation (Rigoli, 2019) proposes that the affective value V(R) of a stimulus is derived from a prediction error between its raw value (R) and reference value (m), moderated by a weight parameter (p) and filtered through a logistic function:
Given the logistic nature of Equation 1, an alternative expression is
where V(R) is the subjective/affective value of a given stimulus (which, given the logistic function, is bounded between 0 and 1), (R) is the “raw” value of a stimulus (e.g., $10), m is the reference point (e.g., a desired value $20), and p is the “weight” attached to a given prediction error (for a detailed outline, see Table 1).
Outline of Computational Parameters
To briefly expand each of these terms, consider first the prediction error: R − m. In a school context, as illustrated by Villano et al. (2020), the raw value of a stimulus (R) is a student’s actual grade, and the referenced value of a stimulus (m) is a student’s desired grade. Villano et al. demonstrated that students’ emotional responses are most robustly predicted by the difference between the students’ desired (referenced) grade and their actual (raw) grade (R − m), not by the raw grade itself. This naturalistic empirical pattern is consistent with a wealth of laboratory evidence showing that indeed, emotions are the outcomes of expectation-outcome prediction errors (Emanuel & Eldar, 2022). The logistic model leverages these error terms to suggest that the affective valence of an event is dependent on a frame of reference: When the raw value of an experience is higher than one’s reference, reality exceeds expectations (R > m), and affective valence is positive; when the raw value is lower, reality falls short of expectations (R < m), and affective valence is negative; finally, when the two are equal, reality meets expectations (R = m), and affective valence is neutral.
Note that the reference point can take various interpretations depending on a researcher’s theoretical tradition or empirical goals. In economics, for example, the reference point has traditionally been viewed as a rational expectation of a future reward (i.e., “Based on all available information, I rationally predict to receive a certain reward”; Muth, 1961). In reinforcement learning, however, the reference reflects a person’s subjective expectation of a future reward based on previous rewards (i.e., “Based on what I have encountered before, I subjectively expect to get a particular reward”; Sutton & Barto, 2020). Our model builds on this latter perspective by viewing the reference point as a person’s standard for rewards (i.e., “I want this” rather than “I rationally expect this”). We believe that this interpretation is more psychologically accurate because it naturally accounts for everyday scenarios whereby a person may still feel sadness even in response to rationally predictable negative events—for instance, losing a loved one from a terminal illness, which is something that one might have (rationally) predicted but still not wish to have happened.
Beyond such reference effects, the second aspect of the logistic model is the weight parameter, π, which either inflates or deflates the prediction error. To illustrate, consider two students who have the same standard for a grade (m = 70/100) but who both receive a much lower grade (R =, 30/100), resulting in a negative emotion of “disappointment” (R − m = 30 − 70 = −40). The weight parameter controls how much each student will “weigh” (i.e., ascribe meaning to) this feeling of “disappointment”: Although a student who weakly weighs this event (p = 0.001) will not be much upset (0.001 × (−40) = −0.04), the student who strongly weighs it (p = 0.1) will be devasted (0.1 × (−40) = −4). In that sense, the weight parameter shapes the intensity of affect by determining how strongly its prediction errors will be weighted. As argued later, this weight parameter can be interpreted as one’s certainty over the meaning of a prediction error: The more certain a person is regarding what the prediction error means, the stronger the person’s affective response will be.
The final key feature of the logistic model is its logistic function, which bounds the affective value of a stimulus between 0 and 1. This function implies that affective values below 0.5 are negatively valenced, those above 0.5 are positively valenced, and those equal to 0.5 are neutral. These outcomes emerge because negative values in a logistic function (here, negative prediction errors) yield V(R) < 0.5, positive values yield V(R) > 0.5, and values of 0 yield V(R) = 0.5. This function is a key aspect of this model because it accounts for the nonlinear relationship between objective rewards and subjective values, as evinced by a wealth of cognitive research on this topic (Rigoli, 2019).
Psychopathology
Having introduced the logistic model of evaluation, we now turn to its application in psychopathology, as exemplified by the work of Rigoli et al. (2021). The general idea around “adaptive evaluation” is that mental health ensues when an agent’s evaluation is based on parameters that reflect the true statistics of the agent’s environment: specifically, when the agent’s reference (μ) reflects the average of the agent’s environment’s rewards and the agent’s weight (π) reflects the inverse of standard deviation (i.e., the consistency) of these rewards. 2 By contrast, psychopathology ensues when either parameter deviates markedly from the true environment statistics. Thus, the logistic model suggests that various psychopathologies can be explained by distinct alterations in these two evaluative parameters.
To illustrate, consider an agent who dwells in an environment offering the following life outcomes with equal probability: R = {10,20,30,40,50,60,70}. (Note that the average and precision [defined as 1/SD] 2 of these outcomes are m = 40 and p = 0.05, respectively.) If the agent’s evaluation relies on these true context statistics (m = 40, p = 0.05), then the agent’s evaluation is deemed “adaptive,” resulting in the following affective values: V(R) = {0.1824,0.2689,0.3775,0.5,0.6225,0.7311,0.8176} (see Fig. 1). The reason these values are adaptive (or “healthy”) is because in accordance with the agent’s environment, R (raw) values below m = 40 are perceived negatively (V(R) < 0.5), R values above m = 40 are perceived positively (V(R) > 0.5), and R values equal to m = 40 are perceived as “neutral” (V(R) = 0.5).

(a) Maladaptive evaluation based on reference points. Blue line illustrates depressive evaluation (high reference), which yields consistently low affective values, V(R), even at high raw values, R; red line illustrates manic evaluation (low reference), which yields consistently high affective values, V(R), even at low raw values, R. (b) Maladaptive evaluation based on the weight parameter. Blue line illustrates apathetic evaluation (low weight), which yields similar affective values, V(R), across various raw values, R; red line illustrates reactive evaluation (high weight), which yields a wide range of affective values, V(R), across various raw values, R.
Consider now the case of another agent who, while dwelling in the same environment, embodies a reference point radically higher than the average of the agent’s surrounding rewards (e.g., m = 80 rather than m = 40). In this case, the reference point is higher than all life outcomes (μ > R), casting all such outcomes in a negative light. This scenario has been argued to explain depression and addiction, whereby a high reference point—that could be expressed either psychologically (in terms of an unrealistic standard for rewards) or neurobiologically (in terms of an altered neuromodulatory set point)—could result in persistently negative affect: Most stimuli pale in comparison with the high reference and are thereby experienced in a negative manner (see Rigoli, 2022b; Rigoli & Martinelli, 2021; Rigoli et al., 2021). The exact opposite scenario has been proposed to reflect (hypo)mania: If an agent’s reference point is smaller than the average of the agent’s contextual distribution (e.g., m = 0 rather than m = 40), then most (and in this case, all) stimuli will be experienced positively, yielding persistently inflated affect (Rigoli et al., 2021; see Fig. 1). In this sense, the reference point determines affective valence: Low and high reference points typically yield positive and negative emotions, respectively (e.g., Clark et al., 2018).
In turn, the weight parameter (π) shapes the intensity of these experiences, giving rise to other psychopathologies. In particular, when the weight is low (e.g., p = 0.025 rather than p = 0.05), emotional experience is less intense: less rewarding if positive and less punishing if negative. This situation has been argued to parallel the clinical condition of apathy, a negative symptom of psychosis and related pathologies, in which all stimuli are experienced as equally rewarding and punishing, resulting in lower motivation to pursue any one of them (see Rigoli & Martinelli, 2023). Conversely, when the weight is high (e.g., p = 0.1 rather than p = 0.05), intense emotionality (aka, emotional reactivity) emerges. An example here could be the strong emotional reactions that individuals with borderline personality typically display even in response to “objectively” mild events (Rigoli et al., 2021; see Fig. 1).
Thus, through aberrant alterations of the reference point, on the one hand, and the weight parameter, on the other, the logistic model can reproduce archetypal patterns of several psychopathologies. But can the same model also shed light on the nature of MI? We argue that as it currently stands, the model is unable to do so. This is because the logistic model assumes that its parameters (i.e., reference and weight parameters) remain fixed over time, thereby neglecting the mechanisms that underpin their evolution. This assumption is problematic when investigating phenomena such as MI, for which, as its very name suggests, dynamical processes are of central interest (e.g., Durstewitz et al., 2021). On this basis, we propose an extension of the logistic model that addresses the question of how evaluative parameters are formed and shaped over time. We then employ this dynamic version to investigate whether any insights on MI could be gained.
Dynamic Evaluation
Adding a temporal dimension to the logistic model, we propose a dynamic framework of (reference-based) evaluation that is predicated on the following discrete equations:
where V(Rt) refers to the affective value at time t, Rt refers to its raw counterpart at time t, mt and mt+1 are the reference points at times t and t + 1, π is the weight parameter, and a [0,1] is the learning rate (at which the reference point gets updated over time; Table 1).
Equations 3 and 4 describe dynamic evaluation, that is, evaluation over time. Equation 3 is similar to Equation 2; the only difference is its extension over time—whereby, at each time point (t), the raw value (Rt) of a given stimulus is converted into its subjective-affective counterpart, V(Rt), via the logistic function. Equation 4 implies that the reference point (m) changes over time based on the Rescorla-Wagner rule, according to which “errors” (i.e., mismatches between one’s reference [μ t ] and reality [Rt], at each time point) drive learning about what the reference point should be in the future (mt+1).
Three parameters define our dynamic model (Table 1). The first is the weight parameter (p), which, as with the static model, is a constant that determines how strongly agents weigh prediction errors during evaluation. The second parameter, also a constant, is the learning rate (a), which determines the extent to which the reference point (μ
t
) changes in response to new events (Rt); larger values reflect faster learning, that is, faster convergence of a reference with each new raw event:
Focusing on these parameters, we sought to examine whether dynamic evaluation could provide any insights on the nature of MI. To do so, we employed an approach akin to the one adopted previously with static evaluation. That is, we assumed mental health to be characterized by moderate parameter values and mental illness to be linked with excessively high or low parameter values. Looking at the consequences of altering these parameters (either in isolation or in combination), we assessed whether the ensuing temporal profiles resemble empirical manifestations of MI. As we detail below, this approach allowed us to generate the three basic forms of MI (inertia/rigidity, instability, and reactivity) from alterations in our model’s parameters (i.e., reference point, learning rate, and weight parameters, respectively).
For our analyses, we simulated a scenario in which on each trial, an agent experiences an objective (or raw) value (Rt) that is sampled from one of three distributions, or contexts, which alternate randomly every 50 trials. These include a high-value context (in which the objective value can be 90, 100, or 110 with equal probability), a moderate-value context (in which the objective value can be 50, 60, or 70 with equal probability), and a low-value context (in which the objective value can be 10, 20, or 30 with equal probability). These contexts were built to mimic life periods of prosperity (e.g., a particularly successful period characterized by an enjoyable and well-remunerated job), normality (e.g., a period characterized by somewhat ordinary life experiences), and adversity (e.g., a period characterized by unemployment and anxiety), respectively. The sequence of outcomes (R) sampled over time is illustrated in Figures 2a to 4a. The same sequence was used for all simulated agents.

(a) The raw value of stimuli, R. (b) Healthy evaluation is characterized by strong affects in response to contextual changes but gradual adaptation to each new context. (c) Bipolar rigidity is characterized by periods of inflated (mania) and deflated (depression) affective values, interspersed by moderate affective values (euthymia). (d) Depressive rigidity is characterized by persistently deflated affective values, interspersed by moderate affective values (euthymia) in overly positive life contexts. (e) Manic rigidity is typified by the opposite pattern of depressive rigidity, showing persistently inflated affective values that are interspersed by moderate affective values (euthymia) in overly negative life contexts.

(a) The raw value of stimuli, R. (b) Healthy evaluation is characterized by strong affects in response to contextual changes but gradual adaptation to each new context. (c) Emotional transience is characterized by fleeting affective values that are inconsistent across similar life events, R. (d) Combined emotional transience and rigidity are characterized by fleeting, inconsistent, and inflated affective values across all life events, R.

(a) The raw value of stimuli, R. (b) Healthy evaluation is characterized by strong affects in response to contextual changes but gradual adaptation to each new context. (c) Emotional reactivity is characterized by inflated affective values across all life events, R. (d) Combined emotional reactivity and bipolar rigidity are characterized by periods of extremely inflated (mania) and extremely deflated (depression) affective values, interspersed by extremely strong and fluctuating affective values (mixed states).
To begin with, we simulate the affective values experienced over time by a “healthy” agent who embodies the following moderate parameter values: m0 = 40, p = 0.05, and a = 0.2. These parameter values produce what can be referred to as “healthy evaluation” (see Figs. 2b–4b). In Figures 2b through 4b, we show that the healthy agent can gradually adapt to new contexts. Specifically, although the agent experiences extremely negative (positive) values when the agent shifts to a worse (better) context, the agent’s affective values progressively adapt to the new situation. This pattern of strong response followed by gradual adaptation resembles the affective dynamics that are typically experienced by nonclinical populations. For example, in response to typical adversities (e.g., bereavement) or successes (e.g., job promotion), most people tend to have strong initial reactions followed by gradual adaptation to the new status quo (Kalisch et al., 2019; Thompson et al., 2010). Our healthy agent reproduces this pattern because of the agent’s moderate parameter values—particularly in the learning rate (α), which allows the agent to gradually adapt to the new context (by flexibly shifting the reference point to the new state of affairs:
Using the scenario of healthy evaluation as a baseline, we next explore whether altering any parameters (m0, π, or α) yields maladaptive consequences that resemble empirical manifestations of MI. As we illustrate below, this procedure allowed us to generate, and thus mathematically integrate, three well-known flavors of MI: emotional rigidity (or inertia), emotional transience (or instability), and emotional reactivity (or sensitivity; see Fig. 5).

All maladaptive affective cases explored in this article: rigidity (including bipolar, manic, and depressive rigidities), transience (including pure transience but also transience that is combined with rigidity), and reactivity (including pure reactivity but also reactivity that is combined with bipolar rigidity).
Emotional rigidity
First, we consider the case of emotional rigidity (inertia): the tendency of experiencing prolonged emotions, which can be formalized using an extremely low learning rate (here, a = 0.001). Below, we detail three scenarios of emotional rigidity: one characterized by a moderate starting reference point (m0 = 60), another characterized by an extremely high starting reference point (m0 = 100), and a final one characterized by an extremely low starting reference point (m0 = 10).
Figure 2c depicts the first of these scenarios, that is, “bipolar rigidity,” which is characterized by a low learning rate (a = 0.001) and a moderate starting reference point (m0 = 60). Here, the bipolar agent’s behavior is similar to the healthy agent’s behavior during the moderate-value context but radically different during the low- and high-value contexts. Specifically, during the high-value (low-value) context, the bipolar agent experiences persistently positive (negative) affects. Furthermore, during these contexts, affective values get stuck at their extreme ends and oscillate less (e.g., in the low-value context, responses to outcomes 10, 20, and 30 are virtually identical), reflecting a reduced ability to discriminate life experiences.
This scenario bears striking resemblance to manic-depressive illness (or bipolarity). For example, much like our simulation illustrates (Fig. 2c), bipolarity is characterized by severe episodes of mania (or hypomania) and depression that persist temporally (more than a week for depression; more than 4 days for mania) and recur cyclically (Goodwin et al., 2007). Furthermore, such affective episodes are typically triggered by significant life experiences, such as severe adversities (in the case of depression) and goal-attainment events (in the case of mania; Johnson et al., 2011; Zavlis et al., 2023). Our model captures this pattern as well because it suggests that the mechanism by which manic and depressive episodes emerge, persist, and alternate is through emotionally intense events that signal significant life changes. Finally, consistent with our proposal (that places abnormal evaluative processes at the heart of bipolar disorder), empirical observations suggest that negative and positive appraisals (“I am a failure” and “I am the best,” respectively) can both generate and maintain bipolar episodes (for an integrative review, see Mansell et al., 2007).
Two further notable points can be made from our simulation on bipolarity. First, an implication from our proposal is that during depressive and manic episodes, individuals with bipolar disorder manifest a diminished ability to discriminate among stimuli. In other words, their emotional response to different experiences is predicted to be less differentiated. To our knowledge, this phenomenon has not yet been tested empirically and thus presents a novel prediction of our theory. Second, we note that a puzzling feature of some cases of bipolarity is mixed affective states, that is, states that combine depressive and manic features (Vieta & Valentí, 2013). Although our present simulation does not account for such states, in a subsequent section (Combined Mood States), we extend it so that it does.
A second form of emotional rigidity or inertia (i.e., “depressive rigidity,” as depicted in Fig. 2d) ensues when an extremely low learning rate (a = 0.001) is coupled with an extremely high starting reference point (m0 = 100). Compared with the healthy agent, this agent exhibits affective values that are persistently depressed: The values are moderate for the high-value context, low for the moderate-value context, and extremely low for the low-value context. Moreover, similar to the previous scenario of bipolarity, certain contexts (i.e., the medium- and low-value ones) are characterized by diminished oscillations in affective values, reflecting a reduced ability to discriminate life experiences.
This scenario is reminiscent of major depression. For instance, similar to what Figure 2d depicts, people with depression tend to persistently experience severely depressed mood (see Rottenberg, 2005; Rottenberg et al., 2005). Crucially, their depressed mood persists even within moderate and positive contexts, reflecting the clinical symptom of “anhedonia” (i.e., the inability to experience pleasure; Loas, 1996). On this, much like our model predicts unvarying responses across varying outcomes, empirical research has shown that depressed patients manifest the same negative affect in response to stimuli of varying emotional intensities (Rottenberg, 2005; Rottenberg et al., 2005). According to our proposal, this tendency of the depressed agent to gravitate toward low mood can be attributed to the agent’s excessively high reference point, which is resistant to change in light of new experiences (because of the low learning rate; Clark et al., 2018). This high reference point could be interpreted in at least two ways. Neurobiologically, it could be interpreted as an aberrant neuromodulatory set point, explaining the more “endogenous” cases of depression (e.g., high trait neuroticism; Barlow et al., 2021; Clark et al., 2018; Ormel et al., 2013). Psychologically, it might be interpreted as an unattainably high standard (desire) for rewards (see M. M. Smith et al., 2018), explaining the more psychological and “perfectionistic” cases of depression (e.g., those found in narcissism; M. M. Smith et al., 2016).
Our final scenario of emotional rigidity (see Fig. 2d, red line) occurs when both the learning rate (a = 0.001) and the reference point (m0 = 10) are extremely low. Affective values in this case are persistently inflated: They are moderate for the low-value context, high for the moderate-value context, and extremely high for the high-value context. Moreover, the low- and moderate-value contexts are once again characterized by diminished oscillations in affective values, reflecting a reduced ability to discriminate varying life experiences.
In principle, this scenario depicts a profile of “pure” (aka, unipolar) mania whereby euthymic states alternate with inflated affective states. Yet at the empirical level, there is ongoing debate over the existence of cases of “unipolar mania”; some clinicians have argued that such cases are, in fact, nonexistent (e.g., Solomon et al., 2003; Stokes et al., 2020). On that basis, notwithstanding its theoretical significance, this last scenario of mood rigidity might not be relevant empirically. Still, it raises an interesting question: If the general logic proposed by the model is sound, why does the case of pure mania never occur in practice?
A possibility is that within the population, the learning rate and the reference point are not independent but are negatively correlated. That is, people characterized by lower learning rate might also tend to have higher reference points. This may occur because, for instance, the neural substrate of the learning rate and of the reference point partially overlap, implying some degree of correlation between the two factors. This possibility is compatible with the empirical observation that cases of unipolar mania (defined by our model as having both excessively low reference point and low learning rate) are virtually nonexistent because they are almost always followed by at least one major depressive episode (Solomon et al., 2003). Moreover, this possibility implies that depression (characterized by an excessively high reference point and an excessively low learning rate) is more prevalent than bipolar disorders (characterized by low reference points and low learning rates), which is indeed the case at the empirical level (Mitchell & Malhi, 2004).
To summarize, the abovementioned mood disorders exhibit a flavor of MI that is characterized by emotional rigidity. In the psychological literature, this rigidity is typically operationalized by autocorrelations that denote the persistence (i.e., inertia) of particular emotions (e.g., Box et al., 2015). Here, we have formalized this phenomenon with our learning parameter, showing how a low learning rate leads to overly rigid evaluations that yield persistently inflated and/or deflated emotions. In that sense, our learning parameter has formalized the concept of inertia by suggesting that it is generated from an inability to adapt to changing life contexts.
Emotional transience
If emotional rigidity (or inertia) emerges when learning is too slow (rendering agents too rigid), then emotional transience (or instability) emerges when learning is too fast (rendering agents too plastic). Thus, emotional transience can be considered to be the polar opposite of emotional rigidity (because it is predicated on an excessively high, not low, learning rate).
Figure 3c illustrates the case of emotional transience (or instability), which ensues when learning is excessively high (α = 0.99). Comparing this scenario of transience against the one of health, we find that two key differences emerge. First, although the healthy agent requires some trials to adapt to a new context (i.e., the agent’s responses are initially extreme and gradually adapt), the transient agent adapts instantly (after a single trial). Second, once adaptation has ensued, the healthy agent manifests a consistent response to the same outcome; for instance, the healthy agent always elicit the same (neutral) response of V(R) ≈ 0.5 to the objective outcome Rt = 60. By contrast, the responses of the transient agent appear to be inconsistent; for example, the same outcome Rt = 60 is sometimes experienced positively (V[R] > 0.5; Trial 3) yet other times experienced negatively (V[R] < 0.5; Trial 50). These evaluative inconsistencies are due to the excessive learning rate (α = 0.99), which renders the agent’s reference point almost equivalent to the agent’s most recent experience (mt+1 ≈ Rt). Thus, the transient agent’s reference is in constant flux, producing inconsistent responses to the same outcomes.
This pattern of evaluative inconsistency and emotional transiency is reminiscent of the pervasive instability (in emotions, relations, and self-perceptions) that typifies BPD. For instance, in line with our simulation, people with BPD are known to be “a function of their present situations” (Hochschild Tolpin, et al., 2004, p. 112), frequently changing their personal tastes and preferences (e.g., their core values, goals, and friends) in accordance to their most recent experiences. Moreover, experimental evidence suggests that during mind-wandering tasks, people with BPD are more likely than healthy control subjects to generate inconsistent and extreme evaluations about themselves and others (Kanske et al., 2016). Note that such evaluations appear to vary quickly, in line with our notion of “transiency” (Kanske et al., 2016). Finally, evaluative inconsistencies have long been theorized to be central to “self-instability” (Kaufman & Meddaoui, 2021), an aspect of BPD that was recently demonstrated to account for its remarkable affective instability (Kockler et al., 2022; P. S. Santangelo et al., 2017, 2020).
Our model formalizes this self-instability by casting it as a form of evaluative inconsistency: the rapid changing of reference points by which one evaluates life events. To illustrate, consider the situation in which a seemingly “perfect” romantic date (R1 = 100) sets up a high “standard” for another one (m2 ≈ R1), only to lead to a strong disappointment (or even a “fear of abandonment”) in a BPD agent when it is merely rescheduled by the other party (R2 = 40 << m2 ≈ 100). Indeed, in line with this formulation, prominent treatment protocols for BPD, such as dialectical-behavior therapy, focus on diverting patients away from such “ruminative” states (regarding less-than-ideal outcomes) and toward states of “acceptance” (that could be interpreted as using more consistent reference points when evaluating suboptimal situations; Dimeff & Linehan, 2001). Our model is consistent with these psychotherapeutic approaches and provides a mathematical intuition of why they might be successful in treating conditions characterized by transient emotions and self-perceptions (including not only BPD but also its frequent comorbidities, such as eating pathologies; Linehan & Chen, 2005).
To summarize, emotional transience reflects fleeting, rather than persisting, emotionality. Also known as “emotional instability,” this type is usually operationalized in terms of high variability and low autocorrelation of mood time series (Dejonckheere et al., 2019). Note that these fleeting dynamics were shown to be the polar opposite of those that typify emotional rigidity (Thompson et al., 2012), in keeping with our model that places these types on opposite ends of a learning-rate spectrum. Our model clarifies the nature of these concepts by suggesting that rigidity can be generated from overly consistent evaluations (rooted in an inability to adapt to changing life conditions), whereas transience can be generated from overly inconsistent evaluations (rooted in the tendency to overidentify with changing life conditions; Deutsch, 1942).
Emotional reactivity
Our final type of MI is generated from an inflated weight parameter (p). When this occurs, agents become emotionally reactive: They experience inflated affects because they overweigh the “meaning” of their life experiences.
These patterns of reactivity are illustrated in Figure 4c, in which all parameters are set to a moderate and adaptive range (m0 = 60, a = 0.2) except the weight parameter, which is set to be high and maladaptive (p = 0.1). Comparing this scenario of reactivity to the one of health (Fig. 4b), we find that one key similarity emerges (because of the adaptive learning rate): Affective values are more extreme when the context shifts and progressively adapt to the new status quo. Nevertheless, one key difference is also evident: The reactive agent exhibits greater oscillations in affective values, reflecting an enhanced reactivity to life outcomes. Formally, these inflated life outcomes occur because the high weight inflates the discrepancy between one’s reference and reality (Rt − mt), magnifying their affective values V(Rt).
We argue that this magnification of affective values reflects several “reactive” temperaments that are typified by an overinterpretation of life events. For instance, the so-called “narcissistic rage,” which entails strong and inappropriate anger in the face of minimal provocation, can be considered a form of emotional reactivity (Krizan & Johar, 2015). Likewise, “defensive aggression,” which is typical of antisocial personalities, is a reactive form of aggression that occurs in response to perceived provocation (Vaidyanathan et al., 2011). Finally, contemporary classification systems describe a “marked reactivity in mood” as a key symptom of BPD (see American Psychiatric Association, 2013, p. 663), suggesting that individuals with borderline personality can also exhibit emotional reactivity (beyond the previously mentioned emotional transiency).
Note that our model pinpoints these reactive temperaments to a computational parameter (p) that has a clear and formal interpretation: that is, overweighing the meaning of life experiences, which results in overconfident inferences (e.g., “I am sure you hate me”). Our definition for this parameter mirrors that of “hyper-mentalizing,” which has been defined as the overcertainty regarding the meaning of social experiences when there is minimal or no objective data to support such inferences (Sharp, 2014). Accordingly, both emotional reactivity in general and hyper-mentalizing in particular have been reported in a number of psychiatric disorders, including psychosis (Myin-Germeys, Krabbendam, et al., 2003; Myin-Germeys, Peeters, et al., 2003), bipolar disorder (Henry et al., 2007), and attention-deficit/hyperactivity disorder (Isaksson et al., 2019; for a meta-analysis, see McLaren et al., 2022). Our model is in line with this evidence and provides a formal interpretation of why hyper-mentalizing (or overcertain) states may lead to emotional reactivity.
The clinical conditions just mentioned manifest excessive reactivity particularly in the domain of negative affect. Regarding the domain of positive affect, evidence suggests that various facets of extraversion (e.g., enthusiasm or sensation-seeking) can be interpreted in terms of an enhanced responsivity to reward (Lucas et al., 2000; Smillie, 2013). Note that when extreme, these personality facets can promote “impulsivity” (Revelle, 1997), a symptom found in a plethora of psychiatric conditions, including bipolar disorder, personality disorder, and substance abuse, to name a few (Moeller et al., 2001). Although impulsivity is a multidimensional trait (comprising emotional, motivational, and cognitive components), one of its central facets includes the “rapid and unplanned reaction to positive internal or external stimuli” (see Moeller et al., 2001, p. 1784). Taken from this perspective, our “positively valenced” reactivity could be regarded as one among many mechanisms that promote impulsivity (see Strickland & Johnson, 2021).
In summary, these patterns illustrate a type of MI that is characterized by strong emotional reactivity, sometimes also referred as “stress reactivity” or “emotional sensitivity” (Marwaha et al., 2014). Our model clarifies the nature of this MI pattern by suggesting that it occurs when the salience of stimuli is inflated. In this view, the notion of reactivity is distinct from its close relative, transience, because the former can be casted as a form of evaluative certainty (rooted in a trait of sensitivity; Linehan, 1987) and the latter can be casted as a form of evaluative inconsistency (rooted in a trait of self-instability; Deutsch, 1942).
Combined mood states
So far, we have explored each MI type in isolation. However, our three MI types can also be considered in combination in the sense that they co-occur within the same person (with the exception of emotional rigidity and transience, which are polar opposites). In this subsection, we briefly turn to two such combinations: emotional reactivity with either bipolar rigidity (to explain “mixed affective states”) or emotional transience (to explain “severe BPD”).
First, consider the scenario of combined bipolar rigidity and emotional reactivity (Fig. 4d), which ensues when a low learning rate (a = .001) and a moderate reference point (m0 = 60) are paired with a high weight parameter (p = 0.1). Given a = .001 and (m0 = 60), this scenario reproduces the previous case of bipolar rigidity: Affective values are extremely positive/negative during high-/low-value contexts, reflecting mania/depression, respectively. However, because of the strong weight parameter (p = 0.1), another pattern is now evident: Affective values oscillate greatly within the moderate context, reflecting high reactivity to all moderate stimuli. Thus, states of depression and mania here are interjected by another state of extreme reactivity to moderate stimuli.
We suggest that this extreme reactivity to varying stimuli may reflect what clinicians have termed as “mixed states.” Although the definition of such states has always been hazy, research in the past decade has demonstrated that one of their defining features is “irritability” (Vieta & Valentí, 2013, p. 33), which “includes a feeling that one’s emotional responses are unjustified or disproportionate to the immediate source” (Barata et al., 2016, p. 170). Indeed, both clinical observations and psychometric analyses have converged on similar models that place emotional responsivity, not valence, at the heart of mixed states (Henry et al., 2003, 2010; Pacchiarotti et al., 2013; Perugi & Akiskal, 2005; Swann et al., 2013), leading to the conclusion that “mixed states may be better defined by emotional hyper-reactivity, meaning that patients feel emotions with a greater intensity and depending on the environmental context” (Henry et al., 2007, p. 37). Our model is remarkably consistent with these perspectives because it suggests that mixed states can be formalized as fast oscillations between extreme emotions—oscillations that occur in response to varying events. Our model is also consistent with evidence suggesting that mixed states are linked to worse clinical outcomes (Swann et al., 2009, 2013; Vieta & Valentí, 2013) by revealing that depressive and manic episodes are magnified in bipolar agents who exhibit mixed states (see Fig. 4d).
Our other case of mixed affective states is one in which a high learning rate (a = 0.99) and a moderate starting reference (m0 = 60) are paired with a high weight parameter (p = 0.1) to produce an amalgam of emotional transience and reactivity, which exhibits by far the most unstable affective dynamics (Fig. 3d). In brief, this picture depicts an agent who experiences both brief emotions (because of inconsistent evaluations) and extreme emotions (because of reactive evaluations). Note that this fast-paced instability occurs across all contexts, suggesting that this agent is similarly unstable across all life situations.
This scenario of fast-paced instability mirrors severe borderline psychopathology. Indeed, researchers sometimes distinguish between two types of BPD based on severity. The less severe type (so-called “as-if” character or “quiet” BPD) is characterized by an unstable identity, internalizing psychopathology, and coping only through intense attachment to others (Deutsch, 1942; Sherwood & Cohen, 1994). Conversely, the more severe type is typified by externalizing psychopathology, strong emotional reactivity, and impulsivity (Arntz et al., 2003; Bales et al., 2012). Our model is in line with these perspectives because it distinguishes emotional transience from reactivity, placing the former on a spectrum of mood transience (in line with the less-severe type) and the latter on separate spectrum of personality pathology (in line with the more-severe type). Our model further implies that the combination of both types (i.e., both transience and reactivity) yields the most unstable BPD profile, in line with empirical observations showing that severe cases of BPD exhibit both internalizing and externalizing psychopathology (Gunderson et al., 2018).
To summarize, in this section, we have shown how emotional reactivity can combine with bipolar rigidity and borderline transiency to exacerbate mood problems: specifically, by generating patterns that resemble mixed episodes (in bipolar disorder) and extreme emotional oscillations (in BPD). In the next and final section, we outline more formally the main predictions of our theory.
Model Predictions
In this final section, we outline the core predictions of our theory and illustrate how they can help advance the theoretical status of MI regardless of whether they prove to be right. In doing so, we focus on a brief exposition of each prediction and its corresponding empirical examination (aimed to corroborate or refute the prediction). For more details on how to test each prediction, interested readers are referred to the Supplemental Material available online.
Our model and its overarching theory make at least three novel predictions. The first prediction is that mood valence will be predicated entirely on prediction errors. Although this prediction is inspired by past reinforcement-learning research, we note that it forms one of the primary tests of our model for the following reason: Our model assumes that neutral affective experiences are independent of the objective value of a reward. Thus, a student who expects a high grade of 99/100 and receives exactly that high grade of 99/100 will be just as “satisfied” as the student who expects a low grade of 10/100 and receives exactly that low grade of 10/100. Although this is a rather strong and novel assumption, we believe it is worth examining for the following reason: If it is supported, it would provide evidence in favor of a simple prediction-error way of understanding mood valence; if it is refuted, however, it would offer preliminary evidence for a more expanded way of understanding mood valence (one whereby valence is determined by both prediction errors and raw value of rewards). In the Supplemental Material, we illustrate in more detail how researchers can test this prediction of our model by pinning two alternative, nested models against each other, yielding novel insights on mood valence regardless of which model is supported.
The second prediction of our model is clinical and suggests that our MI parameters will robustly predict specific mental-health problems. Although one way to test this hypothesis is with a case-control design (whereby our formal parameters are compared in a discrete manner across clinical and nonclinical samples), we believe that a dimensional approach is more robust, especially in light of recent findings implying that computational parameters are normally distributed and can predict mental-health problems across clinical and nonclinical samples (e.g., Story et al., 2024). On that basis, we suggest that a better test of our clinical predictions is to (a) recruit a diverse general-population sample (or a transdiagnostic clinical sample) who will complete a (b) set of self-reported scales (measuring psychopathology symptoms) and (b) a relevant monetary-reward experiment (see the Supplemental Material) to derive our computational parameters. Hypotheses in this setting would be as follows:
Hypothesis 1: Reference points will be positively associated with perfectionism, narcissism, and neuroticism (or depressed mood), highlighting that people with these problems may have higher standards for reward.
Hypothesis 2: Reference points will be negatively associated with (hypo-)mania.
Hypothesis 3: Learning rates will be positively associated with identity disturbance, suggesting that emotional instability in “borderline personality” may be rooted in self-instability.
Hypothesis 4: Learning rates will be negatively associated with anhedonia and depressed mood, suggesting that rigid mood states are based on an inability to adapt to different settings.
Hypothesis 5: Weight parameters will be positively associated with antagonism in relational pathology and aberrant salience in psychosis, formalizing their strong reactivity to external stimuli.
Hypothesis 6: Weight parameters will be negatively associated with apathy, formalizing its nonreactivity to external stimuli.
In our Supplemental Material, we flesh out the details of these hypotheses and comment specifically on the theoretical value that could be gained if they are supported or refuted by future research.
Finally, we highlight one last and rather tentative prediction: that is, that our computational parameters will also be associated with particular “longitudinal affects.” To elaborate, our model makes the following exploratory but clear predictions:
Prediction 1: Learning rates, a, will be negatively related to measures of mood rigidity, such as high affect autocorrelations.
Prediction 2: Learning rates, a, will be positively related to measures of mood transience, such as the mean square of successive differences.
Prediction 3: Weight parameters, p, will be positively related to measures of mood reactivity, such as regression slopes of how life stressors predict certain affects.
Although these predictions are explicit, we emphasize that we view them mostly as exploratory and hypothesis-generating because at the time of writing, there are several conceptual and methodological uncertainties surrounding the fields of both computational and longitudinal modeling. Specifically, there is limited evidence that cognitive mechanisms from controlled laboratory settings (e.g., “learning rates” or “sensitivity parameters”) map on mood dynamics from real-life settings (Bennett et al., 2019; Huys et al., 2021; Karvelis et al., 2023; Zavlis et al., 2025b). Likewise, several methodological concerns, such as limited reliability and validity in both experimental (Karvelis et al., 2023; Zavlis et al., 2025b) and longitudinal research (Jahng et al., 2008; Trull & Ebner-Priemer, 2013), limit the power of a thorough test of our model. In that sense, we believe that our hypotheses above are best regarded not as conclusive tests of our formal model but rather as scaffolds for an agenda of translational research that aims to enhance both the computational and longitudinal fields by integrating lab-based computational findings with real-life mood dynamics (see Discussion).
To summarize, we have outlined two confirmatory predictions (relating to mood valence and instability patterns) and one exploratory prediction (relating to the integration of computational and longitudinal lines of research). Although we believe that these hypotheses are plausible given prior computational work (e.g., Emanuel & Eldar, 2022), we note that even if they are refuted, they could still point to notable insights regarding the most likely mechanisms that generate disparate mood dynamics (for details, see the Supplemental Material).
Discussion
In this article, we have outlined a formal theory of MI that can generate three distinct MI types: emotional rigidity (the tendency to experience prolonged emotions), emotional transiency (the tendency to experience fleeting emotions), and emotional reactivity (the tendency to experience strong emotions; see Fig. 5). In the discussion that follows, we explore the contributions of this computational perspective on the theory, clinical utility, and measurement of MI before concluding on current limitations and future directions for research in this line of inquiry.
Theory
To begin with, we note that our formal perspective could help address existing debates on the nature of MI by integrating disparate theoretical viewpoints. To date, much ink has been spilled on theoretical debates around MI, including whether it contains various subtypes (Hamaker et al., 2015), to what extent these subtypes are pathognomic or transdiagnostic (Trull et al., 2015; Tsanas et al., 2016), and how they could best be defined (Broome, He, et al., 2015; Broome, Saunders, et al., 2015). Our formal theory could help address these debates by showing how three well-known MI types (concerning rigid, transient, and strong emotions) can be generated from a single computational process: the process of evaluating stimuli. In so doing, our theory provides a formal way of thinking about different forms of MI and a pathway toward integrating longitudinal and experimental approaches to mood dynamics.
To briefly elaborate, our theory outlines how three central concepts from longitudinal research (rigidity/inertia, transience/instability, and reactivity) can be generated from three cognitive parameters (reference point, learning rate, and weight parameters), providing a blueprint for how the disparate lines of longitudinal versus cognitive research could be integrated. For example, reward-based paradigms (which are typically employed in experimental settings to quantify cognitive parameters from behaviors) could be employed to measure the learning rate (how fast participants adjust their expectations of future rewards), weight parameter (how strongly participants weigh prediction errors), and reference point (for reviews, see Hitchcock et al., 2022; Huys et al., 2021; Zavlis et al., 2025a,b). In turn, these cognitive parameters can be linked to corresponding longitudinal parameters, examining whether rigidity-inertia can be understood as a form of evaluative consistency (based on a slow learning rate), transience-instability can be understood as a form of evaluative inconsistency (based on a fast learning rate), and reactivity can be understood as a form of evaluative certainty (based on an increased weight parameter). So far, research in computational psychiatry has investigated evaluative processes in cross-sectional research, revealing notable findings, including that individuals with major depression tend to evaluate disparate stimuli in a consistently pessimistic way (implying a low reference point; Kube, 2023) and that individuals with BPD and psychosis tend to overweigh the value of social stimuli (implying a higher weight parameter; Henco et al., 2020). Adding a temporal dimension to these analyses could enhance ecological validity by showing how aberrant computational processes are linked to mood outcomes in naturalistic settings—and how they ameliorate in therapeutic settings (Hitchcock et al., 2022; Karvelis et al., 2023; Moutoussis et al., 2018; Zavlis et al., 2025a,b).
Beyond elucidating the links between experimental and longitudinal parameters, our formal framework was able to generate novel predictions (that were not a priori specified). For example, our simulations indicated that manic/depressive episodes will be typified by a reduced ability to discriminate positive/negative stimuli, providing a novel hypothesis that was not prespecified but is still in line with existing evidence on mood disorders (see Kube, 2023). Likewise, our model predicted that severe borderline psychopathology will be typified by both transience and reactivity, mirroring traditional theories on the topic (e.g., O. Kernberg, 1967; Knight, 1953) that have not yet been precisely examined (Zavlis et al., 2025a). These hypotheses are notable because they emerged without a priori specification, highlighting the utility of formal, as opposed to verbal, theorizing in generating precise, quantifiable, and undogmatic predictions that can be more explicitly examined against empirical observations (see Fried, 2020; Haslbeck, Ryan, et al., 2021; Lewandowsky & Farrell, 2011; Robinaugh et al., 2021; Smaldino, 2020). In our Supplemental Material, we expand on these hypotheses further, commenting on how they could be tested such that they are informative regardless of whether they prove to be “right” (Nosek et al., 2019).
Clinical practice
Beyond theory, our formal framework may also prove useful to clinical work. For example, our framework clarifies the clinical underpinnings of various maladaptive emotions by suggesting that rigid emotions stem from overly consistent evaluations (“I am worthless”; “I am incredible”), transient emotions stem from overly inconsistent evaluations (“I hate you” → “I love you”), and strong emotions stem from overly precise evaluations (“I am sure you hate me!”). Transient and rigid emotions were generated from the opposite ends of a single computational parameter, implying that they may form a continuum of internalizing psychopathology that includes emotionally rigid disorders on the one end (e.g., unipolar vs. bipolar depression) and emotionally unstable disorders on the other (e.g., borderline personality; Barlow et al., 2021; MacKinnon & Pies, 2006; Paris et al., 2007; Tyrer, 2009). Of course, it is also possible that both sets of disorders exhibit the third type of MI (emotional reactivity), complicating diagnostic practice (e.g., the presence of emotional reactivity in bipolar cases leading to their misdiagnosis as “personality disorder” cases; Morgan & Zimmerman, 2015; Zimmerman et al., 2010). Our framework is consistent with dimensional classification systems and suggests that the continuum of emotional transience-rigidity might belong to the internalizing spectrum, whereas that of emotional reactivity may be more characteristic of the externalizing spectrum.
This dichotomy may prove fruitful in elucidating the nature of personality psychopathology by parsing it into two types: an internalizing one (typified by an inconsistency in evaluating oneself, which yields emotional transience; Deutsch, 1942) and an externalizing one (typified by an overcertainty when evaluating social stimuli, which yields emotional reactivity; Linehan, 1987). Currently, these two types are conglomerated within the general concept of “personality functioning,” which is associated with the entire tapestry of personality, including its more internalizing and externalizing aspects (Hopwood, 2025; Kerber et al., 2024). Our theory is consistent with traditional and recent perspectives that have called for the delineation of these two aspects (and the move away from the stigmatizing label “personality disorder”) by suggesting that to the extent that someone’s “personality” problems concern inconsistent evaluations of “the self,” they may be more appropriately considered as “internalizing mood problems” (Barlow et al., 2021; Bowen et al., 2012; Ellard et al., 2010; Ormel et al., 2013), whereas to the extent that they concern reactive evaluations of others, they may be better considered as “externalizing social problems” (Hopwood, 2024; Wright et al., 2022; Zavlis, 2023, 2024b; Zavlis et al., 2025a).
Beyond diagnostic matters, we note that several predictions of our framework align with existing evidence on therapeutic processes, implying that our parameters could be leveraged to formalize mechanisms of therapeutic change. For example, our theoretical prediction that rigid mood disorders (i.e., depression) ameliorate with increased learning of positive reference points may formalize recent theories suggesting that antidepressants work by enhancing neuroplasticity for positive information (Page et al., 2024). Likewise, our prediction that transient emotions ameliorate with more consistent evaluations could formalize the evidence-based observation that borderline instability subsides when patients adopt more integrated and consistent ways (i.e., reference points) of viewing themselves and others (O. F. Kernberg et al., 2008). Finally, our conjecture that reactive emotions subside when patients adopt more uncertain evaluations could formalize the idea that hyper-mentalizing states ameliorate when patients are encouraged to engage in more “nuanced” and “uncertain” mentalizing (Bateman & Fonagy, 2016). Together, these patterns illustrate that it might be possible to link well-defined computational parameters to therapeutic processes, examining them based on experimental paradigms from the nascent field of computational psychotherapy (see Moutoussis et al., 2018; R. Smith et al., 2020).
Measurement
This brings us to the final possible contribution of our theory: the longitudinal measurement of mood dynamics. Contemporary research on mood dynamics is primarily based on “clock time,” examining how emotions vary either within days or across days/weeks (e.g., Dejonckheere et al., 2019). Although somewhat informative, such approaches have been criticized because clock time does not necessarily map onto psychological mechanisms (for a commentary, see Hopwood et al., 2022). For instance, research on BPD has showcased that its remarkable instability is better predicted by fluctuating social stressors (Lazarus et al., 2014; Sadikaj et al., 2013) rather than by the passage of time (Russell et al., 2007; Trull et al., 2008). Likewise, recent experience-sampling studies have illustrated that mood problems are typically preceded by important life events, such as daily social interactions (Benson et al., 2019), but also strong life stressors, such as divorce (Bleidorn et al., 2020) or unemployment (Luhmann et al., 2014). Our theory is consistent with these event-contingent approaches (Moskowitz & Sadikaj, 2012) because it suggests that affect is interwoven with life events, implying that a deeper insight of the former could perhaps be achieved only through concomitant measuring and modeling of the latter.
Indeed, focusing on life events, rather than clock time, may reap additional benefits, including informing researchers about the number, frequency, and timing of longitudinal assessments. As an example, consider research on mood reactivity, which has indicated that mood gets triggered differently across clinical populations—for example, people with borderline traits are more reactive to less affiliative people (Sadikaj et al., 2013) and people with narcissistic traits are more reactive to assertive people (Wright et al., 2017). This research implies that “timing” the assessments so that these disparate triggers get captured is vital when examining reactive dynamics. The same event-contingent approaches might be necessary when investigating transience and reactivity, which, by definition, vary, respectively, based on fast-scale events (e.g., daily hassles; Mauss et al., 2005) and longer-scale events (e.g., divorce; Bleidorn et al., 2020). Our theory is consistent with this evidence and embraces recent calls suggesting that “the spacing of assessments in longitudinal designs should be based on a formal theoretical model about the underlying psychological process” (Hopwood et al., 2022, p. 888).
Nevertheless, although our theory presents an advancement in this sense, it does not yet elucidate the timescales of particular mood dynamics (given insufficient empirical evidence). In that sense, more work is needed on event-contingent approaches to uncover the kinds of life events that trigger particular kinds of affects (see Moskowitz & Sadikaj, 2012). Such research could start with pilot longitudinal investigations that oversample life events to adjudicate how many and which ones are, in fact, needed for the modeling of specific mood patterns (for a discussion, see Boker & Nesselroade, 2002). Researchers could then more specifically test how frequently specific event-affect contingencies occur in people’s lives and whether those contingencies vary across psychiatric disorders (Hopwood et al., 2022). Ultimately, the understanding of mood timescales will be predicated on the synergistic use of data-driven models (that uncover event-affect patterns) and theory-driven models (that instantiate those patterns in well-defined formal theories; Haslbeck, Ryan, et al., 2021; Ryan et al., 2025).
Limitations and future directions
Despite the strengths of our framework, several of its limitations must also be acknowledged. First and foremost, we have favored model parsimony rather than complexity. As an example, we have assumed that our only dynamic (time-varying) parameter was the reference point. Arguably, however, the learning rate and weight parameters could have also been time-varying (with their own update equations), yielding interesting affective dynamics; for instance, with higher learning for positive information (Pulcu & Browning, 2017) or situational, rather than pervasive, reactive patterns (Bellemare et al., 2023). However, it is not clear how (or whether) such dynamics inform MI, which is why we have opted for fixing these parameters to first explore whether any insights could be garnered with a more parsimonious model. Future work may wish to expand our framework by adding dynamic equations for weight or learning, examining whether they can add further insights on MI. In the same spirit, future work may wish to also expand our framework by including computations for different kinds of evaluations, including evaluations of human behavior (which can yield social emotions, such as anger) or evaluations for future prospects (which can yield future-oriented emotions, such as anxiety; for details, see Emanuel & Eldar, 2022).
A second and related limitation is that our model treats valence as a bipolar dimension, varying from positive to negative emotion. By contrast, some researchers have long noted that positive and negative affect could exist as two independent dimensions (see Diener & Emmons, 1984; Warr et al., 1983; Watson et al., 1999). Following this work, an interesting research avenue could be to extend our model by distinguishing positive and negative emotions. Still, we note that the unidimensional approach employed here is already sufficient to explain a wealth of mood dynamics. Moreover, recent work on mood dynamics has supported this assumption by illustrating that maladaptive ways of feeling reflect “a more bipolar experience of positive and negative affect, reflecting reduced emotional complexity and flexibility” (Dejonckheere et al., 2018). This research suggests that maladaptive mood patterns could be captured by simpler models that treat valence in more “unidimensional” and “inflexible” terms, precisely as our framework suggests (see also Brose et al., 2015; Feldman, 1995; Rafaeli et al., 2007).
A final limitation of our work may concern its purely theoretical nature. Indeed, here, we have focused on simulations to examine whether a simple computational model of evaluation can shed light on key aspects of MI. Still, although we have not conducted any empirical tests, we note that our model is firmly grounded on previous empirical research. For example, the notion that emotion is fundamentally reference-dependent has been supported by various studies illustrating how emotions are dependent not on life outcomes per se (e.g., a student’s grade on a school test) but rather on the discrepancy between those outcomes and reference outcomes (e.g., the mismatch between a student’s actual grade and desired grade; Eldar et al., 2016; Emanuel & Eldar, 2022; Otto & Eichstaedt, 2018; Villano et al., 2020). Likewise, the idea that evaluation is central to emotion is a core tenet of validated cognitive models (Koszegi & Rabin, 2006; Louie et al., 2013, 2014, 2015; Rigoli, 2019; Stewart et al., 2006, 2015). In that sense, plenty of research already supports the central tenets of our framework, implying that future work may wish to examine its more novel predictions, including the link between our cognitive parameters and (a) specific clinical problems and (b) longitudinal affect patterns. To these ends, we refer readers to our Supplemental Material and GitHub repository (https://github.com/OrestisZavlis/MoodInstability), which provide more information on how to test our theoretical predictions.
Conclusion
To conclude, we have developed a theory that formally integrates three well-known MI types: emotional rigidity, transience, and reactivity. Our theory illustrates how specific evaluative processes (i.e., evaluative consistency, inconsistency, and certainty) can generate specific emotional experiences (respectively, emotional rigidity, transiency, and reactivity). In that sense, our theory suggests that the disparate lines of experimental versus longitudinal research can be fruitfully united by examining how well-defined computational parameters relate to longitudinal mood patterns. Future work is necessary to examine these computational-longitudinal linkages and enrich them by elucidating the timescale of their dynamics.
Supplemental Material
sj-docx-1-cpx-10.1177_21677026251363862 – Supplemental material for A Formal Theory of Mood Instability
Supplemental material, sj-docx-1-cpx-10.1177_21677026251363862 for A Formal Theory of Mood Instability by Orestis Zavlis, Richard P. Bentall, Peter Fonagy and Francesco Rigoli in Clinical Psychological Science
Footnotes
Transparency
Action Editor: Aleksandra Kaurin
Editor: Jennifer L. Tackett
Author Contributions
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
