Abstract
This survey focuses on the standard assumption in DSGE models: rational expectations (RE) with perfect information (PI) aka full information (FI)—hence FIRE. RE means model consistent expectations—agents be they households, firms, banks or policymakers know your model. PI (or FI) means agents observe or can infer the current and past state variables in your model. RE + PI (or FIRE) is a strong assumption. The purpose of this survey is to examine the literature that relaxes RE or PI or both. This is relevant for DSGE models in general, but particularly so for the efficacy of monetary policy in a New Keynesian environment when the expectation by agents of future policy is of crucial importance.
Introduction
There have been a number of recent assessments of the ‘state of macro’ and the contribution of dynamic stochastic general equilibrium (DSGE) models—a list that is by no means exhaustive would include: Blanchard (2009, 2016), Blanchard et al. (2010, 2013), Driffill (2011), Pesaran and Smith (2011), Blanchard and Summers (2017), Vines and Wils (2018), Christiano et al. (2018) and Levine (2020).
This survey has a more narrow focus on the standard assumption in DSGE models: rational expectations (RE) with perfect information (PI) aka full information (FI)—hence FIRE. RE means
We begin with departures from RE and a recent behavioural macroeconomics literature. The ‘Behavioural Macro models’ section sets out the most common equilibrium concepts found in this literature. The third section sets out a standard NK model we use as an application in the rest of the paper. The fourth section moves on to models with heterogeneous agents consisting of both RE and non-RE agents and examines a class of equilibria when the latter can learn from the former through reinforcement learning. The fifth section then moves on to RE models where the PI assumption is relaxed in favour of imperfect information (II). The sixth section reviews important empirical results that assess, first, what we describe as the ‘wilderness’ of departures from RE and, second, the ability of the RE NK model with the II assumption to provide a better data fit than PI. The seventh section concludes the article.
Beyond RE: Equilibrium Concepts
In departures from RE, two sets of equilibrium concepts and related literature need distinguishing. The first is
Statistical Learning
Applications to macroeconomics were pioneered by Evans and Honkapohja (2001). The main idea is to replace RE with statistical forecasts based on knowledge of the structure of the RE solution—
To formalize the concept, consider the state-space form of a log- linearized DSGE model:
where
The
In the
where [
Behavioural Macro-models
This class of models have one or more of the following features: (a) adaptive expectations in models of individual rationality (b) heterogeneous expectations and reinforcement learning (c) cognitive discounting and (d) agent inattention in otherwise rational models. We examine five concepts in turn.
Let
Solving for the actual law of motion, this leads to first-order autocorrelations in the stochastic steady state
See Hommes and Zhu (2014) and Hommes et al. (2023). It should be stressed that the SCEE is
Gobbi and Grazzini (2015) perform OLS on a first-order VAR of the full state, including shock processes. Eusepi and Preston (2011) perform OLS on a finite approximation of an infinite VAR of a subset of the state space. Hommes and Zhu (2015) use a parsimonious first-order VAR to fit mean and persistence of each state variable to data. All these papers assume the solution of an RE model can be approximately expressed as a finite VAR, which in itself can be a strong assumption as shown by Fernandez-Villaverde et al. (2007). All these papers also use the SCEE concept (aka a
with perfect foresight E[
Similarly, for
and so on. In the applications of this concept cited, as
By iteration, this can be written as
Thus, the expected value is a weighted average of past values of
Anufriev et al. (2015) propose a more general adaptive expectations rule:
This conforms with lab experiments, a speciality of Hommes and colleagues.
Turning to AU, a closely related literature develops the concept of internal rationality (IR) (see Adam & Marcet, 2011). Under both IR and AU, agents maximize utility under uncertainty, given their constraints and a consistent set of probability beliefs about payoff-relevant variables that are
This subsection has reviewed a number of equilibrium concepts found in the literature that relax the RE assumption. In the rest of the paper, we will compare a standard NK that assumes RE with a number of behavioural counterparts. In the third and fourth sections, the behavioural model chosen is that with AU learning (concept IV). In the sixth section, the need for robust policy is demonstrated in its most clear fashion by comparing the RE model with EL (concept IV) and the inattention-myopia model (concept V). Finally, the section ‘Does Imperfect Information Improve Data Fit?’, reverts to AU in a comparison between RE with perfect and imperfect information.
RE and Bounded Rationality in the NK Model
Ultimately, our application will be conducted in terms of a linear NK RE model, under both perfect and imperfect information, and in a behavioural NK model. But first we step back to the underlying
This subsection has reviewed a number of equilibrium concepts in the literature that go beyond RE.
Households
Household
and the value function of the representative household at time
The household’s problem at time
where
where
where Λ
The first-order conditions up to now are suitable for the RE solution. We now express the solution in a form suitable for moving from a RE to a learning equilibrium. We consider the limit as
where the present (expected) value of a series
writing
The forward-looking budget constraint (18) holds for the representative household. If we allow RE and BR agents to borrow from or lend to one another, we must allow for
Solving (16) forward in time and using the law of iterated expectation, we have for
We now express the solution to the household optimization problem for
Substituting (21) and (22) into the forward-looking household budget constraint, using
which can be written in recursive form as
Consumption is then given by (23) assuming point expectations or by the symmetric form of the Euler equation (16) under full rationality (i.e., households know symmetric nature of equilibrium with
Firms, Government Expenditures and Monetary Policy
This section sets out the wholesalers and the retail sector which optimizes using Calvo-pricing contracts. We close the non-linear set-up with resource and balanced government budget constraints, a monetary policy rule and by specifying the structural shocks in the economy. Wholesale firms employ a Cobb–Douglas production function to produce a homogeneous output
where
The retail sector costlessly converts a homogeneous wholesale good into a basket of differentiated goods for aggregate consumption
where ζ is the elasticity of substitution. For each
where
Following Calvo (1983), we assume that there is a probability of 1
subject to (26), where
Denoting the numerator and denominator by
Using the fact that all resetting firms will choose the same price, by the law of large numbers, we can find the evolution of inflation given by
Price dispersion lowers aggregate output as follows. Market clearing in the labour market gives
using (26). Hence equilibrium for good
Assuming that the number of firms is large from Online Appendix E, we obtain the following dynamic relationship:
To close the model, we first require total profits from retail and wholesale firms, T
using the first-order condition (24). Then to complete closure, we have resource and balanced government budget constraints
where
and
Recovering the NK Workhorse Model
We now show that the linearized form of the non-linear model about the steady state reduces to the standard workhorse model where rational expectations E
from (23) where lower case variables
in a symmetric equilibrium. Under RE, (32) or (33) leads to the same equilibrium, but under BR, this is no longer the case.
Linearizing the household supply of hours decision, the resource constraint and the Fisher equation, we have
which completes the decisions of the household. Substituting out for
Turning to the supply side, for the wholesale sector
For retail firm
Then, in a symmetric equilibrium, we have
where E
to obtain in a symmetric equilibrium
Then, substituting back into (40), we arrive at
which omits learning about aggregate inflation. Under RE, (41) and (43) are equivalent. (43) is equivalent to
where
where we note that
The form of the Phillips curve (43) is often used in the behavioural NK literature (see, e.g., De Grauwe, 2012b), but as we have shown, this assumes that firms
AU Learning and Market-consistent Information
With AU learning, our learning model is one where agents make fully optimal decisions given their individual specification of beliefs but have no macroeconomic model to form expectations of aggregate variables. We draw a clear distinction between aggregate and internal quantities so that identical agents in our model are not aware of this equilibrium property (nor any others).
To close the model, we need to specify the manner in which households and firms form their expectations. To do so, we assume that variables which are local to the agents, in a geographical sense, are observable within the period, whereas variables that are strictly macroeconomic are only observable with a lag. This categorization regarding information about the current state of the economy follows Nimark (2014). He distinguishes between the local information that agents acquire directly through their interactions in markets and statistics that are collected and summarized, usually by governments, and made available to the wider public.
3
The policy rate is announced by the central bank, so it is observed without a lag and it is common knowledge. Given this, we assume an adaptive expectations forecasting rule given below by (47) and (48) about variables external to agents’ decisions. Let
Expressing E
which is now expressed in terms of one-step-ahead forecasts by the standard adaptive expectations rule
4
:
Households make inter-temporal decisions for their consumption and hours supplied given adaptive expectations of the wage rate, the nominal interest rate, inflation and profits. These macro-variables may in principle be observed with or without a one-period lag (
We distinguish household and firm expectations
where again one-step-ahead forecasts are given by the adaptive expectations rule:
Retail firms make inter-temporal decisions for their price and output given adaptive expectations of the aggregate inflation rate and their post-shock real marginal shock wage rate. As before, these variables may be observed with or without a one-period lag (
Heterogeneous Expectations and Reinforcement Learning
There is a growing literature within behavioural macro-models based on the Brock and Hommes (1997) framework where agents learn from each other through
RE expectations are then replaced with
Heterogeneous Expectations with Fixed Proportions of RE and BR Agents
Now we turn to the heterogeneous expectations model with BR(AU) agents alongside RE agents with fixed proportions of each type. We assume all RE agents
Zero net wealth
RE vs RE–BR Composite Expectations with nh = nf = 0.5, λx = 0.25, 1.0; Taylor rule with ρr = 0.7, υpi = 1.5 and υy = 0.3, υdy = 0; Monetary Policy Shock.
For our model of BR with AU, Figure 1 plots the impulse response functions (IRFs) with standard parameters for the rule for a shock to monetary policy under fast and slow learning. Not surprisingly, fast learning sees an IRF converge faster to the RE case, but in either case BR introduces
Endogenous Proportions of Rational and Non-rational Agents: Reinforcement Learning
Up to now we assume that the proportions of rational and non-rational agents
where
where
which is pinned down by the
A complete treatment of the model would require a departure from the linear Kalman filter solution for the II case for which we exploit the closed-form saddlepath solution that Pearlman et al. (1986) show both exists and is unique. We have also exploited the convenience of linear Bayesian estimation. In what follows we confine ourselves to the RE PI case and use the linear estimates obtained up to now.
Agents with reinforcement learning that now have proportions of rational households (
Third-order Solution of the Estimated NK RE(PI)-BR Model;
The main results from these simulations are as follows. First, reinforcement learning introduces
Perfect Versus Imperfect Information
The seminal paper on the general solution of linear RE models assuming
Perfect information means that at time
Angeletos and Lian (2016) provide an important survey paper on what they refer to as incomplete information literature. Here a comment on terminology is called for. Our use of perfect/imperfect information corresponds to the standard use in dynamic game theory when describing the information of the history of play driven by draws by nature from the distributions of exogenous shocks. Complete/incomplete information refers to agent’s beliefs regarding each other’s payoffs and information sets. In our set-up, the latter informational friction is absent.
Minford and Peel (1983) were the first to show the importance of information sets for the IRFs and second moments of RE models. Pearlman et al. (1986) generalized this for the general linear model. Pearlman (1992) extended this to optimal policy for fully optimal and time-consistent rules.
Why II? Some Empirical Motivation
Real Effects of Monetary Policy
II with the diverse information pricing model predicts
In log-linear form, let
where
Empirical Results
The Wilderness of Non-rationality
This section demonstrates the need for robust policy design using a special case of the four models for which in a balanced growth deterministic steady state both net inflation and growth is zero. Then about such a steady state, the linearized models take the form:
where
To formulate possible heuristic rules that encompass those in these papers, we draw upon the general form of adaptive expectations from Anufriev et al. (2015) discussed in the ‘Behavioural Macro models’ section that takes the log-linear general form
This encompasses simple adaptive expectations (
plus (54) as before where E
In Figures and 2 and 3, parameter values are set at their priors used later in the estimation. The demand shock follows an AR(1) process with persistence
Does Imperfect Information Improve Data Fit?
We estimate five NK models with different assumptions regarding expectations and information summarized in Table 2. For the RE agents in either the ‘pure’ or composite RE–BR model, we compare the PI or II assumptions.
For each of these five models, Bayesian methods are employed to separately estimate the model parameters using Dynare adapted to handle II.
7
The sample period is 1984:1–2008:2, a subset of that used in Smets and Wouters (2007), which is also used extensively in the empirical and RBC literature. These observable variables are the log differences of the real GDP (
We first focus on Pure RE, Pure BR(AU) and Comp RE(PI)–BR(AU) when RE agents have a PI set. We employ the Bayes factor (BF) from the model marginal likelihoods to gauge the relative merits across the three models in Table 3.
Impulse Responses Comparison Between Four Log-linearized Models to a Demand Shock.

Summary of Estimated Models.
Log-likelihood Values and Posterior Model Odds: RE Agents with PI.
The BR models—Pure BR(AU) and Comp RE(PI)-BR(AU)—all substantially outperform, their RE counterpart, which is firmly rejected by the data. Formally, using the Bayesian statistical language of Kass and Raftery (1995), a BF, the quotient of the probabilities reported, greater than 100 (marginal log-likelihood difference over 4.61), offers ‘decisive evidence’. Thus, we have decisive support for the pure BR and some composite behaviour from the US data we observe. The BF difference between the non-RE models is also strong.
Next we assume a II set for the RE agents:
Log-likelihood Values and Posterior Model Odds: RE Agents with II.
A very different picture now emerges when comparing the RE model with the behavioural alternatives. Two results are worth noting. First, RE with imperfect information (Pure RE(II)) wins the likelihood race against both Pure BR(AU) and Pure RE(PI). Again, in formal Bayesian language, the RE(II) model decisively dominates the pure BR-AU learning model and, not surprisingly, decisively dominates RE(PI), a finding that is consistent with that in Levine et al. (2012a). The second interesting result is that, when the composite heterogenous expectations model is estimated assuming the same II information set for everyone (Comp RE(II)–BR(AU)), it generates the highest log-likelihood value and outperforms all the competing models in fitting the data.
These results suggest that persistence can be injected into the NK model to improve data fit in two contrasting ways: bounded rationality with learning through heuristic rules or retaining RE but with II and Kalman-filtering learning.
Concluding Remarks
Our results for the workhorse NK model suggest a new perspective for the macro/NK/learning literature. Avenues for future work could embed the RE–BR composite model into a richer NK model along the lines of Smets and Wouters (2007), extend the linear Kalman filter to accommodate the non-linearity in reinforcement learning and use non-linear estimation methods to identify a number of parameters that cannot be identified using linear Bayesian estimation. The latter two non-linear extensions are major challenges. Future work could also examine optimal monetary policy and follow Geweke and Amisano (2012) and Deák et al. (2023) to address what has been called the ‘wilderness of non-RE’ to design a robust rule across all the BR model variants discussed in the article.
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
