Testing axioms of revealed preference in Stata

Abstract

The revealed preference approach in economics is central to the empirical analysis of consumer behavior. In this article, we introduce the commands checkax, aei, and powerps as a bundle within the package rpaxioms. The first command allows a user to test whether consumer expenditure data satisfy several revealed preference axioms; the second command calculates measures of goodness of fit when the data violate these axioms; and the third command calculates power against uniformly random behavior as well as predictive success for each axiom. We illustrate the commands using individual-level experimental data and household-level aggregate consumption data.

Keywords

st0673 rpaxioms checkax aei powerps revealed preference generalized axiom of revealed preference Afriat efficiency index power predictive success

1 Introduction

The econometrics of consumer demand is central to economic analysis—it involves testing economic theories, making out-of-sample predictions, and drawing welfare comparisons across different environments and policy regimes. Thus, the research program on empirical consumer demand has held a central position within the economics literature for many decades (see, for example, McFadden [1974] and Deaton and Muellbauer [1980]). Much of the emphasis within this literature has been on consistently estimating consumer preferences, the inherently unobservable primitive from which tests, predictions, and welfare statements can be derived.

The analogue to this research program within the context of finite data is known as revealed preference. The revealed preference approach involves checking whether a finite set of price and demand observations made on an individual consumer is compatible with economic rationality, that is, rationalizable by some form of utility maximization. Revealed preference is fully nonparametric, in the sense that it does not impose any auxiliary functional form or distributional assumptions, only the basic primitives of utility maximization. Famously, Afriat’s (1967) theorem states that a dataset is rationalizable by the maximization of a well-behaved utility function if and only if it obeys an intuitive no-cycling condition on the data. This property is commonly referred to as the generalized axiom of revealed preference (garp), and there are efficient algorithms for checking this axiom empirically (Varian 1982). Other notions (including special cases) of rationalizability can also be characterized in terms of observables, and the approach has given rise to a suite of revealed preference tests in the tradition of Afriat (1967) that can be used in applied empirical work.

In general, revealed preference tests are “sharp”, in the sense that they deliver a binary response as to whether observed expenditure data are compatible with an underlying behavioral model. However, given sufficiently rich data, an outright failure of even fairly permissive notions of rationalizability should not come as unexpected, and it may well be that the data are in fact very close to rationalizability. As Varian (1990) notes, “for most purposes, ‘nearly optimizing behavior’ is just as good as ‘optimizing’ behavior.” Afriat (1973) proposes to test for nearly optimizing behavior by allowing a part of the consumer’s expenditure to be “wasted”. The fraction of expenditure that is not being wasted by the consumer is usually referred to as the efficiency level of the test. Varian’s (1982) original formulation of garp implicitly assumes an efficiency level of 1; that is, the consumer is not allowed to waste any part of his or her expenditure.

Varian (1982) introduces a simple combinatorial algorithm to test whether consumer expenditure data obey garp. This algorithm can be easily adapted to test garp at any efficiency level. Our first command, checkax, implements Varian’s algorithm to test whether a dataset satisfies garp at any efficiency level specified by the user. The command also allows a user to test whether the data obey the following revealed preference axioms at any efficiency level: the strong axiom of revealed preference (sarp), the weak generalized axiom of revealed preference (wgarp), the weak axiom of revealed preference (warp), the symmetric generalized axiom of revealed preference (sgarp), the homothetic axiom of revealed preference (harp), and cyclical monotonicity (cm). These axioms and their behavioral implications are described in detail in section 2.5.

Afriat (1973) proposes an upper limit on the efficiency level at which a dataset satisfies garp, or the critical cost efficiency, as a measure of approximate rationalizability. Hence, this index, called the Afriat efficiency index (aei), also known as the critical cost efficiency index (ccei), measures the severity of violations as the minimal expenditure adjustment that is required for the data to comply with garp. Thus, Varian (1990) interprets (and extends) this measure as a “goodness-of-fit” criterion. The approach can also be applied to other axioms, and our second command, aei, implements the aei for each of the following seven axioms: garp, sarp, wgarp, warp, sgarp, harp, and cm. The aei is discussed in more detail in section 2.3.

In addition to goodness-of-fit, the outcome of a revealed preference test in many empirical applications is often reported alongside some measure of power. The power of a revealed preference test, say, for garp, is defined as the probability of rejecting garp, given that the data were generated by some type of “irrational” consumer behavior. Bronars (1987) proposes a power index where the irrational behavior is based on Becker’s (1962) uniformly random consumption model. Thus, for this widely used power index, the choices generated by an irrational consumer are uniformly distributed along the budgetary boundaries. Our third command, powerps, implements the Bronars power index for any of the axioms above at any efficiency level. This command also reports a measure of “predictive success” originally introduced by Selten (1991) and adapted to the revealed preference framework by Beatty and Crawford (2011). This measure is motivated by the idea that if the data satisfy a given revealed preference axiom, then any robust conclusion on rationalizability should, at a minimum, require the test to have high power against uniformly random behavior. Thus, the predictive success measure combines the pass rate of the revealed preference test with Bronars power index. Power and predictive success are further discussed in section 2.4.

In summary, for applied practitioners, it is important that revealed preference methods are easily implementable and reproducible. To this end, we present the package, rpaxioms. We illustrate our three commands—checkax, aei, and powerps—on two types of data that are commonly used in empirical applications of revealed preference. First, using experimental data collected by Andreoni and Miller (2002), we test whether the social allocations selected by individual subjects are compatible with utility maximization taking several different forms. Second, using aggregate household consumption data on four food categories from Poi (2002), we test whether these data can be rationalized by preferences that are common across all households.¹

2 Revealed preference

Suppose that there are T observations of the prices and quantities of K goods. At each observation t = 1,…, T , the price vector is denoted by $p^{t} = (p_{1}^{t}, ..., p_{k}^{t}) ≫ 0$ and the quantity bundle by $x^{t} = (x_{1}^{t}, \dots, x_{K}^{t}) ⩾ 0$ . We assume that all prices are strictly positive and that all quantities are nonnegative (note that some but not all quantities at any given observation may be equal to 0; that is, all expenditures are strictly positive). The T observations of (p ^t, x ^t ) then form the finite dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ .

2.1 Rationalizability

The dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ is said to be rationalizable by utility maximization if there is a utility function $U : ℝ_{+}^{K} \to ℝ$ , such that, at every observation t = 1, . 0. . , T,

U (x^{t}) ⩾ U (x) for any x \in {x \in ℝ_{+}^{K} : p^{t} \cdot x ⩽ p^{t} \cdot x^{t}}

Rationalizability means that we can find a utility function defined on the consumption space that assigns (weakly) higher utility to the quantity bundle x ^t than to any other bundle x that is affordable at the prevailing prices p ^t . Without any further restrictions on U, any dataset D is rationalizable because U could simply assign the same level of utility to every bundle in the consumption space. For the question to be meaningful, we require further structure on the utility function U.

Afriat (1967) was the first to show that a finite dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ is rationalizable by a locally nonsatiated utility function U if and only if it obeys an intuitive property now known as the generalized axiom of revealed preference.² A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ obeys garp so long as the preference cycles it reveals are weak rather than strict; that is, for any cycle represented by

p^{t} \cdot x^{t} ⩾ p^{t} \cdot x^{u}, p^{u} \cdot x^{u} ⩾ p^{u} \cdot x^{v}, \dots, p^{w} \cdot x^{w} ⩾ p^{w} \cdot x^{t}

the inequalities cannot be strict. The intuition of garp as a no-cycling condition on the data ought to be strong, and it should also come as no surprise that garp is necessary for (or implied by) the maximization of a locally nonsatiated utility function. Afriat’s (1967) theorem shows that garp is also sufficient for rationalizability by a locally nonsatiated utility function U.³ The importance of the result is that garp completely characterizes the content of utility maximization in terms of observables, and it can therefore be used as an empirical test for rationalizability.

2.2 Approximate rationalizability

In a sufficiently rich empirical setting, it is unlikely that any dataset is exactly rationalizable, and so we require some notion of its distance to or departure from exact rationalizability. Loosely speaking, one could think of this as allowing for “error”, which has long been the convention in the econometrics of consumer demand (see, for example, McFadden [1974] and Deaton and Muellbauer [1980]). To this end, the convention in the revealed preference literature is to accommodate error through cost inefficiency, an idea first developed by Afriat (1972, 1973).

According to Afriat’s (1973) formulation, the consumer “has a definite structure of wants” and “programs at a level of cost-efficiency e”, which is tantamount to relaxing the definition of rationalizability. Consider any efficiency level e ∊ (0, 1]. The dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ is said to be rationalizable at efficiency level e if there is a utility function U : $U : ℝ_{+}^{K} \to ℝ$ , such that, at every observation t = 1,…, T ,

U (x^{t}) ⩾ U (x) for any x \in {x \in ℝ_{+}^{K} : p^{t} \cdot x ⩽ e p^{t} \cdot x^{t}}

When e = 1, this definition corresponds to exact rationalizability and, for any e < 1, to approximate rationalizability. Afriat (1973) shows that a dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ is rationalizable at efficiency level e by a locally nonsatiated utility function U if and only if, for any cycle represented by

e p^{t} \cdot x^{t} ⩾ p^{t} \cdot x^{u}, e p^{u} \cdot x^{u} ⩾ p^{u} \cdot x^{v}, \dots, e p^{w} \cdot x^{w} ⩾ p^{w} \cdot x^{t}

the inequalities cannot be strict. The equivalent condition known as e garp is necessary and sufficient for approximate rationalizability by a locally nonsatiated utility function U.⁴ When e = 1, e garp and garp coincide.

The operationalization of e garp as an empirical test is straightforward. Consider any efficiency level e ∊ (0, 1]. For any pair of observations (t, s), we say that x ^t is directly revealed preferred to x ^s at efficiency level e, written $x^{t} R_{e}^{D} x^{s}$ , if e p ^t · x ^t $⩾$ p ^t · x ^s . This means that x ^t is chosen even though the cost of the bundle x ^s (at the prevailing prices p ^t ) does not exceed e p ^t · x ^t . Analogously, we say that x ^t is strictly directly revealed preferred to x ^s at efficiency level e, written $x^{t} P_{e}^{D} x^{s}$ , if e p ^t · x ^t > p ^t · x ^s . We say that x ^t is revealed preferred to x ^s at efficiency level e, written $x^{t} P_{e}^{D} x^{s}$ , if there exists a sequence of observations (t, u, v,…, w, s) such that $x^{t} R_{e}^{D} x^{u}, x^{u} R_{e}^{D} x^{v}, \dots, x^{w} R_{e}^{D} x^{s}$ . Hence, R_e is the transitive closure of $R_{e}^{D}$ . When e = 1, these relations reduce to the usual revealed preference relations (Varian 1982).

A dataset D = {(p^{t}, x^{t})}_{t = 1}^{T} satisfies eGARP if x^{t} R_{e} x^{s} implies e p^{s} \cdot x^{s} ⩽ p^{s} \cdot x^{t}

The e garp condition can be tested at any efficiency level e by slightly modifying the algorithm proposed by Varian (1982). First, the relations R_e ^D and P_e ^D are formed by constructing the T × T matrices $R_{e}^{D}$ and $P_{e}^{D}$ , where the (t, s)th elements of $R_{e}^{D}$ and $P_{e}^{D}$ are equal to 1 if e p ^t · x ^t ≥ p ^t · x ^s and e p ^t · x ^t > p ^t · x ^s and 0 otherwise, respectively. Second, the relation R_e is formed by calculating the transitive closure of the matrix $R_{e}^{D}$ , which gives a T × T matrix R _e , where the (t, s)th element of R _e is equal to 1 if x ^tR_e x ^s and 0 otherwise. Varian (1982) suggests calculating R _e using Warshall’s (1962) algorithm. Finally, e garp is violated whenever the (t, s)th element of R _e and the (s, t)th element of $P_{e}^{D}$ are both equal to 1. The total number of violations is given by the number of pairs (t, s), with t ≠ s, such that this occurs. Therefore, with T observations, the total possible number of e garp violations is T (T −1), and the fraction of violations is given by the ratio of the number of violations to T(T − 1).

Our first command, checkax, constructs $R_{e}^{D}, P_{e}^{D}$ , and R _e at any efficiency level e specified by the user. The last is constructed using a vectorized version of Warshall’s algorithm. The command checkax then reports to the user whether the data satisfy e garp, as well as the number and fraction of violations.⁵

2.3 The AEI

It is clear that any dataset is approximately rationalizable at some sufficiently small efficiency level e ∊ (0, 1]. Afriat (1973) defines the ccei or the aei as the maximal value of e (the supremum, to be precise) such that a dataset obeys e garp.⁶ Varian (1990) interprets the aei as a measure of “goodness of fit” in terms of wasted expenditure: if a consumer has an aei of e^∗ < 1, then he or she could have obtained the same level of utility by spending only the fraction e^∗ of what he or she actually spent. This is the sense in which the consumer is exhibiting cost inefficiency, and in many applications, the aei is interpreted as a measure of “rationality” or “decision-making quality” (see, for example, Choi et al. [2014]).⁷

Our second command, aei, calculates the aei by implementing the binary search algorithm described in Varian (1990).

2.4 Power and predictive success

Within applications of revealed preference, alongside goodness of fit, it is important to know something about the power of the test or empirical environment. To this end, the convention in the applied revealed preference literature is to test against an alternative behavioral model, which is typically random choice and interpreted as “naive” or “irrational”. The notion of “irrationality” that underpins the Bronars (1987) power index is based on a model of uniformly random consumption, in which all feasible consumption allocations (that is, frontier bundles) are equally likely to be chosen.

Bronars (1987) suggests implementing the index using Monte Carlo methods. The first step consists of generating artificial budget shares that are consistent with uniformly random consumption. At each observation, this involves generating K random variables drawn from the Dirichlet distribution with all parameters (characterizing this distribution) set equal to 1. By construction, at each observation, these random variables are uniformly distributed on the (K − 1)-dimensional unit simplex and, consequently, can be interpreted as budget shares in the uniformly random model. The second step solves for each uniformly random consumption quantity (denoted by $q_{k}^{t}$ ) from the budget share equation given by $w_{k}^{t} = p_{k}^{t} q_{k}^{t} / p^{t} \cdot x^{t}$ , where each $w_{k}^{t}$ denotes an artificial budget share generated in the first step. (Notice that p ^t and x ^t are given in the original dataset.) Thus, the first two steps generate a synthetic dataset across K goods and T observations that is compatible with uniformly random behavior. The third step repeats the first two steps many times and, for each repetition, checks whether the synthetic dataset of prices and uniformly random quantities satisfies e garp at a given efficiency level e. The power measure is the fraction of these synthetic datasets that violates e garp.

Our third command, powerps, calculates power at any efficiency level e and for any number of repetitions specified by the user. Moreover, the command allows the user to set the random seed in the generation of the Dirichlet random variables (in the first step of the procedure) to ensure that power calculations are perfectly replicable. The command powerps also reports Beatty and Crawford’s (2011) revealed preference measure of “predictive success”. For a given dataset, this measure is defined as the difference between the pass or fail indicator and one minus the Bronars power index, where the pass or fail indicator takes the value 1 if the original data obey e garp at a given efficiency level e and 0 otherwise, and where the power index corresponding to e garp is calculated at the same efficiency level e. This measure of predictive success can then be straightforwardly aggregated across individual datasets.

2.5 Other axioms

Our commands are also implementable for other revealed preference axioms that characterize some of the common forms of utility or preference maximization. The default axiom in every command is e garp (with e = 1), but each command can also be executed for six other revealed preference axioms at any efficiency level e specified by the user.

sarp. A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ satisfies the strong axiom of revealed preference at efficiency level e, abbreviated e sarp, if x ^tR_e x ^s implies e p ^s · x ^s < p ^s · x ^t whenever x ^t ≠ x ^s . Matzkin and Richter (1991) show that sarp (e = 1) is necessary and sufficient for rationalizability by a continuous, strictly increasing, and strictly concave utility function. Notice that the difference between garp and sarp is that garp allows for “flat spots” of indifference (demand correspondences versus demand functions). Like e garp, there can be up to T (T − 1) violations of e sarp.

wgarp. A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ satisfies the weak generalized axiom of revealed preference at efficiency level e, abbreviated e wgarp, if $x^{t} R_{e}^{D} x^{s}$ implies e p ^s · x ^s < p ^s · x ^t when ever x^t ≠ x^s . Aguiar, Hjertstrand, and Serrano (2020) show that wgarp (e = 1) is necessary and sufficient for rationalizability by a continuous, strictly increasing, piecewise concave, and skew-symmetric preference function (see Aguiar, Hjertstrand, and Serrano [2020] for the definition of a preference function and its properties). Banerjee and Murphy (2006) show that wgarp and garp are equivalent when K = 2 (when there are two goods). The total possible number of violations of e wgarp is T(T − 1)/2.

warp. A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ satisfies the weak axiom of revealed preference at efficiency level e, abbreviated e warp, if $x^{t} R_{e}^{D} x^{s}$ implies e p ^s ·x ^s < p ^s ·x ^t whenever x ^t ≠ x ^s . Aguiar, Hjertstrand, and Serrano (2020) show that warp (e = 1) is necessary and sufficient for rationalizability by a continuous, strictly increasing, piecewise strictly concave, and skew-symmetric preference function. The difference between warp and wgarp is analogous to the difference between sarp and garp. Furthermore, Rose (1958) shows that warp and sarp are equivalent when K = 2. Like e wgarp, there can be up to T (T − 1)/2 violations of e warp.

sgarp. For any (t, s), we can modify the definition of $R_{e}^{D}$ so that $x^{t} R_{e}^{D} x^{s}$ if e p ^t · x ^t ≥ p ^t · y ^s , where y ^s is any permutation of x ^s and where the transitive closure R_e of $R_{e}^{D}$ follows accordingly.⁸ A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ satisfies the symmetric generalized axiom of revealed preference at efficiency level e, abbreviated e sgarp, if x ^tR_e x ^s implies e p ^s · x ^s ≥ p ^s · y ^t (where y ^t is any permutation of x ^t ). Nishimura, Ok, and Quah (2017) show that e sgarp is necessary and sufficient for rationalizability by a continuous, strictly increasing, concave, and symmetric utility function. Polisson, Quah, and Renou (2020) implement e sgarp in the context of symmetric risk; that is, the utility function must also obey first-order stochastic dominance. The total possible number of violations of e sgarp is T ².

harp. A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ satisfies the homothetic axiom of revealed preference at efficiency level e, abbreviated e harp, if for all distinct sequences of observations (s, t, u,…, v), it must be the case that (p ^t ·x ^s )(p ^s ·x ^u ) … (p ^v ·x ^t ) ≥ (e p ^t·x ^t )(e p ^s·x ^s ) … (e p ^v·x ^v ). Varian (1983) shows that harp (e = 1) is necessary and sufficient for rationalizability by a continuous, strictly increasing, concave, and homothetic utility function. Heufer and Hjertstrand (2019) provide a more general characterization (e < 1) and refer to e^∗ in this case as the homothetic efficiency index. The command checkax implements e harp as described in Varian (1983) using the Floyd–Warshall algorithm. The total possible number of violations of e harp is T .

cm. A dataset $D = {(p^{t}, x^{t})}_{t = 1}^{T}$ satisfies a cyclical monotonicity condition at efficiency level e, abbreviated e cm, if for all distinct sequences of observations (s, t, u,…, v), it must be the case that p ^t · (x ^s − e x ^t ) + p ^s · (x ^u − e x ^s ) + · · · + p ^v · (x ^t − e x ^v ) ≥ 0. Brown and Calsamiglia (2007) show that cm (e = 1) is necessary and sufficient for rationalizability by a continuous, strictly increasing, concave, and quasilinear utility function. The command checkax implements e cm similarly to e harp using the Floyd–Warshall algorithm. Like e harp, there can be up to T violations of e cm.

We conclude this section with two comments.

First, notice that in general a dataset is approximately rationalizable if it could have arisen from the maximization of some utility or preference function subject to a modified budget set. Explicit theoretical support for these notions of rationalizability have been developed in the case of e garp, e sgarp, and e harp but not for the other axioms.

Second, we note that smoothness or differentiability has no material empirical content once cost inefficiency has been taken into account. For example, Chiappori and Rochet (1987) show that strong sarp (ssarp) is necessary and sufficient for rationalizability by an infinitely differentiable, strictly increasing, and strictly concave utility function. Suppose that a dataset obeys sarp but fails ssarp, which amounts to the same consumption bundle being chosen at two or more distinct price vectors. If we set the efficiency level to 1 − ε, for some ε > 0 arbitrarily small, then we could always find a smooth rationalization. Because the aei is defined as a supremum, the aei for ssarp would still be equal to 1. In other words, smoothness or differentiability are “untestable” in a meaningful way. See also the discussion in Polisson, Quah, and Renou (2020).

3 The checkax, aei, and powerps commands

All three commands take as their two main (required) arguments the T × K price and quantity matrices:

price( string ) specifies a T ×K price matrix, where each row corresponds to an observation t and each column to a good k. All prices are required to be strictly positive. If any of the elements in the price matrix are nonpositive (or if the price and quantity matrices have different dimensions), the commands return an error message.

quantity( string ) specifies a T × K quantity matrix, where each row corresponds to an observation t and each column to a good k. All quantities are required to be nonnegative. Some (but not all) quantities at a given observation may be equal to 0. If the quantity matrix violates these conditions (or if the price and quantity matrices have different dimensions), the commands return an error message.

3.1 The checkax command

The syntax of checkax is

checkax, price( string ) quantity( string ) [axiom( string ) efficiency( # )]

3.1.1 Options for checkax

axiom( string ) specifies the axiom or axioms that the user would like to test. The default is axiom(eGARP). There are seven axioms that can be tested: eGARP, eSARP, eWGARP, eWARP, eSGARP, eHARP, and eCM. The user may also test all axioms simultaneously by specifying axiom(all).

efficiency( # ) specifies the efficiency level at which the user would like to test each axiom. The default is efficiency(1). The efficiency level must be strictly positive and no greater than 1.

3.1.2 Stored results for checkax

checkax stores the following in r():

3.1.3 Examples of checkax

The following examples illustrate the command checkax using a dataset of 20 observations on the prices and quantities of 5 goods. The prices of goods 1 to 5 are p1, p2, p3, p4, and p5. The quantities are x1, x2, x3, x4, and x5. The price and quantity matrices are P and X, respectively. The first example runs checkax using its default options, that is, for e garp at the efficiency level e = 1. The second example runs checkax for e garp and e harp at the efficiency level e = 0.95.

3.2 The aei command

The syntax of aei is aei, price( string ) quantity( string ) [axiom( string ) tolerance( # )]

3.2.1 Options for aei

tolerance( # ) sets the tolerance level of the termination criterion 10 ⁻ ⁿ by specifying the integer n. For example, tolerance(10) sets the tolerance level to 10 ⁻ ¹⁰. The default is tolerance(6), which gives the default tolerance level 10 ⁻ ⁶. The integer n in the termination criterion 10 ⁻ ⁿ cannot be less than 1 or greater than 18.

3.2.2 Stored results for aei

aei stores the following in r():

3.2.3 Examples for aei

The following examples illustrate the command aei using the same data as above. The first example runs aei using its default options, that is, for e garp with a tolerance level of 10 ⁻ ⁶. The second example runs aei for e garp and e harp with the tolerance level set to 10 ⁻ ¹⁰ and shows that the command quietly can be used to suppress the output.

3.3 The powerps command

The syntax of powerps is

powerps, price( string ) quantity( string ) [axiom( string ) efficiency( # )

simulations( # ) seed( # ) aei tolerance( # ) progressbar]

3.3.1 Options for powerps

efficiency( # ) specifies the efficiency level at which the user would like to test each axiom. The default is efficiency(1). The efficiency level must be strictly positive and no greater than 1.

simulations( # ) specifies the number of repetitions of the simulated uniformly random data. The default is simulations(1000).

seed( # ) specifies the random seed in the generation of the Dirichlet random numbers. The default is seed(12345).

aei specifies whether the user wants to compute the aei for each simulated dataset and specified axiom. Note that including this option may increase computation times substantially.

tolerance( # ) sets the tolerance level of the termination criterion 10 ⁻ ⁿ by specifying the integer n when computing the aei. See section 3.2 for a more detailed description. This option is useful only in combination with the aei option.

progressbar specifies if the user wants to display the number of repetitions that have been executed.

3.3.2 Stored results for powerps

powerps stores the following in r():

3.3.3 Examples for powerps

The following examples illustrate the command powerps using the same data as above. The first example runs powerps for the axioms egarp and eharp. All other options are set to their defaults. The second example also runs powerps for the axioms egarp and eharp but now includes the option aei, which calculates the aei for both axioms for each of the 1,000 simulated datasets. Note that including the aei option increases computation time substantially.

4 Empirical illustrations

This section illustrates how to implement our commands using two types of data that are common in many revealed preference applications. The first dataset contains the individual choices of experimental subjects. Such controlled environments are desirable from the perspective of empirical testing because relative prices can be randomized or calibrated across observations to engineer a sufficiently powerful test of, say, utility maximization. In our empirical illustration, we analyze the budgetary data collected in Andreoni and Miller (2002); other prominent examples of experiments involving budgetary designs include Choi et al. (2007, 2014); Andreoni and Sprenger (2012); and Halevy, Persitz, and Zrill (2018). The second dataset contains annual household food consumption within broad categories. Aggregate household-level data have long been used to estimate parametric demand systems (see, for example, Deaton and Muellbauer [1980]; Banks, Blundell, and Lewbel [1997]; and Lewbel and Pendakur [2009]); and moreover, Poi (2002) makes use of the same dataset to illustrate the estimation of parametric demand systems in Stata.

4.1 Experimental data

Andreoni and Miller (2002) test whether the social choices of experimental subjects are rational, using a dictator game in which one subject (the dictator) allocates token endowments between himself and another subject (the beneficiary) according to some rate of transfer. The payoffs of the dictator and the beneficiary are essentially two distinct goods, and the transfer rates are the price ratios. The experiment contains 2 parts, where 142 subjects (group 1) face T = 8 decision rounds, and where 34 subjects (group 2) face T = 11 rounds. In this illustration, we focus on subjects in group 1.

Andreoni and Miller (2002) find that 13 subjects in group 1 violate rationality and, for each of these 13 subjects, report the aei (for e garp) and the number of violations of e garp, e sarp, and e warp at the efficiency level e = 1 (see table II in Andreoni and Miller [2002]). Banerjee and Murphy (2006) complement this analysis and report the number of violations of e wgarp at the efficiency level e = 1 (see table 1 in Banerjee and Murphy [2006]). Using the commands checkax and aei, the following code replicates these results:

The results from the preceding code are reported in table 1.

Table 1.

Replication of results in Andreoni and Miller (2002, table II) and Banerjee and Murphy (2006, table 1) ^∗

	Number (fraction) of violations
Subject	e garp	e wgarp	e sarp	e warp	aei (e garp)
3	2 (0.036)	1 (0.036)	4 (0.071)	1 (0.036)	1.000 ^†
38	8 (0.143)	2 (0.071)	9 (0.161)	2 (0.071)	0.917
40	8 (0.143)	3 (0.107)	11 (0.196)	3 (0.107)	0.833
41	1 (0.018)	1 (0.036)	2 (0.036)	1 (0.036)	1.000 ^†
47	1 (0.018)	1 (0.036)	2 (0.036)	1 (0.036)	1.000 ^†
61	4 (0.071)	1 (0.036)	5 (0.089)	1 (0.036)	0.917
72	1 (0.018)	1 (0.036)	2 (0.036)	1 (0.036)	1.000 ^†
87	1 (0.018)	1 (0.036)	2 (0.036)	1 (0.036)	1.000 ^†
90	2 (0.036)	1 (0.036)	2 (0.036)	1 (0.036)	0.975
104	1 (0.018)	1 (0.036)	3 (0.054)	1 (0.036)	1.000 ^†
126	1 (0.018)	1 (0.036)	4 (0.071)	1 (0.036)	1.000 ^†
137	1 (0.018)	1 (0.036)	2 (0.036)	1 (0.036)	1.000 ^†
139	1 (0.018)	1 (0.036)	2 (0.036)	1 (0.036)	1.000 ^†

∗ The number (and fraction) of violations is reported at the efficiency level e = 1.

† The symbol (†) indicates that an ε-change in choices eliminates all GARP violations.

In figure 1, we plot the fraction of the 142 subjects satisfying e garp, e sgarp, e harp, and e cm for values of e between 0.85 and 1 in an equally spaced grid with increments of 0.01. The results used to generate figure 1 are obtained by looping over all subjects, axioms, and efficiency levels in the grid and evaluating the command checkax for each subject, axiom, and efficiency level. The following line of code illustrates one such evaluation:

Figure 1.

aei distributions for e garp, e sgarp, e harp, and e cm

Because subjects are choosing from among bundles of two goods, e garp (e sarp) and e wgarp (e warp) are equivalent and must by construction deliver identical results in terms of pass rates (but not in terms of the number and fraction of violations). Furthermore, while theoretically possible, the empirical differences between e garp (e wgarp) and e sarp (e warp) are negligible, implying that distinctions between demand correspondences and demand functions are not of first-order importance within these data. Because neither Andreoni and Miller (2002) nor Banerjee and Murphy (2006) report any results for e sgarp, e harp, or e cm, we give these axioms more attention: we calculate the mean, standard deviation, minimum, first quartile (Q1), median, third quartile (Q3), and maximum of the number (and fraction) of violations and of the aeis corresponding to e sgarp, e harp, and e cm. The results are displayed in table 2.

Table 2.

Summary statistics for e sgarp, e harp, and e cm ^∗

	Number (fraction) of violations			aei
Statistic	e sgarp	e harp	e cm	e sgarp	e harp	e cm
Mean	16.47 (0.257)	6.29 (0.789)	7.68 (0.960)	0.745	0.976	0.935
Std. dev.	16.80 (0.262)	2.90 (0.362)	1.03 (0.129)	0.288	0.049	0.035
Minimum	0 (0.000)	0 (0.000)	0 (0.000)	0.333	0.707	0.800
Q1	0 (0.000)	5 (0.625)	8 (1.000)	0.333	0.966	0.905
Median	8 (0.125)	8 (1.000)	8 (1.000)	0.875	1.000	0.957
Q3	37 (0.578)	8 (1.000)	8 (1.000)	1.000	1.000	0.957
Maximum	41 (0.641)	8 (1.000)	8 (1.000)	1.000	1.000	1.000

^∗ The number (and fraction) of violations is reported at the efficiency level e = 1.

Finally, we turn to power and predictive success. By looping over all subjects, axioms, and values of e between 0.4 and 1.0, we calculate the power and predictive success for every subject, axiom, and efficiency level in the grid. The following line of code illustrates one such evaluation:

powerps, price(P) quantity(Q`subject’) efficiency(0.4)

( output omitted )

We summarize the results in three different ways. First, figure 2 plots the power of e garp, e sgarp, e harp, and e cm for every efficiency level in the grid. Note that because all subjects face the same budgets, the power of each test is identical across subjects. Second, table 3 gives the mean, standard deviation, minimum, first quartile (Q1), median, third quartile (Q3), and maximum of the number (and fraction) of violations and of the aeis corresponding to e sgarp, e harp, and e cm, over all repetitions in the simulated uniformly random data. Third, figure 3(a) plots the mean predictive success across all subjects at each efficiency level in the grid, and figure 3(b) is a subject-level scatterplot of e harp versus e garp at selected efficiency levels.

Figure 2.

Power of e garp, e sgarp, e harp, and e cm

Table 3.

Power summary statistics for e sgarp, e harp, and e cm ^∗

	Number (fraction) of violations			aei
Statistic	e sgarp	e harp	e cm	e sgarp	e harp	e cm
Mean	17.53 (0.274)	7.96 (0.995)	7.93 (0.992)	0.693	0.763	0.761
Std. dev.	12.29 (0.192)	0.47 (0.058)	0.65 (0.081)	0.181	0.120	0.124
Minimum	0 (0.000)	0 (0.000)	0 (0.000)	0.335	0.358	0.358
Q1	8 (0.125)	8 (1.000)	8 (1.000)	0.551	0.684	0.675
Median	15 (0.234)	8 (1.000)	8 (1.000)	0.667	0.773	0.769
Q3	27 (0.422)	8 (1.000)	8 (1.000)	0.840	0.856	0.859
Maximum	53 (0.828)	8 (1.000)	8 (1.000)	1.000	1.000	1.000

∗ The number (and fraction) of violations is reported at the efficiency level e = 1.

Figure 3.

(a) Mean predictive success for e garp, e sgarp, e harp, and e cm; (b) scatterplot of e harp versus e garp. In panel (b), the dashed line is the 45 ^◦ line, and the marker numbers refer to efficiency levels.

4.2 Aggregate household consumption data

In the second empirical illustration, we use aggregate household consumption data from the 1987–1988 Nationwide Food Consumption Survey conducted by the United States Department of Agriculture. This dataset is used by Poi (2002) to illustrate how Stata’s ml command can be used to fit the quadratic almost-ideal demand system. This dataset is named food.dta in the repository, Datasets for Stata Base Reference Manual, Release 17 (https://www.stata-press.com/data/r17/r.html), and contains budget shares and prices for the following four aggregated food categories: meats, fruits and vegetables, breads and cereals, and miscellaneous. As in Poi (2002), we use a sample of 4,048 households.

To test whether the data can be rationalized by preferences that are common across all households, we compute the aei for garp and wgarp.

We find that testing for e garp takes considerably longer than testing for e wgarp, which suggests that the main computational burden in testing for e garp is associated with the calculation of the transitive closure of the revealed preference relation. Interestingly, we find identical values of the aei for garp and wgarp, suggesting that none of the violations of garp can be attributed to violations of transitivity.

Finally, because e wgarp is considerably faster to test than e garp, we calculate the power of e wgarp at an efficiency level equal to the aei for wgarp.

5 Conclusions

In this article, we presented the new commands checkax, aei, and powerps to test whether observed data on prices and quantities can be rationalized by different notions of utility maximization. The commands are implementations of nonparametric revealed preference restrictions that can be formulated as combinatorial algorithms. An important property of such algorithms is that they converge in a finite number of steps and, consequently, can be implemented on rather large datasets. Although the commands are implementations of characterizations that are intrinsically deterministic (in the sense that they lack stochastic components), they also allow the user to calculate diagnostic measures such as goodness of fit, power, and predictive success of the underlying behavioral model.

The package rpaxioms contains implementations of perhaps the most basic concepts in the empirical revealed preference literature. Two natural extensions for future work come to mind. The first is to provide implementations of revealed preference characterizations of other behavioral models, including special cases on preferences which amount to different forms of separability (for example, expected utility under risk and exponential discounted utility over time). The second is to provide implementations of more disaggregated measures of goodness of fit and power. Although some of these models and measures can be implemented by solving (mixed-integer) linear programming problems, this is not a trivial task, and the computational complexity of doing so crucially depends on the algorithms used to solve such problems.

Supplemental Material

Supplemental Material, sj-zip-1-stj-10.1177_1536867X221106374 - Testing axioms of revealed preference in Stata

Supplemental Material, sj-zip-1-stj-10.1177_1536867X221106374 for Testing axioms of revealed preference in Stata by Marcos Demetry, Per Hjertstrand and Matthew Polisson in The Stata Journal

Footnotes

6 Acknowledgments

We thank the editor Stephen Jenkins, a reviewer, Glenn Nielsen, and John Quah for helpful comments and useful suggestions. Per Hjertstrand thanks Torsten Söderbergs stiftelse for financial support.

7 Programs and supplemental materials

To install a snapshot of the corresponding software files as they existed at the time of publication of this article, type

To install checkax, aei, and powerps and obtain the data from the Statistical Software Components Archive, type

Note that typing net get downloads the dataset and do-file to your current working directory.

Notes

References

Afriat

S. N.

1967. The construction of utility functions from expenditure data. International Economic Review 8: 67–77. https://doi.org/10.2307/2525382.

Afriat

S. N.

1972. Efficiency estimation of production functions. International Economic Review 13: 568–598. https://doi.org/10.2307/2525845.

Afriat

S. N.

1973. On a system of inequalities in demand analysis: An extension of the classical method. International Economic Review 14: 460–472. https://doi.org/10.2307/2525934.

Aguiar

V. H.

Hjertstrand

Serrano

2020. A rationalization of the weak axiom of revealed preference. Bravo Working Paper 2020-016, Brown University and Research Institute of Industrial Economics (ifn) Working Paper No. 1321. https://dx.doi.org/10.2139/ssrn.3543674.

Andreoni

Miller

2002. Giving according to garp: An experimental test of the consistency of preferences for altruism. Econometrica 70: 737–753. https://doi.org/10.1111/1468-0262.00302.

Andreoni

Sprenger

2012. Estimating time preferences from convex budgets. American Economic Review 102: 3333–3356. https://doi.org/10.1257/aer.102.7.3333.

Banerjee

Murphy

J. H.

2006. A simplified test for preference rationality of two-commodity choice. Experimental Economics 9: 67–75. https://doi.org/10.1007/s10683-006-4313-6.

Banks

Blundell

Lewbel

1997. Quadratic Engel curves and consumer demand. Review of Economics and Statistics 79: 527–539. https://doi.org/10.1162/003465397557015.

Beatty

T. K. M.

Crawford

I. A.

2011. How demanding is the revealed preference approach to demand? American Economic Review 101: 2782–2795. https://doi.org/10.1257/aer.101.6.2782.

10.

Becker

G. S.

1962. Irrational behavior and economic theory. Journal of Political Economy 70: 1–13. https://doi.org/10.1086/258584.

11.

Bronars

S. G.

1987. The power of nonparametric tests of preference maximization. Econometrica 55: 693–698. https://doi.org/10.2307/1913608.

12.

Brown

D. J.

Calsamiglia

2007. The nonparametric approach to applied welfare analysis. Economic Theory 31: 183–188. https://doi.org/10.1007/s00199-006-0087-5.

13.

Chiappori

P.-A.

Rochet

J.-C.

1987. Revealed preferences and differentiable demand. Econometrica 55: 687–691. https://doi.org/10.2307/1913607.

14.

Choi

Fisman

Gale

Kariv

2007. Consistency and heterogeneity of individual behavior under uncertainty. American Economic Review 97: 1921–1938. https://doi.org/10.1257/aer.97.5.1921.

15.

Choi

Kariv

Muller

Silverman

2014. Who is (more) rational? American Economic Review 104: 1518–1550. https://doi.org/10.1257/aer.104.6.1518.

16.

Deaton

A. S.

Muellbauer

1980. An almost ideal demand system. American Economic Review 70: 312–326.

17.

Diewert

W. E.

1973. Afriat and revealed preference theory. Review of Economic Studies 40: 419–425. https://doi.org/10.2307/2296461.

18.

Famulari

1995. A household-based, nonparametric test of demand theory. Review of Economics and Statistics 77: 372–382. https://doi.org/10.2307/2109872.

19.

Halevy

Persitz

Zrill

2018. Parametric recoverability of preferences. Journal of Political Economy 126: 1558–1593. https://doi.org/10.1086/697741.

20.

Heufer

Hjertstrand

2019. Homothetic efficiency: Theory and applications. Journal of Business and Economic Statistics 37: 235–247. https://doi.org/10.1080/07350015.2017.1319372.

21.

Houtman

Maks

J. A. H.

1985. Determining all maximal data subsets consistent with revealed preference. Kwantitatieve Methoden 6: 89–104.

22.

Lewbel

Pendakur

2009. Tricks with Hicks: The easi demand system. American Economic Review 99: 827–863. https://doi.org/10.1257/aer.99.3.827.

23.

Matzkin

R. L.

Richter

M. K.

1991. Testing strictly concave rationality. Journal of Economic Theory 53: 287–303. https://doi.org/10.1016/0022-0531(91)90157-Y.

24.

McFadden

1974. Conditional logit analysis of qualitative choice behavior. In Frontiers in Econometrics, ed. Zerembka

, 105–142. New York: Academic Press.

25.

Nishimura

E. A.

Quah

J. K.-H.

2017. A comprehensive approach to revealed preference theory. American Economic Review 107: 1239–1263. https://doi.org/10.1257/aer.20150947.

26.

Poi

B. P.

2002. From the help desk: Demand system estimation. Stata Journal 2: 403–410. https://doi.org/10.1177/1536867X0200200406.

27.

Polisson

Quah

J. K.-H.

2022. Rationalizability, cost-rationalizability, and Afriat’s efficiency index. Working Paper 22/04, Institute for Fiscal Studies (ifs). https://doi.org/10.1920/wp.ifs.2022.0422.

28.

Polisson

Quah

J. K.-H.

Renou

2020. Revealed preferences over risk and uncertainty. American Economic Review 110: 1782–1820. https://doi.org/10.1257/aer.20180210.

29.

Rose

1958. Consistency of preference: The two-commodity case. Review of Economic Studies 25: 124–125. https://doi.org/10.2307/2296210.

30.

Selten

. 1991. Properties of a measure of predictive success. Mathematical Social Sciences 21: 153–167. https://doi.org/10.1016/0165-4896(91)90076-4.

31.

Swofford

J. L.

Whitney

G. A.

1987. Nonparametric tests of utility maximization and weak separability for consumption, leisure and money. Review of Economics and Statistics 69: 458–464. https://doi.org/10.2307/1925533.

32.

Varian

H. R.

1982. The nonparametric approach to demand analysis. Econometrica 50: 945–973. https://doi.org/10.2307/1912771.

33.

Varian

H. R.

1983. Non-parametric tests of consumer behavior. Review of Economic Studies 50: 99–110. https://doi.org/10.2307/2296957.

34.

Varian

H. R.

1990. Goodness-of-fit in optimizing models. Journal of Econometrics 46: 125–140. https://doi.org/10.1016/0304-4076(90)90051-T.

35.

Warshall

1962. A theorem on Boolean matrices. Journal of the American Association of Computing Machinery 9: 11–12. https://doi.org/10.1145/321105.321107.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.47 MB

0.00 MB