Sage Journals: Discover world-class research

Abstract

The ubiquitous presence of endogenous regressors presents a significant challenge when drawing causal inferences using observational data. The classical econometric method used to handle regressor endogeneity requires instrumental variables (IVs) that must satisfy the stringent condition of exclusion restriction, rendering it unfeasible in many settings. Herein, the authors propose a new IV-free method that uses copulas to address the endogeneity problem. Existing copula correction methods require nonnormal endogenous regressors: Normally or nearly normally distributed endogenous regressors cause model nonidentification or significant finite-sample bias. Furthermore, existing copula control function methods presume the independence of exogenous regressors and endogenous regressors. The authors’ generalized two-stage copula endogeneity-correction (2sCOPE) method simultaneously relaxes the two key identification requirements while maintaining the Gaussian copula regressor–error dependence structure. They prove that under the Gaussian copula dependence structure, 2sCOPE yields consistent causal-effect estimates with correlated endogenous and exogenous regressors as well as normally distributed endogenous regressors. In addition to relaxing the identification requirements, 2sCOPE has superior finite-sample performance and addresses the significant finite-sample bias problem due to insufficient regressor nonnormality. Moreover, 2sCOPE employs generated regressors derived from existing regressors to control for endogeneity, and can thus considerably increase the ease and broaden the applicability of IV-free methods for handling regressor endogeneity. The authors further demonstrate 2sCOPE's performance using simulation studies and illustrate its use in an empirical application.

Keywords

causal inference endogeneity IV-free method copula correlated regressors identifiability

Causal inference is central to many problems that both academics and practitioners face. It has become increasingly important as rapidly available observational data in the current digital era promise to offer real-world evidence on cause-and-effect relationships for better decision-making. However, empirical researchers attempting to draw valid causal inferences from these data often encounter the presence of endogenous regressors correlated with the structural error in the population regression model representing the causal relationship of interest. Notably, omitted variables, such as unobserved demand shocks (e.g., product attributes, taste changes), may cause price endogeneity in observational scanner data. Ignoring this endogeneity can yield severely biased estimates of pricing effects on consumer demand (Villas-Boas and Winer 1999).

Instrumental variables (IVs) have traditionally been used to address the endogeneity issue. The ideal IV must satisfy two requirements: First, it must be correlated with the endogenous regressor via an explainable and validated relationship (i.e., relevance restriction), and, second, it must be uncorrelated with the structural error and must not directly affect the outcome (i.e., exclusion restriction). Although the theory of IVs is well-developed, researchers often struggle to find good IVs that meet both criteria. Potential IVs are often deficient by virtue of either weak relevance or challenging justification for exclusion restriction, hindering their use for addressing underlying endogeneity concerns (Rossi 2014).

The development and application of IV-free endogeneity-correction methods to address the lack of suitable IVs have gained traction in recent decades (Ebbes, Wedel, and Böckenholt 2009). Park and Gupta (2012) propose an IV-free method that uses the copula model (Christopoulos, McAdam, and Tzavalis 2021; Danaher 2007; Danaher and Smith 2011) to directly model the regressor–error dependence.¹ In addition to requiring no IVs, the copula approach is straightforward to use: One can simply add the latent copula data for the endogenous regressors as control variables to correct for endogeneity. These features considerably enhance the feasibility of endogeneity correction, as evidenced by the prolific use of the copula correction method (see examples of recent applications noted in the literature review in the next section). Like other IV-free methods, however, the copula correction methods also require distinctiveness between the distributions of the endogenous regressor and the structural error. Assuming Gaussian copula regressor–error dependence, the endogenous regressor is required to have a nonnormal distribution for model identification with the commonly assumed normal structural error distribution (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023; Haschka 2022; Papies, Ebbes, and Van Heerde 2017; Park and Gupta 2012; Qian, Koschmann, and Xie 2024; Qian, Xie, and Koschmann 2022). Furthermore, we demonstrate that the existing copula control function correction methods implicitly assume that all exogenous regressors be uncorrelated with the linear combination of copula transformations of endogenous regressors (hereinafter referred to as copula transformed terms, CTT, used in these methods to control for endogeneity) and may yield significant bias when noticeable correlations are present between the endogenous and exogenous regressors.

In practical applications, the requirements of sufficient regressor nonnormality and absence of correlations between endogenous and exogenous regressors can be excessive and pose significant challenges to the application of the copula correction method. We often encounter endogenous regressors or include transformations of endogenous variables as regressors that have close-to-normal distributions. Examples of such regressors in economics and marketing management studies include stock market returns (Sorescu, Warren, and Ertekin 2017), corporate social responsibility (Eckert and Hohberger 2023), the organizational intelligence quotient (Mendelson 2000), and the logarithm of price (see the “Empirical Application” section). Theoretically, the endogenous regressor and the structural error can contain a common set of unobservables that collectively have a normal distribution, which can lead to a close-to-normal distribution of the endogenous regressor. In such scenarios, even if the model is identified asymptotically, the close-to-normality of endogenous regressors can cause estimation bias even in moderate sample sizes and require large sample sizes to mitigate the finite-sample bias (Becker, Proksch, and Ringle 2021). Correlations between the endogenous and exogenous regressors are also common in practical applications, particularly when the exogenous regressors are included to control for potential confounders. Examples of such exogenous control variables abound in marketing and management studies, including customer-specific variables (e.g., location, age, household size, income, past purchase behaviors) when estimating the returns of consumer targeting strategies on product sales (Papies, Ebbes, and Van Heerde 2017) and firms’ similarity when estimating the effect of competition on innovation (Aghion et al. 2005).

The purpose of this article is to provide a generalized copula control function procedure that relaxes the stringent requirements of both sufficient regressor nonnormality and the absence of correlations between endogenous and exogenous regressors. Like the existing copula methods, our proposed two-stage copula endogeneity-correction (2sCOPE) method requires neither IVs nor the assumption of exclusion restriction while assuming the Gaussian copula dependence structure. The 2sCOPE method corrects for endogeneity by adding residuals, obtained by regressing latent copula data for each endogenous regressor on the latent copula data for exogenous regressors, as generated regressors in the structural model. Unlike the original copula method (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023; Park and Gupta 2012; hereinafter Copula_Origin), 2sCOPE can account for the dependence between endogenous and exogenous regressors. Copula_Origin thus constitutes a special case of 2sCOPE. Assuming that a Gaussian copula correlation model captures dependence among the endogenous regressors, correlated exogenous regressors, and the structural error, we prove that 2sCOPE can identify causal effects under weaker assumptions than Copula_Origin and overcome the two key limitations mentioned previously.

The remainder of this article begins with a review of relevant literature on methods for causal inference with endogenous regressors and an overview of this work's contributions. We then propose 2sCOPE and prove its consistency with normally distributed and correlated regressors. Next, we evaluate 2sCOPE's performance using simulation studies under different scenarios and provide a flowchart to guide its use in practical applications. We further apply 2sCOPE to estimate price elasticity using store purchase databases.

Literature Review and Contributions

Marketing, economics, and statistics research has developed a rich array of methods for drawing causal inferences. Randomized experiments, such as controlled lab experiments and field experiments (Anderson and Simester 2004; Godes and Mayzlin 2009; Johnson, Lewis, and Nubbemeyer 2017), have long been the gold standard for estimating causal effects. When controlled experiments are not feasible, quasi-experimental designs, such as regression discontinuity, difference-in-difference, and synthetic control, are used to mimic randomized experiments and enable the identification of causal effects with observational data (Athey and Imbens 2006; Hartmann, Nair, and Narayanan 2011; Kim, Lee, and Gupta 2020; Narayanan and Kalyanam 2015; Shi et al. 2017). However, these quasi-experimental designs have special data and design requirements, and are not designed to cope with the general issue of endogenous regressors when estimating causal effects using observational data.

There exists a large literature on various approaches to addressing endogenous regressors when inferring causal effects. Papies, Ebbes, and Van Heerde (2017), Rutz and Watson (2019), and Park and Gupta (2012) provide overviews of approaches to addressing endogeneity in marketing. Among the three broad classes of solutions that they discuss, the most widely employed is the IV approach (Aghion et al. 2005; Ataman, Van Heerde, and Mela 2010; Li and Ansari 2014; Novak and Stern 2009; Qian 2008; Van Heerde et al. 2013). Rossi (2014) surveys a decade of publications in Marketing Science and Quantitative Marketing and Economics and reveals that the most commonly used IVs are lagged variables, costs, fixed effects, and Hausman-style variables from other markets. However, the survey findings show that the IVs’ strengths are rarely measured/reported, despite the necessity of doing so to detect weak IVs. Moreover, the exclusion restriction condition cannot generally be tested to verify the IVs’ validity. The survey findings also show that most articles lack a discussion of why the IVs used are valid. In short, although IVs have a sound theoretical grounding, good ones are difficult to find, making the IV approach difficult to implement in practice.

The second class of solutions for mitigating endogeneity involves specifying the economic structure that generates the observed data on endogenous regressors (e.g., a supply-side model for marketing-mix variables) (Chintagunta et al. 2006; Dotson and Allenby 2010; Otter, Gilbride, and Allenby 2011; Sudhir 2001; Sun 2005; Yang, Chen, and Allenby 2003). A key concern with this approach is that incorrect assumptions or insufficient information on the supply side may generate biased estimates (Chintagunta et al. 2006).

The third class of solutions to endogeneity correction involves IV-free methods, a more recent methodological development. Ebbes, Wedel, and Böckenholt (2009) discuss three extant IV-free approaches: the higher moments approach (Lewbel 1997), the identification through heteroskedasticity estimator (Rigobon 2003), and the latent IV method (Ebbes et al. 2005). Wang and Blei (2019) propose a deconfounder approach that has some flavor of the latent IV approach. All these methods decompose an endogenous regressor into an exogenous part and an endogenous part. The assumption that the endogenous regressor contains an exogenous component that does not directly affect the outcome is akin to the stringent condition of exclusion restriction for observed IVs and thus can be difficult to justify. Park and Gupta (2012) introduce another IV-free method that does not require the stringent condition of exclusion restriction but assumes a Gaussian copula dependence between the structural error and the endogenous regressor. Researchers have enthusiastically adopted the copula method owing to its feasibility without the need for instruments (Atefi et al. 2018; Becker, Proksch, and Ringle 2021; Datta, Foubert, and Van Heerde 2015; Eckert and Hohberger 2023; Elshiewy and Boztug 2018; Haschka 2022; Heitmann et al. 2020). The 2sCOPE contributes to the field by overcoming significant limitations of existing copula correction methods and by virtue of its broader applicability (Table 1).

Table 1.
A Comparison of Copula Correction Methods.

Features Park and Gupta (2012) Haschka (2022) 2sCOPE

Nonnormality of endogenous regressors^a Required Required Not required^b

Handle correlated
exogenous regressors No Yes Yes

Intercept included Yes No^c Yes

Theoretical proof Yes No Yes

Estimation method Control function
and MLE MLE Control function

Structural model Linear regression LPM-FE Linear regression

RCL LPM-FE, LPM-RE, LPM-ME

Slope endogeneity RCL, slope endogeneity

^a
When required, normality of any endogenous regressor leads to nonidentifiable models. Insufficient nonnormality of endogenous regressors can also cause poor finite-sample performance (finite-sample bias and large standard errors) and require extremely large sample sizes for the model to perform well.

^b
Nonnormality of endogenous regressors is not required provided at least one correlated exogenous regressor is not normally distributed.

^c
The approach cannot estimate the intercept term, which is removed from the panel model prior to estimation using first-difference or fixed-effects transformation (Web Appendix A.8 of Haschka [2022]). Becker, Proksch, and Ringle (2021) show the importance of including intercept in marketing applications.

Notes: LPM = linear panel model; FE = fixed effects for individual-specific intercepts with common slope coefficients; RE = random effects; ME = mixed effects (including both fixed effects and random coefficients); RCL = random coefficient logit.

The contributions of 2sCOPE are threefold. First, to our knowledge, this work is novel in providing formal proofs for copula correction methods’ theoretical properties. These theoretical results are necessary given that model identifiability is central to addressing the endogeneity issue. Recent work notes the lack of rigorous proofs of required model identification conditions and estimation properties (consistency and efficiency) for copula correction as a major area requiring further research (Becker, Proksch, and Ringle 2021; Haschka 2022).² The theoretical results presented herein help bridge this gap and facilitate a better understanding of the properties of the copula correction methods and their optimal use.

A useful theoretical outcome is that the existence of the correlations between endogenous and exogenous regressors alone does not automatically introduce bias into Copula_Origin. Rather, we demonstrate that the implicit assumption for Copula_Origin is the exogenous regressors’ uncorrelatedness with the CTT, the linear combination of copula transformations of endogenous regressors used to control for endogeneity. The difference between the implicit assumption and the condition of zero pairwise correlations between endogenous and exogenous regressors can be substantial, particularly with multiple endogenous regressors.³ We prove that the proposed 2sCOPE yields consistent causal effect estimates when the preceding implicit assumption is violated, which can cause biased causal effect estimates for Copula_Origin. Our other novel finding is that although the exogenous regressors that are correlated with the CTT require special handling for consistent causal effect estimation, they can be leveraged efficiently by 2sCOPE to substantially improve the identification and finite-sample performance of copula correction. Significantly, we prove that the structural model with normally distributed endogenous regressors can be identified using 2sCOPE, provided that one of the exogenous regressors correlated with endogenous ones is nonnormal, which is considerably more feasible in many practical applications.

Second, the proposed 2sCOPE method is the first copula correction method that simultaneously relaxes the nonnormality assumption of endogenous regressors and handles correlated endogenous and exogenous regressors (Table 1). Existing copula correction methods do not account for correlated endogenous and exogenous regressors. An exception is that of Haschka (2022), who generalizes Park and Gupta (2012) to fixed-effects linear panel models with correlated regressors by jointly modeling the structural error and endogenous and exogenous regressors using copulas and maximum likelihood estimation (MLE). However, Haschka's approach still requires the nonnormality of endogenous regressors. Thus, all existing copula correction methods require assumption of sufficient nonnormality of endogenous regressors for model identification and acceptable finite sample performance (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023; Haschka 2022). Becker, Proksch, and Ringle (2021) suggest a minimum absolute skewness of 2 for an endogenous regressor to ensure good performance of Gaussian copula correction methods in a sample under 1,000 (Figure 8 in Becker, Proksch, and Ringle 2021). These requirements can significantly limit the practical deployment of copula correction methods.

Our proposed 2sCOPE method overcomes these important restrictions of existing copula correction methods. Consistent with our theoretical results, the evaluation in Cases 2 and 3 of the simulation demonstrates the superior finite-sample performance of 2sCOPE and shows that 2sCOPE eliminates or substantially reduces the significant problem of finite-sample bias associated with insufficient regressor nonnormality raised in Becker, Proksch, and Ringle (2021) and Eckert and Hohberger (2023). Even when the endogenous regressor is normal or close-to-normal with a skewness of 0, the estimation bias of 2sCOPE remains negligible for a sample size as small as 200 (Figure 1). We further conduct systematic simulation studies and provide an actionable guideline for using 2sCOPE (Figure 2), establishing sufficient conditions regarding exogenous regressors that are verifiable using tests of nonnormality and relevance to endogenous regressors, to effectively handle endogenous regressors with insufficient nonnormality using data at hand. Overall, 2sCOPE can substantially broaden IV-free methods’ practical applicability for handling endogeneity issues.

Figure 1.
Average Absolute Estimation Bias of All Regression Parameters (μ, α, β) in the Structural Model for Different Endogenous Regressor Distributions.

Figure 2.
Decision Tree for Using 2sCOPE.

Third, the proposed 2sCOPE provides a versatile and feasible copula control function method for handling regressor endogeneity. Although the vast majority of applications using the copula correction method have employed the generated-regressor approach (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023), no existing copula control function method can handle endogenous regressors that have insufficient nonnormality or are correlated with exogenous regressors. The 2sCOPE method addresses this need and enjoys several benefits associated with the control function versus the alternative MLE approach. These include—but are not limited to—little extra computational and modeling burdens to be integrated with complex outcome models commonly used in marketing studies (Table 1), broader applicability with weaker assumptions, and increased robustness to model misspecifications.⁴

In many such models, the MLE approach becomes considerably more difficult or computationally infeasible while 2sCOPE is straightforward. Footnote 8 offers an example demonstrating that extension of Haschka's (2022) MLE approach to random coefficient linear panel models with correlated endogenous and exogenous regressors requires the numerical evaluation of potentially high-dimensional integrals of complicated functions containing the product of copula density functions, evaluated at repeated measurement occasions. However, 2sCOPE involves none of these integrals and can be implemented using standard software programs for random coefficient linear panel models, assuming all regressors are exogenous. Furthermore, although 2sCOPE assumes a normal error distribution, we show its robustness to symmetric nonnormal error distributions (see the Web Appendix), in contrast to existing methods’ sensitivity to such error misspecifications (Becker, Proksch, and Ringle 2021). Thus, the 2sCOPE control function approach leveraging correlated exogenous regressors can enhance robustness to model misspecifications.

Methods

In this section, we develop a copula-based IV-free method for handling endogenous regressors with insufficient nonnormality or correlated with exogenous regressors. We first review Copula_Origin and demonstrate that it implicitly assumes no correlations between exogenous regressors and the CTT as well as the bias in the structural model parameter estimates that may arise from the violation of this assumption. We then propose a new approach to the problem and the detailed estimation procedure. We also demonstrate how exogenous regressors correlated with endogenous regressors can sharpen structural model parameter estimates and enable the identification of the structural model containing normally distributed endogenous regressors, which are known to cause model nonidentifiability for Copula_Origin.

Assumptions of the Existing Copula Endogeneity-Correction Method (Copula_Origin)

Consider the following linear structural regression model:⁵
$Y_{t} = μ + P_{t} α + {W^{'}}_{t} β + ξ_{t},$
(1)
where t = 1, 2, …, T indexes either time or cross-sectional units; Y_t is a (1 × 1) dependent variable; P_t is a (1 × 1) continuous endogenous regressor (e.g., price); W_t is a (k_W × 1) vector of exogenous regressors; ξ_t is the structural error term; and (μ, α, β) are model parameters. P_t is correlated with ξ_t, and this correlation causes the endogeneity problem. W_t is exogenous; that is, it is not correlated with ξ_t, but can be correlated with the endogenous variable P_t. An example of W_t is weather when estimating the pricing effect on consumer demand of ice cream using historical sales data. Weather is an exogenous factor that can affect both the pricing and sales of ice cream: The inclusion of weather in the sales model for ice cream mitigates the concern of price endogeneity and improves the demand prediction accuracy.

The key idea of Copula_Origin (Park and Gupta 2012) is to use a copula to jointly model the correlation between the endogenous regressor P_t and the error term ξ_t. This method has the advantage that marginals are not restricted by the joint distribution. Thus, the copula model allows researchers to construct a flexible multivariate joint distribution that captures the correlation among these variables.

Let F(P, ξ) be the joint cumulative distribution function (CDF) of the endogenous regressor P_t and the structural error ξ_t with marginal CDFs H(P) and G(ξ), respectively. For notational simplicity, we may omit the index t in P_t and ξ_t subsequently when appropriate. According to Sklar's theorem (Sklar 1959), there exists a copula function C(·,·) such that
$F (P, ξ) = C (H (P), G (ξ)) = C (U_{p}, U_{ξ}),$
(2)
where U_p = H(P) and U_ξ = G(ξ), and both follow uniform(0, 1) distributions. Thus, the copula maps the marginal CDFs of the endogenous regressor and the structural error to their joint CDF and makes it possible to separately model these random variables’ marginals and correlations. To capture the association between the endogenous regressor P and the error ξ, Park and Gupta (2012) use the following Gaussian copula for its desirable properties (Danaher 2007; Danaher and Smith 2011):
$\begin{aligned} F (P, ξ) = C (U_{p}, U_{ξ}) = Ψ_{ρ} (Φ^{- 1} (U_{p}), Φ^{- 1} (U_{ξ})) \\ = & \frac{1}{2 π {(1 - ρ^{2})}^{1 / 2}} \int_{- \infty}^{Φ^{- 1} (U_{p})} \int_{- \infty}^{Φ^{- 1} (U_{ξ})} \exp [\frac{- (s^{2} - 2 ρ \times s \times t + t^{2})}{2 (1 - ρ^{2})}] dsdt, \end{aligned}$
(3)
where Φ(·) denotes the univariate standard normal distribution function and Ψ_ρ(·,·) denotes the bivariate standard normal distribution with the correlation coefficient ρ. With empirical marginal CDFs, the preceding Gaussian copula model depends on the rank order of raw data only, and is invariant to strictly monotonic transformations of variables in (P, ξ). Thus, the preceding Gaussian copula model is regarded as general and robust for most marketing applications (Danaher and Smith 2011). In the Gaussian copula model, ρ captures the endogeneity of the regressor P, and a nonzero value of ρ corresponds to P being endogenous.

Let $P_{t}^{} = Φ^{- 1} (U_{p})$ and $ξ_{t}^{} = Φ^{- 1} (U_{ξ})$ ; the preceding Gaussian copula means $[P_{t} , ξ_{t} ]^{'}$ follow the standard bivariate normal distribution with the correlation coefficient ρ as follows:
$(\begin{matrix} P_{t}^{} \\ ξ_{t} \end{matrix}) \sim N ([\begin{matrix} 0 \\ 0 \end{matrix}], [\begin{matrix} 1 & ρ \\ ρ & 1 \end{matrix}]) .$
(4)
Based on the assumption that the structural error ξ_t follows $N (0, σ_{ξ}^{2})$ , Park and Gupta (2012) demonstrate that the structural error can be divided into two parts, as follows:
$ξ_{t} = σ_{ξ} ξ_{t}^{} = σ_{ξ} {ρ P}_{t}^{} + σ_{ξ} \sqrt{1 - ρ^{2}} ω_{t},$
(5)
where the first part, $σ_{ξ} {ρ P}_{t}^{}$ , captures the correlation between ξ_t and the endogenous regressor, and the other part, $σ_{ξ} \cdot \sqrt{1 - ρ^{2}} ω_{t}$ , is a new independent error term. Equation 1 can then be rewritten as follows:
$Y_{t} = μ + P_{t} α + W_{t} β + σ_{ξ} \times {ρ \times P}_{t}^{} + σ_{ξ} \times \sqrt{1 - ρ^{2}} \times ω_{t} .$
(6)
Based on the preceding representation, Park and Gupta suggest augmented ordinary least squares (OLS) estimation of Equation 6 with $P_{t}^{} = Φ^{- 1} (U_{p})$ included as an additional regressor to correct for the endogeneity of P_t. Park and Gupta further note that for the preceding approach to work, P_t must have a nonnormal distribution. If P_t is normally distributed, $P_{t} = P_{t} \times σ_{p}$ , resulting in perfect collinearity between P_t and $P_{t}^{}$ and violating the full-rank assumption required to identify the linear regression model in Equation 6.

Next, we demonstrate that an implicit assumption for the previously generated regressor approach to yield consistent model estimates is the uncorrelatedness between $P_{t}^{}$ and W_t. A nonzero correlation between the exogenous regressor W_t and the generated regressor $P_{t}^{}$ would cause biased OLS estimates of Equation 6 using Copula_Origin owing to the induced correlation between the error term ω_t and W_t, which Theorem 1 formally proves.
Theorem 1.
Inconsistency of Copula_Origin with Correlated W_t. Assuming (1) (1, P, W) is full rank and W is exogenous, (2) the error term is normal, (3) the structural error term and P_t follow a Gaussian copula, (4) P_t is endogenous: ρ ≠ 0, and (5) $P_{t}^{}$ and W_t are correlated, $Cov (ω_{t}, W_{t}) = - \frac{ρ}{\sqrt{1 - ρ^{2}}} Cov (W_{t}, P_{t}^{}) \neq 0$ , and, consequently, the OLS estimates of Equation 6 are inconsistent.
Proof:
See the Web Appendix.

To summarize, Copula_Origin based on Equation 6 makes the set of assumptions listed in Table 2. Assumption 5 has been discovered by Haschka (2022). However, as detailed in the Web Appendix, Assumption 5 should be replaced with the more general Assumption 5(b) for multiple endogenous regressors.⁶ Assumptions 5 and 5(b) are verifiable and provide users with criteria to determine whether Copula_Origin will provide consistent estimation when exogenous regressors exist. With only one endogenous regressor, one can simply check the correlations between the copula transformation of this endogenous regressor and each exogenous regressor. For multiple endogenous regressors, one should check the correlations between the CTT (i.e., the linear combination of copula transformations of these endogenous regressors used to control for endogeneity) in Copula_Origin and each exogenous regressor, using Fisher's Z test, as described in the Web Appendix. Where W_t contains at least one exogenous regressor that fails Assumption 5 or 5(b), Copula_Origin yields biased estimates, and our proposed 2sCOPE can be used, as derived in the next subsection.

Table 2.
Assumptions in Copula_Origin.

Assumption 1. Full rank^a of all regressors and exogeneity of W.^b

Assumption 2. The structural error follows a normal distribution.

Assumption 3. P_t and the structural error follow a Gaussian copula.

Assumption 4. Nonnormality of the endogenous regressor P_t.

Assumption 5. For a scalar endogenous regressor P_t, W_t, and $P_{t}^{}$ are uncorrelated.

Assumption 5(b). For multiple endogenous regressors, W_t and the CTT are uncorrelated.^c

^a
Full rank means rank(X′X) = k, in which X = (1, P, W) with column dimension of k.

^b
Exogeneity of W means Cov(W, ξ) = 0. When P and W are uncorrelated, the exogeneity assumption of W can be relaxed if the interest is only on α, the coefficient of P (see the Web Appendix), but P still must be nonnormally distributed to achieve identification.

^c
CTT is the linear combination of $P_{t}^{}$ used to control for endogenous regressors. See an example of CTT for two endogenous regressors in footnote 6.

The full rank of all regressors and exogeneity of W_t (Assumption 1 of Table 2) are assumptions that are made in several other commonly used econometric methods, including OLS and IV methods, to ensure estimation consistency. For Assumptions 2 to 4, Park and Gupta (2012) demonstrate their copula method's reasonable robustness to nonnormal distributions of the structural error (Assumption 2) and alternative forms of copula functions (Assumption 3), although it is not surprising to observe Copula_Origin's sensitivity to gross violations of these assumptions, such as highly skewed error distributions (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023). By contrast, the assumption that the endogenous regressor P_t follows a nonnormal distribution (Assumption 4) is critical. An endogenous regressor following a normal distribution violates the full-rank condition in Equation 6 and causes model unidentification regardless of the sample size; a nearly normally distributed endogenous regressor may require a very large sample size for the method to perform well and may cause the method to perform poorly for a finite sample size. Both Assumptions 4 and 5 (or 5(b)) can be too strong and substantially limit the copula method's applicability.

Proposed Two-Stage Copula Endogeneity-Correction Method

Here, we propose our 2sCOPE method and demonstrate its ability to relax both the uncorrelatedness assumption between CTT and the exogenous regressors (Assumption 5(b)) and the key identification assumption of nonnormal endogenous regressors (Assumption 4). The 2sCOPE method jointly models the endogenous regressor, P_t, the correlated exogenous variable, W_t, and the structural error term, ξ_t, using the Gaussian copula model, which implies that $[P_{t}^{} {, W}_{t}^{} {, ξ}_{t}^{}]$ follows the multivariate normal distribution:
$(\begin{matrix} P_{t}^{} \\ W_{t}^{} \\ ξ_{t}^{} \end{matrix}) \sim N ([\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 1 & ρ_{pw} & ρ_{p ξ} \\ ρ_{pw} & 1 & 0 \\ ρ_{p ξ} & 0 & 1 \end{matrix}]),$
(7)
where $P_{t}^{} = Φ^{- 1} (H (P_{t}))$ , $W_{t}^{} = Φ^{- 1} (L (W_{t}))$ , and $ξ_{t}^{} = Φ^{- 1} (G (ξ_{t}))$ , and H(·), L(·), and G(·) are marginal CDFs of P_t, W_t, and ξ_t, respectively.

Under the Gaussian copula model in Equation 7, one may develop a direct extension of Copula_Origin that adds the generated regressors $P_{t}^{}$ and $W_{t} $ into the structural regression model to correct for endogeneity bias. The resulting method—COPE (COPula Endogeneity correction)—is shown to yield consistent causal effect estimates without requiring the exogeneity of W and Assumption 5 (or Assumption 5(b)), which are needed for Copula_Origin (see the Web Appendix). However, COPE requires that endogenous regressors P_t and exogenous regressors W_t both have sufficient nonnormality and yields substantial bias when regressors have insufficient nonnormality (see the Case 2 simulation results and Figure 1). Furthermore, the addition of numerous generated regressors for control variables W can cause severe multicollinearity issues and exert significantly adverse impacts on causal effect estimation efficiency and stability (the simulation results in the Web Appendix demonstrate that COPE can require a sample size five times greater than our proposed method to achieve the same estimation precision). Overall, COPE suffers from the low face-validity problem; that is, it requires generated regressors for exogenous regressors even though, conceptually, only generated regressors for endogenous regressors are needed to control for the endogeneity problem. To overcome these limitations, we have derived the 2sCOPE method, which relaxes both Assumptions 4 and 5(b) of Copula_Origin, as shown subsequently.

Under the Gaussian copula model in Equation 7, we have the following system of equations:
$Y_{t} = μ + P_{t} α + W_{t} β + ξ_{t},$
(8)
$P_{t}^{} = W_{t}^{} γ + ϵ_{t} .$
(9)
Assuming that ξ_t follows a normal distribution, ε_t and ξ_t follow a bivariate normal distribution, since they are a linear combination of trinormal variate $(ξ_{t}^{} {, P}_{t}^{} {, W}_{t}^{})$ under the Gaussian copula assumption. Equation 9 expresses the copula transformation of the endogenous regressor, determined by the rank order of P_t, as a linear combination of observed and unobserved variables. The two error terms ε_t and ξ_t are correlated owing to the endogeneity of P_t. For example, both ξ_t and ε_t may contain an additive component corresponding to a common omitted variable. The preceding model is then obtained when the omitted variable and regressors follow a Gaussian copula model.

The main idea of 2sCOPE is to make use of the fact that, by conditioning on ε_t, the structural error ξ_t becomes independent of both P_t and W_t. That is, by conditioning on the component of P_t that causes its endogeneity (here, ε_t), the structural error is not correlated with either P_t or W_t, thereby ensuring the consistency of standard estimation methods. In this sense, ε_t serves as a (scaled) control function to address the endogeneity bias. To demonstrate this point, we rewrite the Gaussian copula model in Equation 7 as
$\begin{aligned} (\begin{matrix} P_{t}^{} \\ W_{t}^{} \\ ξ_{t}^{} \end{matrix}) = (\begin{matrix} 1 & 0 & 0 \\ ρ_{pw} & \sqrt{1 - ρ_{pw}^{2}} & 0 \\ ρ_{p ξ} & \frac{- ρ_{pw} ρ_{p ξ}}{\sqrt{1 - ρ_{pw}^{2}}} & \sqrt{1 - ρ_{p ξ}^{2} - \frac{ρ_{pw}^{2} ρ_{p ξ}^{2}}{1 - ρ_{pw}^{2}}} \end{matrix}) \times (\begin{matrix} ω_{1, t} \\ ω_{2, t} \\ ω_{3, t} \end{matrix}), \\ (\begin{matrix} ω_{1, t} \\ ω_{2, t} \\ ω_{3, t} \end{matrix}) \sim N ([\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}]) . \end{aligned}$
(10)
Given the preceding joint normal distribution for ${(P}_{t}^{} {, W}_{t}^{} {, ξ}_{t}^{})$ and $ξ_{t} = σ_{ξ} ξ_{t}^{}$ , we have
$P_{t}^{} = ρ_{pw} W_{t}^{} + ϵ_{t},$
(11)
and
$\begin{aligned} \begin{matrix} Y_{t} & = μ + P_{t} α + W_{t} β + \frac{σ_{ξ} ρ_{p ξ}}{1 - ρ_{pw}^{2}} P_{t}^{} + \frac{- σ_{ξ} ρ_{pw} ρ_{p ξ}}{1 - ρ_{pw}^{2}} W_{t}^{} \\ + σ_{ξ} \sqrt{1 - ρ_{p ξ}^{2} - \frac{ρ_{pw}^{2} ρ_{p ξ}^{2}}{1 - ρ_{pw}^{2}}} \times ω_{3, t} = μ + P_{t} α + W_{t} β \\ + \frac{σ_{ξ} ρ_{p ξ}}{1 - ρ_{pw}^{2}} (P_{t}^{} - ρ_{pw} W_{t}^{}) + σ_{ξ} \sqrt{1 - ρ_{p ξ}^{2} - \frac{ρ_{pw}^{2} ρ_{p ξ}^{2}}{1 - ρ_{pw}^{2}}} \times ω_{3, t}, \end{matrix} \\ \begin{matrix} = μ + P_{t} α + W_{t} β + \frac{σ_{ξ} ρ_{p ξ}}{1 - ρ_{pw}^{2}} ϵ_{t} + σ_{ξ} \sqrt{1 - ρ_{p ξ}^{2} - \frac{ρ_{pw}^{2} ρ_{p ξ}^{2}}{1 - ρ_{pw}^{2}}} \times ω_{3, t} . \end{matrix} \end{aligned}$
(12)
Equation 12 suggests adding the estimate of the error term ε_t from the first-stage regression as a generated regressor instead of adding $P_{t}^{}$ and $W_{t} $ . As Theorem 2 subsequently illustrates, the linear model in Equation 12 satisfies both the full column rank condition of the regressor matrix and zero correlation between the new error term ω_3,t and each regressor in Equation 12, ensuring the consistency of OLS estimates (Wooldridge 2010, chap. 4). This two-step procedure—termed 2sCOPE—adds the first-stage residual ${\hat{ϵ}}_{t}$ to control for endogeneity and, in this respect, is similar to the control function approach adopted by Petrin and Train (2010). However, unlike Petrin and Train, 2sCOPE requires no use of IVs. When P and W are uncorrelated (i.e., ρ_pw = 0), the generated regressor ε_t becomes $P_{t}^{}$ in Equation 12, and, consequently, 2sCOPE is reduced to Copula_Origin. The 2sCOPE method is more general than Copula_Origin by permitting nonzero values of ρ_pw.
Theorem 2.
Consistency of the 2sCOPE Estimator. Assuming (1) (1, P, W) is full rank and W is exogenous, (2) the error is normal, (3) either the endogenous regressor P_t or one correlated regressor in W_t is nonnormal, and (4) (ξ_t, P_t, W_t) follow a Gaussian copula, the 2sCOPE estimator is consistent.
Proof:
See the Web Appendix.

According to Theorem 2, the proposed method 2sCOPE can yield consistent estimates when assumptions are met. Specifically, Assumptions 5 and 5(b) are relaxed because 2sCOPE can handle the case in which the model includes exogenous regressors correlated with the CTT. Theorem 3 further demonstrates that 2sCOPE relaxes Assumption 4 (the nonnormality assumption on endogenous regressors), a critical model identification condition required in all other copula correction methods.
Theorem 3.
Nonnormality Assumption Relaxed. Assuming (1) (1, P, W) is full rank and W is exogenous, (2) the error term is normal, (3) one of the correlated exogenous regressors W_t is nonnormal, and (4) (ξ_t, P_t, W_t) follow a Gaussian copula, the 2sCOPE estimator is consistent when P_t follows a normal distribution.
Proof:
See the Web Appendix.

Theorem 3 shows that as long as one exogenous regressor correlated with the endogenous regressor P_t is nonnormally distributed, 2sCOPE can correct for endogeneity for a normal regressor P_t while COPE cannot. Intuitively, when P_t (or W_t) is normal, $P_{t}^{}$ (or $W_{t} $ ) becomes a linear function of P_t (or W_t) under the Gaussian copula assumption, causing COPE to fail the full-rank assumption and become unidentified. As such, COPE cannot deal with normal endogenous/exogenous regressors. For 2sCOPE in Equation 12, adding the first-stage residual ${\hat{ϵ}}_{t}$ as the generated regressor improves model identification. Provided not all W_t are normal, ${\hat{ϵ}}_{t}$ would not be a linear function of P_t and W_t and thus the second-stage model (Equation 12) in 2sCOPE would satisfy the full-rank requirement for model identification. Thus, 2sCOPE can relax the nonnormality assumption on the endogenous regressor required by the method of Park and Gupta (2012) provided one of the W_t is nonnormally distributed.

Table 3 summarizes the assumptions used in the proof of the properties of 2sCOPE. Among these assumptions, Assumption 4 and the full rank of the regressor matrix in Assumption 1 are required and verifiable. We highlight the remaining assumptions that may be difficult to verify given that they involve the unobserved error term. W's exogeneity is required when P and W are correlated, and this assumption should be evaluated based on institutional knowledge or economic theory. Although Assumptions 2 and 3 are used in the proof, they are not strictly needed. Our simulation study demonstrates that 2sCOPE is robust to a range of nonnormal error distributions and reasonable departures from the Gaussian copula dependence model (see the Web Appendix). Furthermore, it is often reasonable to assume that the error term ξ may be expressed as ξ = U + V, where the normally distributed U stands for the joint effect of confounders (a linear combination of confounders) and V represents an independent disturbance term. When U and regressors jointly follow the Gaussian copula model, 2sCOPE corrects endogeneity bias. The assumption that U and regressors follow a Gaussian copula dependence model appears plausible given that the Gaussian copula is a widely used dependence model and is deemed sufficiently flexible to adequately capture multivariate dependence in many practical applications (Danaher and Smith 2011; Eckert and Hohberger 2023). Meanwhile, we emphasize that the Gaussian copula assumption warrants attention from those who employ copula methods.

Table 3.
Assumptions in 2sCOPE.

Assumption 1. Full rank^a of all regressors and exogeneity of W.^b

Assumption 2. The structural error follows a normal distribution.^c

Assumption 3. P_t, W_t, and the structural error follow a Gaussian copula.^c

Assumption 4. Either P_t or one related regressor in W_t is nonnormally distributed.

^a
Full rank means rank(X′X) = k, in which X = (1, P, W) with column dimension of k.

^b
Exogeneity of W means Cov(W, ξ) = 0. When P and W are uncorrelated, the exogeneity assumption of W can be relaxed if the interest is only on α, the coefficient of P (see the Web Appendix), but P must be nonnormally distributed to achieve identification as, in this special case, 2sCOPE reverts to Copula_Origin (see footnote b under Table 2).

^c
These two assumptions can be relaxed. 2sCOPE can be derived as long as the endogenous part of the structural error is normally distributed and jointly follows a Gaussian copula dependence structure with P and W.

In sum, we have demonstrated the consistency of 2sCOPE (Theorem 2). Theorem 3 and Proposition 1 (see the Web Appendix) further establish that 2sCOPE outperforms COPE, the extended Copula_Origin, with respect to estimation efficiency and relaxation of the nonnormality assumption on endogenous regressors in Copula_Origin by satisfying a looser condition.

Multiple Endogenous Regressors

Herein, we extend 2sCOPE to the general case of multiple endogenous regressors. Consider the following structural linear regression model with two endogenous regressors (P_1,t and P_2,t) that are potentially correlated with the exogenous regressor W_t:
$Y_{t} = μ + P_{1, t} \times α_{1} + P_{2, t} \times α_{2} + W_{t} β + ξ_{t} .$
(13)
Under the multivariate Gaussian distribution assumption on $(ξ_{t}, P_{1, t}^{}, P_{2, t} , W_{t}^{})$ , the system of equations for the 2sCOPE method in Equations 8 and 9 are readily extended to
$Y_{t} = μ + P_{1, t} α_{1} + P_{2, t} α_{2} + W_{t} β + ξ_{t},$
(14)
$P_{1, t}^{} = ρ_{wp 1} W_{t}^{} + ϵ_{1, t},$
(15)
$P_{2, t}^{} = ρ_{wp 2} W_{t}^{} + ϵ_{2, t},$
(16)
where (ξ_t, ε_1,t, ε_2,t) are linear transformations of $(ξ_{t}, P_{1, t}^{}, P_{2, t}^{}, W_{t}^{})$ and thus also follow a multivariate Gaussian distribution. Consequently, we can decompose the structural error ξ_t as additive terms for ε_1,t, ε_2,t, and the remaining independent error term ω_4,t as follows:
$Y_{t} = μ + P_{1, t} α_{1} + P_{2, t} α_{2} + W_{t} β + η_{1} ϵ_{1, t} + η_{2} ϵ_{2, t} + σ_{ξ} \times m \times ω_{4, t},$
(17)
where $ϵ_{1, t} = P_{1, t}^{} - ρ_{wp 1} W_{t}^{}$ and $ϵ_{2, t} = P_{2, t}^{} - ρ_{wp 2} W_{t}^{}$ ; m is a constant that depends solely on the correlation coefficients in the Gaussian copula (see the Web Appendix). The new (scaled) error term ω_4,t is independent of latent copula data $(P_{1, t}^{} {, P}_{2, t}^{} {, W}_{t}^{})$ and all functions of these latent data including P_1,t, P_2,t, W_t, ε_1,t, and ε_2,t. Because ω_4,t is independent of all regressors in Equation 17, the OLS estimation of Equation 17 yields consistent estimates of structural model parameters provided the regressor matrix (1, P₁, P₂, W, ε₁, ε₂) is of full column rank.

The Web Appendix presents the proof for the estimation consistency, relaxation of the regressor-nonnormality assumption, and estimation efficiency gain for 2sCOPE with multiple endogenous regressors under the related Theorems 2 and 3 and Proposition 1. Table 4 summarizes the estimation procedure of 2sCOPE.

Table 4.
Estimation Procedure for 2sCOPE.

Stage 1

Obtain empirical CDFs for each regressor in P_t and W_t, $\hat{H} (P_{t})$ and $\hat{L} (W_{t})$ .

Compute $P_{t}^{} = Φ^{- 1} (\hat{H} (P_{t}))$ and $W_{t}^{} = Φ^{- 1} (\hat{L} (W_{t}))$ .

Regress each endogenous regressor in $P_{t}^{}$ separately on $W_{t} $ and obtain residual ${\hat{ϵ}}_{t}$ .

Stage 2

Add ${\hat{ϵ}}_{t}$ to the outcome structural regression model as generated regressors.

Notes: Standard errors of parameter estimates are estimated using bootstrapping (see the Web Appendix).

2sCOPE for Random Coefficient Linear Panel Models

We consider the following random coefficient model for linear panel data:
$Y_{it} | μ_{i}, α_{i}, β_{i} = \bar{μ} + μ_{i} + {P^{'}}_{it} α_{i} + {W^{'}}_{it} β_{i} + ξ_{it},$
(18)
where i = 1, …, N indexes cross-sectional units and t = 1, …, T indexes occasions. P_it (W_it) denotes a vector of endogenous (exogenous) regressors. P_it and W_it may be correlated. The error term ξ_it is i.i.d. $N (0, σ_{ξ}^{2})$ , which is correlated with P_it owing to the endogeneity of P_it but is uncorrelated with the exogenous regressors in W_it. The individual-specific intercept μ_i and individual-specific slope coefficients (α_i, β_i) permit heterogeneity in both intercepts and regressor effects across cross-sectional units. Extant marketing studies have attested the ubiquitous presence of heterogeneous consumer responses to marketing-mix variables (e.g., price sensitivity) and substantial bias associated with ignoring such slope heterogeneity. Thus, it is important to allow individual-specific slope coefficients in marketing studies.

The linear panel data model as specified in Equation 18 is general and includes the linear panel model with only the individual-specific intercepts considered by Haschka (2022) as a special case. Specifically, Haschka fixes (α_i, β_i) to be the same value (α, β) across all units, assuming that all cross-sectional units have the same slope coefficients. By contrast, the model in Equation 18 relaxes this strong assumption and can generate unit-specific slope parameters that may be used for targeting purposes.

A random coefficient model typically assumes that (μ_i, α_i, β_i) follows a multivariate normal distribution. When all regressors are exogenous, estimation algorithms for such random coefficient models are well established and computationally feasible, even for a high-dimensional vector of random effects (μ_i, α_i, β_i). With the normal conditional distribution for Y_it | (μ_i, α_i, β_i) in Equation 18 and the multivariate normal prior distribution for random effects (μ_i, α_i, β_i), Y_it marginally follows a normal distribution with a closed-form expression that contains no integrals with respect to random effects (μ_i, α_i, β_i), leading to an easy-to-evaluate likelihood function (Wooldridge 2010). Alternatively, one may assume a mixed-effect model where μ_i is a fixed-effect parameter and can be correlated with the regressors P_it and W_it. In this case, the first-difference or fixed-effects transformation is typically used to eliminate the incidental intercept parameters as follows:
${\tilde{y}}_{it} | α_{i}, β_{i} = {\tilde{P}}^{'}_{it} α_{i} + {\tilde{W}}^{'}_{it} β_{i} + {\tilde{ξ}}_{it},$
(19)
where ${\tilde{y}}_{it}$ , ${\tilde{P}}_{it}$ , ${\tilde{W}}_{it}$ , and ${\tilde{ξ}}_{it}$ denote new variables obtained from the first-difference or fixed-effect transformation. Haschka (2022) considers a special case of Equation 19 by fixing (α_i, β_i) as homogeneous (i.e., not varied by i).

It is straightforward to apply 2sCOPE to address regressor endogeneity in the general random coefficient model for linear panel data in Equation 18 and the transformed model without intercepts in Equation 19.⁷ The 2sCOPE procedure adds the residuals obtained from regressing $P_{it}^{}$ on $W_{it}^{}$ . The 2sCOPE method may thus be implemented using standard software programs for random coefficient linear panel models assuming that all regressors are exogenous (see Case 4 in the “Simulation Study” section for an illustration using the lme function in R). By contrast, the MLE approach for copula correction in the random coefficients model accounting for correlated endogenous and exogenous regressors has yet to be developed and would require the construction of complicated joint likelihood on the error term, P_t, and W_t, which involves newly appearing numerical integrals with respect to random effects and cannot be maximized by standard estimation algorithms for random coefficient models.⁸

2sCOPE for Slope Endogeneity and Random Coefficient Logit Model

In the Web Appendix, we derive 2sCOPE to tackle slope endogeneity and address endogeneity bias in random coefficient logit models with correlated and normally distributed regressors. Unlike the MLE approach, 2sCOPE can be implemented using standard estimation methods through the addition of generated regressors to control for endogeneity.

Simulation Study

Here, we conduct Monte Carlo studies to assess (1) the proposed method's performance for correlated regressors, (2) the proposed method's performance under regressor normality and near normality, (3) the proposed method's performance under various types of structural models, and (4) the proposed method's robustness to violations of model assumptions. We measure the estimation bias using t_bias calculated as the ratio of the absolute difference between the mean of the sampling distribution and the true parameter value to the standard error of the parameter estimate (Park and Gupta 2012). Thus, t_bias represents the size of bias relative to the sampling error.

Case 1: Nonnormal Regressors

In the first case, P and W are correlated. The data-generating process (DGP) is summarized as follows:
$(\begin{matrix} P_{t}^{} \\ W_{t}^{} \\ ξ_{t}^{} \end{matrix}) \sim N ([\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 1 & ρ_{pw} & ρ_{p ξ} \\ ρ_{pw} & 1 & 0 \\ ρ_{p ξ} & 0 & 1 \end{matrix}]) = N ([\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}], [\begin{matrix} 1 & .5 & .5 \\ .5 & 1 & 0 \\ .5 & 0 & 1 \end{matrix}]),$
(20)
$ξ_{t} = G^{- 1} (U_{ξ, t}) = G^{- 1} (Φ (ξ_{t}^{})) = Φ^{- 1} (Φ (ξ )) = 1 \times ξ_{t}^{},$
(21)
$P_{t} = H^{- 1} (U_{P, t}) = H^{- 1} (Φ (P_{t}^{})), W_{t} = L^{- 1} (U_{W, t}) = L^{- 1} (Φ (W_{t} )),$
(22)
$Y_{t} = μ + α \times P_{t} + β \times W_{t} + ξ_{t} = 1 + 1 \times P_{t} + (- 1) \times W_{t} + ξ_{t},$
(23)
where $ξ_{t}^{}$ and $P_{t}^{}$ are correlated (ρ_pξ = .5), generating the endogeneity problem; $W_{t}^{}$ is exogenous and uncorrelated with $ξ_{t}^{}$ ; and $W_{t}^{}$ and $P_{t}^{}$ are correlated (ρ_pw = .5), and therefore W_t and P_t are correlated. We consider four different estimation methods: (1) OLS regression of Equation 23; (2) Copula_Origin, which is the OLS regression of Equation 6; (3) the extended method COPE, which is the OLS regression of the equation $Y_{t} = μ + P_{t} α + W_{t} β + P_{t}^{} η_{1} + W_{t}^{} η_{2} + e_{t}$ ; and (4) the proposed 2sCOPE, which is the OLS regression of Equation 12. The generated regressors $P_{t}^{}$ , $W_{t}^{}$ , and ε_t are estimated according to Table 4. We set the sample size T = 1,000 and generate 1,000 datasets as replicates using the preceding DGP. In the simulation, we use the gamma distribution Gamma(1, 1) with shape and rate equal to 1 for P_t and the exponential distribution Exp(1) with rate 1 for W_t. Models are estimated on all generated datasets, providing the empirical distributions of parameter estimates.

Table 5 reports estimation results. As expected, OLS estimates of both α and β are biased (t_bias = 15.75 and 8.24) due to the regressor endogeneity. Copula_Origin reduces the bias, but still shows significant bias for the coefficient estimates of P_t and W_t. The bias of Copula_Origin depends on the strength of the correlation between W and P. Stronger correlations between P* and W* may cause the Copula_Origin estimates to show a larger bias. For example, when the correlation between W* and P* increases from .5 to .7, the mean α estimate changes from 1.055 to 1.260 (see the Copula_Origin column in Table 5), and, consequently, the bias of estimated α increases by around five times (from .055 to .260). The bias confirms our derivation in the model section, demonstrating that using the existing copula method may not wholly resolve the endogeneity problem with correlated regressors.

Table 5.
Results of Simulation Study Case 1: Nonnormal Regressors.

ρ_pw Parameters True OLS Copula_Origin COPE 2sCOPE

Mean SE t_bias Mean SE t_bias Mean SE t_bias Mean SE t_bias

.5 μ 1 .689 .045 6.964 1.231 .081 2.849 1.012 .093 .129 1.009 .059 .157

α 1 1.571 .036 15.75 1.055 .069 .791 .985 .072 .213 .986 .070 .197

β −1 −1.259 .031 8.236 −1.289 .031 9.169 −.997 .067 .038 −.995 .042 .123

ρ_pξ .5 — — — .570 .047 1.504 .505 .055 .090 .504 .038 .097

σ_ξ 1 .862 .020 6.902 1.011 .043 .244 1.008 .041 .206 1.006 .040 .143

D-error — — .002613 .001614

.7 μ 1 .730 .041 6.629 1.307 .076 4.037 1.011 .085 .124 1.005 .053 .088

α 1 1.800 .041 19.67 1.260 .068 3.838 .988 .078 .148 .991 .075 .118

β −1 −1.529 .037 14.21 −1.567 .037 15.36 −.997 .071 .041 −.994 .056 .110

ρ_pξ .5 — — — .633 .043 3.130 .503 .057 .048 .500 .026 .000

σ_ξ 1 .799 .018 11.18 .980 .044 .468 1.007 .041 .160 1.003 .040 .084

D-error — — .002902 .001760

Notes: Mean and SE denote the average and standard deviation of parameter estimates across all 1,000 simulated samples.

The proposed 2sCOPE method provides consistent estimates without using instruments. The average estimate of ρ_pξ is close to the true value of .5 and differs significantly from 0, implying regressor endogeneity detected correctly using 2sCOPE. Moreover, 2sCOPE shows greater estimation efficiency. The standard error of α (β) in 2sCOPE is .070 (.042), which is 2.78% (37.31%) smaller than the corresponding standard errors using COPE. We further calculate COPE's and 2sCOPE's estimation precision using the D-error measure |Σ|^1/K (Arora and Huber 2001; Qian and Xie 2022), where Σ is the covariance matrix of the regression coefficient estimates and K is the number of explanatory variables in the structural model. A smaller D-error indicates greater estimation efficiency and improved estimation precision. When ρ_pw = .5, the D-error measure is .002613 for COPE and .001614 for 2sCOPE (Table 5). Thus, 2sCOPE increases estimation precision by 38.2%, meaning that for 2sCOPE to achieve the same precision as COPE, the sample size can be reduced by 38.2%. A 39.3% efficiency gain for 2sCOPE is observed for ρ_pw = .7 (Table 5).

We perform a further simulation study for a small sample size using the same DGP as that described previously but with sample size T = 200. The results presented in Web Appendix Table W1 demonstrate that OLS estimates have endogeneity bias and Copula_Origin reduces the endogeneity bias, but significant bias remains nonetheless. The proposed 2sCOPE performs well, yielding unbiased estimates for the small sample size T = 200. The efficiency gain of 2sCOPE relative to COPE appears to be greater for a smaller sample size. For example, when the correlation between P* and W* is .5, the D-error measures are .0166 and .0091 for COPE and 2sCOPE (Web Appendix Table W1), respectively, meaning that 2sCOPE increases estimation precision by 1 − .0091/.0166 = 46% compared with COPE. Thus, sample size can be reduced by almost half (∼50%) for 2sCOPE to achieve the same estimation precision as that achieved by COPE.

Case 2: Normal Regressors

Next, we examine a case in which the endogenous regressor and/or the correlated exogenous regressor are normally distributed. This case is of particular interest given that normality is not allowed for endogenous regressors in Park and Gupta (2012). We use the same DGP as that described in Equations 20 to 23, with the exception that the marginal CDFs for regressors, H(·) and L(·), are selected based on the distributions listed in the first two columns of Table 6, which summarizes the estimation results. As expected, the OLS estimates are biased. Copula_Origin produces biased estimates whenever the endogenous regressor P follows a normal distribution. Its estimates are also biased when P follows a gamma distribution (first row of Table 6) for a different reason: P and W are correlated. Similar to Copula_Origin, the COPE estimators in all three scenarios are biased when either P_t or W_t is normal. When W_t is normal, β is .323 away from the true value −1; when P_t is normally distributed, α is .684 away from the true value; when both P_t and W_t are normal, α is .663 away from the true value 1 and β is .324 away from the true value −1. This is expected because COPE adds $P_{t}^{}$ and $W_{t} $ , the copula transformation of regressors, as additional regressors, and will cause perfect colinearity and the model nonidentification problem whenever at least one of these regressors is normally distributed.

Table 6.
Results of Simulation Study Case 2: Normal Regressors.

Distribution Parameters True OLS Copula_Origin COPE 2sCOPE

P W Mean SE t_bias Mean SE t_bias Mean SE t_bias Mean SE t_bias

Gamma Normal μ 1 .431 .045 12.63 1.018 .078 .227 1.017 .080 .217 1.015 .077 .190

α 1 1.569 .037 15.40 .979 .070 .302 .979 .070 .296 .985 .070 .212

β −1 −1.259 .030 8.619 −1.333 .028 11.78 −1.323 .433 .746 −.997 .045 .067

ρ_pξ .5 — — — .640 .039 3.556 .589 .141 .631 .506 .036 .151

σ_ξ 1 .861 .019 7.240 1.064 .046 1.394 1.135 .162 .837 1.005 .038 .134

Normal Exp μ 1 1.286 .042 6.777 1.286 .045 6.374 .994 .073 .081 1.023 .070 .334

α 1 1.628 .031 20.36 1.532 .462 1.152 1.684 .437 1.568 1.048 .126 .381

β −1 −1.286 .032 8.956 −1.287 .032 8.960 −.992 .066 .127 −1.024 .062 .383

ρ_pξ .5 — — — .089 .419 .980 −.167 .384 1.738 .465 .074 .473

σ_ξ 1 .829 .018 9.492 .940 .151 .394 .981 .151 .129 .980 .063 .318

Normal Normal μ 1 1.001 .026 .046 1.002 .030 .052 1.001 .033 .024 1.002 .028 .057

α 1 1.668 .030 22.38 1.663 .450 1.474 1.663 .460 1.441 1.655 .395 1.657

β −1 −1.335 .029 11.44 −1.335 .029 11.42 −1.324 .438 .740 −1.328 .197 1.668

ρ_pξ .5 — — — .006 .412 1.198 .001 .412 2.426 .010 .303 1.616

σ_ξ 1 .816 .019 9.687 .917 .155 .534 1.003 .211 .016 .879 .092 1.317

By contrast, the proposed 2sCOPE method provides consistent estimates as long as P_t and W_t are not both normally distributed. Both α and β are tightly distributed near the true value whenever P_t or W_t is nonnormally distributed. Unlike Copula_Origin and COPE, 2sCOPE adds the residual term obtained from regressing $P_{t}^{}$ on $W_{t} $ as the generated regressor. Therefore, provided P_t and W_t are not both normally distributed, the residual term is not perfectly colinear with the original regressors, permitting model identification. Only when both P_t and W_t are normally distributed (the last scenario in Table 6), the residual term added into the structural regression model becomes a linear combination of P_t and W_t, causing perfect colinearity and model nonidentification. Overall, this simulation study demonstrates the proposed 2sCOPE's ability to relax the nonnormality assumption in Copula_Origin provided either P_t or W_t is nonnormally distributed.

Case 3: Insufficient Nonnormality of Endogenous Regressors

The preceding case illustrates the proposed 2sCOPE's capability to deal with normal endogenous regressors while Copula_Origin and COPE cannot. Here, we examine the more frequent scenario of close-to-normal regressors. Although models are identified asymptotically (i.e., infinite sample size), appreciable finite-sample bias may occur with the realistic sample sizes commonly seen in marketing studies, if the endogenous regressor is too close to a normal distribution (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023; Haschka 2022). Becker, Proksch, and Ringle (2021) suggest a minimum absolute skewness of 2 for an endogenous regressor for Copula_Origin to have good performance in sample sizes less than 1,000. This requirement can significantly limit the use of copula correction methods in practical applications. Given that 2sCOPE can handle normal endogenous regressors, we expect that 2sCOPE will be better able to handle the finite-sample bias caused by insufficient regressor nonnormality than the existing copula correction methods. We use the DGP described in Equations 20 to 23 to generate data, with the exception that the marginal CDF for the endogenous regressor (H(·)) varies across some common distributions with varying closeness to normality. Specifically, we consider uniform, lognormal, t, mixture normal, gamma, beta, and normal distributions, and use the average absolute estimation bias of all the regression parameters (μ, α, β) in the structural model to measure the performance.

Figure 1 plots the estimation bias with different distributions of the endogenous regressor P. The results reveal that the Copula_Origin estimates are biased with correlated endogenous and exogenous regressors, consistent with our theoretical proof (Theorem 1). COPE performs well when P has sufficient nonnormality (t(2), lognormal, gamma) and exhibits no bias even for a sample size as small as 200. However, COPE cannot handle a normal endogenous regressor and yields a large estimation bias that remains unchanged as the sample size increases, consistent with our theoretical proof in Theorem 3 (see the Web Appendix) and the simulation result in Case 2. Furthermore, COPE suffers from finite-sample bias when the endogenous regressor P has distributions with insufficient nonnormality (e.g., beta(2,2), t(df = 30)). Moreover, COPE's estimation bias is larger when the sample size is smaller or the distribution of the endogenous regressor P is closer to normal. For instance, t distribution with 30 degrees of freedom is closer to normal than the t distribution with 10, 5, and 2 degrees of freedom, yielding a larger estimation bias. For t(df = 30), which is very close to normal, increasing the sample size from T = 200 to 1,000 scarcely changes the size of the estimation bias. By contrast, our proposed 2sCOPE method yields consistent estimates for all normal and close-to-normal regressor distributions and has negligible finite-sample bias even for a sample size as small as 200 (bias < 5% of parameter values).

Case 4: Random Coefficient Linear Panel Model

We investigate the performance of 2sCOPE in the random coefficient linear panel model. We use the copula function and marginal distributions of [P_it, W_it, ξ_it] as specified in Case 1 (Equations 20–22). We assign ρ_pw = .7 as an example. We then generate the outcome Y_it using the following standard random coefficient linear panel model:
$Y_{it} = \bar{μ} + μ_{i} + P_{it} (\bar{α} + a_{i}) + W_{it} (\bar{β} + b_{i}) + ξ_{it} = 1 + μ_{i} + P_{it} (1 + a_{i}) + W_{it} (- 1 + b_{i}) + ξ_{it},$
where [μ_i, a_i, b_i] ∼ N(0, I₃), t = 1, …, 50 indexes occasions for repeated measurements, and i = 1, …, 500 indexes the individual units. The preceding random coefficients model permits individual units to have heterogeneous baseline preferences (μ_i) and heterogeneous responses to regressors (a_i, b_i). Marketing studies frequently use such random coefficient models to capture individual heterogeneity and to profile and target individuals. The correlation between ξ_it and P_it creates the regressor endogeneity problem, which can cause biased estimates for standard linear random coefficient estimation methods that ignore the regressor–error correlation. We generate individual-level panel data as described previously 1,000 times and use the data for estimation. Table 7 presents the estimation results. LME is the standard estimation method for linear mixed models assuming all regressors are exogenous, as implemented in the lme function in R. LME and Copula_Origin are biased due to endogeneity and correlated exogenous regressors, respectively. Our proposed method, 2sCOPE, provides unbiased estimates that are tightly distributed around all parameters’ true values.

Table 7.
Results of Simulation Study Case 4: Random Coefficient Linear Panel Model.

Parameters True LME Copula_Origin 2sCOPE

Mean SE t_bias Mean SE t_bias Mean SE t_bias

$\bar{μ}$ 1 .722 .046 6.052 1.314 .049 6.399 1.004 .048 .091

$\bar{α}$ 1 1.853 .045 18.83 1.293 .045 6.469 1.000 .046 .008

$\bar{β}$ −1 −1.557 .045 12.39 −1.598 .044 13.56 −1.000 .044 .005

σ_μ 1 .985 .033 .459 .982 .033 .547 .984 .031 .522

σ_α 1 .988 .036 .326 .987 .034 .397 .989 .035 .316

σ_β 1 .993 .031 .235 .992 .033 .249 .992 .033 .248

ρ_pξ .5 — — — .646 .009 16.33 .507 .005 1.365

σ_ξ 1 .794 .004 57.71 .957 .010 4.439 .985 .009 1.640

Notes: σ_μ, σ_α, and σ_β are standard deviations of μ_i, a_i, and b_i.

Additional Simulation Results and Robustness Checks

The Web Appendix provides additional simulation results on a small sample size; model estimation with multiple endogenous regressors; estimation with multiple exogenous control covariates including binary and close-to-normal control covariates; 2sCOPE's robustness to misspecifications of the structural error distribution and to misspecifications of the copula dependence structure; a test of Assumption 5(b); experimental studies to obtain practical recommendations for using 2sCOPE; the performance of 2sCOPE with one “strongly nonnormal” exogenous regressor versus multiple “weakly nonnormal” exogenous regressors for handling an endogenous regressor with insufficient nonnormality; the random coefficient logit model using 2sCOPE; 2sCOPE's performance when W is endogenous; and 2sCOPE's ability to leverage the empirical correlation between P and W. Overall, these results verify that 2sCOPE is robust to small sample sizes and reasonable violations of normal error and Gaussian copula assumptions, show that it is flexible to leverage control covariates and handle nonlinear models for choice outcomes, and provide guidance for 2sCOPE's use to obtain good performance, as summarized in the next section. Interestingly, the results reported in the Web Appendix indicate that a “strongly nonnormal” W is considerably more effective than multiple “weakly nonnormal” Ws in helping the identification of the causal effect for an endogenous regressor with insufficient nonnormality.

Guidelines for Using 2sCOPE

To summarize, we have established theoretical results that guarantee 2sCOPE's desirable large sample properties where correlated exogenous regressors exist (Theorem 2) and endogenous regressors have nonnormal distributions (Theorem 3). As expected, simulation studies demonstrate that 2sCOPE performs well when the sample size is sufficiently large. Meanwhile, simulation studies also reveal that, for 2sCOPE to perform well in finite samples, it may require sufficient regressor nonnormality and relevance between P and W (e.g., Figure 1, Panel A). To provide actionable guidelines for 2sCOPE's application to the data at hand, we conduct systematic simulation studies to establish the boundary conditions for using 2sCOPE. Specifically, the studies employ a factorial experimental design, which systematically varies the distributions of P and W, sample sizes, the level of endogeneity, and the strength of correlation between P and W. We evaluate 2sCOPE's performance using structural model parameter estimates’ relative bias. The Web Appendix provides details of the experimental design and results.

Figure 2 presents the decision tree for 2sCOPE's use based on the findings of the simulation studies. The decision tree comprises three steps. In Step 1, we test Assumption 5 (or 5(b) for multiple endogenous regressors) to choose between 2sCOPE and Copula_Origin. When Assumption 5 (or 5(b)) is satisfied, Copula_Origin is preferred over 2sCOPE because, although both methods provide consistent estimates, Copula_Origin is more efficient (see the Web Appendix). Becker, Proksch, and Ringle (2021) provide a flowchart for the use of Copula_Origin. Violation of Assumption 5 (or 5(b)) suggests the presence of relevant exogenous regressors that 2sCOPE may leverage to better handle endogeneity. In Step 2, we test the nonnormality of the endogenous regressor P using the Kolmogorov–Smirnov (KS) test (see the Web Appendix for the rationale of using the test of normality). If the KS test rejects the null at the .05 level,⁹ P possesses sufficient nonnormality, and 2sCOPE has a high probability of success in correcting endogeneity bias based on the results detailed in the Web Appendix. Otherwise, P has a close-to-normal distribution, which requires related exogenous regressors with sufficient nonnormality and relevance to help identification. Thus, in Step 3, we check W's nonnormality and relevance to P. Results in the Web Appendix show that if the p-value of the KS test of an exogenous regressor W is smaller than .001 (i.e., sufficient nonnormality of W) and the relevance is sufficient (F-statistic for the effect of W* on P* > 10 in the first-stage regression), 2sCOPE will have a high probability of success.

We have provided sufficient conditions of endogenous and exogenous regressors in Steps 2 and 3 for 2sCOPE to have good finite-sample performance. These conditions are not necessary but are conservative. In particular, we consider extreme cases in which either the exogenous regressor in Step 2 or the endogenous regressor in Step 3 follows the normal distribution. In practice, regressors are more likely to have close-to-normal than exact normal distributions. The failure of W's sufficient condition tests does not necessarily preclude the use of 2sCOPE. For example, the estimation result of Scenario 1 in Web Appendix Table W12 (P and W are close-to-normal and weakly nonnormal, respectively) demonstrates that 2sCOPE may still show acceptable finite-sample performance when the preceding (conservative) sufficient conditions are not satisfied. In this scenario (the rightmost branch in Figure 2), one can employ our proposed bootstrap resampling Algorithm 1 to evaluate 2sCOPE's finite-sample performance on a case-by-case basis.
Algorithm 1.
A Bootstrap Algorithm for Evaluating Finite-Sample Bias of 2sCOPE Series Input: data Y, P, W; sample size N; $\hat{θ} (Y, P, W)$ , the 2sCOPE estimates of the structural model parameters; $(\hat{H}, \hat{L})$ , the empirical CDFs of P and W; and $\hat{Σ}$ , the Gaussian copula correlation structure estimate. If ${\hat{ρ}}_{P ξ}$ is small and not significantly different from zero, set ${\hat{ρ}}_{P ξ} = \pm .5$ in $\hat{Σ}$ .

for b = 1 to B do

Simulate $P_{b}^{}$ , $W_{b}^{}$ , $ξ_{b}^{}$ from Gaussian copula $Ψ_{\hat{Σ}} (Φ^{- 1} (U_{P}), Φ^{- 1} (U_{w}), Φ^{- 1} (U_{ξ}))$ , sample size = N.

Obtain $P_{b} = {\hat{H}}^{- 1} (Φ (P_{b}^{})), W_{b} = {\hat{L}}^{- 1} (Φ (W_{b}^{}))$ , and ξ_b = ${\hat{σ}}_{ξ} \cdot ξ_{b}^{}$ , where ${\hat{σ}}_{ξ}$ is the 2sCOPE estimate of the standard deviation of structural error term.

Obtain $Y_{b} = f (P_{b}, W_{b}, ξ_{b}, \hat{θ} (Y, P, W))$ , where f is the linear regression in this setting.

Obtain the 2sCOPE estimate ${\hat{θ}}_{b} = \hat{θ} (Y_{b}, P_{b}, W_{b})$ using the bth bootstrap sample.

end for

Calculate potential bias of the 2sCOPE estimator: $\frac{1}{B} \sum_{b = 1}^{B} {\hat{θ}}_{b} - \hat{θ} (Y, P, W)$ .

Bootstrap simulations can be used to evaluate the bias size in parameter estimates when sample size is small to moderate (Efron and Tibshirani 1994, chap. 10; Hooker and Mentch 2018),¹⁰ even if the estimation performs well for large samples. Specifically, Algorithm 1 randomly draws the same number of observations from the underlying copula model and the structural model estimated using the original sample,¹¹ and then performs the 2sCOPE estimation on the bootstrap sample as in the original sample. We repeat this simulation B times and obtain a distribution for each model coefficient estimate. We then compare the mean of each coefficient estimate's distribution with the corresponding coefficient estimate using the original data, which is the true parameter value in our model-based bootstrap resampling. The small-sample bias of a coefficient estimate is the difference between the average coefficient estimate from bootstrap samples and the original sample's coefficient estimate.

Empirical Application

In this section, we illustrate the proposed method to address the price endogeneity issue using store-level toothpaste category sales data for Chicago over 373 weeks from 1989 to 1997.¹² To control for product size, we select the most common weight, which is 6.4 oz. Specifically, we estimate the following sales model:
$\log (Sale s_{t}) = β_{0} + \log (Retail Pric e_{t}) \times β_{1} + W_{t}^{'} β_{2} + ξ_{t},$
(24)
where t = 1, 2, …, T indexes the week. Store/category managers and policy makers are often interested in the effect of price on the category demand (e.g., Li, Linn, and Muehlegger 2014; Nijs et al. 2001). Empirical estimates of category price elasticity (i.e., β₁) are key to optimal pricing, enabling retailers/manufacturers to expand category demands (the first source of profitable growth), and critical to designing effective interventions (e.g., gasoline or soda tax) for policy makers to achieve policy goals. Meanwhile, retail price is typically considered endogenous in the category sales demand model (Haschka 2022; Li, Linn, and Muehlegger 2014; Nijs et al. 2001; Park and Gupta 2012). Retail price endogeneity may derive from unmeasured product characteristics or demand shocks that can influence both consumers’ and retailers’ decisions. Unobserved by researchers, these variables are absorbed into the structural error, creating the endogeneity problem. Prices of different stores are correlated and often used as an IV for each other. This enables us to test the proposed 2sCOPE method's performance in an empirical setting with a good IV. Besides the endogenous price, two promotion-related variables—bonus promotion and direct price reduction—would also affect demand. The promotion decisions during the study period were typically made on a quarterly basis or less frequently, plus a long lead time (e.g., several weeks) for implementation; thus, they were unlikely to be correlated with the weekly unobserved demand shock and can be considered exogenous (Chintagunta 2002; Sriram, Balachander, and Kalwani 2007).¹³ We focus on category sales in two large stores in Chicago (Stores 1 and 2). We convert retail price, in-store promotion, and sales from UPC level to the aggregate category level, computed as weekly market share–weighted averages of UPC-level variables.

The correlation between log retail price and bonus promotion in Store 1 (Store 2) is −.31 (−.17), and the correlation between log retail price and price reduction promotion in Store 1 (Store 2) is −.22 (−.31). The appreciable correlations between price and promotion variables provide a suitable context for testing our method with correlated endogenous and exogenous regressors. The moderate sample size (T = 373) also allows us to evaluate 2sCOPE's finite-sample performance in the presence of potentially insufficient regressor nonnormality in real data. Table 8 reports summary statistics of the key variables.

Table 8.
Summary Statistics.

Variables Store 1 Store 2

Mean SD Max Min Mean SD Max Min

Sales (unit) 115 53.0 720 35 100.8 53.5 654 33

Price ($) 2.06 .20 2.48 1.46 2.09 .21 2.47 1.51

Bonus .18 .21 .80 .00 .16 .19 .81 .00

PriceReduction .10 .19 .69 .00 .11 .21 .76 .00

Figure 3 plots the log sales and log retail prices of toothpaste at Store 1 over time (Store 2 is very similar). To control for the possible time trend of retail price, we use detrended log retail prices (and detrended log values for IVs as well) for estimation. Figure 4 shows the histograms of detrended log retail prices and the two promotion variables. All three are continuous variables.

Figure 3.
Log Sales and Log Retail Price of Toothpaste in Store 1.

Figure 4.
Histograms of Log Retail Price, Bonus, and Price Reduction in Store 1.

The flowchart in Figure 2 guides our use of 2sCOPE. In Store 1, the correlations between logP* and the exogenous regressors are −.44 for bonus promotion and −.27 for price reduction promotion, both of which differ substantially from zero with p-value < 2.2 × e⁻¹⁶ and 7.542 × e⁻⁰⁸ respectively, indicating a violation of Assumption 5, which Copula_Origin requires to yield consistent estimates. Next, we check the endogenous regressor's sufficient nonnormality. The KS test of the endogenous price yields a p-value of .063 > .05, indicating insufficient nonnormality of the endogenous price that Copula_Origin (or COPE) cannot handle. We then proceed to check the exogenous regressors’ nonnormality and their relevance with the endogenous regressor. The bonus variable is strongly nonnormal (p-value of KS test = 3.159 × e⁻¹²) and sufficiently relevant (F-statistic = 89.5 > 10). Price reduction is also strongly nonnormal (p-value of KS test < 2.2 × e⁻¹⁶) and is sufficiently relevant (F-statistic = 27.3 > 10). Thus, according to Figure 2, the Store 1 dataset is appropriate for using 2sCOPE to correct endogeneity, and 2sCOPE is expected to have a high probability to achieve good finite-sample performance.

We next go through the flowchart for Store 2. First, the correlations between logP* and the exogenous regressors are −.32 for bonus promotion and −.39 for price reduction promotion, both of which are substantially different from zero with p-value 1.47 × e⁻¹⁰ and 5.32 × e⁻¹⁵ respectively, indicating a violation of Assumption 5 required for Copula_Origin to yield consistent estimates. Next, we check the nonnormality of the endogenous regressor. The KS test of the endogenous price yields a p-value of .0053 < .05, indicating sufficient nonnormality of the endogenous price. Thus, according to the decision tree in Figure 2, the Store 2 dataset is also appropriate for using 2sCOPE to correct endogeneity.¹⁴

We use the IV-based two-stage least squares (TSLS) estimator to cross-validate 2sCOPE's performance and the IV used. We use retail price at the other store as an instrument for price, which is commonly used in the literature (Park and Gupta 2012; Rossi 2014). This variable can be a valid instrument as it satisfies the two key requirements. First, retail prices across stores in a market can be highly correlated because wholesale prices are typically the same (or very similar). The Pearson correlation between the detrended log retail prices in Stores 1 and 2 is .76, providing strong explanatory power on the endogenous price. The correlation is comparable to that in Park and Gupta (2012). Second, unmeasured product characteristics such as shelf-space allocation, shelf location, and category location are determined by retailers and are usually not systematically related to wholesale prices (exclusion restriction). Meanwhile, unobserved national advertisement is not expected to affect production cost and wholesale price on a weekly basis and is thus expected to exert only a small effect on the variance of weekly wholesale price. Given that national advertisement occurs only in a few instances in any given planning time horizon (e.g., a quarter or a year), one would expect these demand shocks to be highly correlated and have a small variance over time at the weekly frequency (Rossi 2014). These considerations suggest that the exclusion restriction condition is reasonably satisfied in the presence of unobserved national advertisement. However, like other IVs, the validity claim cannot be fully verified, and is debatable. We therefore perform both 2sCOPE and TSLS so they can cross-validate each other. Congruent results from the two methods increase our confidence in endogeneity correction. Like TSLS, 2sCOPE includes (and uses) the existing exogenous regressors in the first-stage regression; however, unlike TSLS, 2sCOPE neither includes nor needs IVs. We first regress $logP * = Φ^{- 1} (\hat{H} (logP))$ on $Bonus * = Φ^{- 1} ({\hat{L}}_{1} (Bonus))$ and $PriceReduction * = Φ^{- 1} ({\hat{L}}_{2} (PriceReduction))$ , and then add the residual as the only “generated regressor” to the outcome regression. $\hat{H} (\cdot), {\hat{L}}_{1} (\cdot),$ and ${\hat{L}}_{2} (\cdot)$ are all estimated CDFs using the univariate empirical distribution for each regressor. Standard errors of parameter estimates are obtained using bootstrapping (see the Web Appendix).

Table 9 reports the estimation results. The OLS estimates in Store 1 differ significantly from TSLS estimates, suggesting the price endogeneity issue. Instrumenting for retail price changes the price coefficient estimate from −.767 to −1.797, implying a positive correlation between unobserved product characteristics and price. The 2sCOPE estimate of ρ, representing the correlation between the endogenous regressor P_t and the error term, is .297 (t-value = 3.34) and significantly positive, further confirming our conclusion and consistent with previous empirical findings (e.g., Chintagunta, Dubé, and Goh 2005; Villas-Boas and Winer 1999).

Table 9.
Estimation Results: Toothpaste Sales.

Store Parameters OLS TSLS 2sCOPE

Est SE t-value Est SE t-value Est SE t-value

Store 1 Constant 1.301 1.197 1.09 −2.993 1.646 1.82 −3.908 2.314 1.69

Price −.767 .288 2.66 −1.797 .396 4.54 −2.014 .555 3.63

Bonus .371 .122 3.04 .104 .141 .74 .064 .171 .37

PriceReduction .498 .115 4.33 .285 .125 2.28 .275 .143 1.92

ρ — — — — — — .297 .089 3.34

Store 2 Constant −3.898 1.246 3.13 .763 1.943 .39 .001 2.702 .00

Price −1.982 .300 6.61 −.864 .467 1.85 −1.048 .648 1.62

Bonus .062 .116 .53 .286 .148 1.93 .239 .151 1.58

PriceReduction .283 .111 2.55 .540 .137 3.94 .467 .152 3.07

ρ — — — — — — −.188 .109 1.72

This positive price–error correlation causes upward bias (i.e., less price sensitivity) in the OLS price estimate. By directly accounting for this price–error dependence and controlling the first-stage residual, which captures unobserved product characteristics causing the positive correlation between the endogenous price and the error term, 2sCOPE corrects the classic upward endogeneity bias of price elasticity from −.767 to −2.014. The 2sCOPE price elasticity estimate of −2.014 is close to that of −1.797 from the TSLS method. Both 2sCOPE and TSLS price estimates show greater price sensitivity, suggesting that both correct the price endogeneity problem in the right direction. The TSLS and 2sCOPE estimates are reasonable because the price elasticity of the toothpaste category is approximately −2.0 (Chen and Lim 2022; Hoch et al. 1995; Mackiewicz and Falkowski 2015).

Unlike Store 1, Store 2's results indicate that the retail price is not endogenous. The estimate of ρ (the correlation between price and the error term) does not differ significantly from 0 for 2sCOPE (t-value ≤ 1.96 in the 2sCOPE column for Store 2 in Table 9). The OLS price estimate is −1.982, which is very close to the estimates of TSLS and 2sCOPE in Store 1, suggesting Store 2's lack of price endogeneity. Overall, the price elasticity estimates from TSLS and 2sCOPE are close for Store 2, and the observed differences between them and the OLS estimate may be attributed to the estimation variability incurred from the use of more complicated models compared with the OLS model.

In the preceding application, the convergence of results between TSLS and the proposed 2sCOPE method in both stores supports the validity of the proposed method in addressing the endogeneity issue. The flowchart in Figure 2 also suggests our empirical data satisfy the boundary conditions under which 2sCOPE is expected to have good finite-sample performance. Although here it is unnecessary to empirically evaluate the finite-sample performance using the bootstrap resampling in Algorithm 1, we apply the algorithm to illustrate its usage in the empirical application. Specifically, we apply the bootstrap algorithm (Algorithm 1) to our empirical application with the true parameter values set as Store 1's 2sCOPE estimates (Table 9) rounded to the first nonzero number when generating bootstrap samples. We also consider the case in which ρ is set at .5, somewhat larger than the estimated value of ρ (.3), to assess the robustness of the bootstrap findings. The steps used to generate these bootstrap samples are detailed in the Web Appendix.

Table 10 summarizes the means and standard deviations of parameter estimates for OLS and 2sCOPE over the 1,000 bootstrap samples, unlike the estimation result on a single observed dataset reported in Table 9. The estimation results are broadly consistent with those in Table 9. In both cases (ρ = .3 and .5), the estimates of 2sCOPE are distributed closely to the true values, demonstrating that 2sCOPE corrects the bias of OLS estimates and performs well with little finite-sample bias in our empirical application.

Table 10.
Finite-Sample Performance of Copula Correction.

Parameters True OLS 2sCOPE OLS 2sCOPE

Est SE t_bias Est SE t_bias True Est SE t_bias Est SE t_bias

Constant −4 1.514 .777 7.098 −3.782 1.619 .135 −4 5.256 .635 14.57 −3.601 1.393 .287

Price −2 −.678 .186 7.099 −1.946 .388 .139 −2 .220 .152 14.59 −1.904 .334 .287

Bonus .1 .458 .088 4.046 .113 .128 .103 .1 .706 .073 8.290 .126 .112 .236

PriceReduction .3 .571 .089 3.058 .309 .112 .079 .3 .764 .075 6.160 .323 .095 .240

ρ .3 — — — .284 .071 .222 .5 — — — .483 .048 .360

Notes: “Est” and “SE” denote the mean and standard deviation of the estimates over 1,000 bootstrap samples of Store 1 data.

Conclusion

Observational studies often require rigorous study designs and methodologies to overcome endogeneity concerns. While it is preferable to bring exogeneity via good instruments for identification, this is not always possible. In this article, we focus on the IV-free copula method to handle endogenous regressors, proposing our generalized 2sCOPE method that extends the existing copula correction methods (Becker, Proksch, and Ringle 2021; Eckert and Hohberger 2023; Haschka 2022; Park and Gupta 2012) to more general settings. Specifically, 2sCOPE permits the correlation of exogenous regressors with endogenous regressors and relaxes the nonnormality assumption on the endogenous regressors. Similar to Copula_Origin, 2sCOPE corrects endogeneity by adding “generated regressors” derived from existing regressors and is straightforward to use. However, unlike Copula_Origin, which adds the latent copula transformations of endogenous regressors directly to the model, 2sCOPE has two stages. The first obtains the residuals from regressing latent copula data for the endogenous regressor on the latent copula data for the exogenous regressors, while the second uses the first-stage residual as a “generated regressor” in the structural regression model. We prove that 2sCOPE yields consistent causal effect estimates when the normally distributed structural error and all regressors follow a Gaussian copula correlation structure. The 2sCOPE method can also relax the nonnormality assumption on endogenous regressors and substantially improve copula correction's finite-sample performance.

We evaluate 2sCOPE's performance via simulation studies and demonstrate its use in an empirical application. The simulation results show that 2sCOPE yields consistent estimates under relaxed assumptions. Moreover, 2sCOPE outperforms Copula_Origin (and COPE) in terms of dealing with close-to-normal or normal endogenous regressors and improving estimation efficiency. Endogenous regressors are allowed to have close-to-normal or even normal distributions with the help of exogenous regressors (Figure 2). The efficiency gain relative to COPE is substantial and may reach ∼80% in simulation studies (see the Web Appendix), implying that 2sCOPE can reduce by ∼80% the sample size needed to achieve the same estimation efficiency as COPE, which does not exploit the correlations between endogenous and exogenous regressors. Finally, robustness checks indicate that 2sCOPE is reasonably robust to the structural error distributional assumption and non-Gaussian copula correlation structure (see the Web Appendix). We further apply 2sCOPE to a public dataset in marketing. Regarding endogenous price, we find that the estimated price coefficient using our proposed 2sCOPE is very close to the TSLS estimate and the price coefficient reported in the literature, while the OLS estimator indicates substantial biases.

These findings have rich implications regarding the practical use of the copula-based IV-free methods for handling endogeneity. A known critical assumption for Copula_Origin is the nonnormality of endogenous regressors. The users of the method have largely checked and verified this assumption. However, our findings indicate that this is insufficient: it is also necessary to check Assumption 5 for the single endogenous regressor case and Assumption 5(b) for the case of multiple endogenous regressors. Neither assumption is the same as checking the pairwise correlations between the endogenous and exogenous regressors. Assumption 5 evaluates pairwise correlations involving copula transformation of the endogenous regressor, which, as shown in the literature (Danaher and Smith 2011) and in our specific empirical application, may differ substantially from the pairwise correlations using the original variables. Assumption 5(b) evaluates the correlations between exogenous regressors and the linear combination of generated regressors, which differ even more considerably from pairwise correlations on the regressors themselves. We expect that Assumptions 5 and 5(b) will be violated in the majority of practical applications (see the result of real-data application in footnote 15 and the Web Appendix), for which 2sCOPE should be used. Even in exceptional cases that satisfy the preceding assumptions,¹⁵ 2sCOPE can still be used successfully (see the Web Appendix). Yet in these exceptional cases, Copula_Origin may be considered (Figure 2, Step 1) since the simpler and valid model gains estimation/prediction efficiency and is easier to understand, use, and communicate with as a decision calculus tool (Little 2004).

For endogenous regressors with insufficient nonnormality or exogenous regressors that violate Assumptions 5 or 5(b), 2sCOPE outperforms Copula_Origin and is recommended. When all endogenous regressors have sufficient nonnormality, 2sCOPE is expected to perform well. If any endogenous regressor has insufficient nonnormality, 2sCOPE exploits exogenous regressors with sufficient relevance and nonnormality levels (Figure 2 details sufficient conditions) for satisfactory model identification in finite samples. One can empirically check and verify whether these conditions are satisfied for data at hand, using normality and relevance tests. For cases in which these conditions are not met, we also propose a novel bootstrap resampling method (Algorithm 1) to directly gauge and validate 2sCOPE's finite-sample performance in real applications on a case-by-case basis, complementing the preceding rules of thumb using tests of normality and relevance. According to the decision-tree check using real-world data, 98% of cases result in using 2sCOPE in the decision tree (Figure 2), demonstrating the overwhelming need for 2sCOPE (Web Appendix Table W11).

Unlike the TSLS method, 2sCOPE requires no IVs that must satisfy the exclusion restriction condition. Exclusion restriction is considerably more stringent than the exogeneity condition in that the IV not only is exogenous but also does not appear in the outcome model, meaning that the IV cannot affect the outcome Y by any means other than the endogenous regressor.¹⁶ It is typically impossible to test exclusion restriction; one must rely on institutional knowledge and theoretical arguments to establish exclusion restriction's credibility, which is often the most challenging aspect of IV applications. By contrast, our approach eliminates the requirement that any variable satisfy the exclusion restriction assumption, which constitutes a significant gain. Using 2sCOPE, one need not argue for exclusion restriction, but for the regressors and the error following a copula dependence structure.

Meanwhile, 2sCOPE is capable of leveraging relevant exogenous variables in W in the outcome model (e.g., in Equation 8) for model identification. Marketing models rarely contain endogenous regressors exclusively; the majority of the outcome models estimated (e.g., using OLS or IVs) in marketing include exogenous variables for various reasons, such as the inclusion of exogenous regressors as control variables to mitigate the endogeneity concern of the primary explanatory variables, to improve model estimation and forecasting accuracy, to make the outcome models substantively complete and relevant, or to render the exclusion restriction assumption of IVs more plausible. These exogenous control variables influence other variables in the marketing model but are determined outside the model and are unaffected by the model outcomes. Common examples include environmental factors (e.g., weather), macroeconomic indicators (e.g., interest rate, GDP growth, inflation rates), government policies and legal rules, natural disasters and events, customer characteristics used by firms for targeting, and marketing activities (e.g., promotions) prearranged on an annual basis and independent of daily/weekly demand shocks.

These exogenous regressors are not used to generate the copula control function in Copula_Origin. By contrast, 2sCOPE can leverage these exogenous variables to improve model identification and estimation. Such exogenous control variables are much more widely available than the IVs because 2sCOPE does not require any of these exogenous variables to satisfy the stringent exclusion restriction condition. Furthermore, no theoretical arguments are required for the direction and intuition of correlation between W and P. An empirical correlation is sufficient (see the Web Appendix).¹⁷ Finally, when endogenous regressors have insufficient nonnormality, 2sCOPE can leverage exogenous regressors with certain nonnormality and relevance levels (Figure 2), which are feasible in many applications, for identification.

To fully benefit from leveraging relevant control covariates in W for handling endogeneity, these control covariates need to be exogenous. In practice, choosing a good set of exogenous control variables requires care in model specification and may pose empirical challenges for data analysts. Similar to OLS, TSLS, and Copula_Origin, the addition of endogenous variables to W can yield inconsistent model estimates for 2sCOPE.¹⁸ Thus, certain types of variables that violate the exogeneity condition, such as mediators or colliders, should be excluded from W. For example, consumer perceptions of price levels and emotions toward stores are found to lie in the causal pathway between pricing and demand (Cakici and Tekeli 2022). Such mediators are endogenous (affected by the endogenous pricing variable), and, consequently, including them (e.g., measured using periodic consumer surveys) in W and treating them as being exogenous when estimating a demand model will introduce overcontrol bias (Hernan and Robins 2023). Additionally, if these mediators are also influenced by the unobserved determinants of the sales outcome (e.g., store service quality), then the two mediators could also be colliders influenced by both the primary explanatory variables and the outcome. Including colliders in the regression and treating them as exogenous control variables is known to cause collider bias (Hernan and Robins 2023). The remedy is to exclude mediators and colliders from W. Like other econometric methods, the reasonableness of the exogenous W assumption should be evaluated and justified in the context of the objectives and scopes of analysis. In this respect, substantive or institutional knowledge is useful for guiding or justifying the selection of appropriate exogenous control covariates. To avoid violating the exogeneity assumption about W, we recommend practicing clean adjustment employing only exogenous control variables necessary to improve causal effect estimation. Control variables strongly believed to be endogenous should be treated as endogenous regressors in the model or removed from the model.¹⁹ In this respect, 2sCOPE can offer ways of tackling some major challenges encountered in model specifications: For example, certain control variables must be included to mitigate endogeneity concerns in OLS estimation or make the exclusion restriction assumption of IVs plausible but are nonetheless believed to be endogenous and thus pose a dilemma as to whether they should be included in the model for the OLS or TSLS estimation.

Although 2sCOPE contributes to solving regressor endogeneity by relaxing key assumptions of the existing copula correction methods and extending them to more general settings, it is not without its limitations. For 2sCOPE to function optimally, the endogenous regressors’ distributions must contain adequate information. The condition is violated when the endogenous regressors follow Bernoulli distributions or discrete distributions with small support, as Park and Gupta (2012) note. The proposed 2sCOPE method does not address this limitation. The simplicity of 2sCOPE assumes the normal structural error and Gaussian copula dependence structure. Our evaluation demonstrates that 2sCOPE is robust to symmetric nonnormal error distributions, linear dependence among endogenous and exogenous regressors, and certain non-Gaussian copula structures (see the Web Appendix). Such robustness may not hold for highly skewed error distributions or other forms of dependence or copula structure. Future research is needed to test and relax these assumptions using more flexible copula methods. Despite these limitations, we expect that 2sCOPE will provide a useful alternative to a broad range of empirical problems when instruments are unavailable. Although our empirical application only considered linear sales models, potential empirical applications of 2sCOPE abound. We have derived 2sCOPE and evaluated its performance via simulated data for a range of other commonly used marketing models, including linear panel models with mixed effects, random coefficient logit models, and slope endogeneity. Future empirical studies may apply 2sCOPE to these and many other cases not studied here.

Supplemental Material

sj-pdf-1-mrj-10.1177_00222437241296453 - Supplemental material for Addressing Endogeneity Using a Two-Stage Copula Generated Regressor Approach

Supplemental material, sj-pdf-1-mrj-10.1177_00222437241296453 for Addressing Endogeneity Using a Two-Stage Copula Generated Regressor Approach by Fan Yang, Yi Qian and Hui Xie in Journal of Marketing Research

Features	Park and Gupta (2012)	Haschka (2022)	2sCOPE
Nonnormality of endogenous regressors^a	Required	Required	Not required^b
Handle correlated exogenous regressors	No	Yes	Yes
Intercept included	Yes	No^c	Yes
Theoretical proof	Yes	No	Yes
Estimation method	Control function and MLE	MLE	Control function
Structural model	Linear regression	LPM-FE	Linear regression
RCL		LPM-FE, LPM-RE, LPM-ME
Slope endogeneity		RCL, slope endogeneity

ρ_pw	Parameters	True	OLS	Copula_Origin	COPE	2sCOPE
.5	μ	1	.689	.045	6.964	1.231	.081	2.849	1.012	.093	.129	1.009	.059	.157
	α	1	1.571	.036	15.75	1.055	.069	.791	.985	.072	.213	.986	.070	.197
	β	−1	−1.259	.031	8.236	−1.289	.031	9.169	−.997	.067	.038	−.995	.042	.123
	ρ_pξ	.5	—	—	—	.570	.047	1.504	.505	.055	.090	.504	.038	.097
	σ_ξ	1	.862	.020	6.902	1.011	.043	.244	1.008	.041	.206	1.006	.040	.143
	D-error	—	—	.002613	.001614
.7	μ	1	.730	.041	6.629	1.307	.076	4.037	1.011	.085	.124	1.005	.053	.088
	α	1	1.800	.041	19.67	1.260	.068	3.838	.988	.078	.148	.991	.075	.118
	β	−1	−1.529	.037	14.21	−1.567	.037	15.36	−.997	.071	.041	−.994	.056	.110
	ρ_pξ	.5	—	—	—	.633	.043	3.130	.503	.057	.048	.500	.026	.000
	σ_ξ	1	.799	.018	11.18	.980	.044	.468	1.007	.041	.160	1.003	.040	.084
	D-error	—	—	.002902	.001760

Distribution	Parameters	True	OLS	Copula_Origin	COPE	2sCOPE
Gamma	Normal	μ	1	.431	.045	12.63	1.018	.078	.227	1.017	.080	.217	1.015	.077	.190
		α	1	1.569	.037	15.40	.979	.070	.302	.979	.070	.296	.985	.070	.212
		β	−1	−1.259	.030	8.619	−1.333	.028	11.78	−1.323	.433	.746	−.997	.045	.067
		ρ_pξ	.5	—	—	—	.640	.039	3.556	.589	.141	.631	.506	.036	.151
		σ_ξ	1	.861	.019	7.240	1.064	.046	1.394	1.135	.162	.837	1.005	.038	.134
Normal	Exp	μ	1	1.286	.042	6.777	1.286	.045	6.374	.994	.073	.081	1.023	.070	.334
		α	1	1.628	.031	20.36	1.532	.462	1.152	1.684	.437	1.568	1.048	.126	.381
		β	−1	−1.286	.032	8.956	−1.287	.032	8.960	−.992	.066	.127	−1.024	.062	.383
		ρ_pξ	.5	—	—	—	.089	.419	.980	−.167	.384	1.738	.465	.074	.473
		σ_ξ	1	.829	.018	9.492	.940	.151	.394	.981	.151	.129	.980	.063	.318
Normal	Normal	μ	1	1.001	.026	.046	1.002	.030	.052	1.001	.033	.024	1.002	.028	.057
		α	1	1.668	.030	22.38	1.663	.450	1.474	1.663	.460	1.441	1.655	.395	1.657
		β	−1	−1.335	.029	11.44	−1.335	.029	11.42	−1.324	.438	.740	−1.328	.197	1.668
		ρ_pξ	.5	—	—	—	.006	.412	1.198	.001	.412	2.426	.010	.303	1.616
		σ_ξ	1	.816	.019	9.687	.917	.155	.534	1.003	.211	.016	.879	.092	1.317

Parameters	True	LME	Copula_Origin	2sCOPE
$\bar{μ}$	1	.722	.046	6.052	1.314	.049	6.399	1.004	.048	.091
$\bar{α}$	1	1.853	.045	18.83	1.293	.045	6.469	1.000	.046	.008
$\bar{β}$	−1	−1.557	.045	12.39	−1.598	.044	13.56	−1.000	.044	.005
σ_μ	1	.985	.033	.459	.982	.033	.547	.984	.031	.522
σ_α	1	.988	.036	.326	.987	.034	.397	.989	.035	.316
σ_β	1	.993	.031	.235	.992	.033	.249	.992	.033	.248
ρ_pξ	.5	—	—	—	.646	.009	16.33	.507	.005	1.365
σ_ξ	1	.794	.004	57.71	.957	.010	4.439	.985	.009	1.640

Variables	Store 1	Store 2
Sales (unit)	115	53.0	720	35	100.8	53.5	654	33
Price ($)	2.06	.20	2.48	1.46	2.09	.21	2.47	1.51
Bonus	.18	.21	.80	.00	.16	.19	.81	.00
PriceReduction	.10	.19	.69	.00	.11	.21	.76	.00

Store	Parameters	OLS	TSLS	2sCOPE
Store 1	Constant	1.301	1.197	1.09	−2.993	1.646	1.82	−3.908	2.314	1.69
Price	−.767	.288	2.66	−1.797	.396	4.54	−2.014	.555	3.63
Bonus	.371	.122	3.04	.104	.141	.74	.064	.171	.37
PriceReduction	.498	.115	4.33	.285	.125	2.28	.275	.143	1.92
ρ	—	—	—	—	—	—	.297	.089	3.34
Store 2	Constant	−3.898	1.246	3.13	.763	1.943	.39	.001	2.702	.00
Price	−1.982	.300	6.61	−.864	.467	1.85	−1.048	.648	1.62
Bonus	.062	.116	.53	.286	.148	1.93	.239	.151	1.58
PriceReduction	.283	.111	2.55	.540	.137	3.94	.467	.152	3.07
ρ	—	—	—	—	—	—	−.188	.109	1.72

Parameters	True	OLS	2sCOPE		OLS	2sCOPE
Constant	−4	1.514	.777	7.098	−3.782	1.619	.135	−4	5.256	.635	14.57	−3.601	1.393	.287
Price	−2	−.678	.186	7.099	−1.946	.388	.139	−2	.220	.152	14.59	−1.904	.334	.287
Bonus	.1	.458	.088	4.046	.113	.128	.103	.1	.706	.073	8.290	.126	.112	.236
PriceReduction	.3	.571	.089	3.058	.309	.112	.079	.3	.764	.075	6.160	.323	.095	.240
ρ	.3	—	—	—	.284	.071	.222	.5	—	—	—	.483	.048	.360

Footnotes

Acknowledgments

The authors are very grateful to the JMR review team and seminar and conference participants for many constructive comments that have significantly improved the article. All inferences, opinions, and conclusions drawn in this study are those of the authors, and do not reflect the opinions or policies of the funding agencies and data stewards. No personal identifying information was made available as part of this study. Procedures used were in compliance with British Columbia's Freedom in Information and Privacy Protection Act. Ethics approval was obtained from the University of British Columbia's Behavioral Research Ethics Board (H15-00887).

Coeditor

Raghuram Iyengar

Associate Editor

Marnik Dekimpe

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Social Sciences and Humanities Research Council of Canada (grants 435-2018-0519 and 435-2023-0306), Natural Sciences and Engineering Research Council of Canada (grants RGPIN-2018-04313 and RGPIN-2024-06629), U.S. National Institutes of Health (grant R01CA178061), and AoE research fund from NEOMA Business School, France (grant 416006).

ORCID iD

Yi Qian

Notes

References

Aghion

Philippe

Bloom

Nick

Blundell

Richard

Griffith

Rachel

Howitt

Peter

(2005), “Competition and Innovation: An Inverted-U Relationship,” Quarterly Journal of Economics, 120 (2), 701–28.

Anderson

Eric T.

Simester

Duncan I.

(2004), “Long-Run Effects of Promotion Depth on New Versus Established Customers: Three Field Studies,” Marketing Science, 23 (1), 4–20.

Arora

Neeraj

Huber

Joel

(2001), “Improving Parameter Estimates and Model Prediction by Aggregate Customization in Choice Experiments,” Journal of Consumer Research, 28 (2), 273–83.

Ataman

M. Berk

Van Heerde

Harald J.

Mela

Carl F.

(2010), “The Long-Term Effect of Marketing Strategy on Brand Sales,” Journal of Marketing Research, 47 (5), 866–82.

Atefi

Yashar

Ahearne

Michael

Maxham

James G.

III Donavan

D. Todd

Carlson

Brad D.

(2018), “Does Selective Sales Force Training Work?” Journal of Marketing Research, 55 (5), 722–37.

Athey

Susan

Imbens

Guido W.

(2006), “Identification and Inference in Nonlinear Difference-in-Differences Models,” Econometrica, 74 (2), 431–97.

Becker

Jan-Michael

Proksch

Dorian

Ringle

Christian M.

(2021), “Revisiting Gaussian Copulas to Handle Endogenous Regressors,” Journal of the Academy of Marketing Science, 50 (1), 1–21.

Cakici

A. Celil

Tekeli

Sena

(2022), “The Mediating Effect of Consumers’ Price Level Perception and Emotions Towards Supermarkets,” European Journal of Management and Business Economics, 31 (1), 57–76.

Chen

Hao

Lim

Alvin

(2022), “Were Consumers Less Price Sensitive to Life Necessities During the COVID-19 Pandemic? An Empirical Study on Dutch Consumers,” in Proceedings of SAI Intelligent Systems Conference. Springer, 79–100.

10.

Chintagunta

Pradeep K.

(2002), “Investigating Category Pricing Behavior at a Retail Chain,” Journal of Marketing Research, 39 (2), 141–54.

11.

Chintagunta

Pradeep

Dubé

Jean-Pierre

Goh

Khim Yong

(2005), “Beyond the Endogeneity Bias: The Effect of Unmeasured Brand Characteristics on Household-Level Brand Choice Models,” Management Science, 51 (5), 832–49.

12.

Chintagunta

Pradeep

Erdem

Tülin

Rossi

Peter E.

Wedel

Michel

(2006), “Structural Modeling in Marketing: Review and Assessment,” Marketing Science, 25 (6), 604–16.

13.

Christopoulos

Dimitris

McAdam

Peter

Tzavalis

Elias

(2021), “Dealing with Endogeneity in Threshold Models Using Copulas,” Journal of Business & Economic Statistics, 39 (1), 166–78.

14.

Danaher

Peter J.

(2007), “Modeling Page Views Across Multiple Websites with an Application to Internet Reach and Frequency Prediction,” Marketing Science, 26 (3), 422–37.

15.

Danaher

Peter J.

Smith

Michael S.

(2011), “Modeling Multivariate Distributions Using Copulas: Applications in Marketing,” Marketing Science, 30 (1), 4–21.

16.

Datta

Hannes

Foubert

Bram

van Heerde

Harald J.

(2015), “The Challenge of Retaining Customers Acquired with Free Trials,” Journal of Marketing Research, 52 (2), 217–34.

17.

Dotson

Jeffrey P.

Allenby

Greg M.

(2010), “Investigating the Strategic Influence of Customer and Employee Satisfaction on Firm Financial Performance,” Marketing Science, 29 (5), 895–908.

18.

Ebbes

Peter

Wedel

Michel

Böckenholt

Ulf

(2009), “Frugal IV Alternatives to Identify the Parameter for an Endogenous Regressor,” Journal of Applied Econometrics, 24 (3), 446–68.

19.

Ebbes

Peter

Wedel

Michel

Böckenholt

Ulf

Steerneman

Ton

(2005), “Solving and Testing for Regressor-Error (In)Dependence When No Instrumental Variables Are Available: With New Evidence for the Effect of Education on Income,” Quantitative Marketing and Economics, 3 (4), 365–92.

20.

Eckert

Christine

Hohberger

Jan

(2023), “Addressing Endogeneity Without Instrumental Variables: An Evaluation of the Gaussian Copula Approach for Management Research,” Journal of Management, 49 (4), 1460–95.

21.

Efron

Bradley

Tibshirani

Robert J.

(1994), An Introduction to the Bootstrap. CRC Press.

22.

Elshiewy

Ossama

Boztug

Yasemin

(2018), “When Back of Pack Meets Front of Pack: How Salient and Simplified Nutrition Labels Affect Food Sales in Supermarkets,” Journal of Public Policy & Marketing, 37 (1), 55–67.

23.

Godes

David

Mayzlin

Dina

(2009), “Firm-Created Word-of-Mouth Communication: Evidence from a Field Test,” Marketing Science, 28 (4), 721–39.

24.

Hartmann

Wesley

Nair

Harikesh S.

Narayanan

Sridhar

(2011), “Identifying Causal Marketing Mix Effects Using a Regression Discontinuity Design,” Marketing Science, 30 (6), 1079–97.

25.

Haschka

Rouven E.

(2022), “Handling Endogenous Regressors Using Copulas: A Generalization to Linear Panel Models with Fixed Effects and Correlated Regressors,” Journal of Marketing Research, 59 (4), 860–81.

26.

Heitmann

Mark

Landwehr

Jan R.

Schreiner

Thomas F.

van Heerde

Harald J.

(2020), “Leveraging Brand Equity for Effective Visual Product Design,” Journal of Marketing Research, 57 (2), 257–77.

27.

Hernan

Miguel A.

Robins

James M.

(2023), Causal Inference: What If. CRC Press.

28.

Hoch

Stephen J.

Kim

Byung-Do

Montgomery

Alan L.

Rossi

Peter E.

(1995), “Determinants of Store-Level Price Elasticity,” Journal of Marketing Research, 32 (1), 17–29.

29.

Hooker

Giles

Mentch

Lucas

(2018), “Bootstrap Bias Corrections for Ensemble Methods,” Statistics and Computing, 28 (1), 77–86.

30.

Johnson

Garrett A.

Lewis

Randall A.

Nubbemeyer

Elmar I.

(2017), “Ghost Ads: Improving the Economics of Measuring Online Ad Effectiveness,” Journal of Marketing Research, 54 (6), 867–84.

31.

Kim

Sungjin

Lee

Clarence

Gupta

Sachin

(2020), “Bayesian Synthetic Control Methods,” Journal of Marketing Research, 57 (5), 831–52.

32.

Lewbel

Arthur

(1997), “Constructing Instruments for Regressions with Measurement Error When No Additional Data Are Available, with an Application to Patents and R&D,” Econometrica, 65 (5), 1201–13.

33.

Shanjun

Linn

Joshua

Muehlegger

Erich

(2014), “Gasoline Taxes and Consumer Behavior,” American Economic Journal: Economic Policy, 6 (4), 302–42.

34.

Yang

Ansari

Asim

(2014), “A Bayesian Semiparametric Approach for Endogeneity and Heterogeneity in Choice Models,” Management Science, 60 (5), 1161–79.

35.

Little

John D.C.

(2004), “Models and Managers: The Concept of a Decision Calculus,” Management Science, 50 (12, Suppl.), 1841–53.

36.

Mackiewicz

Robert

Falkowski

Andrzej

(2015), “The Use of Weber Fraction as a Tool to Measure Price Sensitivity: A Gain and Loss Perspective,” in Advances in Consumer Research, Vol. 43, Diehl

Kristin

Yoon

Carolyn

, eds. Association for Consumer Research, 384–87.

37.

McShane

Blakeley B.

Bradlow

Eric T.

Lynch

John G.

Jr. Meyer

Robert J.

(2024), “‘Statistical Significance’ and Statistical Reporting: Moving Beyond Binary,” Journal of Marketing, 88 (3), 1–19.

38.

Mendelson

Haim

(2000), “Organizational Architecture and Success in the Information Technology Industry,” Management Science, 46 (4), 513–29.

39.

Narayanan

Sridhar

Kalyanam

Kirthi

(2015), “Position Effects in Search Advertising and Their Moderators: A Regression Discontinuity Approach,” Marketing Science, 34 (3), 388–407.

40.

Nijs

Vincent R.

Dekimpe

Marnik G.

Steenkamp

Jan-Benedict E.M.

Hanssens

Dominique M.

(2001), “The Category-Demand Effects of Price Promotions,” Marketing Science, 20 (1), 1–22.

41.

Novak

Sharon

Stern

Scott

(2009), “Complementarity Among Vertical Integration Decisions: Evidence from Automobile Product Development,” Management Science, 55 (2), 311–32.

42.

Otter

Thomas

Gilbride

Timothy J.

Allenby

Greg M.

(2011), “Testing Models of Strategic Behavior Characterized by Conditional Likelihoods,” Marketing Science, 30 (4), 686–701.

43.

Papies

Dominik

Ebbes

Peter

Van Heerde

Harald J.

(2017), “Addressing Endogeneity in Marketing Models,” in Advanced Methods for Modeling Markets. Springer, 581–627.

44.

Park

Sungho

Gupta

Sachin

(2012), “Handling Endogenous Regressors by Joint Estimation Using Copulas,” Marketing Science, 31 (4), 567–86.

45.

Petrin

Amil

Train

Kenneth

(2010), “A Control Function Approach to Endogeneity in Consumer Choice Models,” Journal of Marketing Research, 47 (1), 3–13.

46.

Qian

(2008), “Impacts of Entry by Counterfeiters,” Quarterly Journal of Economics, 123 (4), 1577–1609.

47.

Qian

Koschmann

Anthony

Xie

Hui

(2024), “A Practical Guide to Endogeneity Correction Using Copulas,” NBER Working Paper 32231, https://www.nber.org/papers/w32231.

48.

Qian

Xie

Hui

(2022), “Simplifying Bias Correction for Selective Sampling: A Unified Distribution-Free Approach to Handling Endogenously Selected Samples,” Marketing Science, 41 (2), 336–60.

49.

Qian

Xie

Hui

Koschmann

Anthony

(2022), “Should Copula Endogeneity Correction Include Generated Regressors for Higher-Order Terms? No, It Hurts,” National Bureau of Economic Research Working Paper 29978, https://www.nber.org/papers/w29978.

50.

Rigobon

Roberto

(2003), “Identification Through Heteroskedasticity,” Review of Economics and Statistics, 85 (4), 777–92.

51.

Rossi

Peter E.

(2014), “Even the Rich Can Make Themselves Poor: A Critical Examination of IV Methods in Marketing Applications,” Marketing Science, 33 (5), 655–72.

52.

Rutz

Oliver J.

Watson

George F.

(2019), “Endogeneity and Marketing Strategy Research: An Overview,” Journal of the Academy of Marketing Science, 47 (3), 479–98.

53.

Shi

Huanhuan

Sridhar

Shrihari

Grewal

Rajdeep

Lilien

Gary

(2017), “Sales Representative Departures and Customer Reassignment Strategies in Business-to-Business Markets,” Journal of Marketing, 81 (2), 25–44.

54.

Sklar

(1959), “Fonctions de Répartition à N Dimensions et Leurs Marges,” Publications de l’Institut Statistique de l’Université de Paris, 8, 229–31.

55.

Sorescu

Alina

Warren

Nooshin L.

Ertekin

Larisa

(2017), “Event Study Methodology in the Marketing Literature: An Overview,” Journal of the Academy of Marketing Science, 45, 186–207.

56.

Sriram

Srinivasaraghavan

Balachander

Subramanian

Kalwani

Manohar U.

(2007), “Monitoring the Dynamics of Brand Equity Using Store-Level Data,” Journal of Marketing, 71 (2), 61–78.

57.

Sudhir

Karunakaran

(2001), “Competitive Pricing Behavior in the Auto Market: A Structural Analysis,” Marketing Science, 20 (1), 42–60.

58.

Sun

Baohong

(2005), “Promotion Effect on Endogenous Consumption,” Marketing Science, 24 (3), 430–43.

59.

Van Heerde

Harald J.

Gijsenberg

Maarten J.

Dekimpe

Marnik G.

Steenkamp

Jan-Benedict E.M.

(2013), “Price and Advertising Effectiveness over the Business Cycle,” Journal of Marketing Research, 50 (2), 177–93.

60.

Villas-Boas

J. Miguel

Winer

Russell S.

(1999), “Endogeneity in Brand Choice Models,” Management Science, 45 (10), 1324–38.

61.

Wang

Yixin

Blei

David M.

(2019), “The Blessings of Multiple Causes,” Journal of the American Statistical Association, 114 (528), 1574–96.

62.

Wooldridge

Jeffrey M.

(2010), Econometric Analysis of Cross Section and Panel Data. MIT Press.

63.

Yang

Sha

Chen

Yuxin

Allenby

Greg M.

(2003), “Bayesian Analysis of Simultaneous Demand and Supply,” Quantitative Marketing and Economics, 1 (3), 251–75.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.59 MB

Distribution		Parameters	True	OLS			Copula_Origin			COPE			2sCOPE
P	W	Parameters	True	Mean	SE	t_bias	Mean	SE	t_bias	Mean	SE	t_bias	Mean	SE	t_bias
Gamma	Normal	μ	1	.431	.045	12.63	1.018	.078	.227	1.017	.080	.217	1.015	.077	.190
		α	1	1.569	.037	15.40	.979	.070	.302	.979	.070	.296	.985	.070	.212
		β	−1	−1.259	.030	8.619	−1.333	.028	11.78	−1.323	.433	.746	−.997	.045	.067
		ρ_pξ	.5	—	—	—	.640	.039	3.556	.589	.141	.631	.506	.036	.151
		σ_ξ	1	.861	.019	7.240	1.064	.046	1.394	1.135	.162	.837	1.005	.038	.134
Normal	Exp	μ	1	1.286	.042	6.777	1.286	.045	6.374	.994	.073	.081	1.023	.070	.334
		α	1	1.628	.031	20.36	1.532	.462	1.152	1.684	.437	1.568	1.048	.126	.381
		β	−1	−1.286	.032	8.956	−1.287	.032	8.960	−.992	.066	.127	−1.024	.062	.383
		ρ_pξ	.5	—	—	—	.089	.419	.980	−.167	.384	1.738	.465	.074	.473
		σ_ξ	1	.829	.018	9.492	.940	.151	.394	.981	.151	.129	.980	.063	.318
Normal	Normal	μ	1	1.001	.026	.046	1.002	.030	.052	1.001	.033	.024	1.002	.028	.057
		α	1	1.668	.030	22.38	1.663	.450	1.474	1.663	.460	1.441	1.655	.395	1.657
		β	−1	−1.335	.029	11.44	−1.335	.029	11.42	−1.324	.438	.740	−1.328	.197	1.668
		ρ_pξ	.5	—	—	—	.006	.412	1.198	.001	.412	2.426	.010	.303	1.616
		σ_ξ	1	.816	.019	9.687	.917	.155	.534	1.003	.211	.016	.879	.092	1.317

Store	Parameters	OLS			TSLS			2sCOPE
Store	Parameters	Est	SE	t-value	Est	SE	t-value	Est	SE	t-value
Store 1	Constant	1.301	1.197	1.09	−2.993	1.646	1.82	−3.908	2.314	1.69
	Price	−.767	.288	2.66	−1.797	.396	4.54	−2.014	.555	3.63
	Bonus	.371	.122	3.04	.104	.141	.74	.064	.171	.37
	PriceReduction	.498	.115	4.33	.285	.125	2.28	.275	.143	1.92
	ρ	—	—	—	—	—	—	.297	.089	3.34
Store 2	Constant	−3.898	1.246	3.13	.763	1.943	.39	.001	2.702	.00
	Price	−1.982	.300	6.61	−.864	.467	1.85	−1.048	.648	1.62
	Bonus	.062	.116	.53	.286	.148	1.93	.239	.151	1.58
	PriceReduction	.283	.111	2.55	.540	.137	3.94	.467	.152	3.07
	ρ	—	—	—	—	—	—	−.188	.109	1.72

Addressing Endogeneity Using a Two-Stage Copula Generated Regressor Approach

Abstract

Keywords

Literature Review and Contributions

Methods

Assumptions of the Existing Copula Endogeneity-Correction Method (CopulaOrigin)

Proposed Two-Stage Copula Endogeneity-Correction Method

Multiple Endogenous Regressors

2sCOPE for Random Coefficient Linear Panel Models

2sCOPE for Slope Endogeneity and Random Coefficient Logit Model

Simulation Study

Case 1: Nonnormal Regressors

Case 2: Normal Regressors

Case 3: Insufficient Nonnormality of Endogenous Regressors

Case 4: Random Coefficient Linear Panel Model

Additional Simulation Results and Robustness Checks

Guidelines for Using 2sCOPE

Empirical Application

Conclusion

Supplemental Material

sj-pdf-1-mrj-10.1177_00222437241296453 - Supplemental material for Addressing Endogeneity Using a Two-Stage Copula Generated Regressor Approach

Footnotes

Acknowledgments

Coeditor

Associate Editor

Declaration of Conflicting Interests

Funding

ORCID iD

Notes

References

Supplementary Material

Assumptions of the Existing Copula Endogeneity-Correction Method (Copula_Origin)