Abstract
In models with endogenous regressors, a standard regression approach is to exploit just-identifying or overidentifying orthogonality conditions by using instrumental variables. In just-identified models, the identifying orthogonality assumptions cannot be tested without the imposition of other nontestable assumptions. While formal testing of overidentifying restrictions is possible, its interpretation still hinges on the validity of an initial set of untestable just-identifying orthogonality conditions. We present the
Keywords
1 Introduction
The empirical literature on causal inference in linear regression models with endogenous regressors is dominated by estimation methods based on instrumental variables (IVs). For valid inference under conventional asymptotic theory, instruments must be relevant and exogenous. The former condition requires that the instruments are sufficiently strongly correlated with the endogenous regressors. If this correlation is weak, coefficient estimates can be severely biased, finite-sample distributions are poorly approximated with conventional asymptotic theory, and statistical tests using conventional standarderror estimates can exhibit large size distortions. To address these concerns, an extensive literature emerged on detecting instrument weakness and conducting robust statistical inference under the presence of weak instruments. The latter methods, however, usually lead to wide confidence intervals that may not be very informative.
1
In Stata, tests for weak instruments and methods for weak-instruments robust inference are implemented in the community-contributed packages
A noteworthy complication of the quest for good instruments is that the same features that make an instrument relevant can also be a source of a violation of the exogeneity condition (Hall, Rudebusch, and Wilcox 1996). To be exogenous, an IV needs to be uncorrelated with the regression error term. This necessitates that the instrument is validly excluded from the structural model, that is, that the instrument only has an indirect effect on the dependent variable via the instrumented endogenous regressors. If the model is just-identified, that is, there are as many excluded instruments as endogenous regressors, then the exclusion restriction is untestable in the standard IV framework. Intuitively, we cannot use the same instrument to identify the effect of an endogenous regressor and its own direct effect on the dependent variable. For identification of the former, IV-based estimators assume that the latter is known to be 0. Even in overidentified models, the validity of all instruments cannot be jointly tested. Routinely used overidentification tests still rely on the maintained (and untested) assumption that at least as many instruments are validly excluded from the model as there are endogenous regressors, and even then they may not be informative about the instruments’ ability to identify the parameters of interest (Parente and Santos Silva 2012).
In this article, we discuss an identification strategy that does not rely on such exclusion restrictions but instead imposes assumptions on the degree of regressor endogeneity, which is left unrestricted in an IV world. The kinky least-squares (KLS) approach developed by Kiviet (2013, 2020a,b ) achieves set identification of the regression coefficients by confining the admissible correlation of the regressors with the error term within plausible bounds. No excluded instruments are needed. Instead, the bias of the ordinary leastsquares (OLS) estimator is analytically corrected for all values on a grid of endogeneity correlations. This provides a set of consistent coefficient estimates in accordance with the postulated endogeneity range. Asymptotically conservative confidence intervals can be obtained as the union of the confidence intervals over the considered grid.
For a reasonably narrow range of postulated endogeneity correlations, these KLS confidence intervals are—as a general rule—narrower than those from IV/two-stage leastsquares (2SLS) estimations, particularly if the instruments are relatively weak. Thus, KLS inference is often more informative, and it avoids the problems associated with the search for strong and valid instruments. On top of that, the KLS approach enables testing of any potential exclusion restrictions. Because IVs are not needed for identification, their direct effect is (set) identifiable by adding them to the KLS regression (Kiviet 2020a,b).
Similar approaches with the aim to bound a causal effect of a single endogenous regressor in the absence of suitable instruments have been recently proposed by Krauth (2016) and Oster (2019) and implemented in their packages
Undeniably, instrument-free inference is not a panacea to the problems of instrumentbased methods. It replaces one set of possibly strong though speculative assumptions with another set of hopefully less restrictive conjectural assumptions. In many applications, it might be easier to specify a credible range for the correlation of an endogenous regressor with the error term than to convincingly present strong and valid instruments. For example, theoretical considerations might plausibly inform us about the sign of the endogeneity. Yet, if the chosen endogeneity range is too narrow, it may not include the true correlation value, potentially leading to serious bias. If it is too wide, the resulting confidence intervals could be less informative than those from a 2SLS estimation with strong and valid instruments.
Assuming that we have reasonable prior information about the range of endogeneity correlations, KLS confidence intervals and test procedures can provide reliable inference even in the absence of valid and strong IVs. If instruments are available, the KLS inference can facilitate sensitivity checks for IV-based procedures. Because the different methods have different strengths and weaknesses, it is often reasonable to consider the instrument-free approach as a complement rather than a substitute to instrument-based procedures, possibly in addition to other methods that relax some of the assumptions underlying the traditional instrument-based inference. For instance, Conley, Hansen, and Rossi (2012) propose the construction of conservative confidence intervals that allow for a mild violation of the exclusion restrictions, assuming a plausible range of direct effects for the instruments. Nevo and Rosen (2012) derive bounds for the effect size in the presence of imperfect instruments, making assumptions about the sign and the maximum strength of the correlation of the instruments with the error term. These two procedures can be applied with the community-contributed commands
The KLS approach to statistical inference under confined regressor endogeneity is implemented in the new
2 KLS inference
2.1 Coefficient estimates and confidence intervals
Consider the linear regression model with i = 1, 2,…, N observations, an endogenous regressor x
1
i
, and a column vector of exogenous (or predetermined) variables
All variables are transformed into deviations from their means. 3 The restriction to a single endogenous regressor is mainly for expositional purposes. The methodology can be applied to any number of endogenous variables.
The standard approach to fitting models with endogenous regressors is by using IV techniques. However, instruments-based inference can be unreliable if the IVs
Kiviet (2020a,b) suggests an alternative instrument-free approach that uses a nonorthogonality condition for the endogenous regressor in (1): E(x
1
iεi
) = ρ σ
1
σε
, where ρ denotes the correlation coefficient between x
1
i
and εi
, and σ
1 and σε
are the standard deviations (SD) of x
1
i
and εi
.
4
Clearly, this approach is infeasible unless ρ, σ
1, and σε
are known or can be estimated consistently. For the moment, assume that ρ is indeed known. σ
1 can be easily estimated from the observed data as the square root of
where
Notice that the KLS estimator is point-symmetric around ρ = 0, and
For inference on the coefficients
However, the correlation coefficient ρ is unknown, and without imposing additional restrictions, a consistent estimate of ρ is unattainable. Instead of tying oneself to a particular value ρ = r, we can assume that the true value is contained within a set ρ ∊ [rl, ru
]. Often, there might be prior information about the magnitude or the sign of the endogeneity that allows us to pin down reasonable boundaries for this interval. We can then obtain the KLS estimator
As a more illuminating approach, we can also plot the coefficient estimates with corresponding confidence intervals over the chosen range of endogeneity correlations. This shows immediately for which values of ρ we can reject (or not reject) the null hypothesis that a coefficient of interest equals a certain value, most prominently whether the coefficient is statistically significantly different from 0. Such graphs are the main output of the new
The choice of rl
and ru
is restricted by certain feasibility bounds. To rule out a negative estimate of
Thus, unless the endogenous regressor is uncorrelated (in the sample) with the exogenous regressors, that is,
2.2 Specification tests
Just like after OLS or 2SLS estimation, we usually want to scrutinize our model specification. Based on the KLS coefficient and variance estimates, we can calculate and visualize the p-values for any desired test statistic over the range r ∊ [rl, ru
]. Such tests can be conventional tests of linear hypotheses H
0 :
or alternatively the corresponding F statistic if small-sample statistics are desired. It is then straightforward to test the valid exclusion of a set of variables
The results inform us which values of r ∊ [rl, ru
] are compatible with the valid exclusion of
Testing exclusion restrictions is particularly useful in the context of IV/2SLS estimation. Instead of fitting (1) by KLS, we might choose
This is where KLS comes into play. By constraining the endogeneity of x
1
i
, we can test the valid exclusion of
The KLS approach can also be applied to other specification tests. Closely related to the exclusion restrictions test is the Ramsey (1969) regression equation specification error test (RESET). By testing the valid exclusion of polynomials in the fitted values or right-hand-side variables, insights are provided about whether we used the correct functional form. However, the presence of the endogenous regressor x
1
i
will cause the fitted values,
An operationalized RESET version, implemented by
The KLS estimator (3) is derived by assuming a constant variance
In a time-series setting, the KLS approach rests on the assumption that there is no serial error correlation. If we suspect serial correlation, we could add lags of the dependent variable and the right-hand-side variables to the regression model to obtain a dynamically complete model (Wooldridge 2020, chap. 11.4). To be adequate for a model with a lagged dependent variable, a test for serial correlation should allow for regressors that are not strictly exogenous. This is the case for the “alternative test” of Durbin (1970), implemented by
3 The kinkyreg command
3.1 Syntax
[
3.2 Options
With
With
The twoway options
with additional plots; see [G-3]
line_options are options allowed by
fitarea_options are options allowed by
number of observations.
display_options:
The following options are specific to
or
3.3 Stored results
4 Postestimation commands
The
4.1 Syntax
where test_spec is a coefficient list or expression.
4.2 Options
test_options are standard options allowed by the
endogeneity correlation. If # does not match a value on the estimation grid, the results for the closest grid point to # are displayed.
If the twoway option
With
With
With
With
With
With
With
With
With
With
With
4.3 Stored results
All postestimation commands except
5 Example
5.1 KLS estimation with a single endogenous regressor
We reanalyze data from the National Longitudinal Survey of Young Men used by Griliches (1976) to estimate the returns to schooling while accounting for individual differences in ability. Further control variables are labor market experience, job-specific tenure, location in the South, residence in a metropolitan area, and a set of year dummies.
Because ability as a joint predictor of men’s wages and the achieved level of schooling is unobserved, the returns to schooling cannot be consistently estimated by OLS. This omitted-variable bias can be mitigated by using a proxy variable for ability. In the following, it is assumed that by controlling for an individual’s IQ score, we can account for the relationship between the completed years of schooling and the unobserved ability. However, being an imperfect measure of ability, such a proxy variable usually suffers from measurement error and thus needs to be treated as endogenous. 17 The standard approach is to find IVs that are both relevant and exogenous, that is, sufficiently correlated with the endogenous variable, validly excluded from the model, and uncorrelated with the measurement error. Such candidate instruments might be the age and the marital status of the individuals. 18
The 2SLS estimates yield a relatively high wage return of 34% to one additional year of schooling, while the significantly negative ability effect seems odd. To economize on space, we do not show the detailed output of the postestimation commands. The key statistics of interest are the Sargan test, 1.39 with a p-value of 0.238, and the first-stage F statistic, 2.72. While the overidentification test seems to indicate that the instruments are valid, 19 their relevance is questionable given a first-stage F statistic well below 10.
Baum, Schaffer, and Stillman (2007) use this example to illustrate how their

KLS and 2SLS coefficient estimates and confidence intervals for

KLS and 2SLS coefficient estimates and confidence intervals for
The wide confidence intervals of the 2SLS estimates immediately strike the eye. This is a well-known consequence of weak instruments. The KLS confidence intervals for a given endogeneity correlation are much narrower. 20 However, the true correlation is unknown, and we should consider the union of the confidence intervals over a reasonable range of correlations. In our example, over the whole range from −0.75 to 0.75, the union of KLS confidence intervals is about as wide as the 2SLS confidence interval, although the former is inconclusive regarding the sign of the effect. The KLS and 2SLS confidence intervals only overlap for relatively large positive endogeneity correlations, and it is noteworthy that the 2SLS point estimates are always outside of the KLS intervals over the whole considered range. This observation casts serious doubt on the appropriateness of the chosen IVs.
With prior information on the reasonable range of the endogeneity, we can substantially sharpen the KLS inference. For example, we might be confident that it is less than 0.4 in absolute terms. Moreover, if measurement error is the only source of endogeneity, the correlation of the IQ score with the error term is negative by construction. Because of the resulting attenuation bias, the OLS estimates of the IQ coefficient (which are the KLS estimates with an endogeneity correlation of 0) are biased toward 0. Moreover, we would generally expect the effect of ability on wages to be nonnegative, which is incompatible with positive endogeneity correlations given our KLS estimates but also at odds with the 2SLS estimate.
Sticking to the measurement-error story with an endogeneity range [−0.4, 0], the unions of KLS confidence intervals span the bands [0.001, 0.021] for the IQ coefficient and [0.001, 0.076] for the return to schooling. Instead of reading these numbers from the graphs, we can also display regression output with the confidence intervals for specific endogeneity correlations by replaying the
The second output is simply the OLS results. Both the IQ and schooling effects are statistically significantly positive, as we would generally expect, but the KLS estimate of the return to schooling is substantially smaller than the 2SLS point estimate. Also, these KLS intervals do not overlap with the corresponding 2SLS confidence intervals, further reducing the confidence in the 2SLS approach, provided our assumptions on the model and the postulated endogeneity range are correct. Admittedly, and as a word of caution, our choice for the lower bound of the endogeneity range is quite arbitrary. If we relax that restriction, the KLS return-to-schooling estimate would turn statistically insignificant. Yet the confidence interval would expand in the opposite direction from the 2SLS estimate.
While we have seen above that the conventional overidentification tests after the 2SLS regression did not reject the null hypothesis, the weakness of the instruments or the nonexistence of a valid linear combination of the instruments might have been detrimental to the reliability of the test. The KLS approach instead allows us to perform instrument-free inference on the exclusion restrictions with the

p-values for three KLS exclusion restriction tests in specification A
The KLS exclusion restriction tests presented in figure 3 substantiate our claim that age and marital status are unlikely to be valid instruments. Only for very large positive endogeneity correlations do we not reject the null hypothesis that the instruments are validly excluded from the model. Aside from questioning the reliability of the 2SLS estimates, this result also has implications for the KLS approach. If age and marital status are not validly excluded from the model, the KLS estimates would suffer from omitted-variables bias if any of the included regressors is correlated with the excluded variables. In this sample of young men aged between 16 and 30 years, it is in particular age that is substantially correlated with the schooling regressor, but also with experience and tenure. Clearly, the youngest men in the sample cannot be among those with highest years of schooling or experience.
In the following specification B, we have added age and marital status as regressors. If we were to apply 2SLS again, we would have to find another instrument for the endogenous IQ score. The advantage of the KLS approach is that we can obtain valid inference without any instruments. For a compact presentation of the results, we combine all the graphs of interest in the single figure 4. To improve the visibility of the axis titles and labels, we also manipulate a few of the graph settings with standard

KLS coefficient estimates and confidence intervals in specification B
Directly implied by the previous exclusion restrictions test, the coefficient of age and the marriage premium are statistically significant. Both have a positive sign over the whole range of the IQ endogeneity correlations. We could interpret this positive age effect as the wage return to being more mature, which might be associated with the ability to perform more responsible tasks. Another explanation would be legal working-age restrictions for some higher-paying jobs.
Focusing again on the endogeneity range [−0.4, 0], the estimated ability effect remains significantly positive, hardly affected by the inclusion of the two additional variables. The schooling effect, however, now turned statistically insignificant after controlling for age and marital status. It appears that the previously found positive return to schooling resulted primarily from the fact that men with many years of school attendance are also older. Labor market experience and job tenure also no longer seem to have a significant effect at this early stage of the individual’s labor market career. The full returns to schooling or experience may only be reaped in later years, while ability makes a difference from the start. 22
Above, we used the exclusion restrictions test to investigate whether age and marriage were validly excluded from the model. A similar model misspecification test is the RESET test (Ramsey 1969). By testing the valid exclusion of polynomials in the fitted values, it can hint toward possible functional-form misspecification. By default, the

p-values for KLS RESET tests in specification B
At the 5% significance level, we do not reject the null hypothesis of correct model specification when we use at least a third-order polynomial. However, the evidence is not too comfortable for the endogeneity correlation range that is of particular interest to us.
Because the IQ score may not be an ideal proxy for ability, let us follow Griliches (1976) by considering the knowledge in world of work (KWW) test score as an alternative proxy variable. He suggests to use one of the potential proxy variables as an instrument for the other. While we could carry out the KLS analysis again without any instrument, it is insightful to compare the results for this specification C with just-identified 2SLS estimates with the IQ score as the IV.
For brevity, detailed 2SLS results are omitted. The point estimates are 0.028 for the KWW coefficient and 0.003 for the return to schooling. The latter is neither statistically nor economically significant. These 2SLS results are now in line with our KLS evidence, and the confidence intervals are substantially smaller than with the potentially weak and invalid age and marriage instruments. A noteworthy deviation from the previous KLS results is that the 2SLS estimate of the age effect is not statistically significant.
The first-stage F statistic is 46.1, providing confidence that the instrument is sufficiently strong. The Durbin–Wu–Hausman F statistic of 8.68 with a p-value of 0.003 supports the assumption that KWW is endogenous. However, this conclusion relies on the validity of the instrument, which is untestable in the 2SLS framework because the model is just-identified. 24 The negative sign of the t statistic version of the Durbin– Wu–Hausman test further indicates a negative endogeneity correlation, in line with the measurement-error story. 25
Given that the two ability measures are not perfect substitutes, the IQ score might still have a direct effect on wages even after controlling for the KWW test score, thus violating the exclusion restriction. Before we again use our instrument-free machinery to test the valid exclusion of the IQ score, let us consider another instrument-based approach that has been proposed recently. Conley, Hansen, and Rossi (2012) propose to obtain interval estimates over a range of plausible values for the direct effect of the excluded instrument in the regression model. Because the support of this direct effect is in principle unbounded, forming a prior belief about a plausible range for it could generally be harder than agreeing on a reasonable range of endogeneity correlations. If this plausible range is chosen too large, the resulting confidence bands will become uninformatively wide. If the range is chosen too small, it might miss the true value.
Earlier, we obtained KLS estimates of a direct effect of the IQ score that is positive but below 0.018, based on the 95% union of confidence intervals within the endogeneity range [−0.4, 0]. To treat the IQ score as plausibly exogenous (PE) in the sense of Conley, Hansen, and Rossi (2012), we assume that this effect is at least halved once we control for the KWW score. For direct effects of the IQ score within the interval [0, 0.009], we can then use the
Instead of showing the output from the

KLS, 2SLS, and PE coefficient estimates and confidence intervals in specification C
While the instrument-based analysis becomes more robust if we allow the IQ score to have a (small) nonnegative direct effect, the resulting widened PE confidence bands make it harder to infer meaningful implications. 27 Most notably, we would no longer have conclusive evidence of a positive ability effect. In contrast, the KLS inference remains informative as long as we restrict our attention to a reasonable subset of endogeneity correlations.
Maintaining the assumption that the endogeneity of the ability proxy is due to measurement error and therefore negative, the KLS estimate of the ability effect is still significantly positive. The schooling and age profiles over different endogeneity values are now remarkably similar, in contrast to the earlier results with the IQ score as the ability proxy. When KWW is just mildly endogenous, the returns to both schooling and age are statistically significantly positive. Over the endogeneity range [−0.4, 0], the union of KLS confidence intervals covers [0.001, 0.041] for the ability effect, [−0.025, 0.046] for the return to schooling, and [−0.006, 0.046] for the age coefficient. All three intervals encompass the respective 2SLS point estimate. This provides some indication that the IQ score could indeed be a valid and relevant instrument. As a further investigation of this matter, let us look again at the KLS exclusion restrictions test. The p-values are shown in figure 7.

p-values for the KLS exclusion restrictions test of
If the KWW score was subject to only minor measurement error, the test would still reject the hypothesis of valid exclusion of the IQ score. The output table of the
To reinforce the trust in our KLS results, we can look at further specification tests. For example, we might suspect that squares and interaction terms of some of the regressors have predictive power. Instead of running the less specific RESET test again, we can test the exclusion restrictions for some of these terms, one at a time. The corresponding p-value curves are shown in figure 8.

p-values for various KLS exclusion restrictions tests in specification C
Most squares and interaction terms appear to be validly excluded, aside from the interaction effect between tenure and age. 29 This indicates that the return to tenure varies with age but does not yet tell us anything about the magnitude or sign of this effect. In our specification D, we therefore include this interaction term in our regression model and compute the marginal return of tenure at three different ages, 18, 24, and 30 years:
We can do this with the
We observe that the return to tenure increases with age. For the youngest, who just started their labor market careers, tenure does not determine the wage outcome, irrespective of the postulated endogeneity of the ability measure. At an age of 24 years, the point estimate of the return to tenure is positive throughout, although it is economically small and statistically significant only for a moderate endogeneity of ability. Because there is still not much difference between job-specific tenure and overall labor market experience at such an early age, it is not surprising to find no statistical difference between the two effects. For the oldest in our sample, the marginal effect of tenure rises further and is now statistically significant over the whole range of endogeneity correlations that we considered to be reasonable, r ∊ [−0.4, 0]. Moreover, we now reject the null hypothesis that the returns to tenure and experience are equal. At this age, the accumulation of job-specific knowledge and skills eventually pays off.

KLS estimates and confidence intervals in specification D of the return to
Let us scrutinize our regression specification D again with some specification tests. Figure 10 displays the results from RESET tests. The left subfigure shows p-value curves for tests based on polynomials in the fitted values. The right subfigure considers polynomials in all the right-hand-side variables. 30 To economize on the degrees of freedom, we consider only secondand third-order polynomials for this second variant of the test.

p-values for KLS RESET tests in specification D
The results are now much more reassuring than those for our initial model specification. The RESET tests with polynomials of the fitted values in the left-hand graph of figure 10 would still cause some worries if we believed in a quite strong negative endogeneity correlation of the KWW score. 31
Next we use the

p-values for KLS heteroskedasticity tests in specification D
The joint hypotheses tests do not reject the null hypothesis of no conditional heteroskedasticity within our range of most reasonable endogeneity correlations. Just for the most flexible specification, iii, we find at least one regressor in the auxiliary regression with a statistically significant coefficient for negative endogeneity correlations of at least −0.25. While we could add further interaction terms to our regression model in an attempt to mitigate any heteroskedasticity concerns, the quantitative and qualitative conclusions would hardly change. Because most specification tests are already supportive for our chosen model, we are confident that the insights we have drawn from our KLS analysis are meaningful and statistically well grounded. Having said that, the analysis stands and falls with our maintained assumption that the ability proxy has a moderately negative correlation with the error term, consistent with a measurement-error story, and that all remaining regressors are exogenous.
The KLS procedure is related to the alternative instrument-free approaches proposed by Krauth (2016) and Oster (2019). Here we briefly illustrate that all three methods coincide by translating the endogeneity correlation into the respective sensitivity parameters of the other two approaches. Krauth (2016) places bounds on an RCR parameter λ. This is the ratio of the endogeneity correlation to the correlation of the endogenous regressor with an index of the control variables. For a given choice of the endogeneity correlation, say, our lower bound rl
= −0.4, we can obtain

Corresponding RCR values λ and δ for specification D
We immediately notice that the functions
For rl
= −0.4, we have obtained
The KWW coefficient estimates from the KLS and both RCR procedures are identical. The standard errors reported by the
A disadvantage of the
While we can numerically match the coefficient estimates with the different estimators, at least as long as relevant control variables are present, measurement error as the source of endogeneity is not the ideal example for the RCR methods. The RCR sensitivity parameters are usually interpreted as a measure for “the relative selection on observables and unobservables” (Oster 2019) in an evaluation of the OLS robustness to omitted-variables bias. In this sense, measurement error would not be seen as an omitted control variable. Oster considers an additional sensitivity parameter, the maximum R-squared, that is attainable from a hypothetical regression that includes all unobserved control variables. If there is remaining unexplained variation, for instance, due to measurement error, this maximum R-squared would be smaller than its default value 1. Yet the illustrated equivalence of the three instrument-free methods only holds if this hypothetical maximum R-squared is set to 1 in Oster’s approach. This does not invalidate the KLS approach, which is completely flexible regarding the source of the endogeneity, but the corresponding RCR sensitivity parameters would have to be interpreted with caution.
In general, it might be difficult to pick reasonable intervals for δ or λ, not least because there are no natural bounds for these sensitivity parameters. 36 In contrast, the endogeneity correlation ρ is bounded by construction and a restriction of its sign is often credible.
5.2 KLS estimation with multiple endogenous regressors
For the exclusion restriction test that is underlying figure 7, the attentive reader might have noticed that we implicitly assumed the variable
The
To illustrate this approach, let us vary the endogeneity correlation of the KWW score automatically over the range [−0.75, 0.75] but choose fixed values for the correlation of the IQ score with the error term from the set {−0.4, −0.2, 0}, one at a time. We do this with a simple loop and the option

KLS coefficient estimates and confidence intervals in specification E
Notice that some of the graphs do not extend over the full range from −0.75 to 0.75. This is because the feasible range of endogeneity correlations becomes tighter when we have multiple endogenous variables. To avoid distorted pictures from very wide confidence intervals toward the boundaries, we have truncated the y axis with the
The rightmost column displays results when
We could carry out further specification tests and redo our analysis for the returns to tenure and labor market experience, but to economize on space we leave this as an exercise to the interested reader. Instead, we illustrate how one can produce three-dimensional surface plots and contour plots, varying both endogeneity correlations. To achieve this, we use the

Surface plots for the KLS coefficient estimates of
This figure highlights again the positive relationship between the return to schooling and the correlations of the ability proxies with the error term. A statistically significantly positive return to schooling is only consistent with a small negative endogeneity of both ability variables, while large negative endogeneities yield implausible statistically significantly negative returns to schooling.
6 Conclusion
In this article, we introduced the
Supplemental Material
Supplemental Material, sj-zip-1-stj-10.1177_1536867X211045575 - kinkyreg: Instrument-free inference for linear regression models with endogenous regressors
Supplemental Material, sj-zip-1-stj-10.1177_1536867X211045575 for kinkyreg: Instrument-free inference for linear regression models with endogenous regressors by Sebastian Kripfganz and Jan F. Kiviet in The Stata Journal
Footnotes
7 Acknowledgments
We thank Eric Melse and an anonymous referee for providing valuable feedback.
8 Programs and supplemental materials
To install a snapshot of the corresponding software files as they existed at the time of publication of this article, type
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
