Abstract

Watts, T. W. (2020). Academic achievement and economic attainment: Reexamining associations between test scores and long-run earnings. AERA Open, 6(2). https://doi.org/10.1177/2332858420928985
On working with the data for this project after the publication of the article, the author discovered several small coding errors involving control variables used in the analysis. In general, these errors involved responses for “I don’t know” or “NA” receiving numeric codes on several variables. None of the primary measures used in the analysis (i.e., test scores or earnings) were affected by this error. The family income variable for age 16 was also recoded because approximately 16% of cases had family income undercounted due to inconsistencies in missing data values between those reporting monthly versus weekly income (updated descriptives for age-16 family income are shown below). These coding updates led to almost no change in the key results.
The updated data-cleaning syntax has been added to the OpenICSPR page corresponding to this article, along with an additional README explaining which variables were altered (https://www.openicpsr.org/openicpsr/project/119183/). Please consult this updated README before using the posted syntax files for analytic purposes. It is recommended that anyone using the syntax files for this analysis use the updated cleaning file for data preparation. The multiple imputation and analysis files have not been altered.
For sake of clarity, the key results tables from the main text of the article have been reproduced here using the updated data preparation file with the recoded control variables. As these tables reflect, alterations to the control variables led to very minor changes in the results—with most regression coefficients altered at only the third decimal place. These minor changes do not affect the findings or conclusions from the published article. Changes to tables shown in the supplemental material for the article are similarly minor, and the updated supplemental material is available in the online version of the original article.
The corrected Tables 2 to 7 are provided below:
High School Descriptive Characteristics by Average Adult Earnings
Note. The top 50% of earners represent men or women that, on average, were at or above the 50th percentile in earnings across each of the earnings waves (sample restricted to participants who indicated full-time work at a given follow-up wave). The behavioral and personality scales were measured on a continuum from “1” to “5” (e.g., for “Timid to Aggressive” a value of “1” would indicate maximum timidness and a value of “5” would indicate maximum aggressiveness). Prop. = proportion.
Associations Between High School Test Scores and Log-Monthly Earnings Conditional on Working Full Time for Men
Note. Robust standard errors are presented in parentheses. Test scores were transformed to z scores. The “p-value of difference” rows list p values from post hoc tests that tested whether the math and reading coefficients were equal to one another. The first row of the table lists the additional control variables added in the models presented in each column (e.g., for the models listed in column C, behavioral and personality measures were added to the already included set of family background and health controls). For the full list of control variables included in each column, see Supplementary Table A1.
p < .05. **p < .01. ***p < .001.
Associations Between High School Test Scores and Log-Monthly Earnings Conditional on Working Full Time for Women
Note. See Table 3 note.
p < .05. **p < .01. ***p < .001.
Associations Between Composite Math and Reading Scores and Log-Monthly Earnings Conditional on Working Full Time for Men
Note. See Table 3 note. The “achievement composite” variable is the standardized average of the age-16 math and reading tests.
p < .05. **p < .01. ***p < .001.
Associations Between Composite Math and Reading Scores and Log-Monthly Earnings Conditional on Working Full Time for Women
Note. See Table 5 note.
p < .05. **p < .01. ***p < .001.
Pooled Models: Associations Between High School Test Scores and Log-Average Earnings Between Age 33 and Age 50 Conditional on Working Full Time
Note. Robust standard errors were adjusted for student-level clustering and are presented in parentheses. All independent variables shown were transformed to z scores. All models include the full set of controls used in the “Column F” models of Table 3. Pooled models were generated by treating each respective earnings measure (taken at ages 33, 41, 46, and 50, respectively) as an independent observation, and “follow-up” wave fixed effects were included in each regression. Columns 1 and 2 correspond to the models shown in Tables 3 and 4 (i.e., math and reading achievement modeled independently). Columns 3 and 4 correspond to the “achievement composite” estimates shown in Tables 5 and 6. In columns 5 and 6, measures indicating high school quality were added to the model. Inc. = included.
p < .05. **p < .01. ***p < .001.
